0% found this document useful (0 votes)

25 views10 pages

Probabilistic Modeling of Deep Features For Out-of-Distribution and Adversarial Detection

Uploaded by

nishant.roy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views10 pages

Probabilistic Modeling of Deep Features For Out-of-Distribution and Adversarial Detection

Uploaded by

nishant.roy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Probabilistic Modeling of Deep Features for

Out-of-Distribution and Adversarial Detection

Nilesh. A. Ahuja Ibrahima Ndiour Trushant Kalyanpur Omesh Tickoo

Intel Labs Intel Labs Intel Labs
arXiv:1909.11786v1 [stat.ML] 25 Sep 2019

Abstract
We present a principled approach for detecting out-of-distribution (OOD) and
adversarial samples in deep neural networks. Our approach consists in modeling
the outputs of the various layers (deep features) with parametric probability dis-
tributions once training is completed. At inference, the likelihoods of the deep
features w.r.t the previously learnt distributions are calculated and used to derive
uncertainty estimates that can discriminate in-distribution samples from OOD
samples. We explore the use of two classes of multivariate distributions for model-
ing the deep features - Gaussian and Gaussian mixture - and study the trade-off
between accuracy and computational complexity. We demonstrate benefits of our
approach on image features by detecting OOD images and adversarially-generated
images, using popular DNN architectures on MNIST and CIFAR10 datasets. We
show that more precise modeling of the feature distributions result in significantly
improved detection of OOD and adversarial samples; up to 12 percentage points in
AUPR and AUROC metrics. We further show that our approach remains extremely
effective when applied to video data and associated spatio-temporal features by
detecting adversarial samples on activity classification tasks using UCF101 dataset,
and the C3D network. To our knowledge, our methodology is the first one reported
for reliably detecting white-box adversarial framing, a state-of-the-art adversarial
attack for video classifiers.

1 Introduction
Deep neural networks (DNN) have gained widespread popularity in the last decade, starting with the
winning of ILSVRC-2010 challenge by AlexNet [12]. Since then, research in this area has led to a
proliferation of novelties in methods and architectures that have resulted in dramatic improvements
in accuracy [18, 9] and scalability [7]. An important area of active research is the ability of DNNs
to estimate predictive uncertainty measures, which quantify how much trust should be put in DNN
results. This is a critical requirement for perceptual sub-systems based on deep learning, if we
are to build safe and transparent systems that do not adversely impact humans (e.g. in fields such
as autonomous driving, robotics, or healthcare). Additional imperatives for estimating predictive
uncertainty measures relate to results interpretability, dataset bias, AI safety, and active learning.
Typically, deep networks do not provide reliable confidence scores for their outputs. Softmax is the
most popular score used. Interpreted as a probability, it is a posterior probability and provides a
relative ranking of each output with respect to all other outputs, rather than an absolute confidence
score. By relying solely on softmax scores as a confidence measure, deep neural networks tend to
make overconfident predictions. This is especially true when the input does not resemble the training
data (out-of-distribution), or has been crafted to attack and “fool” the network (adversarial examples).
In this paper, we consider the problem of detecting out-of-distribution (OOD) samples and adversarial
samples in DNNs. Recently, there has been substantial work on this topic from researchers in the
Bayesian deep learning community [5, 11]. In this class of methods, the network’s parameters are

Preprint. Under review.

represented by probability distributions rather than single point values. Such parameters are learned
using variational training. At inference, multiple stochastic forward passes are required to generate
a distribution over the outputs, instead of the typical single forward pass needed in a traditional
(non-Bayesian) DNN. This significantly increases the complexity and requirements in terms of model
representation, computational cost and memory.
Another class of methods attempt to solve this problem by estimating uncertainty directly from a
trained DNN (non-Bayesian). Hendrycks and Gimpel [10] proposed using probabilities from the
softmax distributions to detect misclassified or OOD samples. Liang et al. [14] showed that by
introducing a temperature scaling parameter to the softmax function, the OOD detection performance
could be greatly enhanced relative to [10]. Both these methods use posterior softmax distribution to
perform OOD detection. By contrast, Lee et al. [13] adopted a generative approach and proposed
fitting class-conditional multivariate Gaussian distributions to the pre-trained features of a DNN. The
confidence score was defined as the Mahalanobis distance with respect to the closest class conditional
distribution. Using this confidence score, they obtained impressive results, outperforming both the
previous methods on detecting OOD and adversarial samples. In contrast to Bayesian Deep Learning
based approaches, this class of methods can be applied to existing pre-trained networks, does not
require weights to be represented by distributions, and does not entail the computational overhead of
requiring multiple forward passes during inference.

Contribution: We present an approach for detecting OOD and adversarial samples in deep neural
networks based on probabilistic modeling of the deep-features within a DNN. Conceptually, our
method is most similar to the generative approach in [13] in that we fit class-conditional distributions
to the outputs of the various layers (deep-features), once training is completed. In [13], however, it is
hypothesized that the class-conditional distributions can be modeled as multivariate Gaussians with
shared covariance across all classes. We show that such an assumption is not valid in general; instead,
we adopt a more principled approach in choosing the type of density function to model with. To this
end, we explore the use of two additional types of distributions to model the deep-features: Gaussian
(with separate covariances for each class) and Gaussian mixture models (GMM). We demonstrate that
a more precise modeling of the distributions of features results in significantly improved detection of
OOD and adversarial samples; in particular, we see an improvement of up to 12 percentage points in
the AUPR and AUROC metrics in these tasks.
We also investigate the numerical and computational aspects of this approach. In particular, fitting
distributions to very high-dimensional features can result in severe ill-conditioning during estimation
of the densities parameters. We demonstrate empirically that such issues can be resolved with the
application of dimensionality reduction techniques such as PCA and average pooling.
Finally, in addition to demonstrating the effectiveness of the approach on standard image datasets, we
further show that our approach remains extremely effective when applied to video data and associated
spatio-temporal features by detecting adversarial samples generated by both white-box and black-box
attacks on activity classification tasks on the UCF101 dataset [19] using the C3D network [8]. To our
knowledge, our methodology is the first one reported for reliably detecting white-box adversarial
framing [22], a state-of-the-art adversarial attack for video classifiers.

2 Approach
Suppose we have a deep network trained to recognize samples from N classes, {Ck }, k = 1, . . . , N .
Let fi (x) denote the output at the ith layer of the network, and ni its dimension. As described earlier,
our approach consists of fitting class-conditional probability distributions to the features of a DNN,
once training is completed. By fitting distributions to the deep features induced by training samples,
we are effectively defining a generative model over the deep feature space.
At test time, the log-likelihood scores of the features of a test sample are calculated with respect
to these distributions and used to derive uncertainty estimates that can discriminate in-distribution
samples (which should have high likelihood) from OOD or adversarial samples (which should have
low likelihood). These per-layer likelihoods can also be used for classification in lieu of the softmax
output of the network and give classification accuracy as good as the softmax classifier.
Choice of density function: Lee et al. [13] assumed that the class-conditional densities p(fi (x)|Ck )
are multivariate Gaussian with shared covariance across all classes. The justification was based on

2
Figure 1: Scatterplot and the corresponding density histogram of the logits. It is clearly seen that the
covariances of the two clusters are different.

the following connection between LDA (linear discriminant analysis) and the softmax classifier: in
a generative classifier in which the underlying class-conditional distributions are Gaussians with
tied covariance, the posterior distribution p(Ck |fi (x)) is equivalent to the softmax function with
linear separation boundaries [1]. As we demonstrate empirically via a simple example, the use of a
softmax classifier in a DNN does not automatically imply that the underlying distributions will be
well represented by a Gaussian with tied covariance. In this example, we constructed and trained a
CNN architecture which we call MNET (shown in Figure 3) to classify only two digits (’0’ and ’1’)
from the MNIST dataset. Since only two classes are considered, the final FC-10 layer is replaced by
FC-2. A 2D density histogram of the features from the FC-2 layer is shown in Figure 1. It is obvious
even without performing any goodness-of-fit tests that if a 2D Gaussian was fitted to each cluster, the
covariance of one would be significantly different from that of the other; forcing these to be the same
would result in a poorer fit. Further, even if the assumption of tied-covariance was valid, it would
apply only to the features of the final layer of the network, on which softmax is performed. It would
not be applicable to the inner layers.
In this work, therefore, we relax the assumption of tied covariance, and instead employ more general
parametric probability distributions. The first type is a separate multivariate Gaussian distribution for
each class without the assumption of a tied covariance matrix. Note that this corresponds to the more
general QDA (quadratic discriminant analysis) classifier, which is hence capable of representing a
larger class of distributions. The second type is a Gaussian Mixture Model (GMM). The choice of
GMM is also motivated by the fact that high-dimensional naturally occurring data may not necessarily
occupy all dimensions in the Euclidean space, but may in fact reside on or close to an underlying
sub-manifold [20]. Owing to the more general nature of the GMM, it is a better choice to model
such a distribution. In the toy example shown in Figure 2 data is distributed along the boundary of
an ellipse. It is clear that the GMM is able to model such a distribution well, while a multivariate
Gaussian is a very poor modeling choice for it. It would be interesting to apply more sophisticated
manifold learning techniques, but these are significantly more complex to implement practically;
their use will be explored in future work.
Estimating parameters: The parameters of the class-conditional densities are estimated from the
training set samples by maximum-likelihood. If the chosen density is a multivariate Gaussian
(separate covariance), the maximum-likelihood values of the mean and covariance for class k are
given by the sample mean and sample covariance:

1 X 1 X T
µk = f (x), Σk = (f (x) − µk )(f (x) − µk ) (1)
Mk Mk
x∈Ck x∈Ck

where f (x) are the feature values from the network and the layer subscript i has been dropped for
notational convenience. If the covariance is assumed to be tied across all classes, then all samples x
in the training set are used to estimate the covariance, rather than only x ∈ CK . The estimation of
the mean remains unchanged. If the chosen density is a GMM, its parameters are estimated using
an expectation-maximization (EM) procedure. To choose the number of components in the GMM
(i.e. model selection), we adopt the Bayesian Information Criteria (BIC) to penalize complex models;

3
details on EM and BIC can be found in [1].

input (28 x 28)

conv3-64
maxpool
conv3-32
maxpool
FC-128
FC-10

Figure 3: MNET
Figure 2: Fitting distributions to points on the boundary of an ellipse. architecture
The left side has been fitted with a 10-component GMM, is a clearly
better fit than the right, which has been fitted with a single 2D Gaussian.
Scoring samples: As described earlier, the log-likelihood values are used to measure the closeness
of a sample w.r.t a probability distribution. For an n-dimensional multivariate Gaussian N (µ, Σ), the
log-likelihood of a feature f (x) is given by [1]:

2L = − n log 2π + log(det |Σ|) + (f (x) − µ)T Σ−1 (f (x) − µ)

(2)

Under the assumption of tied covariance, the term n log 2π + log(det |Σ|) is the same for all the class-
conditional distributions and can then be ignored. The remaining term (f (x) − µ)T Σ−1 (f (x) − µ),
which is the Mahalanobis distance, is then adequate to measure the closeness of a test sample to the
modeled distribution. If the covariances are not assumed to be tied, we cannot use the Mahalanobis
distance, and should instead use the full log-likelihood term (ignoring the additive and multiplicative
constants). For GMM, the log-likelihood is a weighted sum of exponential terms.

3 Computational aspects

The use of more general distributions such as multivariate Gaussian (without tied covariance) or
GMMs to model the class-conditional densities brings its own set of challenges. The obvious one is
increased computational complexity, both during modeling and during inference, especially when
using GMMs. The other challenge is the lack of sufficient training data from which to estimate
the parameters of the modeled distributions. For n-dimensional features, if the number of training
samples available, M , is less than n, then the maximum-likelihood estimate of the covariance as
given in Eq. (1), will have a rank M < n and be singular. The problem is exacerbated if GMMs are
used, since a covariance matrix for each component of each class needs to be estimated, dramatically
increasing the number of parameters to be estimated. In such situations, the assumption of tied
covariance can prove helpful, since the number of samples used to estimate the covariance matrix
would increase by a factor of N (the number of classes), and the covariance matrix can hence be
estimated without risk of rank deficiency so long as mN > n. However, as we demonstrate, by
applying appropriate dimensionality-reduction techniques, we can not only mitigate these issues, but
actually improve the eventual detection and classification scores by enabling the use of more general
distributions.
Further, as the dimensionality of the features being modeled increases, it poses numerical challenges
which result in highly ill-conditioned covariance matrices. For this reason too, application of some
form of dimensionality reduction is recommended. Here, we follow a two-fold approach: average
pooling of very high-dimensional layers and applying PCA for projecting onto a lower dimensional
subspace. In our experiments, we average pool by a factor of 4. This number was chosen empirically,
primarily to enhance computational efficiency. While applying PCA, one can specify the fraction
of the variance of the original data that should be retained in the lower-dimensional subspace. We
choose a high value of 0.995, i.e. we retain 99.5% of the original variance. This resulted in a dramatic
reduction in the feature dimensions, at times up to 90%, indicating that 99.5% of the information in
the features is actually contained in a much lower- dimensional subspace.

4
Table 1: Classification accuracy
MNIST CIFAR10(Resnet) CIFAR10(densenet)
GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 98.9 98.6 98.6 90.2 90.5 89.9 90.3 90.0 89.8
Layer 1 98.2 98.6 98.6 89.9 90.5 90.0 89.1 88.7 89.7
Layer 2 86.0 97.4 98.3 90.1 90.5 90.0 88.9 88.9 89.9
Softmax 98.99 89.09 89.14

4 Experiments and Results

4.1 Applications in Image Classification tasks

Experimental setup and evaluation metrics We use MNIST and CIFAR10 as the in-distribution
datasets. For MNIST, we use FashionMNIST and EMNIST Letters [3] as the OOD datasets. For
CIFAR10, we use SVHN dataset [16] and a resized version of the LSUN datasets [21] as the OOD
datasets. To test against adversarial attacks, we use the FGSM attack introduced by Goodfellow
et al. [6]. In all experiments, the parameters of the fitted density function are estimated from the
training split of the in-distribution dataset, while performance metrics (accuracy, AUPR, AUROC)
are calculated on the test split.
For MNIST, use the MNET architecture as shown in Figure 3. For CIFAR10, we use two publicly
available architectures: Resnet50 and Densenet-BC. For reasons of computational efficiency, we
perform our experiments on 3 layers of the networks listed above. In MNET, these are the final 3
layers; in Densenet and Resnet, these are the outputs of the corresponding final 3 dense or residual
blocks. The layers are labelled as 0, 1, and 2, with 0 being the outermost layer, and 1,2 being inside
the network. Layers further inside can easily be included too, but these typically have outputs of very
high-dimensions and require aggressive dimensionality reduction in order to process them efficiently.
During testing, the log-likelihood scores of the features generated by a test sample are calculated.
These can then be used to distinguish between in-distribution and out-of-distribution data, effectively
creating a binary classifier. The performance of this classifier can be characterized by typical methods
such as the precision-recall (PR) curve or the receiver operating characteristics (ROC curve). Davis
and Goadrich [4] showed that although the PR and ROC curves are equivalent, maximizing the area
under ROC (AUROC) is not equivalent to maximizing area under precision-recall (AUPR). We report,
therefore, both metrics in our results. To calculate these metrics, we used the scikit-learn library [17].

Results We want to first demonstrate the effectiveness of our approach by using it to perform
classification based solely on the log-likelihood scores w.r.t the class-conditional distributions of a
particular layer. The classification accuracy using this scheme is measured on the test set for each
layer individually. The results are summarized in Table 1. It is seen that the classification accuracy
using the proposed method is comparable, if not slightly better, than the softmax-based accuracy,
indicating that our scheme is as good as softmax for classification of in-distribution samples.
To see the performance on OOD samples, we calculate the AUPR and AUROC scores as described
earlier. In particular, we examine the change in AUPR and AUROC values obtained by using the
more general distribution types (outlined in Section 2) relative to those obtained by using the baseline
distribution (multivariate Gaussian with tied covariance). The results are presented in Tables 2 and 3.
It is seen that the use of the more general distribution types results in improvements, often significant,
in the AUPR and AUROC scores over the baseline distribution. On the few instances in which the
baseline distribution achieves the best score, it is by a small margin. It is further interesting to examine
the improvements in scores over the baseline distribution as a function of the layer being modeled.
These results are summarized in Table 4, which shows the average change (across all tested datasets)
in the AUPR and AUROC scores per layer. For all layers, switching to a more general distribution
produces an improvement in the scores. However, the extent of the improvement increases the further
we are from the final output layer. This is consistent with the reasoning described in Section 2 that
the assumption of a Gaussian with tied covariance is a valid approximation for the output layer only,
and not the inner layers.

5
Table 2: AUPR (%) scores from three different density functions: GMM, Sep (Gaussian with separate
covariance per class), Tied (Gaussian with tied covariance). Best values are shown in bold.
FashionMNIST EMNIST FGSM, = 0.2
MNIST
GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 86.4 84.8 81.8 66.5 61.1 53.4 96.0 95.3 94.8
Layer 1 84.5 81.3 53.5 67.5 72.8 65.4 96.4 95.3 88.0
Layer 2 88.4 90.9 58.1 72.0 77.7 58.1 97.7 97.7 86.1
CIFAR10 SVHN LSUN FGSM, = 0.1
(Resnet) GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 95.0 93.8 91.5 64.4 62.8 56.0 92.5 90.7 89.8
Layer 1 95.2 94.9 93.4 57.9 68.8 68.0 93.2 92.8 92.4
Layer 2 95.1 94.9 93.4 57.9 68.8 68.0 93.2 92.8 92.4
CIFAR10 SVHN LSUN FGSM, = 0.1
(Densenet) GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 79.1 77.2 79.6 29.5 28.9 21.9 85.8 86.1 84.6
Layer 1 86.8 85.6 75.9 77.7 80.4 79.3 90.6 90.2 87.0
Layer 2 80.4 80.4 57.1 37.2 37.2 27.1 87.2 87.2 78.1

Table 3: AUROC scores (%) from three different density functions: GMM, Sep (Gaussian with
separate covariance per class), Tied (Gaussian with tied covariance). Best values are shown in bold.
FashionMNIST EMNIST FGSM, = 0.2
MNIST
GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 92.9 91.8 92.1 94.0 93.2 91.9 87.2 85.9 84.4
Layer 1 92.9 93.5 75.3 96.2 96.3 93.4 88.6 85.7 66.8
Layer 2 97.0 97.5 89.0 96.7 97.2 93.1 92.0 92.0 62.6
CIFAR10 SVHN LSUN FGSM, = 0.1
(Resnet) GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 94.2 93.1 90.0 78.2 80.8 79.8 91.0 89.0 87.9
Layer 1 94.3 94.3 92.3 75.3 82.9 84.3 92.5 91.9 91.8
Layer 2 94.2 94.3 92.3 75.7 82.9 84.3 92.4 91.9 91.8
CIFAR10 SVHN LSUN FGSM, = 0.1
(Densenet) GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 76.7 75.2 77.2 69.5 69.9 64.8 85.7 86.4 83.6
Layer 1 85.2 84.7 73.3 94.8 95.2 95.4 92.7 92.5 90.3
Layer 2 78.1 78.1 51.1 78.2 78.2 74.8 88.0 88.0 80.5

Table 4: Average improvements in scores over the baseline distribution

AUPR change AUROC change
GMM Sep GMM Sep
Layer 0 4.64 3.00 1.97 1.51
Layer 1 5.21 6.60 5.51 6.01
Layer 2 10.1 12.1 8.09 8.96

6
Figure 4: t-SNE visualization of (spatio-temporal) feature embeddings for UCF101 using C3D
Resnet101 (Layer 1).

Finally, note that the improvements obtained by using a multivarate Gaussian (separate covariance)
and GMM are very comparable. This observation is consistent with the reasoning in Section 3 that
the larger number of parameters to be estimated for a GMM can cause more ill-conditioning during
estimation, especially in the case of high-dimension and limited training samples, which might lead
to sub-optimal parameter estimates.

4.2 Application to Activity Classification in Videos

While there is extensive work on adversarial attacks against image classifiers, the reported cases
of video classifier attacks remains very limited. Here, we consider one such case on a video-based
human activity classifier. The setup is the following: we use the original UCF101 dataset (split into
training and testing subsets) to train a 3D deep neural network, C3D ResNet101 [8], for human
action classification. With our trained model, we obtain 84.1% accuracy on the test set. We then
use a state-of-the-art video attack method known as adversarial framing [22] to generate adversarial
samples. Adversarial framing is a simple, yet very potent video attack method that operates by
keeping most of the video frames intact, but adds a 4-pixel wide framing at the border of each frame.
We employ both a white-box attack where we assume full knowledge of the action recognition
classifier (architecture and parameters) and a black-box attack where no such knowledge is available.
These allow us to generate two sets of adversarial frames of the UCF101 dataset that are fed as inputs
to the action classifier. The classifier’s recognition accuracy drops to 63.1% and 4.2% for black-box
and white-box attacks respectively. For the sake of brevity, we show visualizations only for the
white-box attack, but results for both types of attacks are fully reported in Table 5.
For this experiment, we fit distributions to the features from the logit layer (output of last layer before
softmax) and the preceding layer (feature embeddings), denoted as Layers 0 and 1 respectively. Figure
4 provides a visualization of the feature embeddings via t-SNE [15] for the genuine data and white-
box attacked data, showing the potency of the white-box adversarial framing attack. Subsequently,
the adversarial samples are passed through the network and the corresponding uncertainty (log-
likelihood) scores at layers 1 and 0 are calculated. Figure 5 shows the density histogram of these
scores for the in-distribution data and white-box attacked data. Note that while the recognition
accuracy dramatically plummeted to 4.2% for the white-box adversarial samples, the softmax scores
shows even more confidence: the network outputs are wrong, yet the network is ever more confident.
Using our approach, Figure 5 shows that while the network still provides wrong classification results,
it is now able to produce reliable uncertainty metrics showing poor confidence in the generated
outputs. Moreover, the discrimination between in-distribution and OOD samples (adversarial here) is
clearly improved with the more general choices of distributions (GMM and Gaussian with separate
covariances). These improvements are captured quantitatively with AUPR and AUROC metrics in
Table 5 for both white-box and black-box attacks.

7
(a) Softmax (b) Gaussian with tied covariance

(c) Gaussian with separate covariances (d) Gaussian mixture

Figure 5: Density histogram of softmax and log-likelihood scores for Layer 0 (in-distribution samples
in green, white-box adversarial samples in red).

Table 5: Quantitative metrics (AUROC, AUPR) for video attack detection

White box attack Black box attack
AUROC AUPR AUROC AUPR
GMM Sep Tied GMM Sep Tied GMM Sep Tied GMM Sep Tied
Layer 0 97.6 99.4 83.2 97.7 99.3 88.4 86.6 90.8 83.6 90.3 93.2 87.6
Layer 1 91.3 92.3 91.7 92.9 93.7 93.9 86.4 86.9 83.6 90.4 90.8 87.3
Softmax 12.0 32.8 72.7 76.8

5 Conclusions and Future Work

This paper presented a method for modeling the outputs of the various DNN layers (deep-features)
with parametric probability distributions, with applications to adversarial and out-of-distribution
sample detection. We showed that accurate modeling of the class-conditional distributions can enable
the derivation of reliable uncertainty scores. The methodology was theoretically motivated, and
experimentally proven by showing improvements to out-of-distribution detection, and adversarial
sample detection on both image and video data. In particular, we report adversarial sample detection
against a state-of-the-art video classifier attack.
While this work performed feature modeling based on a trained model, future work will seek to
analyze the evolution of the feature distributions during training. Given the complexities arising
from parameter estimation on high-dimensional spaces, we will also consider fitting distributions
on features induced by larger pre-training datasets (e.g. ImageNet, Sports1M, Kinetics [2]) and
subsequently use the estimated parameters as priors for modeling the features on the (smaller) dataset
of interest.

8
References
[1] Bishop, C. M. (2006). Pattern recognition and machine learning. springer.
[2] Carreira, J. and Zisserman, A. (2017). Quo vadis, action recognition? a new model and
the kinetics dataset. In Proceedings of the IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), page 4724–4733.
[3] Cohen, G., Afshar, S., Tapson, J., and van Schaik, A. (2017). Emnist: an extension of mnist to
handwritten letters. arXiv preprint arXiv:1702.05373.
[4] Davis, J. and Goadrich, M. (2006). The relationship between precision-recall and roc curves. In
Proceedings of the 23rd international conference on Machine learning, pages 233–240. ACM.
[5] Gal, Y. and Ghahramani, Z. (2016). Dropout as a bayesian approximation: Representing model
uncertainty in deep learning. In international conference on machine learning, pages 1050–1059.
[6] Goodfellow, I. J., Shlens, J., and Szegedy, C. (2015). Explaining and harnessing adversarial
examples.
[7] Guo, Y., Yao, A., and Chen, Y. (2016). Dynamic network surgery for efficient dnns. In Advances
in Neural Information Processing Systems, pages 1379–1387.
[8] Hara, K., Kataoka, H., and Satoh, Y. (2018). Can spatiotemporal 3d cnns retrace the history of
2d cnns and imagenet? In Proceedings of the IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), pages 6546–6555.
[9] He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep residual learning for image recognition. In
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages
770–778.
[10] Hendrycks, D. and Gimpel, K. (2017). A baseline for detecting misclassified and out-of-
distribution examples in neural networks.
[11] Kendall, A. and Gal, Y. (2017). What uncertainties do we need in bayesian deep learning for
computer vision? In Advances in neural information processing systems, pages 5574–5584.
[12] Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Imagenet classification with deep
convolutional neural networks. In Advances in neural information processing systems, pages
1097–1105.
[13] Lee, K., Lee, K., Lee, H., and Shin, J. (2018). A simple unified framework for detecting
out-of-distribution samples and adversarial attacks. In Advances in Neural Information Processing
Systems, pages 7167–7177.
[14] Liang, S., Li, Y., and Srikant, R. (2018). Enhancing the reliability of out-of-distribution image
detection in neural networks.
[15] Maaten, L. v. d. and Hinton, G. (2008). Visualizing data using t-sne. Journal of machine
learning research, 9(Nov):2579–2605.
[16] Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., and Ng, A. Y. (2011). Reading digits in
natural images with unsupervised feature learning.
[17] Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M.,
Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher,
M., Perrot, M., and Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of
Machine Learning Research, 12:2825–2830.
[18] Simonyan, K. and Zisserman, A. (2015). Very deep convolutional networks for large-scale
image recognition.
[19] Soomro, K., Zamir, A. R., and Shah, M. (2012). Ucf101: A dataset of 101 human actions
classes from videos in the wild. arXiv preprint arXiv:1212.0402.

9
[20] Tenenbaum, J. B., De Silva, V., and Langford, J. C. (2000). A global geometric framework for
nonlinear dimensionality reduction. science, 290(5500):2319–2323.
[21] Yu, F., Zhang, Y., Song, S., Seff, A., and Xiao, J. (2015). Lsun: Construction of a large-scale
image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365.
[22] Zajac,
˛ M., Żołna, K., Rostamzadeh, N., and Pinheiro, P. (2019). Adversarial framing for image
and video classification. In AAAI Conference on Artificial Intelligence.

NeurIPS 2018 A Simple Unified Framework For Detecting Out of Distribution Samples and Adversarial Attacks Paper
No ratings yet
NeurIPS 2018 A Simple Unified Framework For Detecting Out of Distribution Samples and Adversarial Attacks Paper
11 pages
NeurIPS-2020-towards-maximizing-the-representation-gap-between-in-domain-out-of-distribution-examples-Paper
No ratings yet
NeurIPS-2020-towards-maximizing-the-representation-gap-between-in-domain-out-of-distribution-examples-Paper
12 pages
Approximations To The Fisher Information Metric of Deep Generative Models For Out-Of-Distribution Detection
No ratings yet
Approximations To The Fisher Information Metric of Deep Generative Models For Out-Of-Distribution Detection
32 pages
2001.05419v3
No ratings yet
2001.05419v3
14 pages
Exploring Adversarial Training For Out-of-Distribution Detection
No ratings yet
Exploring Adversarial Training For Out-of-Distribution Detection
6 pages
2211.11255
No ratings yet
2211.11255
19 pages
SSD: A U F S - S O D: Nified Ramework FOR ELF Upervised Utlier Etection
No ratings yet
SSD: A U F S - S O D: Nified Ramework FOR ELF Upervised Utlier Etection
17 pages
1876 Diffusion Based Probabilistic
No ratings yet
1876 Diffusion Based Probabilistic
27 pages
G E R A D: Enerative Nsembles For Obust Nomaly Etection
No ratings yet
G E R A D: Enerative Nsembles For Obust Nomaly Etection
10 pages
Exploring Feature Sparsity For Out-Of-Distribution Detection
No ratings yet
Exploring Feature Sparsity For Out-Of-Distribution Detection
14 pages
A Survey On Uncertainty Estimation in Deep Learning Classification Systems From A Bayesian Perspective
No ratings yet
A Survey On Uncertainty Estimation in Deep Learning Classification Systems From A Bayesian Perspective
35 pages
1806.01768v3
No ratings yet
1806.01768v3
12 pages
2106.05964
No ratings yet
2106.05964
72 pages
2022-xu-huang-zheng-wornell-entropy
No ratings yet
2022-xu-huang-zheng-wornell-entropy
28 pages
Deep_Generative_Models_to_Counter_Class_Imbalance_A_Model-Metric_Mapping_With_Proportion_Calibration_Methodology
No ratings yet
Deep_Generative_Models_to_Counter_Class_Imbalance_A_Model-Metric_Mapping_With_Proportion_Calibration_Methodology
19 pages
VMIFGSM
No ratings yet
VMIFGSM
18 pages
Killing It With Zero-Shot Adversarially Robust Novelty Detection
No ratings yet
Killing It With Zero-Shot Adversarially Robust Novelty Detection
5 pages
Logit Disagreement: OoD Detection with Bayesian Neural Networks
No ratings yet
Logit Disagreement: OoD Detection with Bayesian Neural Networks
14 pages
A Survey of Uncertainty in Deep Neural Networks
No ratings yet
A Survey of Uncertainty in Deep Neural Networks
41 pages
Novos Classificadores
No ratings yet
Novos Classificadores
38 pages
RMC- Asha Sharmani Sem 2 (1)
No ratings yet
RMC- Asha Sharmani Sem 2 (1)
11 pages
Monte Carlo Averaging for Uncertainty Estimation i
No ratings yet
Monte Carlo Averaging for Uncertainty Estimation i
13 pages
978-3-031-27481-7_36 (1)
No ratings yet
978-3-031-27481-7_36 (1)
13 pages
On the Dilemma of Out-of-distribution Detection
No ratings yet
On the Dilemma of Out-of-distribution Detection
30 pages
Vol 8 No 0103
No ratings yet
Vol 8 No 0103
5 pages
2507.01831v1
No ratings yet
2507.01831v1
26 pages
Out-Of-Distribution Detection in Long-Tailed Recognition
No ratings yet
Out-Of-Distribution Detection in Long-Tailed Recognition
9 pages
1050_vos_learning_what_you_don_t_kn
No ratings yet
1050_vos_learning_what_you_don_t_kn
21 pages
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet
NeurIPS 2023 Dream the Impossible Outlier Imagination With Diffusion Models Paper Conference
No ratings yet
NeurIPS 2023 Dream the Impossible Outlier Imagination With Diffusion Models Paper Conference
24 pages
Final Presentation
No ratings yet
Final Presentation
18 pages
UQ Review
No ratings yet
UQ Review
129 pages
Chen Scoring Your Prediction On Unseen Data CVPRW 2023 Paper
No ratings yet
Chen Scoring Your Prediction On Unseen Data CVPRW 2023 Paper
10 pages
Sampath Et Al. - 2021 - A Survey On Generative Adversarial Networks For Im
No ratings yet
Sampath Et Al. - 2021 - A Survey On Generative Adversarial Networks For Im
60 pages
arXiv 2412.00278
No ratings yet
arXiv 2412.00278
7 pages
Adversarial Examples Are Misaligned in Diffusion Model Manifolds
No ratings yet
Adversarial Examples Are Misaligned in Diffusion Model Manifolds
23 pages
New Adversarial Image Detection Based On Sentiment Analysis
No ratings yet
New Adversarial Image Detection Based On Sentiment Analysis
15 pages
Magdiff:: Covariate Data Set Shift Detection Via Activation Graphs of Deep Neural Networks
No ratings yet
Magdiff:: Covariate Data Set Shift Detection Via Activation Graphs of Deep Neural Networks
19 pages
Choi Et Al. - 2022 - Imbalanced Data Classification Via Cooperative Int
No ratings yet
Choi Et Al. - 2022 - Imbalanced Data Classification Via Cooperative Int
14 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Wang 2021
No ratings yet
Wang 2021
11 pages
1910.04851
No ratings yet
1910.04851
11 pages
Secure Machine Learning Against Adversarial Samples at Test Time
No ratings yet
Secure Machine Learning Against Adversarial Samples at Test Time
15 pages
AdversarialML GiovIEIIT 20230413
No ratings yet
AdversarialML GiovIEIIT 20230413
27 pages
Evidence Reconciled Neural Network For Out-of-Distribution Detection in Medical Images
No ratings yet
Evidence Reconciled Neural Network For Out-of-Distribution Detection in Medical Images
11 pages
Data Augmentation in Network Intrusion Detection Based on S-DCGAN
No ratings yet
Data Augmentation in Network Intrusion Detection Based on S-DCGAN
6 pages
failing_loudly
No ratings yet
failing_loudly
38 pages
Averly Unified Out-Of-Distribution Detection a Model-Specific Perspective ICCV 2023 Paper
No ratings yet
Averly Unified Out-Of-Distribution Detection a Model-Specific Perspective ICCV 2023 Paper
11 pages
Main
No ratings yet
Main
9 pages
Out-of-Distribution Detection With Deep Nearest Neighbors: Lee Et Al. 2018 Tack Et Al. 2020 Sehwag Et Al. 2021
No ratings yet
Out-of-Distribution Detection With Deep Nearest Neighbors: Lee Et Al. 2018 Tack Et Al. 2020 Sehwag Et Al. 2021
14 pages
ids deep network belief
No ratings yet
ids deep network belief
15 pages
COCL Arxiv
No ratings yet
COCL Arxiv
13 pages
CVPR 2016 Imbalanced
No ratings yet
CVPR 2016 Imbalanced
10 pages
Information Fusion: Sciencedirect
No ratings yet
Information Fusion: Sciencedirect
55 pages
Cost Effective Transfer of Reinforcement Learn - 2024 - Expert Systems With Appl
No ratings yet
Cost Effective Transfer of Reinforcement Learn - 2024 - Expert Systems With Appl
15 pages
Adversarial_Examples_Presentation
No ratings yet
Adversarial_Examples_Presentation
29 pages
thesis 2020-Towards machine self-awareness - A Bayesian framework for uncerta
No ratings yet
thesis 2020-Towards machine self-awareness - A Bayesian framework for uncerta
97 pages
Full Text
No ratings yet
Full Text
15 pages
Towards Open Set Deep Networks: Abhijit Bendale, Terrance E. Boult University of Colorado at Colorado Springs
No ratings yet
Towards Open Set Deep Networks: Abhijit Bendale, Terrance E. Boult University of Colorado at Colorado Springs
14 pages
Advancing Out-of-Distribution Detection Through Data Purification and Dynamic Activation Function Design
No ratings yet
Advancing Out-of-Distribution Detection Through Data Purification and Dynamic Activation Function Design
10 pages
Feature Extraction Techniques
No ratings yet
Feature Extraction Techniques
32 pages
PDF Big Data Iot and Machine Learning Tools and Applications Internet of Everything Ioe 1St Edition Rashmi Agrawal Editor Ebook Full Chapter
100% (4)
PDF Big Data Iot and Machine Learning Tools and Applications Internet of Everything Ioe 1St Edition Rashmi Agrawal Editor Ebook Full Chapter
54 pages
In The Eye of The Beholder: A Survey of Models For Eyes and Gaze
No ratings yet
In The Eye of The Beholder: A Survey of Models For Eyes and Gaze
23 pages
Chapter 4 - Dimension Reduction: Data Mining For Business Intelligence
No ratings yet
Chapter 4 - Dimension Reduction: Data Mining For Business Intelligence
24 pages
MLPPT 5
No ratings yet
MLPPT 5
97 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
A Concise Review On Recent Developments of Machine Learning For The Prediction of Vibrational Spectra-NA
No ratings yet
A Concise Review On Recent Developments of Machine Learning For The Prediction of Vibrational Spectra-NA
12 pages
Malicious URL Detection Using Machine Learning 2
No ratings yet
Malicious URL Detection Using Machine Learning 2
24 pages
Kernel Multivariate Analysis Framework F
No ratings yet
Kernel Multivariate Analysis Framework F
12 pages
Data Mining: Dimensionality Reduction Pca - SVD
No ratings yet
Data Mining: Dimensionality Reduction Pca - SVD
33 pages
ACS-Recognition-of-Prior-Learning-(RPL)-Form-2024-v2 (1)
No ratings yet
ACS-Recognition-of-Prior-Learning-(RPL)-Form-2024-v2 (1)
17 pages
UNIT-3: Face Recognition
No ratings yet
UNIT-3: Face Recognition
47 pages
Unit-Iv Material
No ratings yet
Unit-Iv Material
24 pages
Machine Learning in Python For Process Systems Engineering: Ankur Kumar, Jesus Flores-Cerrillo
No ratings yet
Machine Learning in Python For Process Systems Engineering: Ankur Kumar, Jesus Flores-Cerrillo
352 pages
Data modification and predictive analytics_MCQ_1_2 (1)
No ratings yet
Data modification and predictive analytics_MCQ_1_2 (1)
24 pages
dm1
No ratings yet
dm1
52 pages
ML Module
No ratings yet
ML Module
129 pages
Dimension Reduction _ Dimensionality Reduction Techniques
No ratings yet
Dimension Reduction _ Dimensionality Reduction Techniques
5 pages
ML Unit-1
No ratings yet
ML Unit-1
15 pages
A New Method For Dimensionality Reduction Using K-Means Clustering Algorithm For High Dimensional Data Set
No ratings yet
A New Method For Dimensionality Reduction Using K-Means Clustering Algorithm For High Dimensional Data Set
6 pages
Fake News Analysis
No ratings yet
Fake News Analysis
46 pages
AIML-5th-Sem_Pattern Recognition_Dr. Sudipta Chakrabarty
No ratings yet
AIML-5th-Sem_Pattern Recognition_Dr. Sudipta Chakrabarty
73 pages
Python Course Outline
No ratings yet
Python Course Outline
24 pages
EE769-11 Dimension Reduction
No ratings yet
EE769-11 Dimension Reduction
16 pages
ISO IEC 23053-2022 (2)
No ratings yet
ISO IEC 23053-2022 (2)
44 pages
Research_Paper3
No ratings yet
Research_Paper3
10 pages
Adoc - Pub Irfan Abbas Vincent Suhartono Stefanus Santosa Abs
No ratings yet
Adoc - Pub Irfan Abbas Vincent Suhartono Stefanus Santosa Abs
15 pages
Articulo en Ingles Sobre IA
No ratings yet
Articulo en Ingles Sobre IA
43 pages
Data Modeling - Cheatsheet
No ratings yet
Data Modeling - Cheatsheet
9 pages
Business Analytics and Data Mining Modeling Using R
No ratings yet
Business Analytics and Data Mining Modeling Using R
6 pages

Probabilistic Modeling of Deep Features For Out-of-Distribution and Adversarial Detection

Uploaded by

Probabilistic Modeling of Deep Features For Out-of-Distribution and Adversarial Detection

Uploaded by

Probabilistic Modeling of Deep Features for

Out-of-Distribution and Adversarial Detection

Nilesh. A. Ahuja Ibrahima Ndiour Trushant Kalyanpur Omesh Tickoo

Preprint. Under review.

input (28 x 28)

2L = − n log 2π + log(det |Σ|) + (f (x) − µ)T Σ−1 (f (x) − µ)

4 Experiments and Results

4.1 Applications in Image Classification tasks

Table 4: Average improvements in scores over the baseline distribution

4.2 Application to Activity Classification in Videos

(c) Gaussian with separate covariances (d) Gaussian mixture

Table 5: Quantitative metrics (AUROC, AUPR) for video attack detection

5 Conclusions and Future Work

You might also like