-
HLoOP -- Hyperbolic 2-space Local Outlier Probabilities
Authors:
Clémence Allietta,
Jean-Philippe Condomines,
Jean-Yves Tourneret,
Emmanuel Lochin
Abstract:
Hyperbolic geometry has recently garnered considerable attention in machine learning due to its capacity to embed hierarchical graph structures with low distortions for further downstream processing. This paper introduces a simple framework to detect local outliers for datasets grounded in hyperbolic 2-space referred to as HLoOP (Hyperbolic Local Outlier Probability). Within a Euclidean space, wel…
▽ More
Hyperbolic geometry has recently garnered considerable attention in machine learning due to its capacity to embed hierarchical graph structures with low distortions for further downstream processing. This paper introduces a simple framework to detect local outliers for datasets grounded in hyperbolic 2-space referred to as HLoOP (Hyperbolic Local Outlier Probability). Within a Euclidean space, well-known techniques for local outlier detection are based on the Local Outlier Factor (LOF) and its variant, the LoOP (Local Outlier Probability), which incorporates probabilistic concepts to model the outlier level of a data vector. The developed HLoOP combines the idea of finding nearest neighbors, density-based outlier scoring with a probabilistic, statistically oriented approach. Therefore, the method consists in computing the Riemmanian distance of a data point to its nearest neighbors following a Gaussian probability density function expressed in a hyperbolic space. This is achieved by defining a Gaussian cumulative distribution in this space. The HLoOP algorithm is tested on the WordNet dataset yielding promising results. Code and data will be made available on request for reproductibility.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
A Robust and Flexible EM Algorithm for Mixtures of Elliptical Distributions with Missing Data
Authors:
Florian Mouret,
Alexandre Hippert-Ferrer,
Frédéric Pascal,
Jean-Yves Tourneret
Abstract:
This paper tackles the problem of missing data imputation for noisy and non-Gaussian data. A classical imputation method, the Expectation Maximization (EM) algorithm for Gaussian mixture models, has shown interesting properties when compared to other popular approaches such as those based on k-nearest neighbors or on multiple imputations by chained equations. However, Gaussian mixture models are k…
▽ More
This paper tackles the problem of missing data imputation for noisy and non-Gaussian data. A classical imputation method, the Expectation Maximization (EM) algorithm for Gaussian mixture models, has shown interesting properties when compared to other popular approaches such as those based on k-nearest neighbors or on multiple imputations by chained equations. However, Gaussian mixture models are known to be non-robust to heterogeneous data, which can lead to poor estimation performance when the data is contaminated by outliers or follows non-Gaussian distributions. To overcome this issue, a new EM algorithm is investigated for mixtures of elliptical distributions with the property of handling potential missing data. This paper shows that this problem reduces to the estimation of a mixture of Angular Gaussian distributions under generic assumptions (i.e., each sample is drawn from a mixture of elliptical distributions, which is possibly different for one sample to another). In that case, the complete-data likelihood associated with mixtures of elliptical distributions is well adapted to the EM framework with missing data thanks to its conditional distribution, which is shown to be a multivariate $t$-distribution. Experimental results on synthetic data demonstrate that the proposed algorithm is robust to outliers and can be used with non-Gaussian data. Furthermore, experiments conducted on real-world datasets show that this algorithm is very competitive when compared to other classical imputation methods.
△ Less
Submitted 22 May, 2023; v1 submitted 28 January, 2022;
originally announced January 2022.
-
Reconstruction of Sentinel-2 Time Series Using Robust Gaussian Mixture Models -- Application to the Detection of Anomalous Crop Development in wheat and rapeseed crops
Authors:
Florian Mouret,
Mohanad Albughdadi,
Sylvie Duthoit,
Denis Kouamé,
Guillaume Rieu,
Jean-Yves Tourneret
Abstract:
Missing data is a recurrent problem in remote sensing, mainly due to cloud coverage for multispectral images and acquisition problems. This can be a critical issue for crop monitoring, especially for applications relying on machine learning techniques, which generally assume that the feature matrix does not have missing values. This paper proposes a Gaussian Mixture Model (GMM) for the reconstruct…
▽ More
Missing data is a recurrent problem in remote sensing, mainly due to cloud coverage for multispectral images and acquisition problems. This can be a critical issue for crop monitoring, especially for applications relying on machine learning techniques, which generally assume that the feature matrix does not have missing values. This paper proposes a Gaussian Mixture Model (GMM) for the reconstruction of parcel-level features extracted from multispectral images. A robust version of the GMM is also investigated, since datasets can be contaminated by inaccurate samples or features (e.g., wrong crop type reported, inaccurate boundaries, undetected clouds, etc). Additional features extracted from Synthetic Aperture Radar (SAR) images using Sentinel-1 data are also used to provide complementary information and improve the imputations. The robust GMM investigated in this work assigns reduced weights to the outliers during the estimation of the GMM parameters, which improves the final reconstruction. These weights are computed at each step of an Expectation-Maximization (EM) algorithm by using outlier scores provided by the isolation forest algorithm. Experimental validation is conducted on rapeseed and wheat parcels located in the Beauce region (France). Overall, we show that the GMM imputation method outperforms other reconstruction strategies. A mean absolute error (MAE) of 0.013 (resp. 0.019) is obtained for the imputation of the median Normalized Difference Index (NDVI) of the rapeseed (resp. wheat) parcels. Other indicators (e.g., Normalized Difference Water Index) and statistics (for instance the interquartile range, which captures heterogeneity among the parcel indicator) are reconstructed at the same time with good accuracy. In a dataset contaminated by irrelevant samples, using the robust GMM is recommended since the standard GMM imputation can lead to inaccurate imputed values.
△ Less
Submitted 9 May, 2022; v1 submitted 22 October, 2021;
originally announced October 2021.
-
Outlier detection at the parcel-level in wheat and rapeseed crops using multispectral and SAR time series
Authors:
Florian Mouret,
Mohanad Albughdadi,
Sylvie Duthoit,
Denis Kouamé,
Guillaume Rieu,
Jean-Yves Tourneret
Abstract:
This paper studies the detection of anomalous crop development at the parcel-level based on an unsupervised outlier detection technique. The experimental validation is conducted on rapeseed and wheat parcels located in Beauce (France). The proposed methodology consists of four sequential steps: 1) preprocessing of synthetic aperture radar (SAR) and multispectral images acquired using Sentinel-1 an…
▽ More
This paper studies the detection of anomalous crop development at the parcel-level based on an unsupervised outlier detection technique. The experimental validation is conducted on rapeseed and wheat parcels located in Beauce (France). The proposed methodology consists of four sequential steps: 1) preprocessing of synthetic aperture radar (SAR) and multispectral images acquired using Sentinel-1 and Sentinel-2 satellites, 2) extraction of SAR and multispectral pixel-level features, 3) computation of parcel-level features using zonal statistics and 4) outlier detection. The different types of anomalies that can affect the studied crops are analyzed and described. The different factors that can influence the outlier detection results are investigated with a particular attention devoted to the synergy between Sentinel-1 and Sentinel-2 data. Overall, the best performance is obtained when using jointly a selection of Sentinel-1 and Sentinel-2 features with the isolation forest algorithm. The selected features are VV and VH backscattering coefficients for Sentinel-1 and 5 Vegetation Indexes for Sentinel-2 (among us, the Normalized Difference Vegetation Index and two variants of the Normalized Difference Water). When using these features with an outlier ratio of 10%, the percentage of detected true positives (i.e., crop anomalies) is equal to 94.1% for rapeseed parcels and 95.5% for wheat parcels.
△ Less
Submitted 5 March, 2021; v1 submitted 17 April, 2020;
originally announced April 2020.
-
Statistical modeling and classification of reflectance confocal microscopy images
Authors:
Abdelghafour Halimi,
Hadj Batatia,
Jimmy Le Digabel,
Gwendal Josse,
Jean-Yves Tourneret
Abstract:
This paper deals with the characterization and classification of reflectance confocal microscopy images of human skin. The aim is to identify and characterize the lentigo, a phenomenon that originates at the dermo-epidermic junction of the skin. High resolution confocal images are acquired at different skin depths and are analyzed for each depth. Histograms of pixel intensities associated with a g…
▽ More
This paper deals with the characterization and classification of reflectance confocal microscopy images of human skin. The aim is to identify and characterize the lentigo, a phenomenon that originates at the dermo-epidermic junction of the skin. High resolution confocal images are acquired at different skin depths and are analyzed for each depth. Histograms of pixel intensities associated with a given depth are determined, showing that the generalized gamma distribution (GGD) is a good statistical model for confocal images. A GGD is parameterized by translation, scale and shape parameters. These parameters are estimated using a new estimation method based on a natural gradient descent showing fast convergence properties with respect to state-of-the-art estimation methods. The resulting parameter estimates can be used to classify clinical images of healthy and lentigo patients. The obtained results show that the scale and shape parameters are good features to identify and characterize the presence of lentigo in skin tissues.
△ Less
Submitted 9 June, 2017;
originally announced July 2017.
-
An unsupervised bayesian approach for the joint reconstruction and classification of cutaneous reflectance confocal microscopy images
Authors:
Abdelghafour Halimi,
Hadj Batatia,
Jimmy Le Digabel,
Gwendal Josse,
Jean-Yves Tourneret
Abstract:
This paper studies a new Bayesian algorithm for the joint reconstruction and classification of reflectance confocal microscopy (RCM) images, with application to the identification of human skin lentigo. The proposed Bayesian approach takes advantage of the distribution of the multiplicative speckle noise affecting the true reflectivity of these images and of appropriate priors for the unknown mode…
▽ More
This paper studies a new Bayesian algorithm for the joint reconstruction and classification of reflectance confocal microscopy (RCM) images, with application to the identification of human skin lentigo. The proposed Bayesian approach takes advantage of the distribution of the multiplicative speckle noise affecting the true reflectivity of these images and of appropriate priors for the unknown model parameters. A Markov chain Monte Carlo (MCMC) algorithm is proposed to jointly estimate the model parameters and the image of true reflectivity while classifying images according to the distribution of their reflectivity. Precisely, a Metropolis-whitin-Gibbs sampler is investigated to sample the posterior distribution of the Bayesian model associated with RCM images and to build estimators of its parameters, including labels indicating the class of each RCM image. The resulting algorithm is applied to synthetic data and to real images from a clinical study containing healthy and lentigo patients.
△ Less
Submitted 4 March, 2017;
originally announced March 2017.
-
A Hierarchical Bayesian Model Accounting for Endmember Variability and Abrupt Spectral Changes to Unmix Multitemporal Hyperspectral Images
Authors:
Pierre-Antoine Thouvenin,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
Hyperspectral unmixing is a blind source separation problem which consists in estimating the reference spectral signatures contained in a hyperspectral image, as well as their relative contribution to each pixel according to a given mixture model. In practice, the process is further complexified by the inherent spectral variability of the observed scene and the possible presence of outliers. More…
▽ More
Hyperspectral unmixing is a blind source separation problem which consists in estimating the reference spectral signatures contained in a hyperspectral image, as well as their relative contribution to each pixel according to a given mixture model. In practice, the process is further complexified by the inherent spectral variability of the observed scene and the possible presence of outliers. More specifically, multi-temporal hyperspectral images, i.e., sequences of hyperspectral images acquired over the same area at different time instants, are likely to simultaneously exhibit moderate endmember variability and abrupt spectral changes either due to outliers or to significant time intervals between consecutive acquisitions. Unless properly accounted for, these two perturbations can significantly affect the unmixing process. In this context, we propose a new unmixing model for multitemporal hyperspectral images accounting for smooth temporal variations, construed as spectral variability, and abrupt spectral changes interpreted as outliers. The proposed hierarchical Bayesian model is inferred using a Markov chain Monte-Carlo (MCMC) method allowing the posterior of interest to be sampled and Bayesian estimators to be approximated. A comparison with unmixing techniques from the literature on synthetic and real data allows the interest of the proposed approach to be appreciated.
△ Less
Submitted 18 November, 2017; v1 submitted 25 September, 2016;
originally announced September 2016.
-
EEG reconstruction and skull conductivity estimation using a Bayesian model promoting structured sparsity
Authors:
Facundo Costa,
Hadj Batatia,
Thomas Oberlin,
Jean-Yves Tourneret
Abstract:
M/EEG source localization is an open research issue. To solve it, it is important to have good knowledge of several physical parameters to build a reliable head operator. Amongst them, the value of the conductivity of the human skull has remained controversial. This report introduces a novel hierarchical Bayesian framework to estimate the skull conductivity jointly with the brain activity from the…
▽ More
M/EEG source localization is an open research issue. To solve it, it is important to have good knowledge of several physical parameters to build a reliable head operator. Amongst them, the value of the conductivity of the human skull has remained controversial. This report introduces a novel hierarchical Bayesian framework to estimate the skull conductivity jointly with the brain activity from the M/EEG measurements to improve the reconstruction quality. A partially collapsed Gibbs sampler is used to draw samples asymptotically distributed according to the associated posterior. The generated samples are then used to estimate the brain activity and the model hyperparameters jointly in a completely unsupervised framework. We use synthetic and real data to illustrate the improvement of the reconstruction. The performance of our method is also compared with two optimization algorithms introduced by Vallaghé \textit{et al.} and Gutierrez \textit{et al.} respectively, showing that our method is able to provide results of similar or better quality while remaining applicable in a wider array of situations.
△ Less
Submitted 4 January, 2017; v1 submitted 22 September, 2016;
originally announced September 2016.
-
Online Unmixing of Multitemporal Hyperspectral Images accounting for Spectral Variability
Authors:
Pierre-Antoine Thouvenin,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
Hyperspectral unmixing is aimed at identifying the reference spectral signatures composing an hyperspectral image and their relative abundance fractions in each pixel. In practice, the identified signatures may vary spectrally from an image to another due to varying acquisition conditions, thus inducing possibly significant estimation errors. Against this background, hyperspectral unmixing of seve…
▽ More
Hyperspectral unmixing is aimed at identifying the reference spectral signatures composing an hyperspectral image and their relative abundance fractions in each pixel. In practice, the identified signatures may vary spectrally from an image to another due to varying acquisition conditions, thus inducing possibly significant estimation errors. Against this background, hyperspectral unmixing of several images acquired over the same area is of considerable interest. Indeed, such an analysis enables the endmembers of the scene to be tracked and the corresponding endmember variability to be characterized. Sequential endmember estimation from a set of hyperspectral images is expected to provide improved performance when compared to methods analyzing the images independently. However, the significant size of hyperspectral data precludes the use of batch procedures to jointly estimate the mixture parameters of a sequence of hyperspectral images. Provided that each elementary component is present in at least one image of the sequence, we propose to perform an online hyperspectral unmixing accounting for temporal endmember variability. The online hyperspectral unmixing is formulated as a two-stage stochastic program, which can be solved using a stochastic approximation. The performance of the proposed method is evaluated on synthetic and real data. A comparison with independent unmixing algorithms finally illustrates the interest of the proposed strategy.
△ Less
Submitted 6 June, 2016; v1 submitted 20 October, 2015;
originally announced October 2015.
-
Bayesian Structured Sparsity Priors for EEG Source Localization Technical Report
Authors:
Facundo Costa,
Hadj Batatia,
Thomas Oberlin,
Jean-Yves Tourneret
Abstract:
This report introduces a new hierarchical Bayesian model for the EEG source localization problem. This model promotes structured sparsity to search for focal brain activity. This sparsity is obtained via a multivariate Bernoulli Laplacian prior assigned to the brain activity approximating an $\ell_{20}$ pseudo norm regularization in a Bayesian framework. A partially collapsed Gibbs sampler is used…
▽ More
This report introduces a new hierarchical Bayesian model for the EEG source localization problem. This model promotes structured sparsity to search for focal brain activity. This sparsity is obtained via a multivariate Bernoulli Laplacian prior assigned to the brain activity approximating an $\ell_{20}$ pseudo norm regularization in a Bayesian framework. A partially collapsed Gibbs sampler is used to draw samples asymptotically distributed according to the posterior associated with the proposed Bayesian model. The generated samples are used to estimate the brain activity and the model hyperparameters jointly in an unsupervised framework. Two different kinds of Metropolis-Hastings moves are introduced to accelerate the convergence of the Gibbs sampler. The first move is based on multiple dipole shifts within each MCMC chain whereas the second one exploits proposals associated with different MCMC chains. We use both synthetic and real data to compare the performance of the proposed method with the weighted $\ell_{21}$ mixed norm regularization and a method based on a multiple sparse prior, showing that our algorithm presents advantages in several scenarios.
△ Less
Submitted 15 September, 2015;
originally announced September 2015.
-
Hyperspectral pansharpening: a review
Authors:
Laetitia Loncan,
Luis B. Almeida,
José M. Bioucas-Dias,
Xavier Briottet,
Jocelyn Chanussot,
Nicolas Dobigeon,
Sophie Fabre,
Wenzhi Liao,
Giorgio A. Licciardi,
Miguel Simões,
Jean-Yves Tourneret,
Miguel A. Veganzones,
Gemine Vivone,
Qi Wei,
Naoto Yokoya
Abstract:
Pansharpening aims at fusing a panchromatic image with a multispectral one, to generate an image with the high spatial resolution of the former and the high spectral resolution of the latter. In the last decade, many algorithms have been presented in the literature for pansharpening using multispectral data. With the increasing availability of hyperspectral systems, these methods are now being ada…
▽ More
Pansharpening aims at fusing a panchromatic image with a multispectral one, to generate an image with the high spatial resolution of the former and the high spectral resolution of the latter. In the last decade, many algorithms have been presented in the literature for pansharpening using multispectral data. With the increasing availability of hyperspectral systems, these methods are now being adapted to hyperspectral images. In this work, we compare new pansharpening techniques designed for hyperspectral data with some of the state of the art methods for multispectral pansharpening, which have been adapted for hyperspectral data. Eleven methods from different classes (component substitution, multiresolution analysis, hybrid, Bayesian and matrix factorization) are analyzed. These methods are applied to three datasets and their effectiveness and robustness are evaluated with widely used performance indicators. In addition, all the pansharpening techniques considered in this paper have been implemented in a MATLAB toolbox that is made available to the community.
△ Less
Submitted 17 April, 2015;
originally announced April 2015.
-
Bayesian Estimation of Smooth Altimetric Parameters: Application to Conventional and Delay/Doppler Altimetry
Authors:
Abderrahim Halimi,
Corinne Mailhes,
Jean-Yves Tourneret,
Hichem Snoussi
Abstract:
This paper proposes a new Bayesian strategy for the smooth estimation of altimetric parameters. The altimetric signal is assumed to be corrupted by a thermal and speckle noise distributed according to an independent and non identically Gaussian distribution. We introduce a prior enforcing a smooth temporal evolution of the altimetric parameters which improves their physical interpretation. The pos…
▽ More
This paper proposes a new Bayesian strategy for the smooth estimation of altimetric parameters. The altimetric signal is assumed to be corrupted by a thermal and speckle noise distributed according to an independent and non identically Gaussian distribution. We introduce a prior enforcing a smooth temporal evolution of the altimetric parameters which improves their physical interpretation. The posterior distribution of the resulting model is optimized using a gradient descent algorithm which allows us to compute the maximum a posteriori estimator of the unknown model parameters. This algorithm presents a low computational cost which is suitable for real time applications. The proposed Bayesian strategy and the corresponding estimation algorithm are validated on both synthetic and real data associated with conventional and delay/Doppler altimetry. The analysis of real Jason-2 and Cryosat-2 waveforms shows an improvement in parameter estimation when compared to the state-of-the-art estimation algorithms.
△ Less
Submitted 17 February, 2015;
originally announced February 2015.
-
Hyperspectral unmixing with spectral variability using a perturbed linear mixing model
Authors:
Pierre-Antoine Thouvenin,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
Given a mixed hyperspectral data set, linear unmixing aims at estimating the reference spectral signatures composing the data - referred to as endmembers - their abundance fractions and their number. In practice, the identified endmembers can vary spectrally within a given image and can thus be construed as variable instances of reference endmembers. Ignoring this variability induces estimation er…
▽ More
Given a mixed hyperspectral data set, linear unmixing aims at estimating the reference spectral signatures composing the data - referred to as endmembers - their abundance fractions and their number. In practice, the identified endmembers can vary spectrally within a given image and can thus be construed as variable instances of reference endmembers. Ignoring this variability induces estimation errors that are propagated into the unmixing procedure. To address this issue, endmember variability estimation consists of estimating the reference spectral signatures from which the estimated endmembers have been derived as well as their variability with respect to these references. This paper introduces a new linear mixing model that explicitly accounts for spatial and spectral endmember variabilities. The parameters of this model can be estimated using an optimization algorithm based on the alternating direction method of multipliers. The performance of the proposed unmixing method is evaluated on synthetic and real data. A comparison with state-of-the-art algorithms designed to model and estimate endmember variability allows the interest of the proposed unmixing solution to be appreciated.
△ Less
Submitted 20 October, 2015; v1 submitted 4 February, 2015;
originally announced February 2015.
-
Estimating the Intrinsic Dimension of Hyperspectral Images Using an Eigen-Gap Approach
Authors:
A. Halimi,
P. Honeine,
M. Kharouf,
C. Richard,
J. -Y. Tourneret
Abstract:
Linear mixture models are commonly used to represent hyperspectral datacube as a linear combinations of endmember spectra. However, determining of the number of endmembers for images embedded in noise is a crucial task. This paper proposes a fully automatic approach for estimating the number of endmembers in hyperspectral images. The estimation is based on recent results of random matrix theory re…
▽ More
Linear mixture models are commonly used to represent hyperspectral datacube as a linear combinations of endmember spectra. However, determining of the number of endmembers for images embedded in noise is a crucial task. This paper proposes a fully automatic approach for estimating the number of endmembers in hyperspectral images. The estimation is based on recent results of random matrix theory related to the so-called spiked population model. More precisely, we study the gap between successive eigenvalues of the sample covariance matrix constructed from high dimensional noisy samples. The resulting estimation strategy is unsupervised and robust to correlated noise. This strategy is validated on both synthetic and real images. The experimental results are very promising and show the accuracy of this algorithm with respect to state-of-the-art algorithms.
△ Less
Submitted 22 January, 2015;
originally announced January 2015.
-
Bayesian estimation of the multifractality parameter for image texture using a Whittle approximation
Authors:
Sébastien Combrexelle,
Herwig Wendt,
Nicolas Dobigeon,
Jean-Yves Tourneret,
Steve McLaughlin,
Patrice Abry
Abstract:
Texture characterization is a central element in many image processing applications. Multifractal analysis is a useful signal and image processing tool, yet, the accurate estimation of multifractal parameters for image texture remains a challenge. This is due in the main to the fact that current estimation procedures consist of performing linear regressions across frequency scales of the two-dimen…
▽ More
Texture characterization is a central element in many image processing applications. Multifractal analysis is a useful signal and image processing tool, yet, the accurate estimation of multifractal parameters for image texture remains a challenge. This is due in the main to the fact that current estimation procedures consist of performing linear regressions across frequency scales of the two-dimensional (2D) dyadic wavelet transform, for which only a few such scales are computable for images. The strongly non-Gaussian nature of multifractal processes, combined with their complicated dependence structure, makes it difficult to develop suitable models for parameter estimation. Here, we propose a Bayesian procedure that addresses the difficulties in the estimation of the multifractality parameter. The originality of the procedure is threefold: The construction of a generic semi-parametric statistical model for the logarithm of wavelet leaders; the formulation of Bayesian estimators that are associated with this model and the set of parameter values admitted by multifractal theory; the exploitation of a suitable Whittle approximation within the Bayesian model which enables the otherwise infeasible evaluation of the posterior distribution associated with the model. Performance is assessed numerically for several 2D multifractal processes, for several image sizes and a large range of process parameters. The procedure yields significant benefits over current benchmark estimators in terms of estimation performance and ability to discriminate between the two most commonly used classes of multifractal process models. The gains in performance are particularly pronounced for small image sizes, notably enabling for the first time the analysis of image patches as small as 64x64 pixels.
△ Less
Submitted 9 April, 2015; v1 submitted 17 October, 2014;
originally announced October 2014.
-
Unsupervised Unmixing of Hyperspectral Images Accounting for Endmember Variability
Authors:
Abderrahim Halimi,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
This paper presents an unsupervised Bayesian algorithm for hyperspectral image unmixing accounting for endmember variability. The pixels are modeled by a linear combination of endmembers weighted by their corresponding abundances. However, the endmembers are assumed random to take into account their variability in the image. An additive noise is also considered in the proposed model generalizing t…
▽ More
This paper presents an unsupervised Bayesian algorithm for hyperspectral image unmixing accounting for endmember variability. The pixels are modeled by a linear combination of endmembers weighted by their corresponding abundances. However, the endmembers are assumed random to take into account their variability in the image. An additive noise is also considered in the proposed model generalizing the normal compositional model. The proposed algorithm exploits the whole image to provide spectral and spatial information. It estimates both the mean and the covariance matrix of each endmember in the image. This allows the behavior of each material to be analyzed and its variability to be quantified in the scene. A spatial segmentation is also obtained based on the estimated abundances. In order to estimate the parameters associated with the proposed Bayesian model, we propose to use a Hamiltonian Monte Carlo algorithm. The performance of the resulting unmixing strategy is evaluated via simulations conducted on both synthetic and real data.
△ Less
Submitted 19 June, 2014;
originally announced June 2014.
-
A Hamiltonian Monte Carlo Method for Non-Smooth Energy Sampling
Authors:
Lotfi Chaari,
Jean-Yves Tourneret,
Caroline Chaux,
Hadj Batatia
Abstract:
Efficient sampling from high-dimensional distributions is a challenging issue which is encountered in many large data recovery problems involving Markov chain Monte Carlo schemes. In this context, sampling using Hamiltonian dynamics is one of the recent techniques that have been proposed to exploit the target distribution geometry. Such schemes have clearly been shown to be efficient for multi-dim…
▽ More
Efficient sampling from high-dimensional distributions is a challenging issue which is encountered in many large data recovery problems involving Markov chain Monte Carlo schemes. In this context, sampling using Hamiltonian dynamics is one of the recent techniques that have been proposed to exploit the target distribution geometry. Such schemes have clearly been shown to be efficient for multi-dimensional sampling, but are rather adapted to the exponential families of distributions with smooth energy function. In this paper, we address the problem of using Hamiltonian dynamics to sample from probability distributions having non-differentiable energy functions such as $\ell_1$. Such distributions are being more and more used in sparse signal and image recovery applications. The proposed technique uses a modified leapfrog transform involving a proximal step. The resulting non-smooth Hamiltonian Monte Carlo (ns-HMC) method is tested and validated on a number of experiments. Results show its ability to accurately sample according to various multivariate target distributions. The proposed technique is illustrated on synthetic examples and is applied to an image denoising problem.
△ Less
Submitted 30 January, 2015; v1 submitted 16 January, 2014;
originally announced January 2014.
-
Joint Bayesian estimation of close subspaces from noisy measurements
Authors:
Olivier Besson,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
In this letter, we consider two sets of observations defined as subspace signals embedded in noise and we wish to analyze the distance between these two subspaces. The latter entails evaluating the angles between the subspaces, an issue reminiscent of the well-known Procrustes problem. A Bayesian approach is investigated where the subspaces of interest are considered as random with a joint prior d…
▽ More
In this letter, we consider two sets of observations defined as subspace signals embedded in noise and we wish to analyze the distance between these two subspaces. The latter entails evaluating the angles between the subspaces, an issue reminiscent of the well-known Procrustes problem. A Bayesian approach is investigated where the subspaces of interest are considered as random with a joint prior distribution (namely a Bingham distribution), which allows the closeness of the two subspaces to be adjusted. Within this framework, the minimum mean-square distance estimator of both subspaces is formulated and implemented via a Gibbs sampler. A simpler scheme based on alternative maximum a posteriori estimation is also presented. The new schemes are shown to provide more accurate estimates of the angles between the subspaces, compared to singular value decomposition based independent estimation of the two subspaces.
△ Less
Submitted 1 October, 2013;
originally announced October 2013.
-
Bayesian Fusion of Multi-Band Images
Authors:
Qi Wei,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
In this paper, a Bayesian fusion technique for remotely sensed multi-band images is presented. The observed images are related to the high spectral and high spatial resolution image to be recovered through physical degradations, e.g., spatial and spectral blurring and/or subsampling defined by the sensor characteristics. The fusion problem is formulated within a Bayesian estimation framework. An a…
▽ More
In this paper, a Bayesian fusion technique for remotely sensed multi-band images is presented. The observed images are related to the high spectral and high spatial resolution image to be recovered through physical degradations, e.g., spatial and spectral blurring and/or subsampling defined by the sensor characteristics. The fusion problem is formulated within a Bayesian estimation framework. An appropriate prior distribution exploiting geometrical consideration is introduced. To compute the Bayesian estimator of the scene of interest from its posterior distribution, a Markov chain Monte Carlo algorithm is designed to generate samples asymptotically distributed according to the target distribution. To efficiently sample from this high-dimension distribution, a Hamiltonian Monte Carlo step is introduced in the Gibbs sampling strategy. The efficiency of the proposed fusion method is evaluated with respect to several state-of-the-art fusion techniques. In particular, low spatial resolution hyperspectral and multispectral images are fused to produce a high spatial resolution hyperspectral image.
△ Less
Submitted 26 August, 2014; v1 submitted 23 July, 2013;
originally announced July 2013.
-
Residual component analysis of hyperspectral images -- Application to joint nonlinear unmixing and nonlinearity detection
Authors:
Yoann Altmann,
Nicolas Dobigeon,
Steve McLaughlin,
Jean-Yves Tourneret
Abstract:
This paper presents a nonlinear mixing model for joint hyperspectral image unmixing and nonlinearity detection. The proposed model assumes that the pixel reflectances are linear combinations of known pure spectral components corrupted by an additional nonlinear term, affecting the endmembers and contaminated by an additive Gaussian noise. A Markov random field is considered for nonlinearity detect…
▽ More
This paper presents a nonlinear mixing model for joint hyperspectral image unmixing and nonlinearity detection. The proposed model assumes that the pixel reflectances are linear combinations of known pure spectral components corrupted by an additional nonlinear term, affecting the endmembers and contaminated by an additive Gaussian noise. A Markov random field is considered for nonlinearity detection based on the spatial structure of the nonlinear terms. The observed image is segmented into regions where nonlinear terms, if present, share similar statistical properties. A Bayesian algorithm is proposed to estimate the parameters involved in the model yielding a joint nonlinear unmixing and nonlinearity detection algorithm. The performance of the proposed strategy is first evaluated on synthetic data. Simulations conducted with real data show the accuracy of the proposed unmixing and nonlinearity detection strategy for the analysis of hyperspectral images.
△ Less
Submitted 22 July, 2013;
originally announced July 2013.
-
Unsupervised Post-Nonlinear Unmixing of Hyperspectral Images Using a Hamiltonian Monte Carlo Algorithm
Authors:
Yoann Altmann,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
This paper presents a nonlinear mixing model for hyperspectral image unmixing. The proposed model assumes that the pixel reflectances are post-nonlinear functions of unknown pure spectral components contaminated by an additive white Gaussian noise. These nonlinear functions are approximated using polynomials leading to a polynomial post-nonlinear mixing model. A Bayesian algorithm is proposed to e…
▽ More
This paper presents a nonlinear mixing model for hyperspectral image unmixing. The proposed model assumes that the pixel reflectances are post-nonlinear functions of unknown pure spectral components contaminated by an additive white Gaussian noise. These nonlinear functions are approximated using polynomials leading to a polynomial post-nonlinear mixing model. A Bayesian algorithm is proposed to estimate the parameters involved in the model yielding an unsupervised nonlinear unmixing algorithm. Due to the large number of parameters to be estimated, an efficient Hamiltonian Monte Carlo algorithm is investigated. The classical leapfrog steps of this algorithm are modified to handle the parameter constraints. The performance of the unmixing strategy, including convergence and parameter tuning, is first evaluated on synthetic data. Simulations conducted with real data finally show the accuracy of the proposed unmixing strategy for the analysis of hyperspectral images.
△ Less
Submitted 9 April, 2013;
originally announced April 2013.
-
Nonlinear unmixing of hyperspectral images: models and algorithms
Authors:
Nicolas Dobigeon,
Jean-Yves Tourneret,
Cédric Richard,
José C. M. Bermudez,
Stephen McLaughlin,
Alfred O. Hero
Abstract:
When considering the problem of unmixing hyperspectral images, most of the literature in the geoscience and image processing areas relies on the widely used linear mixing model (LMM). However, the LMM may be not valid and other nonlinear models need to be considered, for instance, when there are multi-scattering effects or intimate interactions. Consequently, over the last few years, several signi…
▽ More
When considering the problem of unmixing hyperspectral images, most of the literature in the geoscience and image processing areas relies on the widely used linear mixing model (LMM). However, the LMM may be not valid and other nonlinear models need to be considered, for instance, when there are multi-scattering effects or intimate interactions. Consequently, over the last few years, several significant contributions have been proposed to overcome the limitations inherent in the LMM. In this paper, we present an overview of recent advances in nonlinear unmixing modeling.
△ Less
Submitted 18 July, 2013; v1 submitted 6 April, 2013;
originally announced April 2013.
-
Parameter Estimation For Multivariate Generalized Gaussian Distributions
Authors:
F. Pascal,
L. Bombrun,
J. Y. Tourneret,
Y. Berthoumieu
Abstract:
Due to its heavy-tailed and fully parametric form, the multivariate generalized Gaussian distribution (MGGD) has been receiving much attention for modeling extreme events in signal and image processing applications. Considering the estimation issue of the MGGD parameters, the main contribution of this paper is to prove that the maximum likelihood estimator (MLE) of the scatter matrix exists and is…
▽ More
Due to its heavy-tailed and fully parametric form, the multivariate generalized Gaussian distribution (MGGD) has been receiving much attention for modeling extreme events in signal and image processing applications. Considering the estimation issue of the MGGD parameters, the main contribution of this paper is to prove that the maximum likelihood estimator (MLE) of the scatter matrix exists and is unique up to a scalar factor, for a given shape parameter β\in(0,1). Moreover, an estimation algorithm based on a Newton-Raphson recursion is proposed for computing the MLE of MGGD parameters. Various experiments conducted on synthetic and real data are presented to illustrate the theoretical derivations in terms of number of iterations and number of samples for different values of the shape parameter. The main conclusion of this work is that the parameters of MGGDs can be estimated using the maximum likelihood principle with good performance.
△ Less
Submitted 24 February, 2017; v1 submitted 26 February, 2013;
originally announced February 2013.
-
Nonlinear spectral unmixing of hyperspectral images using Gaussian processes
Authors:
Yoann Altmann,
Nicolas Dobigeon,
Steve McLaughlin,
Jean-Yves Tourneret
Abstract:
This paper presents an unsupervised algorithm for nonlinear unmixing of hyperspectral images. The proposed model assumes that the pixel reflectances result from a nonlinear function of the abundance vectors associated with the pure spectral components. We assume that the spectral signatures of the pure components and the nonlinear function are unknown. The first step of the proposed method consist…
▽ More
This paper presents an unsupervised algorithm for nonlinear unmixing of hyperspectral images. The proposed model assumes that the pixel reflectances result from a nonlinear function of the abundance vectors associated with the pure spectral components. We assume that the spectral signatures of the pure components and the nonlinear function are unknown. The first step of the proposed method consists of the Bayesian estimation of the abundance vectors for all the image pixels and the nonlinear function relating the abundance vectors to the observations. The endmembers are subsequently estimated using Gaussian process regression. The performance of the unmixing strategy is evaluated with simulations conducted on synthetic and real data.
△ Less
Submitted 23 July, 2012;
originally announced July 2012.
-
Estimating the granularity coefficient of a Potts-Markov random field within an MCMC algorithm
Authors:
Marcelo Pereyra,
Nicolas Dobigeon,
Hadj Batatia,
Jean-Yves Tourneret
Abstract:
This paper addresses the problem of estimating the Potts parameter B jointly with the unknown parameters of a Bayesian model within a Markov chain Monte Carlo (MCMC) algorithm. Standard MCMC methods cannot be applied to this problem because performing inference on B requires computing the intractable normalizing constant of the Potts model. In the proposed MCMC method the estimation of B is conduc…
▽ More
This paper addresses the problem of estimating the Potts parameter B jointly with the unknown parameters of a Bayesian model within a Markov chain Monte Carlo (MCMC) algorithm. Standard MCMC methods cannot be applied to this problem because performing inference on B requires computing the intractable normalizing constant of the Potts model. In the proposed MCMC method the estimation of B is conducted using a likelihood-free Metropolis-Hastings algorithm. Experimental results obtained for synthetic data show that estimating B jointly with the other unknown parameters leads to estimation results that are as good as those obtained with the actual value of B. On the other hand, assuming that the value of B is known can degrade estimation performance significantly if this value is incorrect. To illustrate the interest of this method, the proposed algorithm is successfully applied to real bidimensional SAR and tridimensional ultrasound images.
△ Less
Submitted 23 July, 2012;
originally announced July 2012.
-
Computing the Cramer-Rao bound of Markov random field parameters: Application to the Ising and the Potts models
Authors:
Marcelo Pereyra,
Nicolas Dobigeon,
Hadj Batatia,
Jean-Yves Tourneret
Abstract:
This report considers the problem of computing the Cramer-Rao bound for the parameters of a Markov random field. Computation of the exact bound is not feasible for most fields of interest because their likelihoods are intractable and have intractable derivatives. We show here how it is possible to formulate the computation of the bound as a statistical inference problem that can be solve approxima…
▽ More
This report considers the problem of computing the Cramer-Rao bound for the parameters of a Markov random field. Computation of the exact bound is not feasible for most fields of interest because their likelihoods are intractable and have intractable derivatives. We show here how it is possible to formulate the computation of the bound as a statistical inference problem that can be solve approximately, but with arbitrarily high accuracy, by using a Monte Carlo method. The proposed methodology is successfully applied on the Ising and the Potts models.% where it is used to assess the performance of three state-of-the art estimators of the parameter of these Markov random fields.
△ Less
Submitted 17 September, 2013; v1 submitted 18 June, 2012;
originally announced June 2012.
-
Minimum mean square distance estimation of a subspace
Authors:
Olivier Besson,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
We consider the problem of subspace estimation in a Bayesian setting. Since we are operating in the Grassmann manifold, the usual approach which consists of minimizing the mean square error (MSE) between the true subspace $U$ and its estimate $\hat{U}$ may not be adequate as the MSE is not the natural metric in the Grassmann manifold. As an alternative, we propose to carry out subspace estimation…
▽ More
We consider the problem of subspace estimation in a Bayesian setting. Since we are operating in the Grassmann manifold, the usual approach which consists of minimizing the mean square error (MSE) between the true subspace $U$ and its estimate $\hat{U}$ may not be adequate as the MSE is not the natural metric in the Grassmann manifold. As an alternative, we propose to carry out subspace estimation by minimizing the mean square distance (MSD) between $U$ and its estimate, where the considered distance is a natural metric in the Grassmann manifold, viz. the distance between the projection matrices. We show that the resulting estimator is no longer the posterior mean of $U$ but entails computing the principal eigenvectors of the posterior mean of $U U^{T}$. Derivation of the MMSD estimator is carried out in a few illustrative examples including a linear Gaussian model for the data and a Bingham or von Mises Fisher prior distribution for $U$. In all scenarios, posterior distributions are derived and the MMSD estimator is obtained either analytically or implemented via a Markov chain Monte Carlo simulation method. The method is shown to provide accurate estimates even when the number of samples is lower than the dimension of $U$. An application to hyperspectral imagery is finally investigated.
△ Less
Submitted 18 January, 2011;
originally announced January 2011.
-
Enhancing hyperspectral image unmixing with spatial correlations
Authors:
Olivier Eches,
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
This paper describes a new algorithm for hyperspectral image unmixing. Most of the unmixing algorithms proposed in the literature do not take into account the possible spatial correlations between the pixels. In this work, a Bayesian model is introduced to exploit these correlations. The image to be unmixed is assumed to be partitioned into regions (or classes) where the statistical properties of…
▽ More
This paper describes a new algorithm for hyperspectral image unmixing. Most of the unmixing algorithms proposed in the literature do not take into account the possible spatial correlations between the pixels. In this work, a Bayesian model is introduced to exploit these correlations. The image to be unmixed is assumed to be partitioned into regions (or classes) where the statistical properties of the abundance coefficients are homogeneous. A Markov random field is then proposed to model the spatial dependency of the pixels within any class. Conditionally upon a given class, each pixel is modeled by using the classical linear mixing model with additive white Gaussian noise. This strategy is investigated the well known linear mixing model. For this model, the posterior distributions of the unknown parameters and hyperparameters allow ones to infer the parameters of interest. These parameters include the abundances for each pixel, the means and variances of the abundances for each class, as well as a classification map indicating the classes of all pixels in the image. To overcome the complexity of the posterior distribution of interest, we consider Markov chain Monte Carlo methods that generate samples distributed according to the posterior of interest. The generated samples are then used for parameter and hyperparameter estimation. The accuracy of the proposed algorithms is illustrated on synthetic and real data.
△ Less
Submitted 4 September, 2012; v1 submitted 4 February, 2010;
originally announced February 2010.
-
A Hierarchical Bayesian Model for Frame Representation
Authors:
L. Chaâri,
J. -C. Pesquet,
J. -Y. Tourneret,
Ph. Ciuciu,
A. Benazza-Benyahia
Abstract:
In many signal processing problems, it may be fruitful to represent the signal under study in a frame. If a probabilistic approach is adopted, it becomes then necessary to estimate the hyper-parameters characterizing the probability distribution of the frame coefficients. This problem is difficult since in general the frame synthesis operator is not bijective. Consequently, the frame coefficient…
▽ More
In many signal processing problems, it may be fruitful to represent the signal under study in a frame. If a probabilistic approach is adopted, it becomes then necessary to estimate the hyper-parameters characterizing the probability distribution of the frame coefficients. This problem is difficult since in general the frame synthesis operator is not bijective. Consequently, the frame coefficients are not directly observable. This paper introduces a hierarchical Bayesian model for frame representation. The posterior distribution of the frame coefficients and model hyper-parameters is derived. Hybrid Markov Chain Monte Carlo algorithms are subsequently proposed to sample from this posterior distribution. The generated samples are then exploited to estimate the hyper-parameters and the frame coefficients of the target signal. Validation experiments show that the proposed algorithms provide an accurate estimation of the frame coefficients and hyper-parameters. Application to practical problems of image denoising show the impact of the resulting Bayesian estimation on the recovered signal quality.
△ Less
Submitted 15 November, 2009;
originally announced November 2009.
-
Bayesian orthogonal component analysis for sparse representation
Authors:
Nicolas Dobigeon,
Jean-Yves Tourneret
Abstract:
This paper addresses the problem of identifying a lower dimensional space where observed data can be sparsely represented. This under-complete dictionary learning task can be formulated as a blind separation problem of sparse sources linearly mixed with an unknown orthogonal mixing matrix. This issue is formulated in a Bayesian framework. First, the unknown sparse sources are modeled as Bernoull…
▽ More
This paper addresses the problem of identifying a lower dimensional space where observed data can be sparsely represented. This under-complete dictionary learning task can be formulated as a blind separation problem of sparse sources linearly mixed with an unknown orthogonal mixing matrix. This issue is formulated in a Bayesian framework. First, the unknown sparse sources are modeled as Bernoulli-Gaussian processes. To promote sparsity, a weighted mixture of an atom at zero and a Gaussian distribution is proposed as prior distribution for the unobserved sources. A non-informative prior distribution defined on an appropriate Stiefel manifold is elected for the mixing matrix. The Bayesian inference on the unknown parameters is conducted using a Markov chain Monte Carlo (MCMC) method. A partially collapsed Gibbs sampler is designed to generate samples asymptotically distributed according to the joint posterior distribution of the unknown model parameters and hyperparameters. These samples are then used to approximate the joint maximum a posteriori estimator of the sources and mixing matrix. Simulations conducted on synthetic data are reported to illustrate the performance of the method for recovering sparse representations. An application to sparse coding on under-complete dictionary is finally investigated.
△ Less
Submitted 4 January, 2010; v1 submitted 31 August, 2009;
originally announced August 2009.
-
Bayesian separation of spectral sources under non-negativity and full additivity constraints
Authors:
Nicolas Dobigeon,
Said Moussaoui,
Jean-Yves Tourneret,
Cedric Carteret
Abstract:
This paper addresses the problem of separating spectral sources which are linearly mixed with unknown proportions. The main difficulty of the problem is to ensure the full additivity (sum-to-one) of the mixing coefficients and non-negativity of sources and mixing coefficients. A Bayesian estimation approach based on Gamma priors was recently proposed to handle the non-negativity constraints in a…
▽ More
This paper addresses the problem of separating spectral sources which are linearly mixed with unknown proportions. The main difficulty of the problem is to ensure the full additivity (sum-to-one) of the mixing coefficients and non-negativity of sources and mixing coefficients. A Bayesian estimation approach based on Gamma priors was recently proposed to handle the non-negativity constraints in a linear mixture model. However, incorporating the full additivity constraint requires further developments. This paper studies a new hierarchical Bayesian model appropriate to the non-negativity and sum-to-one constraints associated to the regressors and regression coefficients of linear mixtures. The estimation of the unknown parameters of this model is performed using samples generated using an appropriate Gibbs sampler. The performance of the proposed algorithm is evaluated through simulation results conducted on synthetic mixture models. The proposed approach is also applied to the processing of multicomponent chemical mixtures resulting from Raman spectroscopy.
△ Less
Submitted 23 September, 2009; v1 submitted 25 June, 2009;
originally announced June 2009.
-
Hierarchical Bayesian sparse image reconstruction with application to MRFM
Authors:
Nicolas Dobigeon,
Alfred O. Hero,
Jean-Yves Tourneret
Abstract:
This paper presents a hierarchical Bayesian model to reconstruct sparse images when the observations are obtained from linear transformations and corrupted by an additive white Gaussian noise. Our hierarchical Bayes model is well suited to such naturally sparse image applications as it seamlessly accounts for properties such as sparsity and positivity of the image via appropriate Bayes priors. W…
▽ More
This paper presents a hierarchical Bayesian model to reconstruct sparse images when the observations are obtained from linear transformations and corrupted by an additive white Gaussian noise. Our hierarchical Bayes model is well suited to such naturally sparse image applications as it seamlessly accounts for properties such as sparsity and positivity of the image via appropriate Bayes priors. We propose a prior that is based on a weighted mixture of a positive exponential distribution and a mass at zero. The prior has hyperparameters that are tuned automatically by marginalization over the hierarchical Bayesian model. To overcome the complexity of the posterior distribution, a Gibbs sampling strategy is proposed. The Gibbs samples can be used to estimate the image to be recovered, e.g. by maximizing the estimated posterior distribution. In our fully Bayesian approach the posteriors of all the parameters are available. Thus our algorithm provides more information than other previously proposed sparse reconstruction methods that only give a point estimate. The performance of our hierarchical Bayesian sparse reconstruction method is illustrated on synthetic and real data collected from a tobacco virus sample using a prototype MRFM instrument.
△ Less
Submitted 23 September, 2009; v1 submitted 22 September, 2008;
originally announced September 2008.