-
Decision from Suboptimal Classifiers: Excess Risk Pre- and Post-Calibration
Authors:
Alexandre Perez-Lebel,
Gael Varoquaux,
Sanmi Koyejo,
Matthieu Doutreligne,
Marine Le Morvan
Abstract:
Probabilistic classifiers are central for making informed decisions under uncertainty. Based on the maximum expected utility principle, optimal decision rules can be derived using the posterior class probabilities and misclassification costs. Yet, in practice only learned approximations of the oracle posterior probabilities are available. In this work, we quantify the excess risk (a.k.a. regret) i…
▽ More
Probabilistic classifiers are central for making informed decisions under uncertainty. Based on the maximum expected utility principle, optimal decision rules can be derived using the posterior class probabilities and misclassification costs. Yet, in practice only learned approximations of the oracle posterior probabilities are available. In this work, we quantify the excess risk (a.k.a. regret) incurred using approximate posterior probabilities in batch binary decision-making. We provide analytical expressions for miscalibration-induced regret ($R^{\mathrm{CL}}$), as well as tight and informative upper and lower bounds on the regret of calibrated classifiers ($R^{\mathrm{GL}}$). These expressions allow us to identify regimes where recalibration alone addresses most of the regret, and regimes where the regret is dominated by the grouping loss, which calls for post-training beyond recalibration. Crucially, both $R^{\mathrm{CL}}$ and $R^{\mathrm{GL}}$ can be estimated in practice using a calibration curve and a recent grouping loss estimator. On NLP experiments, we show that these quantities identify when the expected gain of more advanced post-training is worth the operational cost. Finally, we highlight the potential of multicalibration approaches as efficient alternatives to costlier fine-tuning approaches.
△ Less
Submitted 23 March, 2025;
originally announced March 2025.
-
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data
Authors:
Jingang Qu,
David Holzmüller,
Gaël Varoquaux,
Marine Le Morvan
Abstract:
The long-standing dominance of gradient-boosted decision trees on tabular data is currently challenged by tabular foundation models using In-Context Learning (ICL): setting the training data as context for the test data and predicting in a single forward pass without parameter updates. While TabPFNv2 foundation model excels on tables with up to 10K samples, its alternating column- and row-wise att…
▽ More
The long-standing dominance of gradient-boosted decision trees on tabular data is currently challenged by tabular foundation models using In-Context Learning (ICL): setting the training data as context for the test data and predicting in a single forward pass without parameter updates. While TabPFNv2 foundation model excels on tables with up to 10K samples, its alternating column- and row-wise attentions make handling large training sets computationally prohibitive. So, can ICL be effectively scaled and deliver a benefit for larger tables? We introduce TabICL, a tabular foundation model for classification, pretrained on synthetic datasets with up to 60K samples and capable of handling 500K samples on affordable resources. This is enabled by a novel two-stage architecture: a column-then-row attention mechanism to build fixed-dimensional embeddings of rows, followed by a transformer for efficient ICL. Across 200 classification datasets from the TALENT benchmark, TabICL is on par with TabPFNv2 while being systematically faster (up to 10 times), and significantly outperforms all other approaches. On 53 datasets with over 10K samples, TabICL surpasses both TabPFNv2 and CatBoost, demonstrating the potential of ICL for large data. Pretraining code, inference code, and pre-trained models are available at https://github.com/soda-inria/tabicl.
△ Less
Submitted 24 May, 2025; v1 submitted 8 February, 2025;
originally announced February 2025.
-
The R-Vessel-X Project
Authors:
Abir Affane,
Mohamed Amine Chetoui,
Jonas Lamy,
Guillaume Lienemann,
Raphaël Peron,
P. Beaurepaire,
Guillaume Dollé,
Marie-Ange Lèbre,
Benoit Magnin,
Odyssée Merveille,
Mathilde Morvan,
Phuc Ngo,
Thibault Pelletier,
Hugo Rositi,
Stéphanie Salmon,
Julien Finet,
Bertrand Kerautret,
Nicolas Passat,
Antoine Vacavant
Abstract:
1) Objectives: This technical report presents a synthetic summary and the principal outcomes of the project R-Vessel-X ("Robust vascular network extraction and understanding within hepatic biomedical images") funded by the French Agence Nationale de la Recherche, and developed between 2019 and 2023. 2) Material and methods: We used datasets and tools publicly available such as IRCAD, Bullitt or Va…
▽ More
1) Objectives: This technical report presents a synthetic summary and the principal outcomes of the project R-Vessel-X ("Robust vascular network extraction and understanding within hepatic biomedical images") funded by the French Agence Nationale de la Recherche, and developed between 2019 and 2023. 2) Material and methods: We used datasets and tools publicly available such as IRCAD, Bullitt or VascuSynth toobtain real or synthetic angiographic images. The main contributions lie in the field of 3D angiographic image analysis: filtering, segmentation, modeling and simulation, with a specific focus on the liver. 3) Results: We paid a particular attention to open-source software diffusion of the developed methods, by means of 3D Slicer plugins for the liver anatomy segmentation (SlicerRVXLiverSegmentation) and vesselness filtering (Slicer-RVXVesselnessFilters), and an online demo for the generation of synthetic and realistic vessels in 2D and 3D (OpenCCO). 4) Conclusion: The R-Vessel-X project provided extensive research outcomes, covering various topics related to 3D angiographic image analysis, such as filtering, segmentation, modeling and simulation. We also developed open-source and free softwares so that the research communities in biomedical engineering can use these results in their future research.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Imputation for prediction: beware of diminishing returns
Authors:
Marine Le Morvan,
Gaël Varoquaux
Abstract:
Missing values are prevalent across various fields, posing challenges for training and deploying predictive models. In this context, imputation is a common practice, driven by the hope that accurate imputations will enhance predictions. However, recent theoretical and empirical studies indicate that simple constant imputation can be consistent and competitive. This empirical study aims at clarifyi…
▽ More
Missing values are prevalent across various fields, posing challenges for training and deploying predictive models. In this context, imputation is a common practice, driven by the hope that accurate imputations will enhance predictions. However, recent theoretical and empirical studies indicate that simple constant imputation can be consistent and competitive. This empirical study aims at clarifying if and when investing in advanced imputation methods yields significantly better predictions. Relating imputation and predictive accuracies across combinations of imputation and predictive models on 19 datasets, we show that imputation accuracy matters less i) when using expressive models, ii) when incorporating missingness indicators as complementary inputs, iii) matters much more for generated linear outcomes than for real-data outcomes. Interestingly, we also show that the use of the missingness indicator is beneficial to the prediction performance, even in MCAR scenarios. Overall, on real-data with powerful models, improving imputation only has a minor effect on prediction performance. Thus, investing in better imputations for improved predictions often offers limited benefits.
△ Less
Submitted 20 February, 2025; v1 submitted 29 July, 2024;
originally announced July 2024.
-
Correcting Exoplanet Transmission Spectra for Stellar Activity with an Optimised Retrieval Framework
Authors:
Alexandra Thompson,
Alfredo Biagini,
Gianluca Cracchiolo,
Antonino Petralia,
Quentin Changeat,
Arianna Saba,
Giuseppe Morello,
Mario Morvan,
Giuseppina Micela,
Giovanna Tinetti
Abstract:
The chromatic contamination that arises from photospheric heterogeneities e.g. spots and faculae on the host star presents a significant noise source for exoplanet transmission spectra. If this contamination is not corrected for, it can introduce substantial bias in our analysis of the planetary atmosphere. We utilise two stellar models of differing complexity, StARPA and ASteRA, to explore the bi…
▽ More
The chromatic contamination that arises from photospheric heterogeneities e.g. spots and faculae on the host star presents a significant noise source for exoplanet transmission spectra. If this contamination is not corrected for, it can introduce substantial bias in our analysis of the planetary atmosphere. We utilise two stellar models of differing complexity, StARPA and ASteRA, to explore the biases introduced by stellar contamination in retrieval under differing degrees of stellar activity. We use the retrieval framework TauREx3 and a grid of 27 synthetic, spot-contaminated transmission spectra to investigate potential biases and to determine how complex our stellar models must be in order to accurately extract the planetary parameters from transmission spectra. The input observation is generated using the more complex model (StARPA), in which the spot latitude is an additional, fixable parameter. This observation is then fed into a combined stellar-planetary retrieval which contains a simplified stellar model (ASteRA). Our results confirm that the inclusion of stellar activity parameters in retrieval minimises bias under all activity regimes considered. ASteRA performs very well under low to moderate activity conditions, retrieving the planetary parameters with a high degree of accuracy. For the most active cases, characterised by larger, higher temperature contrast spots, some minor residual bias remains due to ASteRA neglecting the interplay between the spot and the limb darkening effect. As a result of this, we find larger errors in retrieved planetary parameters for central spots (0 degrees) and those found close to the limb (60 degrees) than those at intermediate latitudes (30 degrees).
△ Less
Submitted 26 October, 2023; v1 submitted 9 February, 2023;
originally announced February 2023.
-
Beyond calibration: estimating the grouping loss of modern neural networks
Authors:
Alexandre Perez-Lebel,
Marine Le Morvan,
Gaël Varoquaux
Abstract:
The ability to ensure that a classifier gives reliable confidence scores is essential to ensure informed decision-making. To this end, recent work has focused on miscalibration, i.e., the over or under confidence of model scores. Yet calibration is not enough: even a perfectly calibrated classifier with the best possible accuracy can have confidence scores that are far from the true posterior prob…
▽ More
The ability to ensure that a classifier gives reliable confidence scores is essential to ensure informed decision-making. To this end, recent work has focused on miscalibration, i.e., the over or under confidence of model scores. Yet calibration is not enough: even a perfectly calibrated classifier with the best possible accuracy can have confidence scores that are far from the true posterior probabilities. This is due to the grouping loss, created by samples with the same confidence scores but different true posterior probabilities. Proper scoring rule theory shows that given the calibration loss, the missing piece to characterize individual errors is the grouping loss. While there are many estimators of the calibration loss, none exists for the grouping loss in standard settings. Here, we propose an estimator to approximate the grouping loss. We show that modern neural network architectures in vision and NLP exhibit grouping loss, notably in distribution shifts settings, which highlights the importance of pre-production validation.
△ Less
Submitted 27 April, 2023; v1 submitted 28 October, 2022;
originally announced October 2022.
-
ExoClock Project III: 450 new exoplanet ephemerides from ground and space observations
Authors:
A. Kokori,
A. Tsiaras,
B. Edwards,
A. Jones,
G. Pantelidou,
G. Tinetti,
L. Bewersdorff,
A. Iliadou,
Y. Jongen,
G. Lekkas,
A. Nastasi,
E. Poultourtzidis,
C. Sidiropoulos,
F. Walter,
A. Wünsche,
R. Abraham,
V. K. Agnihotri,
R. Albanesi,
E. Arce-Mansego,
D. Arnot,
M. Audejean,
C. Aumasson,
M. Bachschmidt,
G. Baj,
P. R. Barroy
, et al. (192 additional authors not shown)
Abstract:
The ExoClock project has been created with the aim of increasing the efficiency of the Ariel mission. It will achieve this by continuously monitoring and updating the ephemerides of Ariel candidates over an extended period, in order to produce a consistent catalogue of reliable and precise ephemerides. This work presents a homogenous catalogue of updated ephemerides for 450 planets, generated by t…
▽ More
The ExoClock project has been created with the aim of increasing the efficiency of the Ariel mission. It will achieve this by continuously monitoring and updating the ephemerides of Ariel candidates over an extended period, in order to produce a consistent catalogue of reliable and precise ephemerides. This work presents a homogenous catalogue of updated ephemerides for 450 planets, generated by the integration of $\sim$18000 data points from multiple sources. These sources include observations from ground-based telescopes (ExoClock network and ETD), mid-time values from the literature and light-curves from space telescopes (Kepler/K2 and TESS). With all the above, we manage to collect observations for half of the post-discovery years (median), with data that have a median uncertainty less than one minute. In comparison with literature, the ephemerides generated by the project are more precise and less biased. More than 40\% of the initial literature ephemerides had to be updated to reach the goals of the project, as they were either of low precision or drifting. Moreover, the integrated approach of the project enables both the monitoring of the majority of the Ariel candidates (95\%), and also the identification of missing data. The dedicated ExoClock network effectively supports this task by contributing additional observations when a gap in the data is identified. These results highlight the need for continuous monitoring to increase the observing coverage of the candidate planets. Finally, the extended observing coverage of planets allows us to detect trends (TTVs - Transit Timing Variations) for a sample of 19 planets. All products, data, and codes used in this work are open and accessible to the wider scientific community.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
Don't Pay Attention to the Noise: Learning Self-supervised Representations of Light Curves with a Denoising Time Series Transformer
Authors:
Mario Morvan,
Nikolaos Nikolaou,
Kai Hou Yip,
Ingo Waldmann
Abstract:
Astrophysical light curves are particularly challenging data objects due to the intensity and variety of noise contaminating them. Yet, despite the astronomical volumes of light curves available, the majority of algorithms used to process them are still operating on a per-sample basis. To remedy this, we propose a simple Transformer model -- called Denoising Time Series Transformer (DTST) -- and s…
▽ More
Astrophysical light curves are particularly challenging data objects due to the intensity and variety of noise contaminating them. Yet, despite the astronomical volumes of light curves available, the majority of algorithms used to process them are still operating on a per-sample basis. To remedy this, we propose a simple Transformer model -- called Denoising Time Series Transformer (DTST) -- and show that it excels at removing the noise and outliers in datasets of time series when trained with a masked objective, even when no clean targets are available. Moreover, the use of self-attention enables rich and illustrative queries into the learned representations. We present experiments on real stellar light curves from the Transiting Exoplanet Space Satellite (TESS), showing advantages of our approach compared to traditional denoising techniques.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
ESA-Ariel Data Challenge NeurIPS 2022: Inferring Physical Properties of Exoplanets From Next-Generation Telescopes
Authors:
Kai Hou Yip,
Ingo P. Waldmann,
Quentin Changeat,
Mario Morvan,
Ahmed F. Al-Refaie,
Billy Edwards,
Nikolaos Nikolaou,
Angelos Tsiaras,
Catarina Alves de Oliveira,
Pierre-Olivier Lagage,
Clare Jenner,
James Y-K. Cho,
Jeyan Thiyagalingam,
Giovanna Tinetti
Abstract:
The study of extra-solar planets, or simply, exoplanets, planets outside our own Solar System, is fundamentally a grand quest to understand our place in the Universe. Discoveries in the last two decades have re-defined our understanding of planets, and helped us comprehend the uniqueness of our very own Earth. In recent years the focus has shifted from planet detection to planet characterisation,…
▽ More
The study of extra-solar planets, or simply, exoplanets, planets outside our own Solar System, is fundamentally a grand quest to understand our place in the Universe. Discoveries in the last two decades have re-defined our understanding of planets, and helped us comprehend the uniqueness of our very own Earth. In recent years the focus has shifted from planet detection to planet characterisation, where key planetary properties are inferred from telescope observations using Monte Carlo-based methods. However, the efficiency of sampling-based methodologies is put under strain by the high-resolution observational data from next generation telescopes, such as the James Webb Space Telescope and the Ariel Space Mission. We are delighted to announce the acceptance of the Ariel ML Data Challenge 2022 as part of the NeurIPS competition track. The goal of this challenge is to identify a reliable and scalable method to perform planetary characterisation. Depending on the chosen track, participants are tasked to provide either quartile estimates or the approximate distribution of key planetary properties. To this end, a synthetic spectroscopic dataset has been generated from the official simulators for the ESA Ariel Space Mission. The aims of the competition are three-fold. 1) To offer a challenging application for comparing and advancing conditional density estimation methods. 2) To provide a valuable contribution towards reliable and efficient analysis of spectroscopic data, enabling astronomers to build a better picture of planetary demographics, and 3) To promote the interaction between ML and exoplanetary science. The competition is open from 15th June and will run until early October, participants of all skill levels are more than welcomed!
△ Less
Submitted 29 June, 2022;
originally announced June 2022.
-
Benchmarking missing-values approaches for predictive models on health databases
Authors:
Alexandre Perez-Lebel,
Gaël Varoquaux,
Marine Le Morvan,
Julie Josse,
Jean-Baptiste Poline
Abstract:
BACKGROUND: As databases grow larger, it becomes harder to fully control their collection, and they frequently come with missing values: incomplete observations. These large databases are well suited to train machine-learning models, for instance for forecasting or to extract biomarkers in biomedical settings. Such predictive approaches can use discriminative -- rather than generative -- modeling,…
▽ More
BACKGROUND: As databases grow larger, it becomes harder to fully control their collection, and they frequently come with missing values: incomplete observations. These large databases are well suited to train machine-learning models, for instance for forecasting or to extract biomarkers in biomedical settings. Such predictive approaches can use discriminative -- rather than generative -- modeling, and thus open the door to new missing-values strategies. Yet existing empirical evaluations of strategies to handle missing values have focused on inferential statistics. RESULTS: Here we conduct a systematic benchmark of missing-values strategies in predictive models with a focus on large health databases: four electronic health record datasets, a population brain imaging one, a health survey and two intensive care ones. Using gradient-boosted trees, we compare native support for missing values with simple and state-of-the-art imputation prior to learning. We investigate prediction accuracy and computational time. For prediction after imputation, we find that adding an indicator to express which values have been imputed is important, suggesting that the data are missing not at random. Elaborate missing values imputation can improve prediction compared to simple strategies but requires longer computational time on large data. Learning trees that model missing values-with missing incorporated attribute-leads to robust, fast, and well-performing predictive modeling. CONCLUSIONS: Native support for missing values in supervised machine learning predicts better than state-of-the-art imputation with much less computational cost. When using imputation, it is important to add indicator columns expressing which values have been imputed.
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
ExoClock project II: A large-scale integrated study with 180 updated exoplanet ephemerides
Authors:
A. Kokori,
A. Tsiaras,
B. Edwards,
M. Rocchetto,
G. Tinetti,
L. Bewersdorff,
Y. Jongen,
G. Lekkas,
G. Pantelidou,
E. Poultourtzidis,
A. Wünsche,
C. Aggelis,
V. K. Agnihotri,
C. Arena,
M. Bachschmidt,
D. Bennett,
P. Benni,
K. Bernacki,
E. Besson,
L. Betti,
A. Biagini,
P. Brandebourg,
M. Bretton,
S. M. Brincat,
M. Caló
, et al. (80 additional authors not shown)
Abstract:
The ExoClock project is an inclusive, integrated, and interactive platform that was developed to monitor the ephemerides of the Ariel targets to increase the mission efficiency. The project makes the best use of all available resources, i.e., observations from ground telescopes, mid-time values from the literature and finally, observations from space instruments. Currently, the ExoClock network in…
▽ More
The ExoClock project is an inclusive, integrated, and interactive platform that was developed to monitor the ephemerides of the Ariel targets to increase the mission efficiency. The project makes the best use of all available resources, i.e., observations from ground telescopes, mid-time values from the literature and finally, observations from space instruments. Currently, the ExoClock network includes 280 participants with telescopes capable of observing 85\% of the currently known Ariel candidate targets. This work includes the results of $\sim$1600 observations obtained up to the 31st of December 2020 from the ExoClock network. These data in combination with $\sim$2350 mid-time values collected from the literature are used to update the ephemerides of 180 planets. The analysis shows that 40\% of the updated ephemerides will have an impact on future scheduling as either they have a significantly improved precision, or they have revealed biases in the old ephemerides. With the new observations, the observing coverage and rate for half of the planets in the sample has been doubled or more. Finally, from a population perspective, we identify that the differences in the 2028 predictions between the old and the new ephemerides have an STD that is double what is expected from gaussian uncertainties. These findings have implications for planning future observations, where we will need to account for drifts potentially greater than the prediction uncertainties. The updated ephemerides are open and accessible to the wider exoplanet community both from our Open Science Framework (OSF) repository and our website.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
The transmission spectrum of WASP-17 b from the optical to the near-infrared wavelengths: combining STIS, WFC3 and IRAC datasets
Authors:
Arianna Saba,
Angelos Tsiaras,
Mario Morvan,
Alexandra Thompson,
Quentin Changeat,
Billy Edwards,
Andrew Jolly,
Ingo Waldmann,
Giovanna Tinetti
Abstract:
We present the transmission spectrum of the inflated hot-Jupiter WASP-17 b, observed with the STIS and WFC3 instruments aboard the Hubble Space Telescope, allowing for a continuous wavelength coverage from ~0.4 to ~1.7 um. Observations taken with IRAC channel 1 and 2 on the Spitzer Space Telescope are also included, adding photometric measurements at 3.6 and 4.5 um. HST spectral data was analysed…
▽ More
We present the transmission spectrum of the inflated hot-Jupiter WASP-17 b, observed with the STIS and WFC3 instruments aboard the Hubble Space Telescope, allowing for a continuous wavelength coverage from ~0.4 to ~1.7 um. Observations taken with IRAC channel 1 and 2 on the Spitzer Space Telescope are also included, adding photometric measurements at 3.6 and 4.5 um. HST spectral data was analysed with Iraclis, a pipeline specialised in the reduction of STIS and WFC3 transit and eclipse observations. Spitzer photometric observations were reduced with the TLCD-LSTM method, utilising recurrent neural networks. The outcome of our reduction produces incompatible results between STIS visit 1 and visit 2, which leads us to consider two scenarios for G430L. Additionally, by modelling the WFC3 data alone, we can extract atmospheric information without having to deal with the contrasting STIS datasets. We run separate retrievals on the three spectral scenarios with the aid of TauREx 3, a fully Bayesian retrieval framework. We find that, independently of the data considered, the exoplanet atmosphere displays strong water signatures and potentially, the presence of aluminium oxide (AlO) and titanium hydride (TiH). A retrieval that includes an extreme photospheric activity of the host star is the preferred model, but we recognise that such a scenario is unlikely for an F6-type star. Due to the incompleteness of all STIS spectral lightcurves, only further observations with this instrument would allow us to properly constrain the atmospheric limb of WASP-17 b, before JWST or Ariel will come online.
△ Less
Submitted 2 May, 2022; v1 submitted 31 August, 2021;
originally announced August 2021.
-
What's a good imputation to predict with missing values?
Authors:
Marine Le Morvan,
Julie Josse,
Erwan Scornet,
Gaël Varoquaux
Abstract:
How to learn a good predictor on data with missing values? Most efforts focus on first imputing as well as possible and second learning on the completed data to predict the outcome. Yet, this widespread practice has no theoretical grounding. Here we show that for almost all imputation functions, an impute-then-regress procedure with a powerful learner is Bayes optimal. This result holds for all mi…
▽ More
How to learn a good predictor on data with missing values? Most efforts focus on first imputing as well as possible and second learning on the completed data to predict the outcome. Yet, this widespread practice has no theoretical grounding. Here we show that for almost all imputation functions, an impute-then-regress procedure with a powerful learner is Bayes optimal. This result holds for all missing-values mechanisms, in contrast with the classic statistical results that require missing-at-random settings to use imputation in probabilistic modeling. Moreover, it implies that perfect conditional imputation is not needed for good prediction asymptotically. In fact, we show that on perfectly imputed data the best regression function will generally be discontinuous, which makes it hard to learn. Crafting instead the imputation so as to leave the regression function unchanged simply shifts the problem to learning discontinuous imputations. Rather, we suggest that it is easier to learn imputation and regression jointly. We propose such a procedure, adapting NeuMiss, a neural network capturing the conditional links across observed and unobserved variables whatever the missing-value pattern. Experiments confirm that joint imputation and regression through NeuMiss is better than various two step procedures in our experiments with finite number of samples.
△ Less
Submitted 30 November, 2021; v1 submitted 1 June, 2021;
originally announced June 2021.
-
Ariel: Enabling planetary science across light-years
Authors:
Giovanna Tinetti,
Paul Eccleston,
Carole Haswell,
Pierre-Olivier Lagage,
Jérémy Leconte,
Theresa Lüftinger,
Giusi Micela,
Michel Min,
Göran Pilbratt,
Ludovic Puig,
Mark Swain,
Leonardo Testi,
Diego Turrini,
Bart Vandenbussche,
Maria Rosa Zapatero Osorio,
Anna Aret,
Jean-Philippe Beaulieu,
Lars Buchhave,
Martin Ferus,
Matt Griffin,
Manuel Guedel,
Paul Hartogh,
Pedro Machado,
Giuseppe Malaguti,
Enric Pallé
, et al. (293 additional authors not shown)
Abstract:
Ariel, the Atmospheric Remote-sensing Infrared Exoplanet Large-survey, was adopted as the fourth medium-class mission in ESA's Cosmic Vision programme to be launched in 2029. During its 4-year mission, Ariel will study what exoplanets are made of, how they formed and how they evolve, by surveying a diverse sample of about 1000 extrasolar planets, simultaneously in visible and infrared wavelengths.…
▽ More
Ariel, the Atmospheric Remote-sensing Infrared Exoplanet Large-survey, was adopted as the fourth medium-class mission in ESA's Cosmic Vision programme to be launched in 2029. During its 4-year mission, Ariel will study what exoplanets are made of, how they formed and how they evolve, by surveying a diverse sample of about 1000 extrasolar planets, simultaneously in visible and infrared wavelengths. It is the first mission dedicated to measuring the chemical composition and thermal structures of hundreds of transiting exoplanets, enabling planetary science far beyond the boundaries of the Solar System. The payload consists of an off-axis Cassegrain telescope (primary mirror 1100 mm x 730 mm ellipse) and two separate instruments (FGS and AIRS) covering simultaneously 0.5-7.8 micron spectral range. The satellite is best placed into an L2 orbit to maximise the thermal stability and the field of regard. The payload module is passively cooled via a series of V-Groove radiators; the detectors for the AIRS are the only items that require active cooling via an active Ne JT cooler. The Ariel payload is developed by a consortium of more than 50 institutes from 16 ESA countries, which include the UK, France, Italy, Belgium, Poland, Spain, Austria, Denmark, Ireland, Portugal, Czech Republic, Hungary, the Netherlands, Sweden, Norway, Estonia, and a NASA contribution.
△ Less
Submitted 10 April, 2021;
originally announced April 2021.
-
ARES V: No Evidence For Molecular Absorption in the HST WFC3 Spectrum of GJ 1132 b
Authors:
Lorenzo V. Mugnai,
Darius Modirrousta-Galian,
Billy Edwards,
Quentin Changeat,
Jeroen Bouwman,
Giuseppe Morello,
Ahmed Al-Refaie,
Robin Baeyens,
Michelle Fabienne Bieger,
Doriann Blain,
Amélie Gressier,
Gloria Guilluy,
Yassin Jaziri,
Flavien Kiefer,
Mario Morvan,
William Pluriel,
Mathilde Poveda,
Nour Skaf,
Niall Whiteford,
Sam Wright,
Kai Hou Yip,
Tiziano Zingales,
Benjamin Charnay,
Pierre Drossart,
Jérémy Leconte
, et al. (3 additional authors not shown)
Abstract:
We present a study on the spatially scanned spectroscopic observations of the transit of GJ 1132 b, a warm ($\sim$500 K) Super-Earth (1.13 R$_\oplus$) that was obtained with the G141 grism (1.125 - 1.650 $μ$m) of the Wide Field Camera 3 (WFC3) onboard the Hubble Space Telescope. We used the publicly available Iraclis pipeline to extract the planetary transmission spectra from the five visits and p…
▽ More
We present a study on the spatially scanned spectroscopic observations of the transit of GJ 1132 b, a warm ($\sim$500 K) Super-Earth (1.13 R$_\oplus$) that was obtained with the G141 grism (1.125 - 1.650 $μ$m) of the Wide Field Camera 3 (WFC3) onboard the Hubble Space Telescope. We used the publicly available Iraclis pipeline to extract the planetary transmission spectra from the five visits and produce a precise transmission spectrum. We analysed the spectrum using the TauREx3 atmospheric retrieval code with which we show that the measurements do not contain molecular signatures in the investigated wavelength range and are best-fit with a flat-line model. Our results suggest that the planet does not have a clear primordial, hydrogen-dominated atmosphere. Instead, GJ 1132 b could have a cloudy hydrogen-dominated envelope, a very enriched secondary atmosphere, be airless, or have a tenuous atmosphere that has not been detected. Due to the narrow wavelength coverage of WFC3, these scenarios cannot be distinguished yet but the James Webb Space Telescope may be capable of detecting atmospheric features, although several observations may be required to provide useful constraints.
△ Less
Submitted 3 May, 2021; v1 submitted 5 April, 2021;
originally announced April 2021.
-
ExoClock Project: An open platform for monitoring the ephemerides of Ariel targets with contributions from the public
Authors:
Anastasia Kokori,
Angelos Tsiaras,
Billy Edwards,
Marco Rocchetto,
Giovanna Tinetti,
Anaël Wünsche,
Nikolaos Paschalis,
Vikrant Kumar Agnihotri,
Matthieu Bachschmidt,
Marc Bretton,
Hamish Caines,
Mauro Caló,
Roland Casali,
Martin Crow,
Simon Dawes,
Marc Deldem,
Dimitrios Deligeorgopoulos,
Roger Dymock,
Phil Evans,
Carmelo Falco,
Stephane Ferratfiat,
Martin Fowler,
Stephen Futcher,
Pere Guerra,
Francois Hurter
, et al. (24 additional authors not shown)
Abstract:
The Ariel mission will observe spectroscopically around 1000 exoplanets to further characterise their atmospheres. For the mission to be as efficient as possible, a good knowledge of the planets' ephemerides is needed before its launch in 2028. While ephemerides for some planets are being refined on a per-case basis, an organised effort to collectively verify or update them when necessary does not…
▽ More
The Ariel mission will observe spectroscopically around 1000 exoplanets to further characterise their atmospheres. For the mission to be as efficient as possible, a good knowledge of the planets' ephemerides is needed before its launch in 2028. While ephemerides for some planets are being refined on a per-case basis, an organised effort to collectively verify or update them when necessary does not exist. In this study, we introduce the ExoClock project, an open, integrated and interactive platform with the purpose of producing a confirmed list of ephemerides for the planets that will be observed by Ariel. The project has been developed in a manner to make the best use of all available resources: observations reported in the literature, observations from space instruments and, mainly, observations from ground-based telescopes, including both professional and amateur observatories. To facilitate inexperienced observers and at the same time achieve homogeneity in the results, we created data collection and validation protocols, educational material and easy to use interfaces, open to everyone. ExoClock was launched in September 2019 and now counts over 140 participants from more than 15 countries around the world. In this release, we report the results of observations obtained until the 15h of April 2020 for 119 Ariel candidate targets. In total, 632 observations were used to either verify or update the ephemerides of 83 planets. Additionally, we developed the Exoplanet Characterisation Catalogue (ECC), a catalogue built in a consistent way to assist the ephemeris refinement process. So far, the collaborative open framework of the ExoClock project has proven to be highly efficient in coordinating scientific efforts involving diverse audiences. Therefore, we believe that it is a paradigm that can be applied in the future for other research purposes, too.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
Peeking inside the Black Box: Interpreting Deep Learning Models for Exoplanet Atmospheric Retrievals
Authors:
Kai Hou Yip,
Quentin Changeat,
Nikolaos Nikolaou,
Mario Morvan,
Billy Edwards,
Ingo P. Waldmann,
Giovanna Tinetti
Abstract:
Deep learning algorithms are growing in popularity in the field of exoplanetary science due to their ability to model highly non-linear relations and solve interesting problems in a data-driven manner. Several works have attempted to perform fast retrievals of atmospheric parameters with the use of machine learning algorithms like deep neural networks (DNNs). Yet, despite their high predictive pow…
▽ More
Deep learning algorithms are growing in popularity in the field of exoplanetary science due to their ability to model highly non-linear relations and solve interesting problems in a data-driven manner. Several works have attempted to perform fast retrievals of atmospheric parameters with the use of machine learning algorithms like deep neural networks (DNNs). Yet, despite their high predictive power, DNNs are also infamous for being 'black boxes'. It is their apparent lack of explainability that makes the astrophysics community reluctant to adopt them. What are their predictions based on? How confident should we be in them? When are they wrong and how wrong can they be? In this work, we present a number of general evaluation methodologies that can be applied to any trained model and answer questions like these. In particular, we train three different popular DNN architectures to retrieve atmospheric parameters from exoplanet spectra and show that all three achieve good predictive performance. We then present an extensive analysis of the predictions of DNNs, which can inform us - among other things - of the credibility limits for atmospheric parameters for a given instrument and model. Finally, we perform a perturbation-based sensitivity analysis to identify to which features of the spectrum the outcome of the retrieval is most sensitive. We conclude that for different molecules, the wavelength ranges to which the DNN's predictions are most sensitive, indeed coincide with their characteristic absorption regions. The methodologies presented in this work help to improve the evaluation of DNNs and to grant interpretability to their predictions.
△ Less
Submitted 23 July, 2021; v1 submitted 23 November, 2020;
originally announced November 2020.
-
Hubble WFC3 Spectroscopy of the Habitable-zone Super-Earth LHS 1140 b
Authors:
Billy Edwards,
Quentin Changeat,
Mayuko Mori,
Lara O. Anisman,
Mario Morvan,
Kai Hou Yip,
Angelos Tsiaras,
Ahmed Al-Refaie,
Ingo Waldmann,
Giovanna Tinetti
Abstract:
Atmospheric characterisation of temperate, rocky planets is the holy grail of exoplanet studies. These worlds are at the limits of our capabilities with current instrumentation in transmission spectroscopy and challenge our state-of-the-art statistical techniques. Here we present the transmission spectrum of the temperate Super-Earth LHS 1140b using the Hubble Space Telescope (HST). The Wide Field…
▽ More
Atmospheric characterisation of temperate, rocky planets is the holy grail of exoplanet studies. These worlds are at the limits of our capabilities with current instrumentation in transmission spectroscopy and challenge our state-of-the-art statistical techniques. Here we present the transmission spectrum of the temperate Super-Earth LHS 1140b using the Hubble Space Telescope (HST). The Wide Field Camera 3 (WFC3) G141 grism data of this habitable zone (T$_{\rm{eq}}$ = 235 K) Super-Earth (R = 1.7 $R_\oplus$), shows tentative evidence of water. However, the signal-to-noise ratio, and thus the significance of the detection, is low and stellar contamination models can cause modulation over the spectral band probed. We attempt to correct for contamination using these models and find that, while many still lead to evidence for water, some could provide reasonable fits to the data without the need for molecular absorption although most of these cause also features in the visible ground-based data which are nonphysical. Future observations with the James Webb Space Telescope (JWST) would be capable of confirming, or refuting, this atmospheric detection.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
ARES IV: Probing the atmospheres of the two warm small planets HD 106315 c and HD 3167 c with the HST/WFC3 camera
Authors:
Gloria Guilluy,
Amélie Gressier,
Sam Wright,
Alexandre Santerne,
Adam Y. jaziri,
Billy Edwards,
Quentin Changeat,
Darius Modirrousta-Galian,
Nour Skaf,
Ahmed Al-Refaie,
Robin Baeyens,
Michelle Fabienne Bieger,
Doriann Blain,
Flavien Kiefer,
Mario Morvan,
Lorenzo V. Mugnai,
William Pluriel,
Mathilde Poveda,
Tiziano Tsingales,
Niall Whiteford,
Kai Hou Yip,
Benjamin Charnay,
Jérémy Leconte,
Pierre Drossart,
Alessandro Sozzetti
, et al. (5 additional authors not shown)
Abstract:
We present an atmospheric characterization study of two medium sized planets bracketing the radius of Neptune: HD 106315 c (R$_{\rm{P}}$=4.98 $\pm$ 0.23 R$_{\oplus}$) and HD 3167 c (R$_{\rm{P}}$=2.740$_{-0.100}^{+0.106}$ R$_{\oplus}$). We analyse spatially scanned spectroscopic observations obtained with the G141 grism (1.125 - 1.650 $μ$m) of the Wide Field Camera 3 (WFC3) onboard the Hubble Space…
▽ More
We present an atmospheric characterization study of two medium sized planets bracketing the radius of Neptune: HD 106315 c (R$_{\rm{P}}$=4.98 $\pm$ 0.23 R$_{\oplus}$) and HD 3167 c (R$_{\rm{P}}$=2.740$_{-0.100}^{+0.106}$ R$_{\oplus}$). We analyse spatially scanned spectroscopic observations obtained with the G141 grism (1.125 - 1.650 $μ$m) of the Wide Field Camera 3 (WFC3) onboard the Hubble Space Telescope. We use the publicly available Iraclis pipeline and TauREx3 atmospheric retrieval code and we detect water vapor in the atmosphere of both planets with an abundance of $\log_{10}[\mathrm{H_2O}]=-2.1^{+0.7}_{-1.3}$ ($\sim$5.68$σ$) and $\log_{10}[\mathrm{H_2O}]=-4.1^{+0.9}_{-0.9}$ ($\sim$3.17$σ$) for HD 106315 c and HD 3167 c, respectively. The transmission spectrum of HD 106315 c shows also a possible evidence of ammonia absorption ($\log_{10}[\mathrm {NH_3}]=-4.3^{+0.7}_{-2.0}$, $\sim$1.97$σ$ -even if it is not significant-), whilst carbon dioxide absorption features may be present in the atmosphere of HD 3167 c in the $\sim$1.1-1.6~$μ$m wavelength range ($\log_{10}[\mathrm{CO_{2}}]= -2.4^{+0.7}_{-1.0}$, $\sim$3.28$σ$). However the CO$_2$ detection appears significant, it must be considered carefully and put into perspective. Indeed, CO$_2$ presence is not explained by 1D equilibrium chemistry models, and it could be due to possible systematics. The additional contribution of clouds, CO and CH$_4$ are discussed. HD 106315 c and HD 3167 c will be interesting targets for upcoming telescopes such as the James Webb Space Telescope (JWST) and the Atmospheric Remote-Sensing Infrared Exoplanet Large-Survey (Ariel).
△ Less
Submitted 6 November, 2020;
originally announced November 2020.
-
PyLightcurve-torch: a transit modelling package for deep learning applications in PyTorch
Authors:
Mario Morvan,
Angelos Tsiaras,
Nikolaos Nikolaou,
Ingo P. Waldmann
Abstract:
We present a new open source python package, based on PyLightcurve and PyTorch, tailored for efficient computation and automatic differentiation of exoplanetary transits. The classes and functions implemented are fully vectorised, natively GPU-compatible and differentiable with respect to the stellar and planetary parameters. This makes PyLightcurve-torch suitable for traditional forward computati…
▽ More
We present a new open source python package, based on PyLightcurve and PyTorch, tailored for efficient computation and automatic differentiation of exoplanetary transits. The classes and functions implemented are fully vectorised, natively GPU-compatible and differentiable with respect to the stellar and planetary parameters. This makes PyLightcurve-torch suitable for traditional forward computation of transits, but also extends the range of possible applications with inference and optimisation algorithms requiring access to the gradients of the physical model. This endeavour is aimed at fostering the use of deep learning in exoplanets research, motivated by an ever increasing amount of stellar light curves data and various incentives for the improvement of detection and characterisation techniques.
△ Less
Submitted 28 December, 2020; v1 submitted 3 November, 2020;
originally announced November 2020.
-
Lessons Learned from the 1st ARIEL Machine Learning Challenge: Correcting Transiting Exoplanet Light Curves for Stellar Spots
Authors:
Nikolaos Nikolaou,
Ingo P. Waldmann,
Angelos Tsiaras,
Mario Morvan,
Billy Edwards,
Kai Hou Yip,
Giovanna Tinetti,
Subhajit Sarkar,
James M. Dawson,
Vadim Borisov,
Gjergji Kasneci,
Matej Petkovic,
Tomaz Stepisnik,
Tarek Al-Ubaidi,
Rachel Louise Bailey,
Michael Granitzer,
Sahib Julka,
Roman Kern,
Patrick Ofner,
Stefan Wagner,
Lukas Heppe,
Mirko Bunse,
Katharina Morik
Abstract:
The last decade has witnessed a rapid growth of the field of exoplanet discovery and characterisation. However, several big challenges remain, many of which could be addressed using machine learning methodology. For instance, the most prolific method for detecting exoplanets and inferring several of their characteristics, transit photometry, is very sensitive to the presence of stellar spots. The…
▽ More
The last decade has witnessed a rapid growth of the field of exoplanet discovery and characterisation. However, several big challenges remain, many of which could be addressed using machine learning methodology. For instance, the most prolific method for detecting exoplanets and inferring several of their characteristics, transit photometry, is very sensitive to the presence of stellar spots. The current practice in the literature is to identify the effects of spots visually and correct for them manually or discard the affected data. This paper explores a first step towards fully automating the efficient and precise derivation of transit depths from transit light curves in the presence of stellar spots. The methods and results we present were obtained in the context of the 1st Machine Learning Challenge organized for the European Space Agency's upcoming Ariel mission. We first present the problem, the simulated Ariel-like data and outline the Challenge while identifying best practices for organizing similar challenges in the future. Finally, we present the solutions obtained by the top-5 winning teams, provide their code and discuss their implications. Successful solutions either construct highly non-linear (w.r.t. the raw data) models with minimal preprocessing -deep neural networks and ensemble methods- or amount to obtaining meaningful statistics from the light curves, constructing linear models on which yields comparably good predictive performance.
△ Less
Submitted 29 October, 2020;
originally announced October 2020.
-
KELT-11 b: Abundances of water and constraints on carbon-bearing molecules from the Hubble transmission spectrum
Authors:
Quentin Changeat,
Billy Edwards,
Ahmed F. Al-Refaie,
Mario Morvan,
Angelos Tsiaras,
Ingo P. Waldmann,
Giovanna Tinetti
Abstract:
In the past decade, the analysis of exoplanet atmospheric spectra has revealed the presence of water vapour in almost all the planets observed, with the exception of a fraction of overcast planets. Indeed, water vapour presents a large absorption signature in the wavelength coverage of the Hubble Space Telescope's (HST) Wide Field Camera 3 (WFC3), which is the main space-based observatory for atmo…
▽ More
In the past decade, the analysis of exoplanet atmospheric spectra has revealed the presence of water vapour in almost all the planets observed, with the exception of a fraction of overcast planets. Indeed, water vapour presents a large absorption signature in the wavelength coverage of the Hubble Space Telescope's (HST) Wide Field Camera 3 (WFC3), which is the main space-based observatory for atmospheric studies of exoplanets, making its detection very robust. However, while carbon-bearing species such as methane, carbon monoxide and carbon dioxide are also predicted from current chemical models, their direct detection and abundance characterisation has remained a challenge. Here we analyse the transmission spectrum of the puffy, clear hot-Jupiter KELT-11 b from the HST WFC3 camera. We find that the spectrum is consistent with the presence of water vapor and an additional absorption at longer wavelengths than 1.5um, which could well be explained by a mix of carbon bearing molecules. CO2, when included is systematically detected. One of the main difficulties to constrain the abundance of those molecules is their weak signatures across the HST WFC3 wavelength coverage, particularly when compared to those of water. Through a comprehensive retrieval analysis, we attempt to explain the main degeneracies present in this dataset and explore some of the recurrent challenges that are occurring in retrieval studies (e.g: the impact of model selection, the use of free vs self-consistent chemistry and the combination of instrument observations). Our results make this planet an exceptional example of chemical laboratory where to test current physical and chemical models of hot-Jupiters' atmospheres.
△ Less
Submitted 3 October, 2020;
originally announced October 2020.
-
On The Compatibility of Ground-based and Space-based Data: WASP-96 b, An Example
Authors:
Kai Hou Yip,
Quentin Changeat,
Billy Edwards,
Mario Morvan,
Katy L. Chubb,
Angelos Tsiaras,
Ingo P. Waldmann,
Giovanna Tinetti
Abstract:
The study of exoplanetary atmospheres relies on detecting minute changes in the transit depth at different wavelengths. To date, a number of ground and space based instruments have been used to obtain transmission spectra of exoplanets in different spectral band. One common practice is to combine observations from different instruments in order to achieve a broader wavelength coverage. We present…
▽ More
The study of exoplanetary atmospheres relies on detecting minute changes in the transit depth at different wavelengths. To date, a number of ground and space based instruments have been used to obtain transmission spectra of exoplanets in different spectral band. One common practice is to combine observations from different instruments in order to achieve a broader wavelength coverage. We present here two inconsistent observations of WASP-96 b, one by Hubble Space Telescope (HST) and the other by the Very Large Telescope (VLT). We present two key findings in our investigation: 1.) a strong water signature is detected via the HST WFC3 observations. 2.) A notable offset in transit depth (> 1100 ppm) can be seen when the ground-based and space-based observations are combined together. The discrepancy raises the question of whether observations from different instruments could indeed be combined together. We attempt to align the observations by including an additional parameter in our retrieval studies but are unable to definitively ascertain that the aligned observations are indeed compatible. The case of WASP-96 b signals that compatibility of instruments should not be assumed. While wavelength overlaps between instruments can help, it should be noted that combining datasets remains a risky business. The difficulty in combining observations also strengthens the need for next generation instruments which will possess broader spectral coverage.
△ Less
Submitted 19 October, 2020; v1 submitted 22 September, 2020;
originally announced September 2020.
-
Original Research By Young Twinkle Students (ORBYTS): Ephemeris Refinement of Transiting Exoplanets II
Authors:
Billy Edwards,
Lara Anisman,
Quentin Changeat,
Mario Morvan,
Sam Wright,
Kai Hou Yip,
Amiira Abdullahi,
Jesmin Ali,
Clarry Amofa,
Antony Antoniou,
Shahad Arzouni,
Noeka Bradley,
Dayanara Campana,
Nandini Chavda,
Jessy Creswell,
Neliman Gazieva,
Emily Gudgeon-Sidelnikova,
Pratap Guha,
Ella Hayden,
Mohammed Huda,
Hana Hussein,
Ayub Ibrahim,
Chika Ike,
Salma Jama,
Bhavya Joshi
, et al. (38 additional authors not shown)
Abstract:
We report follow-up observations of four transiting exoplanets, TRES-2b, HAT-P-22b, HAT-P-36b and XO-2b, as part of the Original Research By Young Twinkle Students (ORBYTS) programme. These observations were taken using the Las Cumbres Observatory Global Telescope Network's (LCOGT) robotic 0.4 m telescopes and were analysed using the HOlomon Photometric Software (HOPS). Such observations are key f…
▽ More
We report follow-up observations of four transiting exoplanets, TRES-2b, HAT-P-22b, HAT-P-36b and XO-2b, as part of the Original Research By Young Twinkle Students (ORBYTS) programme. These observations were taken using the Las Cumbres Observatory Global Telescope Network's (LCOGT) robotic 0.4 m telescopes and were analysed using the HOlomon Photometric Software (HOPS). Such observations are key for ensuring accurate transit times for upcoming telescopes, such as the James Webb Space Telescope (JWST), Twinkle and Ariel, which may seek to characterise the atmospheres of these planets. The data have been uploaded to ExoClock and a significant portion of this work has been completed by secondary school students in London.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
NeuMiss networks: differentiable programming for supervised learning with missing values
Authors:
Marine Le Morvan,
Julie Josse,
Thomas Moreau,
Erwan Scornet,
Gaël Varoquaux
Abstract:
The presence of missing values makes supervised learning much more challenging. Indeed, previous work has shown that even when the response is a linear function of the complete data, the optimal predictor is a complex function of the observed entries and the missingness indicator. As a result, the computational or sample complexities of consistent approaches depend on the number of missing pattern…
▽ More
The presence of missing values makes supervised learning much more challenging. Indeed, previous work has shown that even when the response is a linear function of the complete data, the optimal predictor is a complex function of the observed entries and the missingness indicator. As a result, the computational or sample complexities of consistent approaches depend on the number of missing patterns, which can be exponential in the number of dimensions. In this work, we derive the analytical form of the optimal predictor under a linearity assumption and various missing data mechanisms including Missing at Random (MAR) and self-masking (Missing Not At Random). Based on a Neumann-series approximation of the optimal predictor, we propose a new principled architecture, named NeuMiss networks. Their originality and strength come from the use of a new type of non-linearity: the multiplication by the missingness indicator. We provide an upper bound on the Bayes risk of NeuMiss networks, and show that they have good predictive accuracy with both a number of parameters and a computational complexity independent of the number of missing data patterns. As a result they scale well to problems with many features, and remain statistically efficient for medium-sized samples. Moreover, we show that, contrary to procedures using EM or imputation, they are robust to the missing data mechanism, including difficult MNAR settings such as self-masking.
△ Less
Submitted 4 November, 2020; v1 submitted 3 July, 2020;
originally announced July 2020.
-
ARES III: Unveiling the Two Faces of KELT-7 b with HST WFC3
Authors:
William Pluriel,
Niall Whiteford,
Billy Edwards,
Quentin Changeat,
Kai Hou Yip,
Robin Baeyens,
Ahmed Al-Refaie,
Michelle Fabienne Bieger,
Dorian Blain,
Amelie Gressier,
Gloria Guilluy,
Adam Yassin Jaziri,
Flavien Kiefer,
Darius Modirrousta-Galian,
Mario Morvan,
Lorenzo V. Mugnai,
Mathilde Poveda,
Nour Skaf,
Tiziano Zingales,
Sam Wright,
Benjamin Charnay,
Pierre Drossart,
Jeremy Leconte,
Angelos Tsiaras,
Olivia Venot
, et al. (2 additional authors not shown)
Abstract:
We present the analysis of the hot-Jupiter KELT-7b using transmission and emission spectroscopy from the Hubble Space Telescope (HST), both taken with the Wide Field Camera 3 (WFC3). Our study uncovers a rich transmission spectrum which is consistent with a cloud-free atmosphere and suggests the presence of H2O and H-. In contrast, the extracted emission spectrum does not contain strong absorption…
▽ More
We present the analysis of the hot-Jupiter KELT-7b using transmission and emission spectroscopy from the Hubble Space Telescope (HST), both taken with the Wide Field Camera 3 (WFC3). Our study uncovers a rich transmission spectrum which is consistent with a cloud-free atmosphere and suggests the presence of H2O and H-. In contrast, the extracted emission spectrum does not contain strong absorption features and, although it is not consistent with a simple blackbody, it can be explained by a varying temperature-pressure profile, collision induced absorption (CIA) and H-. KELT-7 b had also been studied with other space-based instruments and we explore the effects of introducing these additional datasets. Further observations with Hubble, or the next generation of space-based telescopes, are needed to allow for the optical opacity source in transmission to be confirmed and for molecular features to be disentangled in emission.
△ Less
Submitted 17 September, 2020; v1 submitted 25 June, 2020;
originally announced June 2020.
-
ARES II: Characterising the Hot Jupiters WASP-127 b, WASP-79 b and WASP-62 b with HST
Authors:
Nour Skaf,
Michelle Fabienne Bieger,
Billy Edwards,
Quentin Changeat,
Mario Morvan,
Flavien Kiefer,
Doriann Blain,
Tiziano Zingales,
Mathilde Poveda,
Ahmed Al-Refaie,
Robin Baeyens,
Amelie Gressier,
Gloria Guilluy,
Adam Yassin Jaziri,
Darius Modirrousta-Galian,
Lorenzo V. Mugnai,
William Pluriel,
Niall Whiteford,
Sam Wright,
Kai Hou Yip,
Benjamin Charnay,
Jeremy Leconte,
Pierre Drossart,
Angelos Tsiaras,
Olivia Venot
, et al. (2 additional authors not shown)
Abstract:
This paper presents the atmospheric characterisation of three large, gaseous planets: WASP-127b, WASP-79b and WASP-62b. We analysed spectroscopic data obtained with the G141 grism (1.088 - 1.68 $μ$m) of the Wide Field Camera 3 (WFC3) onboard the Hubble Space Telescope (HST) using the Iraclis pipeline and the TauREx3 retrieval code, both of which are publicly available. For WASP-127 b, which is the…
▽ More
This paper presents the atmospheric characterisation of three large, gaseous planets: WASP-127b, WASP-79b and WASP-62b. We analysed spectroscopic data obtained with the G141 grism (1.088 - 1.68 $μ$m) of the Wide Field Camera 3 (WFC3) onboard the Hubble Space Telescope (HST) using the Iraclis pipeline and the TauREx3 retrieval code, both of which are publicly available. For WASP-127 b, which is the least dense planet discovered so far and is located in the short-period Neptune desert, our retrieval results found strong water absorption corresponding to an abundance of log(H$_2$O) = -2.71$^{+0.78}_{-1.05}$, and absorption compatible with an iron hydride abundance of log(FeH)=$-5.25^{+0.88}_{-1.10}$, with an extended cloudy atmosphere. We also detected water vapour in the atmospheres of WASP-79 b and WASP-62 b, with best-fit models indicating the presence of iron hydride, too. We used the Atmospheric Detectability Index (ADI) as well as Bayesian log evidence to quantify the strength of the detection and compared our results to the hot Jupiter population study by Tsiaras et al. 2018. While all the planets studied here are suitable targets for characterisation with upcoming facilities such as the James Webb Space Telescope (JWST) and Ariel, WASP-127 b is of particular interest due to its low density, and a thorough atmospheric study would develop our understanding of planet formation and migration.
△ Less
Submitted 17 September, 2020; v1 submitted 19 May, 2020;
originally announced May 2020.
-
ARES I: WASP-76 b, A Tale of Two HST Spectra
Authors:
Billy Edwards,
Quentin Changeat,
Robin Baeyens,
Angelos Tsiaras,
Ahmed Al-Refaie,
Jake Taylor,
Kai Hou Yip,
Michelle Fabienne Bieger,
Doriann Blain,
Amelie Gressier,
Gloria Guilluy,
Adam Yassin Jaziri,
Flavien Kiefer,
Darius Modirrousta-Galian,
Mario Morvan,
Lorenzo V. Mugnai,
William Pluriel,
Mathilde Poveda,
Nour Skaf,
Niall Whiteford,
Sam Wright,
Tiziano Zingales,
Benjamin Charnay,
Pierre Drossart,
Jeremy Leconte
, et al. (3 additional authors not shown)
Abstract:
We analyse the transmission and emission spectra of the ultra-hot Jupiter WASP-76b, observed with the G141 grism of the Hubble Space Telescope's Wide Field Camera 3 (WFC3). We reduce and fit the raw data for each observation using the open-source software Iraclis before performing a fully Bayesian retrieval using the publicly available analysis suite TauRex 3. Previous studies of the WFC3 transmis…
▽ More
We analyse the transmission and emission spectra of the ultra-hot Jupiter WASP-76b, observed with the G141 grism of the Hubble Space Telescope's Wide Field Camera 3 (WFC3). We reduce and fit the raw data for each observation using the open-source software Iraclis before performing a fully Bayesian retrieval using the publicly available analysis suite TauRex 3. Previous studies of the WFC3 transmission spectra of WASP-76 b found hints of titanium oxide (TiO) and vanadium oxide (VO) or non-grey clouds. Accounting for a fainter stellar companion to WASP-76, we reanalyse this data and show that removing the effects of this background star changes the slope of the spectrum, resulting in these visible absorbers no longer being detected, eliminating the need for a non-grey cloud model to adequately fit the data but maintaining the strong water feature previously seen. However, our analysis of the emission spectrum suggests the presence of TiO and an atmospheric thermal inversion, along with a significant amount of water. Given the brightness of the host star and the size of the atmospheric features, WASP-76 b is an excellent target for further characterisation with HST, or with future facilities, to better understand the nature of its atmosphere, to confirm the presence of TiO and to search for other optical absorbers.
△ Less
Submitted 17 September, 2020; v1 submitted 5 May, 2020;
originally announced May 2020.
-
Linear predictor on linearly-generated data with missing values: non consistency and solutions
Authors:
Marine Le Morvan,
Nicolas Prost,
Julie Josse,
Erwan Scornet,
Gaël Varoquaux
Abstract:
We consider building predictors when the data have missing values. We study the seemingly-simple case where the target to predict is a linear function of the fully-observed data and we show that, in the presence of missing values, the optimal predictor may not be linear. In the particular Gaussian case, it can be written as a linear function of multiway interactions between the observed data and t…
▽ More
We consider building predictors when the data have missing values. We study the seemingly-simple case where the target to predict is a linear function of the fully-observed data and we show that, in the presence of missing values, the optimal predictor may not be linear. In the particular Gaussian case, it can be written as a linear function of multiway interactions between the observed data and the various missing-value indicators. Due to its intrinsic complexity, we study a simple approximation and prove generalization bounds with finite samples, highlighting regimes for which each method performs best. We then show that multilayer perceptrons with ReLU activation functions can be consistent, and can explore good trade-offs between the true model and approximations. Our study highlights the interesting family of models that are beneficial to fit with missing values depending on the amount of data available.
△ Less
Submitted 12 May, 2020; v1 submitted 3 February, 2020;
originally announced February 2020.
-
Detrending Exoplanetary Transit Light Curves with Long Short-Term Memory Networks
Authors:
Mario Morvan,
Nikolaos Nikolaou,
Angelos Tsiaras,
Ingo P. Waldmann
Abstract:
The precise derivation of transit depths from transit light curves is a key component for measuring exoplanet transit spectra, and henceforth for the study of exoplanet atmospheres. However, it is still deeply affected by various kinds of systematic errors and noise. In this paper we propose a new detrending method by reconstructing the stellar flux baseline during transit time. We train a probabi…
▽ More
The precise derivation of transit depths from transit light curves is a key component for measuring exoplanet transit spectra, and henceforth for the study of exoplanet atmospheres. However, it is still deeply affected by various kinds of systematic errors and noise. In this paper we propose a new detrending method by reconstructing the stellar flux baseline during transit time. We train a probabilistic Long Short-Term Memory (LSTM) network to predict the next data point of the light curve during the out-of-transit, and use this model to reconstruct a transit-free light curve - i.e. including only the systematics - during the in-transit. By making no assumption about the instrument, and using only the transit ephemeris, this provides a general way to correct the systematics and perform a subsequent transit fit. The name of the proposed model is TLCD-LSTM, standing for Transit Light Curve Detrending LSTM. Here we present the first results on data from six transit observations of HD 189733b with the IRAC camera on board the Spitzer Space Telescope, and discuss some of its possible further applications.
△ Less
Submitted 10 January, 2020;
originally announced January 2020.
-
Pushing the Limits of Exoplanet Discovery via Direct Imaging with Deep Learning
Authors:
Kai Hou Yip,
Nikolaos Nikolaou,
Piero Coronica,
Angelos Tsiaras,
Billy Edwards,
Quentin Changeat,
Mario Morvan,
Beth Biller,
Sasha Hinkley,
Jeffrey Salmond,
Matthew Archer,
Paul Sumption,
Elodie Choquet,
Remi Soummer,
Laurent Pueyo,
Ingo P. Waldmann
Abstract:
Further advances in exoplanet detection and characterisation require sampling a diverse population of extrasolar planets. One technique to detect these distant worlds is through the direct detection of their thermal emission. The so-called direct imaging technique, is suitable for observing young planets far from their star. These are very low signal-to-noise-ratio (SNR) measurements and limited g…
▽ More
Further advances in exoplanet detection and characterisation require sampling a diverse population of extrasolar planets. One technique to detect these distant worlds is through the direct detection of their thermal emission. The so-called direct imaging technique, is suitable for observing young planets far from their star. These are very low signal-to-noise-ratio (SNR) measurements and limited ground truth hinders the use of supervised learning approaches. In this paper, we combine deep generative and discriminative models to bypass the issues arising when directly training on real data. We use a Generative Adversarial Network to obtain a suitable dataset for training Convolutional Neural Network classifiers to detect and locate planets across a wide range of SNRs. Tested on artificial data, our detectors exhibit good predictive performance and robustness across SNRs. To demonstrate the limits of the detectors, we provide maps of the precision and recall of the model per pixel of the input image. On real data, the models can re-confirm bright source detections.
△ Less
Submitted 28 January, 2020; v1 submitted 12 April, 2019;
originally announced April 2019.
-
A new method for unveiling Open Clusters in Gaia: new nearby Open Clusters confirmed by DR2
Authors:
A. Castro-Ginard,
C. Jordi,
X. Luri,
F. Julbe,
M. Morvan,
L. Balaguer-Núñez,
T. Cantat-Gaudin
Abstract:
The publication of the Gaia Data Release 2 (Gaia DR2) opens a new era in Astronomy. It includes precise astrometric data (positions, proper motions and parallaxes) for more than $1.3$ billion sources, mostly stars. To analyse such a vast amount of new data, the use of data mining techniques and machine learning algorithms are mandatory. The search for Open Clusters, groups of stars that were born…
▽ More
The publication of the Gaia Data Release 2 (Gaia DR2) opens a new era in Astronomy. It includes precise astrometric data (positions, proper motions and parallaxes) for more than $1.3$ billion sources, mostly stars. To analyse such a vast amount of new data, the use of data mining techniques and machine learning algorithms are mandatory. The search for Open Clusters, groups of stars that were born and move together, located in the disk, is a great example for the application of these techniques. Our aim is to develop a method to automatically explore the data space, requiring minimal manual intervention. We explore the performance of a density based clustering algorithm, DBSCAN, to find clusters in the data together with a supervised learning method such as an Artificial Neural Network (ANN) to automatically distinguish between real Open Clusters and statistical clusters. The development and implementation of this method to a $5$-Dimensional space ($l$, $b$, $\varpi$, $μ_{α^*}$, $μ_δ$) to the Tycho-Gaia Astrometric Solution (TGAS) data, and a posterior validation using Gaia DR2 data, lead to the proposal of a set of new nearby Open Clusters. We have developed a method to find OCs in astrometric data, designed to be applied to the full Gaia DR2 archive.
△ Less
Submitted 12 June, 2018; v1 submitted 8 May, 2018;
originally announced May 2018.
-
WHInter: A Working set algorithm for High-dimensional sparse second order Interaction models
Authors:
Marine Le Morvan,
Jean-Philippe Vert
Abstract:
Learning sparse linear models with two-way interactions is desirable in many application domains such as genomics. l1-regularised linear models are popular to estimate sparse models, yet standard implementations fail to address specifically the quadratic explosion of candidate two-way interactions in high dimensions, and typically do not scale to genetic data with hundreds of thousands of features…
▽ More
Learning sparse linear models with two-way interactions is desirable in many application domains such as genomics. l1-regularised linear models are popular to estimate sparse models, yet standard implementations fail to address specifically the quadratic explosion of candidate two-way interactions in high dimensions, and typically do not scale to genetic data with hundreds of thousands of features. Here we present WHInter, a working set algorithm to solve large l1-regularised problems with two-way interactions for binary design matrices. The novelty of WHInter stems from a new bound to efficiently identify working sets while avoiding to scan all features, and on fast computations inspired from solutions to the maximum inner product search problem. We apply WHInter to simulated and real genetic data and show that it is more scalable and two orders of magnitude faster than the state of the art.
△ Less
Submitted 16 February, 2018;
originally announced February 2018.
-
Supervised Quantile Normalisation
Authors:
Marine Le Morvan,
Jean-Philippe Vert
Abstract:
Quantile normalisation is a popular normalisation method for data subject to unwanted variations such as images, speech, or genomic data. It applies a monotonic transformation to the feature values of each sample to ensure that after normalisation, they follow the same target distribution for each sample. Choosing a "good" target distribution remains however largely empirical and heuristic, and is…
▽ More
Quantile normalisation is a popular normalisation method for data subject to unwanted variations such as images, speech, or genomic data. It applies a monotonic transformation to the feature values of each sample to ensure that after normalisation, they follow the same target distribution for each sample. Choosing a "good" target distribution remains however largely empirical and heuristic, and is usually done independently of the subsequent analysis of normalised data. We propose instead to couple the quantile normalisation step with the subsequent analysis, and to optimise the target distribution jointly with the other parameters in the analysis. We illustrate this principle on the problem of estimating a linear model over normalised data, and show that it leads to a particular low-rank matrix regression problem that can be solved efficiently. We illustrate the potential of our method, which we term SUQUAN, on simulated data, images and genomic data, where it outperforms standard quantile normalisation.
△ Less
Submitted 1 June, 2017;
originally announced June 2017.
-
French Roadmap for complex Systems 2008-2009
Authors:
Paul Bourgine,
David Chavalarias,
Edith Perrier,
Frederic Amblard,
Francois Arlabosse,
Pierre Auger,
Jean-Bernard Baillon,
Olivier Barreteau,
Pierre Baudot,
Elisabeth Bouchaud,
Soufian Ben Amor,
Hugues Berry,
Cyrille Bertelle,
Marc Berthod,
Guillaume Beslon,
Giulio Biroli,
Daniel Bonamy,
Daniele Bourcier,
Nicolas Brodu,
Marc Bui,
Yves Burnod,
Bertrand Chapron,
Catherine Christophe,
Bruno Clement,
Jean-Louis Coatrieux
, et al. (56 additional authors not shown)
Abstract:
This second issue of the French Complex Systems Roadmap is the outcome of the Entretiens de Cargese 2008, an interdisciplinary brainstorming session organized over one week in 2008, jointly by RNSC, ISC-PIF and IXXI. It capitalizes on the first roadmap and gathers contributions of more than 70 scientists from major French institutions. The aim of this roadmap is to foster the coordination of the…
▽ More
This second issue of the French Complex Systems Roadmap is the outcome of the Entretiens de Cargese 2008, an interdisciplinary brainstorming session organized over one week in 2008, jointly by RNSC, ISC-PIF and IXXI. It capitalizes on the first roadmap and gathers contributions of more than 70 scientists from major French institutions. The aim of this roadmap is to foster the coordination of the complex systems community on focused topics and questions, as well as to present contributions and challenges in the complex systems sciences and complexity science to the public, political and industrial spheres.
△ Less
Submitted 13 July, 2009;
originally announced July 2009.
-
A Distributed Trust Diffusion Protocol for Ad Hoc Networks
Authors:
Michel Morvan,
Sylvain Sené
Abstract:
In this paper, we propose and evaluate a distributed protocol to manage trust diffusion in ad hoc networks. In this protocol, each node i maintains a \trust value" about an other node j which is computed both as a result of the exchanges with node j itself and as a function of the opinion that other nodes have about j. These two aspects are respectively weighted by a trust index that measures th…
▽ More
In this paper, we propose and evaluate a distributed protocol to manage trust diffusion in ad hoc networks. In this protocol, each node i maintains a \trust value" about an other node j which is computed both as a result of the exchanges with node j itself and as a function of the opinion that other nodes have about j. These two aspects are respectively weighted by a trust index that measures the trust quality the node has in its own experiences and by a trust index representing the trust the node has in the opinions of the other nodes. Simulations have been realized to validate the robustness of this protocol against three kinds of attacks: simple coalitions, Trojan attacks and detonator attacks.
△ Less
Submitted 21 January, 2009;
originally announced January 2009.
-
Coalescing Cellular Automata -- Synchronizing CA by Common Random Source and Varying Asynchronicity
Authors:
Jean-Baptiste Rouquier,
Michel Morvan
Abstract:
We say that a Cellular Automata (CA) is coalescing when its execution on two distinct (random) initial configurations in the same asynchronous mode (the same cells are updated in each configuration at each time step) makes both configurations become identical after a reasonable time.
We prove coalescence for two elementary rules, non coalescence for two other, and show that there exists infini…
▽ More
We say that a Cellular Automata (CA) is coalescing when its execution on two distinct (random) initial configurations in the same asynchronous mode (the same cells are updated in each configuration at each time step) makes both configurations become identical after a reasonable time.
We prove coalescence for two elementary rules, non coalescence for two other, and show that there exists infinitely many coalescing CA. We then conduct an experimental study on all elementary CA and show that some rules exhibit a phase transition, which belongs to the universality class of directed percolation.
△ Less
Submitted 12 December, 2007;
originally announced December 2007.
-
Coalescing Cellular Automata
Authors:
Jean-Baptiste Rouquier,
Michel Morvan
Abstract:
We say that a Cellular Automata (CA) is coalescing when its execution on two distinct (random) initial configurations in the same asynchronous mode (the same cells are updated in each configuration at each time step) makes both configurations become identical after a reasonable time. We prove coalescence for two elementary rules and show that there exists infinitely many coalescing CA. We then c…
▽ More
We say that a Cellular Automata (CA) is coalescing when its execution on two distinct (random) initial configurations in the same asynchronous mode (the same cells are updated in each configuration at each time step) makes both configurations become identical after a reasonable time. We prove coalescence for two elementary rules and show that there exists infinitely many coalescing CA. We then conduct an experimental study on all elementary CA and show that some rules exhibit a phase transition, which belongs to the universality class of directed percolation.
△ Less
Submitted 4 October, 2006;
originally announced October 2006.
-
Stable Oxide Nanoparticle Clusters Obtained by Complexation
Authors:
J. -F. Berret,
A. Sehgal,
M. Morvan,
O. Sandre,
A. Vacher,
M. Airiau
Abstract:
We report on the electrostatic complexation between polyelectrolyte-neutral copolymers and oppositely charged 6 nm-crystalline nanoparticles. For two different dispersions of oxide nanoparticles, the electrostatic complexation gives rise to the formation of stable nanoparticle clusters in the range 20 - 100 nm. It is found that inside the clusters, the particles are pasted together by the polyel…
▽ More
We report on the electrostatic complexation between polyelectrolyte-neutral copolymers and oppositely charged 6 nm-crystalline nanoparticles. For two different dispersions of oxide nanoparticles, the electrostatic complexation gives rise to the formation of stable nanoparticle clusters in the range 20 - 100 nm. It is found that inside the clusters, the particles are pasted together by the polyelectrolyte blocks adsorbed on their surface. Cryo-transmission electronic microscopy allows to visualize the clusters and to determine the probability distributions functions in size and in aggregation number. The comparison between light scattering and cryo-microscopy results suggests the existence of a polymer brush around the clusters.
△ Less
Submitted 8 August, 2006;
originally announced August 2006.
-
Polymer-Nanoparticle Complexes : from Dilute Solution to Solid State
Authors:
Jean-Francois Berret,
Kazuhiko Yokota,
Mikel Morvan,
Ralf Schweins
Abstract:
We report on the formation and the structural properties of supermicellar aggregates also called electrostatic complexes, made from mineral nanoparticles and polyelectrolyte-neutral block copolymers in aqueous solutions. The mineral particles put under scrutiny are ultra-fine and positively charged yttrium hydroxyacetate nanoparticles. Combining light, neutron and x-ray scattering experiments, w…
▽ More
We report on the formation and the structural properties of supermicellar aggregates also called electrostatic complexes, made from mineral nanoparticles and polyelectrolyte-neutral block copolymers in aqueous solutions. The mineral particles put under scrutiny are ultra-fine and positively charged yttrium hydroxyacetate nanoparticles. Combining light, neutron and x-ray scattering experiments, we have characterized the sizes and the aggregation numbers of the organic-inorganic complexes. We have found that the hybrid aggregates have typical sizes in the range 100 nm and exhibit a remarkable colloidal stability with respect to ionic strength and concentration variations. Solid films with thicknesses up to several hundreds of micrometers were cast from solutions, resulting in a bulk polymer matrix in which nanoparticle clusters are dispersed and immobilized. It was found in addition that the structure of the complexes remains practically unchanged during film casting.
△ Less
Submitted 7 July, 2006;
originally announced July 2006.
-
Precipitation-Redispersion of Cerium Oxide Nanoparticles with Poly(Acrylic Acid) : Towards Stable Dispersions
Authors:
A. Sehgal,
Y. Lalatonne,
J. -F. Berret,
M. Morvan
Abstract:
We exploit a precipitation-redispersion mechanism for complexation of short chain polyelectrolytes with cerium oxide nanoparticles to extend their stability ranges. As synthesized, cerium oxide sols at pH 1.4 consist of monodisperse cationic nanocrystalline particles having a hydrodynamic diameter of 10 nm and a molecular weight 400000 gmol-1. We show that short chain uncharged poly(acrylic acid…
▽ More
We exploit a precipitation-redispersion mechanism for complexation of short chain polyelectrolytes with cerium oxide nanoparticles to extend their stability ranges. As synthesized, cerium oxide sols at pH 1.4 consist of monodisperse cationic nanocrystalline particles having a hydrodynamic diameter of 10 nm and a molecular weight 400000 gmol-1. We show that short chain uncharged poly(acrylic acid) at low pH when added to a cerium oxide sols leads to macroscopic precipitation. As the pH is increased, the solution spontaneously redisperses into a clear solution of single particles with an anionic poly(acrylic acid) corona. The structure and dynamics of cerium oxide nanosols and their hybrid polymer-inorganic complexes in solution are investigated by static and dynamic light scattering, X-ray scattering, and by chemical analysis. Quantitative analysis of the redispersed sol gives rise to an estimate of 40 - 50 polymer chains per particle for stable suspension. This amount represents 20 % of the mass of the polymer-nanoparticle complexes. This complexation adds utility to the otherwise unstable cerium oxide dispersions by extending the range of stability of the sols in terms of pH, ionic strength and concentration.
△ Less
Submitted 21 July, 2005; v1 submitted 11 July, 2005;
originally announced July 2005.
-
Electrostatic Self-assembly : A New Route Towards Nanostructures
Authors:
J. -F. Berret,
P. Herve,
M. Morvan,
K. Yokota,
M. Destarac,
J. Oberdisse,
I. Grillo,
R. Schweins
Abstract:
During the last 3 years, our group has investigated extensively the complexation mechanism between neutral-polyelectrolyte block copolymers with oppositely charged species. These species are surfactant micelles, multivalent counterions and inorganic nanoparticles. In the three cases, we have established the thermodynamical phase diagram of these systems, and found broad regions where supramolecu…
▽ More
During the last 3 years, our group has investigated extensively the complexation mechanism between neutral-polyelectrolyte block copolymers with oppositely charged species. These species are surfactant micelles, multivalent counterions and inorganic nanoparticles. In the three cases, we have established the thermodynamical phase diagram of these systems, and found broad regions where supramolecular aggregates spontaneously form via electrostatic self-assembly. From earlier works, it was suspected that these mixed colloids exhibit a core-shell structure. However, their inner structure was unveiled by us only recently, using a combination of light, neutron and x-ray scattering experiments.
△ Less
Submitted 5 January, 2005;
originally announced January 2005.
-
Interactions between Polymers and Nanoparticles : Formation of Supermicellar Hybrid Aggregates
Authors:
J. -F. Berret,
K. Yokota,
M. Morvan
Abstract:
When polyelectrolyte-neutral block copolymers are mixed in solutions to oppositely charged species (e.g. surfactant micelles, macromolecules, proteins etc), there is the formation of stable supermicellar aggregates combining both components. The resulting colloidal complexes exhibit a core-shell structure and the mechanism yielding to their formation is electrostatic self-assembly. In this contr…
▽ More
When polyelectrolyte-neutral block copolymers are mixed in solutions to oppositely charged species (e.g. surfactant micelles, macromolecules, proteins etc), there is the formation of stable supermicellar aggregates combining both components. The resulting colloidal complexes exhibit a core-shell structure and the mechanism yielding to their formation is electrostatic self-assembly. In this contribution, we report on the structural properties of supermicellar aggregates made from yttrium-based inorganic nanoparticles (radius 2 nm) and polyelectrolyte-neutral block copolymers in aqueous solutions. The yttrium hydroxyacetate particles were chosen as a model system for inorganic colloids, and also for their use in industrial applications as precursors for ceramic and opto-electronic materials. The copolymers placed under scrutiny are the water soluble and asymmetric poly(sodium acrylate)poly(acrylamide) diblocks. Using static and dynamical light scattering experiments, we demonstrate the analogy between surfactant micelles and nanoparticles in the complexation phenomenon with oppositely charged polymers. We also determine the sizes and the aggregation numbers of the hybrid organic-inorganic complexes. Several additional properties are discussed, such as the remarkable stability of the hybrid aggregates and the dependence of their sizes on the mixing conditions.
△ Less
Submitted 26 November, 2004;
originally announced November 2004.
-
Stabilization and Controlled Association of Inorganic Nanoparticles using Block Copolymers
Authors:
K. Yokota,
M. Morvan,
J. -F. Berret,
J. Oberdisse
Abstract:
We report on the structural properties of mixed aggregates made from rare-earth inorganic nanoparticles (radius 20 Angstroms) and polyelectrolyte-neutral block copolymers in aqueous solutions. Using scattering experiments and Monte Carlo simulations, we show that these mixed aggregates have a hierarchical core-shell microstructure. The core is made of densely packed nanoparticles and it is surro…
▽ More
We report on the structural properties of mixed aggregates made from rare-earth inorganic nanoparticles (radius 20 Angstroms) and polyelectrolyte-neutral block copolymers in aqueous solutions. Using scattering experiments and Monte Carlo simulations, we show that these mixed aggregates have a hierarchical core-shell microstructure. The core is made of densely packed nanoparticles and it is surrounded by a corona of neutral chains. This microstructure results from a process of controlled association and confers to the hybrid aggregates a remarkable colloidal stability.
△ Less
Submitted 26 November, 2004;
originally announced November 2004.
-
Perturbing the topology of the Game of Life increases its robustness to asynchrony
Authors:
Nazim A. Fates,
Michel Morvan
Abstract:
An experimental analysis of the asynchronous version of the "Game of Life" is performed to estimate how topology perturbations modify its evolution. We focus on the study of a phase transition from an "inactive-sparse phase" to a "labyrinth phase" and produce experimental data to quantify these changes as a function of the density of the initial configuration, the value of the synchrony rate, an…
▽ More
An experimental analysis of the asynchronous version of the "Game of Life" is performed to estimate how topology perturbations modify its evolution. We focus on the study of a phase transition from an "inactive-sparse phase" to a "labyrinth phase" and produce experimental data to quantify these changes as a function of the density of the initial configuration, the value of the synchrony rate, and the topology missing-link rate. An interpretation of the experimental results is given using the hypothesis that initial "germs" colonize the whole lattice and the validity of this hypothesis is tested.
△ Less
Submitted 17 July, 2004; v1 submitted 27 May, 2004;
originally announced May 2004.
-
An Experimental Study of Robustness to Asynchronism for Elementary Cellular Automata
Authors:
Nazim A. Fates,
Michel Morvan
Abstract:
Cellular Automata (CA) are a class of discrete dynamical systems that have been widely used to model complex systems in which the dynamics is specified at local cell-scale. Classically, CA are run on a regular lattice and with perfect synchronicity. However, these two assumptions have little chance to truthfully represent what happens at the microscopic scale for physical, biological or social s…
▽ More
Cellular Automata (CA) are a class of discrete dynamical systems that have been widely used to model complex systems in which the dynamics is specified at local cell-scale. Classically, CA are run on a regular lattice and with perfect synchronicity. However, these two assumptions have little chance to truthfully represent what happens at the microscopic scale for physical, biological or social systems. One may thus wonder whether CA do keep their behavior when submitted to small perturbations of synchronicity.
This work focuses on the study of one-dimensional (1D) asynchronous CA with two states and nearest-neighbors. We define what we mean by ``the behavior of CA is robust to asynchronism'' using a statistical approach with macroscopic parameters. and we present an experimental protocol aimed at finding which are the robust 1D elementary CA. To conclude, we examine how the results exposed can be used as a guideline for the research of suitable models according to robustness criteria.
△ Less
Submitted 13 February, 2004; v1 submitted 11 February, 2004;
originally announced February 2004.
-
The structure of Chip Firing Games and related models
Authors:
Eric Goles,
Michel Morvan,
Ha Duong Phan
Abstract:
In this paper, we study the dynamics of sand grains falling in sand piles. Usually sand piles are characterized by a decreasing integer partition and grain moves are described in terms of transitions between such partitions. We study here four main transition rules. The more classical one, introduced by Brylawski (1973) induces a lattice structure $L_B (n)$ (called dominance ordering) between de…
▽ More
In this paper, we study the dynamics of sand grains falling in sand piles. Usually sand piles are characterized by a decreasing integer partition and grain moves are described in terms of transitions between such partitions. We study here four main transition rules. The more classical one, introduced by Brylawski (1973) induces a lattice structure $L_B (n)$ (called dominance ordering) between decreasing partitions of a given integer n. We prove that a more restrictive transition rule, called SPM rule, induces a natural partition of L_B (n) in suborders, each one associated to a fixed point for SPM rule. In the second part, we extend the SPM rule in a natural way and obtain a model called Chip Firing Game (Goles and Kiwi, 1993). We prove that this new model has interesting properties: the induced order is a lattice, a natural greedoid can be associated to the model and it also defines a strongly convergent game. In the last section, we generalize the SPM rule in another way and obtain other lattice structure parametrized by some t: L(n,t), which form for -n+2 <= t <= n a decreasing sequence of lattices. For each t, we characterize the fixed point of L(n,t) and give the value of its maximal sized chain's lenght. We also note that L(n,-n+2) is the lattice of all compositions of n.
△ Less
Submitted 31 October, 2000;
originally announced October 2000.
-
Lattice Structure and Convergence of a Game of Cards
Authors:
Eric Goles,
Michel Morvan,
Ha Duong Phan
Abstract:
This paper is devoted to the study of the dynamics of a discrete system related to some self stabilizing protocol on a ring of processors.
This paper is devoted to the study of the dynamics of a discrete system related to some self stabilizing protocol on a ring of processors.
△ Less
Submitted 31 October, 2000;
originally announced October 2000.
-
Structure of some sand pile model
Authors:
M. Latapy,
R. Mantaci,
M. Morvan,
H. D. Phan
Abstract:
SPM (Sand Pile Model) is a simple discrete dynamical system used in physics to represent granular objects. It is deeply related to integer partitions, and many other combinatorics problems, such as tilings or rewriting systems. The evolution of the system started with n stacked grains generates a lattice, denoted by SPM(n). We study here the structure of this lattice. We first explain how it can…
▽ More
SPM (Sand Pile Model) is a simple discrete dynamical system used in physics to represent granular objects. It is deeply related to integer partitions, and many other combinatorics problems, such as tilings or rewriting systems. The evolution of the system started with n stacked grains generates a lattice, denoted by SPM(n). We study here the structure of this lattice. We first explain how it can be constructed, by showing its strong self-similarity property. Then, we define SPM(infini), a natural extension of SPM when one starts with an infinite number of grains. Again, we give an efficient construction algorithm and a coding of this lattice using a self-similar tree. The two approaches give different recursive formulae for the cardinal of SPM(n), where no closed formula have ever been found.
△ Less
Submitted 2 August, 2000;
originally announced August 2000.