-
PainNet: Statistical Relation Network with Episode-Based Training for Pain Estimation
Authors:
Mina Bishay,
Graham Page,
Mohammad Mavadati
Abstract:
Despite the span in estimating pain from facial expressions, limited works have focused on estimating the sequence-level pain, which is reported by patients and used commonly in clinics. In this paper, we introduce a novel Statistical Relation Network, referred to as PainNet, designed for the estimation of the sequence-level pain. PainNet employs two key modules, the embedding and the relation mod…
▽ More
Despite the span in estimating pain from facial expressions, limited works have focused on estimating the sequence-level pain, which is reported by patients and used commonly in clinics. In this paper, we introduce a novel Statistical Relation Network, referred to as PainNet, designed for the estimation of the sequence-level pain. PainNet employs two key modules, the embedding and the relation modules, for comparing pairs of pain videos, and producing relation scores indicating if each pair belongs to the same pain category or not. At the core of the embedding module is a statistical layer mounted on the top of a RNN for extracting compact video-level features. The statistical layer is implemented as part of the deep architecture. Doing so, allows combining multiple training stages used in previous research, into a single end-to-end training stage. PainNet is trained using the episode-based training scheme, which involves comparing a query video with a set of videos representing the different pain categories. Experimental results show the benefit of using the statistical layer and the episode-based training in the proposed model. Furthermore, PainNet outperforms the state-of-the-art results on self-reported pain estimation.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
Monitoring Viewer Attention During Online Ads
Authors:
Mina Bishay,
Graham Page,
Waleed Emad,
Mohammad Mavadati
Abstract:
Nowadays, video ads spread through numerous online platforms, and are being watched by millions of viewers worldwide. Big brands gauge the liking and purchase intent of their new ads, by analyzing the facial responses of viewers recruited online to watch the ads from home or work. Although this approach captures naturalistic responses, it is susceptible to distractions inherent in the participants…
▽ More
Nowadays, video ads spread through numerous online platforms, and are being watched by millions of viewers worldwide. Big brands gauge the liking and purchase intent of their new ads, by analyzing the facial responses of viewers recruited online to watch the ads from home or work. Although this approach captures naturalistic responses, it is susceptible to distractions inherent in the participants' environments, such as a movie playing on TV, a colleague speaking, or mobile notifications. Inattentive participants should get flagged and eliminated to avoid skewing the ad-testing process. In this paper we introduce an architecture for monitoring viewer attention during online ads. Leveraging two behavior analysis toolkits; AFFDEX 2.0 and SmartEye SDK, we extract low-level facial features encompassing facial expressions, head pose, and gaze direction. These features are then combined to extract high-level features that include estimated gaze on the screen plane, yawning, speaking, etc -- this enables the identification of four primary distractors; off-screen gaze, drowsiness, speaking, and unattended screen. Our architecture tailors the gaze settings according to the device type (desktop or mobile). We validate our architecture first on datasets annotated for specific distractors, and then on a real-world ad testing dataset with various distractors. The proposed architecture shows promising results in detecting distraction across both desktop and mobile devices.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics
Authors:
Neil Ashton,
Jordan B. Angel,
Aditya S. Ghate,
Gaetan K. W. Kenway,
Man Long Wong,
Cetin Kiris,
Astrid Walle,
Danielle C. Maddix,
Gary Page
Abstract:
This paper presents a new open-source high-fidelity dataset for Machine Learning (ML) containing 355 geometric variants of the Windsor body, to help the development and testing of ML surrogate models for external automotive aerodynamics. Each Computational Fluid Dynamics (CFD) simulation was run with a GPU-native high-fidelity Wall-Modeled Large-Eddy Simulations (WMLES) using a Cartesian immersed-…
▽ More
This paper presents a new open-source high-fidelity dataset for Machine Learning (ML) containing 355 geometric variants of the Windsor body, to help the development and testing of ML surrogate models for external automotive aerodynamics. Each Computational Fluid Dynamics (CFD) simulation was run with a GPU-native high-fidelity Wall-Modeled Large-Eddy Simulations (WMLES) using a Cartesian immersed-boundary method using more than 280M cells to ensure the greatest possible accuracy. The dataset contains geometry variants that exhibits a wide range of flow characteristics that are representative of those observed on road-cars. The dataset itself contains the 3D time-averaged volume & boundary data as well as the geometry and force & moment coefficients. This paper discusses the validation of the underlying CFD methods as well as contents and structure of the dataset. To the authors knowledge, this represents the first, large-scale high-fidelity CFD dataset for the Windsor body with a permissive open-source license (CC-BY-SA).
△ Less
Submitted 16 January, 2025; v1 submitted 27 July, 2024;
originally announced July 2024.
-
Bayesian Inverse Ising Problem with Three-body Interactions
Authors:
Godwin Osabutey,
Robert Richardson,
Garritt L. Page
Abstract:
In this paper, we solve the inverse Ising problem with three-body interaction. Using the mean-field approximation, we find a tractable expansion of the normalizing constant. This facilitates estimation, which is known to be quite challenging for the Ising model. We then develop a novel hybrid MCMC algorithm that integrates Adaptive Metropolis Hastings (AMH), Hamiltonian Monte Carlo (HMC), and the…
▽ More
In this paper, we solve the inverse Ising problem with three-body interaction. Using the mean-field approximation, we find a tractable expansion of the normalizing constant. This facilitates estimation, which is known to be quite challenging for the Ising model. We then develop a novel hybrid MCMC algorithm that integrates Adaptive Metropolis Hastings (AMH), Hamiltonian Monte Carlo (HMC), and the Manifold-Adjusted Langevin Algorithm (MALA), which converges quickly and mixes well. We demonstrate the robustness of our algorithm using data simulated with a structure under which parameter estimation is known to be challenging, such as in the presence of a phase transition and at the critical point of the system.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
"It's Sink or Swim'': Exploring Patients' Challenges and Tool Needs for Self-Management of Postoperative Acute Pain
Authors:
Souleima Zghab,
Gabrielle Pagé,
Mélanie Lussier,
Sylvain Bédard,
Jinghui Cheng
Abstract:
Poorly managed postoperative acute pain can have long-lasting negative impacts and pose a major healthcare issue. There is limited investigation to understand and address the unique needs of patients experiencing acute pain. In this paper, we tackle this gap through an interview study with 14 patients who recently underwent postoperative acute pain to understand their challenges in pain self-manag…
▽ More
Poorly managed postoperative acute pain can have long-lasting negative impacts and pose a major healthcare issue. There is limited investigation to understand and address the unique needs of patients experiencing acute pain. In this paper, we tackle this gap through an interview study with 14 patients who recently underwent postoperative acute pain to understand their challenges in pain self-management and their need for supportive tools. Our analysis identified various factors associated with the major aspects of acute pain self-management. Together, our findings indicated that tools for supporting these patients need to carefully consider information and support delivery to adapt to rapid changes in pain experiences, offer personalized and dynamic assistance that adapts to individual situations in context, and monitor emotion when promoting motivation. Overall, our work provided valuable knowledge to address the less-investigated but highly-needed problem of designing technology for the self-management of acute pain and similar health conditions.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Informed Random Partition Models with Temporal Dependence
Authors:
Sally Paganin,
Garritt L. Page,
Fernando Andrés Quintana
Abstract:
Model-based clustering is a powerful tool that is often used to discover hidden structure in data by grouping observational units that exhibit similar response values. Recently, clustering methods have been developed that permit incorporating an ``initial'' partition informed by expert opinion. Then, using some similarity criteria, partitions different from the initial one are down weighted, i.e.…
▽ More
Model-based clustering is a powerful tool that is often used to discover hidden structure in data by grouping observational units that exhibit similar response values. Recently, clustering methods have been developed that permit incorporating an ``initial'' partition informed by expert opinion. Then, using some similarity criteria, partitions different from the initial one are down weighted, i.e. they are assigned reduced probabilities. These methods represent an exciting new direction of method development in clustering techniques. We add to this literature a method that very flexibly permits assigning varying levels of uncertainty to any subset of the partition. This is particularly useful in practice as there is rarely clear prior information with regards to the entire partition. Our approach is not based on partition penalties but considers individual allocation probabilities for each unit (e.g., locally weighted prior information). We illustrate the gains in prior specification flexibility via simulation studies and an application to a dataset concerning spatio-temporal evolution of ${\rm PM}_{10}$ measurements in Germany.
△ Less
Submitted 20 June, 2025; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Regression with Variable Dimension Covariates
Authors:
Peter Mueller,
Fernando Andrés Quintana,
Garritt L. Page
Abstract:
Regression is one of the most fundamental statistical inference problems. A broad definition of regression problems is as estimation of the distribution of an outcome using a family of probability models indexed by covariates. Despite the ubiquitous nature of regression problems and the abundance of related methods and results there is a surprising gap in the literature. There are no well establis…
▽ More
Regression is one of the most fundamental statistical inference problems. A broad definition of regression problems is as estimation of the distribution of an outcome using a family of probability models indexed by covariates. Despite the ubiquitous nature of regression problems and the abundance of related methods and results there is a surprising gap in the literature. There are no well established methods for regression with a varying dimension covariate vectors, despite the common occurrence of such problems. In this paper we review some recent related papers proposing varying dimension regression by way of random partitions.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Informed Bayesian Finite Mixture Models via Asymmetric Dirichlet Priors
Authors:
Garritt L. Page,
Massimo Ventrucci,
Maria Franco-Villoria
Abstract:
Finite mixture models are flexible methods that are commonly used for model-based clustering. A recent focus in the model-based clustering literature is to highlight the difference between the number of components in a mixture model and the number of clusters. The number of clusters is more relevant from a practical stand point, but to date, the focus of prior distribution formulation has been on…
▽ More
Finite mixture models are flexible methods that are commonly used for model-based clustering. A recent focus in the model-based clustering literature is to highlight the difference between the number of components in a mixture model and the number of clusters. The number of clusters is more relevant from a practical stand point, but to date, the focus of prior distribution formulation has been on the number of components. In light of this, we develop a finite mixture methodology that permits eliciting prior information directly on the number of clusters in an intuitive way. This is done by employing an asymmetric Dirichlet distribution as a prior on the weights of a finite mixture. Further, a penalized complexity motivated prior is employed for the Dirichlet shape parameter. We illustrate the ease to which prior information can be elicited via our construction and the flexibility of the resulting induced prior on the number of clusters. We also demonstrate the utility of our approach using numerical experiments and two real world data sets.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
A Projection Approach to Local Regression with Variable-Dimension Covariates
Authors:
Matthew J. Heiner,
Garritt L. Page,
Fernando Andrés Quintana
Abstract:
Incomplete covariate vectors are known to be problematic for estimation and inferences on model parameters, but their impact on prediction performance is less understood. We develop an imputation-free method that builds on a random partition model admitting variable-dimension covariates. Cluster-specific response models further incorporate covariates via linear predictors, facilitating estimation…
▽ More
Incomplete covariate vectors are known to be problematic for estimation and inferences on model parameters, but their impact on prediction performance is less understood. We develop an imputation-free method that builds on a random partition model admitting variable-dimension covariates. Cluster-specific response models further incorporate covariates via linear predictors, facilitating estimation of smooth prediction surfaces with relatively few clusters. We exploit marginalization techniques of Gaussian kernels to analytically project response distributions according to any pattern of missing covariates, yielding a local regression with internally consistent uncertainty propagation that utilizes only one set of coefficients per cluster. Aggressive shrinkage of these coefficients regulates uncertainty due to missing covariates. The method allows in- and out-of-sample prediction for any missingness pattern, even if the pattern in a new subject's incomplete covariate vector was not seen in the training data. We develop an MCMC algorithm for posterior sampling that improves a computationally expensive update for latent cluster allocation. Finally, we demonstrate the model's effectiveness for nonlinear point and density prediction under various circumstances by comparing with other recent methods for regression of variable dimensions on synthetic and real data.
△ Less
Submitted 28 February, 2023; v1 submitted 13 February, 2023;
originally announced February 2023.
-
Automatic Detection of Sentimentality from Facial Expressions
Authors:
Mina Bishay,
Jay Turcot,
Graham Page,
Mohammad Mavadati
Abstract:
Emotion recognition has received considerable attention from the Computer Vision community in the last 20 years. However, most of the research focused on analyzing the six basic emotions (e.g. joy, anger, surprise), with a limited work directed to other affective states. In this paper, we tackle sentimentality (strong feeling of heartwarming or nostalgia), a new emotional state that has few works…
▽ More
Emotion recognition has received considerable attention from the Computer Vision community in the last 20 years. However, most of the research focused on analyzing the six basic emotions (e.g. joy, anger, surprise), with a limited work directed to other affective states. In this paper, we tackle sentimentality (strong feeling of heartwarming or nostalgia), a new emotional state that has few works in the literature, and no guideline defining its facial markers. To this end, we first collect a dataset of 4.9K videos of participants watching some sentimental and non-sentimental ads, and then we label the moments evoking sentimentality in the ads. Second, we use the ad-level labels and the facial Action Units (AUs) activation across different frames for defining some weak frame-level sentimentality labels. Third, we train a Multilayer Perceptron (MLP) using the AUs activation for sentimentality detection. Finally, we define two new ad-level metrics for evaluating our model performance. Quantitative and qualitative results show promising results for sentimentality detection. To the best of our knowledge this is the first work to address the problem of sentimentality detection.
△ Less
Submitted 11 September, 2022;
originally announced September 2022.
-
Nonparametric Bayesian Approach to Treatment Ranking in Network Meta-Analysis with Application to Comparisons of Antidepressants
Authors:
Andrés F. Barrientos,
Garritt L. Page,
Lifeng Lin
Abstract:
Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issu…
▽ More
Network meta-analysis is a powerful tool to synthesize evidence from independent studies and compare multiple treatments simultaneously. A critical task of performing a network meta-analysis is to offer ranks of all available treatment options for a specific disease outcome. Frequently, the estimated treatment rankings are accompanied by a large amount of uncertainty, suffer from multiplicity issues, and rarely permit ties. These issues make interpreting rankings problematic as they are often treated as absolute metrics. To address these shortcomings, we formulate a ranking strategy that adapts to scenarios with high order uncertainty by producing more conservative results. This improves the interpretability while simultaneously accounting for multiple comparisons. To admit ties between treatment effects, we also develop a Bayesian Nonparametric approach for network meta-analysis. The approach capitalizes on the induced clustering mechanism of Bayesian Nonparametric methods producing a positive probability that two treatment effects are equal. We demonstrate the utility of the procedure through numerical experiments and a network meta-analysis designed to study antidepressant treatments.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
AFFDEX 2.0: A Real-Time Facial Expression Analysis Toolkit
Authors:
Mina Bishay,
Kenneth Preston,
Matthew Strafuss,
Graham Page,
Jay Turcot,
Mohammad Mavadati
Abstract:
In this paper we introduce AFFDEX 2.0 - a toolkit for analyzing facial expressions in the wild, that is, it is intended for users aiming to; a) estimate the 3D head pose, b) detect facial Action Units (AUs), c) recognize basic emotions and 2 new emotional states (sentimentality and confusion), and d) detect high-level expressive metrics like blink and attention. AFFDEX 2.0 models are mainly based…
▽ More
In this paper we introduce AFFDEX 2.0 - a toolkit for analyzing facial expressions in the wild, that is, it is intended for users aiming to; a) estimate the 3D head pose, b) detect facial Action Units (AUs), c) recognize basic emotions and 2 new emotional states (sentimentality and confusion), and d) detect high-level expressive metrics like blink and attention. AFFDEX 2.0 models are mainly based on Deep Learning, and are trained using a large-scale naturalistic dataset consisting of thousands of participants from different demographic groups. AFFDEX 2.0 is an enhanced version of our previous toolkit [1], that is capable of tracking efficiently faces at more challenging conditions, detecting more accurately facial expressions, and recognizing new emotional states (sentimentality and confusion). AFFDEX 2.0 can process multiple faces in real time, and is working across the Windows and Linux platforms.
△ Less
Submitted 2 November, 2022; v1 submitted 24 February, 2022;
originally announced February 2022.
-
Using Joint Random Partition Models for Flexible Change Point Analysis in Multivariate Processes
Authors:
José J. Quinlan,
Garritt L. Page,
Luis M. Castro
Abstract:
Change point analyses are concerned with identifying positions of an ordered stochastic process that undergo abrupt local changes of some underlying distribution. When multiple processes are observed, it is often the case that information regarding the change point positions is shared across the different processes. This work describes a method that takes advantage of this type of information. Sin…
▽ More
Change point analyses are concerned with identifying positions of an ordered stochastic process that undergo abrupt local changes of some underlying distribution. When multiple processes are observed, it is often the case that information regarding the change point positions is shared across the different processes. This work describes a method that takes advantage of this type of information. Since the number and position of change points can be described through a partition with contiguous clusters, our approach develops a joint model for these types of partitions. We describe computational strategies associated with our approach and illustrate improved performance in detecting change points through a small simulation study. We then apply our method to a financial data set of emerging markets in Latin America and highlight interesting insights discovered due to the correlation between change point locations among these economies.
△ Less
Submitted 19 January, 2022;
originally announced January 2022.
-
Multi-frequency MRE for elasticity quantitation and optimal tissue discrimination: a two-platform liver fibrosis mimicking phantom study
Authors:
Fatiha Andoh,
Jin Long Yue,
Felicia Julea,
Marion Tardieu,
Camille Noûs,
Gwenaël Pagé,
Philippe Garteiser,
Bernard van Beers,
Xavier Maître,
Claire Pellot-barakat,
Van Beers
Abstract:
In the framework of algebraic inversion, Magnetic Resonance Elastography (MRE) repeatability, reproducibility and robustness were evaluated on extracted shear velocities (or elastic moduli). The same excitation system was implemented at two sites equipped with clinical MR scanners of 1.5 T and 3 T. A set of four elastic, isotropic, homogeneous calibrated phantoms of distinct elasticity representin…
▽ More
In the framework of algebraic inversion, Magnetic Resonance Elastography (MRE) repeatability, reproducibility and robustness were evaluated on extracted shear velocities (or elastic moduli). The same excitation system was implemented at two sites equipped with clinical MR scanners of 1.5 T and 3 T. A set of four elastic, isotropic, homogeneous calibrated phantoms of distinct elasticity representing the spectrum of liver fibrosis severity was mechanically characterized. The repeatability of the measurements and the reproducibility between the two platforms were found to be excellent with mean coefficients of variations of 1.62% for the shear velocity mean values and 1.95% for the associated standard deviations. MRE velocities were robust to the amplitude and pattern variations of the displacement field with virtually no difference between outcomes from both magnets at identical excitation frequencies even when the displacement field amplitude was 6 times smaller. However, MRE outcomes were very sensitive to the number of voxels per wavelength, s, of the recorded displacement field, with relative biases reaching 62% and precision losing up to a factor 23.5. For both magnetic field strengths, MRE accuracy and precision were largely degraded outside of established conditions of validity ($6 \lesssim s \lesssim 9$) resulting in estimated shear velocity values not significantly different between phantoms of increasing elasticity. When fulfilling the spatial sampling conditions, either prospectively in the acquisition or retrospectively before the reconstruction, MRE produced quantitative measurements that allowed to unambiguously discriminate, with infinitesimal p-values, between the phantoms mimicking increasing severity of liver fibrosis.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
A spectral adjustment for spatial confounding
Authors:
Yawen Guan,
Garritt L. Page,
Brian J Reich,
Massimo Ventrucci,
Shu Yang
Abstract:
Adjusting for an unmeasured confounder is generally an intractable problem, but in the spatial setting it may be possible under certain conditions. In this paper, we derive necessary conditions on the coherence between the treatment variable of interest and the unmeasured confounder that ensure the causal effect of the treatment is estimable. We specify our model and assumptions in the spectral do…
▽ More
Adjusting for an unmeasured confounder is generally an intractable problem, but in the spatial setting it may be possible under certain conditions. In this paper, we derive necessary conditions on the coherence between the treatment variable of interest and the unmeasured confounder that ensure the causal effect of the treatment is estimable. We specify our model and assumptions in the spectral domain to allow for different degrees of confounding at different spatial resolutions. The key assumption that ensures identifiability is that confounding present at global scales dissipates at local scales. We show that this assumption in the spectral domain is equivalent to adjusting for global-scale confounding in the spatial domain by adding a spatially smoothed version of the treatment variable to the mean of the response variable. Within this general framework, we propose a sequence of confounder adjustment methods that range from parametric adjustments based on the Matern coherence function to more robust semi-parametric methods that use smoothing splines. These ideas are applied to areal and geostatistical data for both simulated and real datasets
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
The XMM-Newton serendipitous survey IX. The fourth XMM-Newton serendipitous source catalogue
Authors:
N. A. Webb,
M. Coriat,
I. Traulsen,
J. Ballet,
C. Motch,
F. J. Carrera,
F. Koliopanos,
J. Authier,
I. de la Calle,
M. T. Ceballos,
E. Colomo,
D. Chuard,
M. Freyberg,
T. Garcia,
M. Kolehmainen,
G. Lamer,
D. Lin,
P. Maggi,
L. Michel,
C. G. Page,
M. J. Page,
J. V. Perea-Calderon,
F. -X. Pineau,
P. Rodriguez,
S. R. Rosen
, et al. (6 additional authors not shown)
Abstract:
Sky surveys produce enormous quantities of data on extensive regions of the sky. The easiest way to access this information is through catalogues of standardised data products. {\em XMM-Newton} has been surveying the sky in the X-ray, ultra-violet, and optical bands for 20 years. The {\em XMM-Newton} Survey Science Centre has been producing standardised data products and catalogues to facilitate a…
▽ More
Sky surveys produce enormous quantities of data on extensive regions of the sky. The easiest way to access this information is through catalogues of standardised data products. {\em XMM-Newton} has been surveying the sky in the X-ray, ultra-violet, and optical bands for 20 years. The {\em XMM-Newton} Survey Science Centre has been producing standardised data products and catalogues to facilitate access to the serendipitous X-ray sky. Using improved calibration and enhanced software, we re-reduced all of the 14041 {\em XMM-Newton} X-ray observations, of which 11204 observations contained data with at least one detection and with these we created a new, high quality version of the {\em XMM-Newton} serendipitous source catalogue, 4XMM-DR9. 4XMM-DR9 contains 810795 detections down to a detection significance of 3 $σ$, of which 550124 are unique sources, which cover 1152 degrees$^{2}$ (2.85\%) of the sky. Filtering 4XMM-DR9 to retain only the cleanest sources with at least a 5 $σ$ detection significance leaves 433612 detections. Of these detections, 99.6\% have no pileup. Furthermore, 336 columns of information on each detection are provided, along with images. The quality of the source detection is shown to have improved significantly with respect to previous versions of the catalogues. Spectra and lightcurves are also made available for more than 288000 of the brightest sources (36\% of all detections).
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Clustering and Prediction with Variable Dimension Covariates
Authors:
Garritt L. Page,
Fernando A. Quintana,
Peter Müller
Abstract:
In many applied fields incomplete covariate vectors are commonly encountered. It is well known that this can be problematic when making inference on model parameters, but its impact on prediction performance is less understood. We develop a method based on covariate dependent partition models that seamlessly handles missing covariates while completely avoiding any type of imputation. The method we…
▽ More
In many applied fields incomplete covariate vectors are commonly encountered. It is well known that this can be problematic when making inference on model parameters, but its impact on prediction performance is less understood. We develop a method based on covariate dependent partition models that seamlessly handles missing covariates while completely avoiding any type of imputation. The method we develop allows in-sample predictions as well as out-of-sample prediction, even if the missing pattern in the new subjects' incomplete covariate vector was not seen in the training data. Any data type, including categorical or continuous covariates are permitted. In simulation studies the proposed method compares favorably. We illustrate the method in two application examples.
△ Less
Submitted 12 July, 2020; v1 submitted 30 December, 2019;
originally announced December 2019.
-
Dependent Modeling of Temporal Sequences of Random Partitions
Authors:
Garritt L. Page,
Fernando A. Quintana,
David B. Dahl
Abstract:
We consider the task of modeling a dependent sequence of random partitions. It is well-known that a random measure in Bayesian nonparametrics induces a distribution over random partitions. The community has therefore assumed that the best approach to obtain a dependent sequence of random partitions is through modeling dependent random measures. We argue that this approach is problematic and show t…
▽ More
We consider the task of modeling a dependent sequence of random partitions. It is well-known that a random measure in Bayesian nonparametrics induces a distribution over random partitions. The community has therefore assumed that the best approach to obtain a dependent sequence of random partitions is through modeling dependent random measures. We argue that this approach is problematic and show that the random partition model induced by dependent Bayesian nonparametric priors exhibit counter-intuitive dependence among partitions even though the dependence for the sequence of random probability measures is intuitive. Because of this, we advocate instead to model the sequence of random partitions directly when clustering is of principal interest. To this end, we develop a class of dependent random partition models that explicitly models dependence in a sequence of partitions. We derive conditional and marginal properties of the joint partition model and devise computational strategies when employing the method in Bayesian modeling. In the case of temporal dependence, we demonstrate through simulation how the methodology produces partitions that evolve gently and naturally over time. We further illustrate the utility of the method by applying it to an environmental data set that exhibits spatio-temporal dependence.
△ Less
Submitted 30 July, 2021; v1 submitted 24 December, 2019;
originally announced December 2019.
-
Bayesian inferences on uncertain ranks and orderings: Application to ranking players and lineups
Authors:
Andres F. Barrientos,
Deborshee Sen,
Garritt L Page,
David B Dunson
Abstract:
It is common to be interested in rankings or order relationships among entities. In complex settings where one does not directly measure a univariate statistic upon which to base ranks, such inferences typically rely on statistical models having entity-specific parameters. These can be treated as random effects in hierarchical models characterizing variation among the entities. In this paper, we a…
▽ More
It is common to be interested in rankings or order relationships among entities. In complex settings where one does not directly measure a univariate statistic upon which to base ranks, such inferences typically rely on statistical models having entity-specific parameters. These can be treated as random effects in hierarchical models characterizing variation among the entities. In this paper, we are particularly interested in the problem of ranking basketball players in terms of their contribution to team performance. Using data from the United States National Basketball Association (NBA), we find that many players have similar latent ability levels, making any single estimated ranking highly misleading. The current literature fails to provide summaries of order relationships that adequately account for uncertainty. Motivated by this, we propose a Bayesian strategy for characterizing uncertainty in inferences on order relationships among players and lineups. Our approach adapts to scenarios in which uncertainty in ordering is high by producing more conservative results that improve interpretability. This is achieved through a reward function within a decision theoretic framework. We apply our approach to data from the 2009-10 NBA season.
△ Less
Submitted 11 April, 2022; v1 submitted 10 July, 2019;
originally announced July 2019.
-
Unraveling radial dependency effects in fiber thermal drawing
Authors:
Alexis G. Page,
Mathias Bechert,
François Gallaire,
Fabien Sorin
Abstract:
Fiber-based devices with advanced functionalities are emerging as promising solutions for various applications in flexible electronics and bioengineering. Multimaterial thermal drawing, in particular, has attracted strong interest for its ability to generate fibers with complex architectures. Thus far, however, the understanding of its fluid dynamics has only been applied to single material prefor…
▽ More
Fiber-based devices with advanced functionalities are emerging as promising solutions for various applications in flexible electronics and bioengineering. Multimaterial thermal drawing, in particular, has attracted strong interest for its ability to generate fibers with complex architectures. Thus far, however, the understanding of its fluid dynamics has only been applied to single material preforms for which higher order effects, such as the radial dependency of the axial velocity, could be neglected. With complex multimaterial preforms, such effects must be taken into account, as they can affect the architecture and the functional properties of the resulting fiber device. Here, we propose a versatile model of the thermal drawing of fibers, which takes into account a radially varying axial velocity. Unlike the commonly used cross section averaged approach, our model is capable of predicting radial variations of functional properties caused by the deformation during drawing. This is demonstrated for two effects observed, namely, by unraveling the deformation of initially straight, transversal lines in the preform and the dependence on the draw ratio and radial position of the in-fiber electrical conductivity of polymer nanocomposites, an important class of materials for emerging fiber devices. This work sets a thus far missing theoretical and practical understanding of multimaterial fiber processing to better engineer advanced fibers and textiles for sensing, health care, robotics, or bioengineering applications.
△ Less
Submitted 31 July, 2019; v1 submitted 4 February, 2019;
originally announced March 2019.
-
Recurrence dynamics of particulate transport with reversible blockage: from a single channel to a bundle of coupled channels
Authors:
Chloé Barré,
Gregory Page,
Julian Talbot,
Pascal Viot
Abstract:
We model a particulate flow of constant velocity through confined geometries, ranging from a single channel to a bundle of $N_c$ identical coupled channels, under conditions of reversible blockage. Quantities of interest include the exiting particle flux (or throughput) and the probability that the bundle is open. For a constant entering flux, the bundle evolves through a transient regime to a ste…
▽ More
We model a particulate flow of constant velocity through confined geometries, ranging from a single channel to a bundle of $N_c$ identical coupled channels, under conditions of reversible blockage. Quantities of interest include the exiting particle flux (or throughput) and the probability that the bundle is open. For a constant entering flux, the bundle evolves through a transient regime to a steady state. We present analytic solutions for the stationary properties of a single channel with capacity $N\le 3$ and for a bundle of channels each of capacity $N = 1$. For larger values of $N$ and $N_c$, the system's steady state behavior is explored by numerical simulation. Depending on the deblocking time, the exiting flux either increases monotonically with intensity or displays a maximum at a finite intensity. For large $N$ we observe an abrupt change from a state with few blockages to one in which the bundle is permanently blocked and the exiting flux is due entirely to the release of blocked particles. We also compare the relative efficiency of coupled and uncoupled bundles. For $N=1$ the coupled system is always more efficient, but for $N>1$ the behavior is more complex.
△ Less
Submitted 5 February, 2019;
originally announced February 2019.
-
Discovering Interactions Using Covariate Informed Random Partition Models
Authors:
Garritt L. Page,
Fernando A. Quintana,
Gary L. Rosner
Abstract:
Combination chemotherapy treatment regimens created for patients diagnosed with childhood acute lymphoblastic leukemia have had great success in improving cure rates. Unfortunately, patients prescribed these types of treatment regimens have displayed susceptibility to the onset of osteonecrosis. Some have suggested that this is due to pharmacokinetic interaction between two agents in the treatment…
▽ More
Combination chemotherapy treatment regimens created for patients diagnosed with childhood acute lymphoblastic leukemia have had great success in improving cure rates. Unfortunately, patients prescribed these types of treatment regimens have displayed susceptibility to the onset of osteonecrosis. Some have suggested that this is due to pharmacokinetic interaction between two agents in the treatment regimen (asparaginase and dexamethasone) and other physiological variables. Determining which physiological variables to consider when searching for interactions in scenarios like these, minus a priori guidance, has proved to be a challenging problem, particularly if interactions influence the response distribution in ways beyond shifts in expectation or dispersion only. In this paper we propose an exploratory technique that is able to discover associations between covariates and responses in a very general way. The procedure connects covariates to responses very flexibly through dependent random partition prior distributions, and then employs machine learning techniques to highlight potential associations found in each cluster. We provide a simulation study to show utility and apply the method to data produced from a study dedicated to learning which physiological predictors influence severity of osteonecrosis multiplicatively.
△ Less
Submitted 3 August, 2020; v1 submitted 28 September, 2018;
originally announced October 2018.
-
Stochastic models of multi-channel particulate transport with blockage
Authors:
Chloé Barré,
Gregory Page,
Julian Talbot,
Pascal Viot
Abstract:
Networks of channels conveying particles are often subject to blockages due to the limited carrying capacity of the individual channels. If the channels are coupled, blockage of one causes an increase in the flux entering the remaining open channels leading to a cascade of failures. Once all channels are blocked no additional particle can enter the system. If the blockages are of finite duration,…
▽ More
Networks of channels conveying particles are often subject to blockages due to the limited carrying capacity of the individual channels. If the channels are coupled, blockage of one causes an increase in the flux entering the remaining open channels leading to a cascade of failures. Once all channels are blocked no additional particle can enter the system. If the blockages are of finite duration, however, the system reaches a steady state with an exiting flux that is reduced compared to the incoming one. We propose a stochastic model consisting of $N_c$ channels each with a blocking threshold of $N$ particles. Particles enter the system according to a Poisson process with the entering flux of intensity $Λ$ equally distributed over the open channels. Any particle in an open channel exits at a rate $μ$ and a blocked channel unblocks at a rate $μ^*$. We present a method to obtain the exiting flux in the steady state, and other properties, for arbitrary $N_c$ and $N$ and we present explicit solutions for $N_c=2,3$. We apply these results to compare the efficiency of conveying a particulate stream of intensity $Λ$ using different channel configurations. We compare a single "robust" channel with a large capacity with multiple "fragile" channels with a proportionately reduced capacity. The "robust" channel is more efficient at low intensity, while multiple, "fragile" channels have a higher throughput at large intensity. We also compare $N_c$ coupled channels with $N_c$ independent channels, both with threshold $N=2$. For $N_c=2$ if $μ^*/μ>1/4$, the coupled channels are always more efficient. Otherwise the independent channels are more efficient for sufficiently large $Λ$.
△ Less
Submitted 26 March, 2018;
originally announced March 2018.
-
Optimizing the Throughput of Particulate Streams Subject to Blocking
Authors:
G. Page,
J. Resing,
P. Viot,
J. Talbot
Abstract:
Filtration, flow in narrow channels and traffic flow are examples of processes subject to blocking when the channel conveying the particles becomes too crowded. If the blockage is temporary, which means that after a finite time the channel is flushed and reopened, one expects to observe a maximum throughput for a finite intensity of entering particles. We investigate this phenomenon by introducing…
▽ More
Filtration, flow in narrow channels and traffic flow are examples of processes subject to blocking when the channel conveying the particles becomes too crowded. If the blockage is temporary, which means that after a finite time the channel is flushed and reopened, one expects to observe a maximum throughput for a finite intensity of entering particles. We investigate this phenomenon by introducing a queueing theory inspired, circular Markov model. Particles enter a channel with intensity $λ$ and exit at a rate $μ$. If $N$ particles are present at the same time in the channel, the system becomes blocked and no more particles can enter until the blockage is cleared after an exponentially distributed time with rate $μ^*$. We obtain an exact expression for the steady state throughput (including the exiting blocked particles) for all values of $N$. For $N=2$ we show that the throughput assumes a maximum value for finite $λ$ if $μ^*/μ< 1/4$. The time-dependent throughput either monotonically approaches the steady state value, or reaches a maximum value at finite time. We demonstrate that, in the steady state, this model can be mapped to a previously introduced non-Markovian model with fixed transit and blockage times.
We also examine an irreversible, non-Markovian blockage process with constant transit time exposed to an entering flux of fixed intensity for a finite time and we show that the first and second moments of the number of exiting particles are maximized for a finite intensity.
△ Less
Submitted 12 March, 2018;
originally announced March 2018.
-
Affinity-based measures of medical diagnostic test accuracy
Authors:
Miguel de Carvalho,
Bradley J. Barney,
Garritt L. Page
Abstract:
We propose new summary measures of diagnostic test accuracy which can be used as companions to existing diagnostic accuracy measures. Conceptually, our summary measures are tantamount to the so-called Hellinger affinity and we show that they can be regarded as measures of agreement constructed from similar geometrical principles as Pearson correlation. A covariate-specific version of our summary i…
▽ More
We propose new summary measures of diagnostic test accuracy which can be used as companions to existing diagnostic accuracy measures. Conceptually, our summary measures are tantamount to the so-called Hellinger affinity and we show that they can be regarded as measures of agreement constructed from similar geometrical principles as Pearson correlation. A covariate-specific version of our summary index is developed, which can be used to assess the discrimination performance of a diagnostic test, conditionally on the value of a predictor. Nonparametric Bayes estimators for the proposed indexes are devised, theoretical properties of the corresponding priors are derived, and the performance of our methods is assessed through a simulation study. Data from a prostate cancer diagnosis study are used to illustrate our methods.
△ Less
Submitted 28 December, 2017;
originally announced December 2017.
-
On the geometry of Bayesian inference
Authors:
Miguel de Carvalho,
Garritt L. Page,
Bradley J. Barney
Abstract:
We provide a geometric interpretation to Bayesian inference that allows us to introduce a natural measure of the level of agreement between priors, likelihoods, and posteriors. The starting point for the construction of our geometry is the simple observation that the marginal likelihood can be regarded as an inner product between the prior and the likelihood. A key concept in our geometry is that…
▽ More
We provide a geometric interpretation to Bayesian inference that allows us to introduce a natural measure of the level of agreement between priors, likelihoods, and posteriors. The starting point for the construction of our geometry is the simple observation that the marginal likelihood can be regarded as an inner product between the prior and the likelihood. A key concept in our geometry is that of compatibility, a measure which is based on the same construction principles as Pearson correlation, but which can be used to assess how much the prior agrees with the likelihood, to gauge the sensitivity of the posterior to the prior, and to quantify the coherency of the opinions of two experts. Estimators for all the quantities involved in our geometric setup are discussed, which can be directly computed from the posterior simulation output. Some examples are used to illustrate our methods, including data related to on-the-job drug usage, midge wing length, and prostate cancer.
△ Less
Submitted 23 May, 2018; v1 submitted 31 January, 2017;
originally announced January 2017.
-
Parsimonious Hierarchical Modeling Using Repulsive Distributions
Authors:
J. J. Quinlan,
F. A. Quintana,
G. L. Page
Abstract:
Employing nonparametric methods for density estimation has become routine in Bayesian statistical practice. Models based on discrete nonparametric priors such as Dirichlet Process Mixture (DPM) models are very attractive choices due to their flexibility and tractability. However, a common problem in fitting DPMs or other discrete models to data is that they tend to produce a large number of (somet…
▽ More
Employing nonparametric methods for density estimation has become routine in Bayesian statistical practice. Models based on discrete nonparametric priors such as Dirichlet Process Mixture (DPM) models are very attractive choices due to their flexibility and tractability. However, a common problem in fitting DPMs or other discrete models to data is that they tend to produce a large number of (sometimes) redundant clusters. In this work we propose a method that produces parsimonious mixture models (i.e. mixtures that discourage the creation of redundant clusters), without sacrificing flexibility or model fit. This method is based on the idea of repulsion, that is, that any two mixture components are encouraged to be well separated. We propose a family of d-dimensional probability densities whose coordinates tend to repel each other in a smooth way. The induced probability measure has a close relation with Gibbs measures, graph theory and point processes. We investigate its global properties and explore its use in the context of mixture models for density estimation. Computational techniques are detailed and we illustrate its usefulness with some well-known data sets and a small simulation study.
△ Less
Submitted 29 June, 2017; v1 submitted 16 January, 2017;
originally announced January 2017.
-
Predictions Based on the Clustering of Heterogeneous Functions via Shape and Subject-Specific Covariates
Authors:
Garritt L. Page,
Fernando A. Quintana
Abstract:
We consider a study of players employed by teams who are members of the National Basketball Association where units of observation are functional curves that are realizations of production measurements taken through the course of one's career. The observed functional output displays large amounts of between player heterogeneity in the sense that some individuals produce curves that are fairly smoo…
▽ More
We consider a study of players employed by teams who are members of the National Basketball Association where units of observation are functional curves that are realizations of production measurements taken through the course of one's career. The observed functional output displays large amounts of between player heterogeneity in the sense that some individuals produce curves that are fairly smooth while others are (much) more erratic. We argue that this variability in curve shape is a feature that can be exploited to guide decision making, learn about processes under study and improve prediction. In this paper we develop a methodology that takes advantage of this feature when clustering functional curves. Individual curves are flexibly modeled using Bayesian penalized B-splines while a hierarchical structure allows the clustering to be guided by the smoothness of individual curves. In a sense, the hierarchical structure balances the desire to fit individual curves well while still producing meaningful clusters that are used to guide prediction. We seamlessly incorporate available covariate information to guide the clustering of curves non-parametrically through the use of a product partition model prior for a random partition of individuals. Clustering based on curve smoothness and subject-specific covariate information is particularly important in carrying out the two types of predictions that are of interest, those that complete a partially observed curve from an active player, and those that predict the entire career curve for a player yet to play in the National Basketball Association.
△ Less
Submitted 11 May, 2015;
originally announced May 2015.
-
The XMM-Newton serendipitous survey. VII. The third XMM-Newton serendipitous source catalogue
Authors:
S. R. Rosen,
N. A. Webb,
M. G. Watson,
J. Ballet,
D. Barret,
V. Braito,
F. J. Carrera,
M. T. Ceballos,
M. Coriat,
R. Della Ceca,
G. Denkinson,
P. Esquej,
S. A. Farrell,
M. Freyberg,
F. Grisé,
P. Guillout,
L. Heil,
F. Koliopanos,
D. Law-Green,
G. Lamer,
D. Lin,
R. Martino,
L. Michel,
C. Motch,
A. Nebot Gomez-Moran
, et al. (15 additional authors not shown)
Abstract:
Thanks to the large collecting area (3 x ~1500 cm$^2$ at 1.5 keV) and wide field of view (30' across in full field mode) of the X-ray cameras on board the European Space Agency X-ray observatory XMM-Newton, each individual pointing can result in the detection of hundreds of X-ray sources, most of which are newly discovered. Recently, many improvements in the XMM-Newton data reduction algorithms ha…
▽ More
Thanks to the large collecting area (3 x ~1500 cm$^2$ at 1.5 keV) and wide field of view (30' across in full field mode) of the X-ray cameras on board the European Space Agency X-ray observatory XMM-Newton, each individual pointing can result in the detection of hundreds of X-ray sources, most of which are newly discovered. Recently, many improvements in the XMM-Newton data reduction algorithms have been made. These include enhanced source characterisation and reduced spurious source detections, refined astrometric precision, greater net sensitivity and the extraction of spectra and time series for fainter sources, with better signal-to-noise. Further, almost 50\% more observations are in the public domain compared to 2XMMi-DR3, allowing the XMM-Newton Survey Science Centre (XMM-SSC) to produce a much larger and better quality X-ray source catalogue. The XMM-SSC has developed a pipeline to reduce the XMM-Newton data automatically and using improved calibration a new catalogue version has been produced from XMM-Newton data made public by 2013 Dec. 31 (13 years of data). Manual screening ensures the highest data quality. This catalogue is known as 3XMM. In the latest release, 3XMM-DR5, there are 565962 X-ray detections comprising 396910 unique X-ray sources. For the 133000 brightest sources, spectra and lightcurves are provided. For all detections, the positions on the sky, a measure of the quality of the detection, and an evaluation of the X-ray variability is provided, along with the fluxes and count rates in 7 X-ray energy bands, the total 0.2-12 keV band counts, and four hardness ratios. To identify the detections, a cross correlation with 228 catalogues is also provided for each X-ray detection. 3XMM-DR5 is the largest X-ray source catalogue ever produced. Thanks to the large array of data products, it is an excellent resource in which to find new and extreme objects.
△ Less
Submitted 9 February, 2016; v1 submitted 27 April, 2015;
originally announced April 2015.
-
Spatial Product Partition Models
Authors:
Garritt L. Page,
Fernando A. Quintana
Abstract:
When modeling geostatistical or areal data, spatial structure is commonly accommodated via a covariance function for the former and a neighborhood structure for the latter. In both cases the resulting spatial structure is a consequence of implicit spatial grouping in that observations near in space are assumed to behave similarly. It would be desirable to develop spatial methods that explicitly mo…
▽ More
When modeling geostatistical or areal data, spatial structure is commonly accommodated via a covariance function for the former and a neighborhood structure for the latter. In both cases the resulting spatial structure is a consequence of implicit spatial grouping in that observations near in space are assumed to behave similarly. It would be desirable to develop spatial methods that explicitly model the partitioning of spatial locations providing more control over resulting spatial structures and being able to better balance global vs local spatial dependence. To this end, we extend product partition models to a spatial setting so that the partitioning of locations into spatially dependent clusters is explicitly modeled. We explore the spatial structures that result from employing a spatial product partition model and demonstrate its flexibility in accommodating many types of spatial dependencies. We illustrate the method's utility through simulation studies and an education application.
△ Less
Submitted 17 April, 2015;
originally announced April 2015.
-
Density Estimation and Classification via Bayesian Nonparametric Learning of Affine Subspaces
Authors:
Abhishek Bhattacharya,
Garritt Page,
David Dunson
Abstract:
It is now practically the norm for data to be very high dimensional in areas such as genetics, machine vision, image analysis and many others. When analyzing such data, parametric models are often too inflexible while nonparametric procedures tend to be non-robust because of insufficient data on these high dimensional spaces. It is often the case with high-dimensional data that most of the variabi…
▽ More
It is now practically the norm for data to be very high dimensional in areas such as genetics, machine vision, image analysis and many others. When analyzing such data, parametric models are often too inflexible while nonparametric procedures tend to be non-robust because of insufficient data on these high dimensional spaces. It is often the case with high-dimensional data that most of the variability tends to be along a few directions, or more generally along a much smaller dimensional submanifold of the data space. In this article, we propose a class of models that flexibly learn about this submanifold and its dimension which simultaneously performs dimension reduction. As a result, density estimation is carried out efficiently. When performing classification with a large predictor space, our approach allows the category probabilities to vary nonparametrically with a few features expressed as linear combinations of the predictors. As opposed to many black-box methods for dimensionality reduction, the proposed model is appealing in having clearly interpretable and identifiable parameters. Gibbs sampling methods are developed for posterior computation, and the methods are illustrated in simulated and real data applications.
△ Less
Submitted 28 May, 2011;
originally announced May 2011.
-
How Well Do We Know the Orbits of the Outer Planets?
Authors:
Gary L. Page,
John F. Wallin,
David S. Dixon
Abstract:
This paper deals with the problem of astrometric determination of the orbital elements of the outer planets, in particular by assessing the ability of astrometric observations to detect perturbations of the sort expected from the Pioneer effect or other small perturbations to gravity. We also show that while using simplified models of the dynamics can lead to some insights, one must be careful t…
▽ More
This paper deals with the problem of astrometric determination of the orbital elements of the outer planets, in particular by assessing the ability of astrometric observations to detect perturbations of the sort expected from the Pioneer effect or other small perturbations to gravity. We also show that while using simplified models of the dynamics can lead to some insights, one must be careful to not over-simplify the issues involved lest one be misled by the analysis onto false paths. Specifically, we show that the current ephemeris of Pluto does not preclude the existence of the Pioneer effect. We show that the orbit of Pluto is simply not well enough characterized at present to make such an assertion. A number of misunderstandings related to these topics have now propagated through the literature and have been used as a basis for drawing conclusions about the dynamics of the solar system. Thus, the objective of this paper is to address these issues. Finally, we offer some comments dealing with the complex topic of model selection and comparison.
△ Less
Submitted 30 April, 2009;
originally announced May 2009.
-
High precision X-ray logN-logS distributions: implications for the obscured AGN population
Authors:
S. Mateos,
R. S. Warwick,
F. J. Carrera,
G. C. Stewart,
J. Ebrero,
R. Della Ceca,
A. Caccianiga,
R. Gilli,
M. J. Page,
E. Treister,
J. A. Tedds,
M. G. Watson,
G. Lamer,
R. D. Saxton,
H. Brunner,
C. G. Page
Abstract:
We have constrained the extragalactic source count distributions over a broad range of X-ray fluxes and in various energy bands to test whether the predictions from X-ray background synthesis models agree with the observational constraints provided by our measurements. We have used 1129 XMM-Newton observations at |b|>20 deg covering a sky area of 132.3 deg^2 to compile the largest complete sampl…
▽ More
We have constrained the extragalactic source count distributions over a broad range of X-ray fluxes and in various energy bands to test whether the predictions from X-ray background synthesis models agree with the observational constraints provided by our measurements. We have used 1129 XMM-Newton observations at |b|>20 deg covering a sky area of 132.3 deg^2 to compile the largest complete samples of X-ray objects to date in the 0.5-1 keV, 1-2 keV, 2-4.5 keV, 4.5-10 keV, 0.5-2 keV and 2-10 keV energy bands. Our survey includes in excess of 30,000 sources down to ~10^-15 erg/cm^2/s below 2 keV and down to ~10^{-14} erg/cm^2/s above 2 keV. A break in the source count distributions was detected in all energy bands except the 4.5-10 keV band. An analytical model comprising 2 power-law components cannot adequately describe the curvature seen in the source count distributions. The shape of the logN(>S)-logS is strongly dependent on the energy band with a general steepening apparent as we move to higher energies. This is due to non-AGN populations, comprised mainly of stars and clusters of galaxies, contribute up to 30% of the source population at energies <2 keV and at fluxes >10^{-13} erg/cm^2/s, and these populations of objects have significantly flatter source count distributions than AGN. We find a substantial increase in the relative fraction of hard X-ray sources at higher energies, from >55% below 2 keV to >77% above 2 keV. However the majority of sources detected above 4.5 keV still have significant flux below 2 keV. Comparison with predictions from the synthesis models suggest that the models might be overpredicting the number of faint absorbed AGN, which would call for fine adjustment of some model parameters such as the obscured to unobscured AGN ratio and/or the distribution of column densities at intermediate obscuration.
△ Less
Submitted 11 September, 2008;
originally announced September 2008.
-
The XMM-Newton Serendipitous Survey. V. The Second XMM-Newton Serendipitous Source Catalogue
Authors:
M. G. Watson,
A. C. Schröder,
D. Fyfe,
C. G. Page,
G. Lamer,
S. Mateos,
J. Pye,
M. Sakano,
S. Rosen,
J. Ballet,
X. Barcons,
D. Barret,
T. Boller,
H. Brunner,
M. Brusa,
A. Caccianiga,
F. J. Carrera,
M. Ceballos,
R. Della Ceca,
M. Denby,
G. Denkinson,
S. Dupuy,
S. Farrell,
F. Fraschetti,
M. J. Freyberg
, et al. (25 additional authors not shown)
Abstract:
Aims: Pointed observations with XMM-Newton provide the basis for creating catalogues of X-ray sources detected serendipitously in each field. This paper describes the creation and characteristics of the 2XMM catalogue. Methods: The 2XMM catalogue has been compiled from a new processing of the XMM-Newton EPIC camera data. The main features of the processing pipeline are described in detail. Resul…
▽ More
Aims: Pointed observations with XMM-Newton provide the basis for creating catalogues of X-ray sources detected serendipitously in each field. This paper describes the creation and characteristics of the 2XMM catalogue. Methods: The 2XMM catalogue has been compiled from a new processing of the XMM-Newton EPIC camera data. The main features of the processing pipeline are described in detail. Results: The catalogue, the largest ever made at X-ray wavelengths, contains 246,897 detections drawn from 3491 public XMM-Newton observations over a 7-year interval, which relate to 191,870 unique sources. The catalogue fields cover a sky area of more than 500 sq.deg. The non-overlapping sky area is ~360 sq.deg. (~1% of the sky) as many regions of the sky are observed more than once by XMM-Newton. The catalogue probes a large sky area at the flux limit where the bulk of the objects that contribute to the X-ray background lie and provides a major resource for generating large, well-defined X-ray selected source samples, studying the X-ray source population and identifying rare object types. The main characteristics of the catalogue are presented, including its photometric and astrometric properties .
△ Less
Submitted 21 October, 2008; v1 submitted 7 July, 2008;
originally announced July 2008.
-
Testing Gravity in the Outer Solar System: Results from Trans-Neptunian Objects
Authors:
John F. Wallin,
David S. Dixon,
Gary L. Page
Abstract:
The inverse square law of gravity is poorly probed by experimental tests at distances of ~ 10 AUs. Recent analysis of the trajectory of the Pioneer 10 and 11 spacecraft have shown an unmodeled acceleration directed toward the Sun which was not explained by any obvious spacecraft systematics, and occurred when at distances greater than 20 AUs from the Sun. If this acceleration represents a depart…
▽ More
The inverse square law of gravity is poorly probed by experimental tests at distances of ~ 10 AUs. Recent analysis of the trajectory of the Pioneer 10 and 11 spacecraft have shown an unmodeled acceleration directed toward the Sun which was not explained by any obvious spacecraft systematics, and occurred when at distances greater than 20 AUs from the Sun. If this acceleration represents a departure from Newtonian gravity or is indicative of an additional mass distribution in the outer solar system, it should be detectable in the orbits of Trans-Neptunian Objects (TNOs). To place limits on deviations from Newtonian gravity, we have selected a well observed sample of TNOs found orbiting between 20 and 100 AU from the Sun. By examining their orbits with modified orbital fitting software, we place tight limits on the perturbations of gravity that could exist in this region of the solar system.
△ Less
Submitted 23 May, 2007;
originally announced May 2007.
-
Can Minor Planets be Used to Assess Gravity in the Outer Solar System?
Authors:
Gary L. Page,
David S. Dixon,
John F. Wallin
Abstract:
The twin Pioneer spacecraft have been tracked for over thirty years as they headed out of the solar system. After passing 20 AU from the Sun, both exhibited a systematic error in their trajectories that can be interpreted as a constant acceleration towards the Sun. This Pioneer Effect is most likely explained by spacecraft systematics, but there have been no convincing arguments that that is the…
▽ More
The twin Pioneer spacecraft have been tracked for over thirty years as they headed out of the solar system. After passing 20 AU from the Sun, both exhibited a systematic error in their trajectories that can be interpreted as a constant acceleration towards the Sun. This Pioneer Effect is most likely explained by spacecraft systematics, but there have been no convincing arguments that that is the case. The alternative is that the Pioneer Effect represents a real phenomenon and perhaps new physics. What is lacking is a means of measuring the effect, its variation, its potential anisotropies, and its region of influence. We show that minor planets provide an observational vehicle for investigating the gravitational field in the outer solar system, and that a sustained observation campaign against properly chosen minor planets could confirm or refute the existence of the Pioneer Effect. Additionally, even if the Pioneer Effect does not represent a new physical phenomenon, minor planets can be used to probe the gravitational field in the outer Solar System and since there are very few intermediate range tests of gravity at the multiple AU distance scale, this is a worthwhile endeavor in its own right.
△ Less
Submitted 2 January, 2006; v1 submitted 17 April, 2005;
originally announced April 2005.