Search | arXiv e-print repository

A robust estimation and variable selection approach for sparse partially linear additive models

Abstract: In partially linear additive models the response variable is modelled with a linear component on a subset of covariates and an additive component in which the rest of the covariates enter to the model as a sum of univariate unknown functions. This structure is more flexible than the usual full linear or full nonparametric regression models, avoids the 'curse of dimensionality', is easily interpret… ▽ More In partially linear additive models the response variable is modelled with a linear component on a subset of covariates and an additive component in which the rest of the covariates enter to the model as a sum of univariate unknown functions. This structure is more flexible than the usual full linear or full nonparametric regression models, avoids the 'curse of dimensionality', is easily interpretable and allows the user to include discrete or categorical variables in the linear part. On the other hand, in practice, the user incorporates all the available variables in the model no matter how they would impact on the response variable. For this reason, variable selection plays an important role since including covariates that has a null impact on the responses will reduce the prediction capability of the model. As in other settings, outliers in the data may harm estimations based on strong assumptions, such as normality of the response variable, leading to conclusions that are not representative of the data set. In this work, we propose a family of robust estimators that estimate and select variables from both the linear and the additive part of the model simultaneously. This family considers an adaptive procedure on a general class of penalties in the regularization part of the objetive function that defines the estimators. We study the behaviour of the proposal againts its least-squares counterpart under simulations and show the advantages of its use on a real data set. △ Less

Submitted 18 February, 2025; originally announced February 2025.

arXiv:2402.08325 [pdf, other]

doi 10.1088/1748-0221/19/05/P05010

Multi-Blade detector with VMM3a-ASIC-based readout: installation and commissioning at the reflectometer Amor at PSI

Authors: F. Piscitelli, F. Ghazi Moradi, F. S. Alves, M. J. Christensen, J. Hrivnak, A. Johansson, K. Fissum, C. C. Lai, A. Monera Martinez, D. Pfeiffer, E. Shahu, J. Stahn, P. O. Svensson

Abstract: The Multi-Blade (MB) Boron-10-based neutron detector is the chosen technology for three instruments at the European Spallation Source (ESS): the two ESS reflectometers, ESTIA and FREIA, and the Test Beam Line. A fourth MB detector has been built, installed and commissioned for the user operation of the reflectometer Amor at PSI (Switzerland). Amor can be considered a downscaled version of the ESS… ▽ More The Multi-Blade (MB) Boron-10-based neutron detector is the chosen technology for three instruments at the European Spallation Source (ESS): the two ESS reflectometers, ESTIA and FREIA, and the Test Beam Line. A fourth MB detector has been built, installed and commissioned for the user operation of the reflectometer Amor at PSI (Switzerland). Amor can be considered a downscaled version of the ESS reflectometer ESTIA. They are based on the same Selene guide concept, optimized for performing focusing reflectometry on small samples. The experience gained at Amor is invaluable for the future deployment of the MB detector at the ESS. This manuscript describes the MB detector construction and installation at Amor along with the readout electronics chain based on the VMM3a ASIC. The readout chain deployed at Amor is equivalent of that of the ESS, including the readout master module (RMM), event-formation-units (EFUs), Kafka, FileWriter and live visualisation tools. △ Less

Submitted 18 March, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: 16 pages, 12 figures

Journal ref: 2024 JINST 19 P05010

arXiv:2305.05687 [pdf, other]

doi 10.3847/1538-4357/accc89

Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71

arXiv:2203.09148 [pdf, other]

doi 10.1016/j.csl.2021.101329

Prediction of speech intelligibility with DNN-based performance measures

Authors: Angel Mario Castro Martinez, Constantin Spille, Jana Roßbach, Birger Kollmeier, Bernd T. Meyer

Abstract: This paper presents a speech intelligibility model based on automatic speech recognition (ASR), combining phoneme probabilities from deep neural networks (DNN) and a performance measure that estimates the word error rate from these probabilities. This model does not require the clean speech reference nor the word labels during testing as the ASR decoding step, which finds the most likely sequence… ▽ More This paper presents a speech intelligibility model based on automatic speech recognition (ASR), combining phoneme probabilities from deep neural networks (DNN) and a performance measure that estimates the word error rate from these probabilities. This model does not require the clean speech reference nor the word labels during testing as the ASR decoding step, which finds the most likely sequence of words given phoneme posterior probabilities, is omitted. The model is evaluated via the root-mean-squared error between the predicted and observed speech reception thresholds from eight normal-hearing listeners. The recognition task consists of identifying noisy words from a German matrix sentence test. The speech material was mixed with eight noise maskers covering different modulation types, from speech-shaped stationary noise to a single-talker masker. The prediction performance is compared to five established models and an ASR-model using word labels. Two combinations of features and networks were tested. Both include temporal information either at the feature level (amplitude modulation filterbanks and a feed-forward network) or captured by the architecture (mel-spectrograms and a time-delay deep neural network, TDNN). The TDNN model is on par with the DNN while reducing the number of parameters by a factor of 37; this optimization allows parallel streams on dedicated hearing aid hardware as a forward-pass can be computed within the 10ms of each frame. The proposed model performs almost as well as the label-based model and produces more accurate predictions than the baseline models. △ Less

Submitted 17 March, 2022; originally announced March 2022.

Journal ref: Computer Speech & Language, 74, p.101329 (2022)

arXiv:2111.15651 [pdf, other]

Leveraging The Topological Consistencies of Learning in Deep Neural Networks

Authors: Stuart Synakowski, Fabian Benitez-Quiroz, Aleix M. Martinez

Abstract: Recently, methods have been developed to accurately predict the testing performance of a Deep Neural Network (DNN) on a particular task, given statistics of its underlying topological structure. However, further leveraging this newly found insight for practical applications is intractable due to the high computational cost in terms of time and memory. In this work, we define a new class of topolog… ▽ More Recently, methods have been developed to accurately predict the testing performance of a Deep Neural Network (DNN) on a particular task, given statistics of its underlying topological structure. However, further leveraging this newly found insight for practical applications is intractable due to the high computational cost in terms of time and memory. In this work, we define a new class of topological features that accurately characterize the progress of learning while being quick to compute during running time. Additionally, our proposed topological features are readily equipped for backpropagation, meaning that they can be incorporated in end-to-end training. Our newly developed practical topological characterization of DNNs allows for an additional set of applications. We first show we can predict the performance of a DNN without a testing set and without the need for high-performance computing. We also demonstrate our topological characterization of DNNs is effective in estimating task similarity. Lastly, we show we can induce learning in DNNs by actively constraining the DNN's topological structure. This opens up new avenues in constricting the underlying structure of DNNs in a meta-learning framework. △ Less

Submitted 30 November, 2021; originally announced November 2021.

arXiv:2107.12987 [pdf, other]

A robust spline approach in partially linear additive models

Authors: Graciela Boente, Alejandra Mercedes Martinez

Abstract: Partially linear additive models generalize linear ones since they model the relation between a response variable and covariates by assuming that some covariates have a linear relation with the response but each of the others enter through unknown univariate smooth functions. The harmful effect of outliers either in the residuals or in the covariates involved in the linear component has been descr… ▽ More Partially linear additive models generalize linear ones since they model the relation between a response variable and covariates by assuming that some covariates have a linear relation with the response but each of the others enter through unknown univariate smooth functions. The harmful effect of outliers either in the residuals or in the covariates involved in the linear component has been described in the situation of partially linear models, that is, when only one nonparametric component is involved in the model. When dealing with additive components, the problem of providing reliable estimators when atypical data arise, is of practical importance motivating the need of robust procedures. Hence, we propose a family of robust estimators for partially linear additive models by combining $B-$splines with robust linear regression estimators. We obtain consistency results, rates of convergence and asymptotic normality for the linear components, under mild assumptions. A Monte Carlo study is carried out to compare the performance of the robust proposal with its classical counterpart under different models and contamination schemes. The numerical experiments show the advantage of the proposed methodology for finite samples. We also illustrate the usefulness of the proposed approach on a real data set. △ Less

Submitted 4 August, 2023; v1 submitted 27 July, 2021; originally announced July 2021.

arXiv:2010.15047 [pdf, other]

Automatic selection of eye tracking variables in visual categorization in adults and infants

Authors: Samuel Rivera, Catherine A. Best, Hyungwook Yim, Dirk B. Walther, Vladimir M. Sloutsky, Aleix M. Martinez

Abstract: Visual categorization and learning of visual categories exhibit early onset, however the underlying mechanisms of early categorization are not well understood. The main limiting factor for examining these mechanisms is the limited duration of infant cooperation (10-15 minutes), which leaves little room for multiple test trials. With its tight link to visual attention, eye tracking is a promising m… ▽ More Visual categorization and learning of visual categories exhibit early onset, however the underlying mechanisms of early categorization are not well understood. The main limiting factor for examining these mechanisms is the limited duration of infant cooperation (10-15 minutes), which leaves little room for multiple test trials. With its tight link to visual attention, eye tracking is a promising method for getting access to the mechanisms of category learning. But how should researchers decide which aspects of the rich eye tracking data to focus on? To date, eye tracking variables are generally handpicked, which may lead to biases in the eye tracking data. Here, we propose an automated method for selecting eye tracking variables based on analyses of their usefulness to discriminate learners from non-learners of visual categories. We presented infants and adults with a category learning task and tracked their eye movements. We then extracted an over-complete set of eye tracking variables encompassing durations, probabilities, latencies, and the order of fixations and saccadic eye movements. We compared three statistical techniques for identifying those variables among this large set that are useful for discriminating learners form non-learners: ANOVA ranking, Bayes ranking, and L1 regularized logistic regression. We found remarkable agreement between these methods in identifying a small set of discriminant variables. Moreover, the same eye tracking variables allow us to classify category learners from non-learners among adults and 6- to 8-month-old infants with accuracies above 71%. △ Less

Submitted 26 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

arXiv:2005.10177 [pdf, other]

doi 10.1364/PRJ.388693

Spectrally-resolved Hong-Ou-Mandel interferometry for Quantum-Optical Coherence Tomography

Authors: Pablo Yepiz-Graciano, Ali Michel Angulo Martinez, Dorilian Lopez-Mago, Hector Cruz-Ramirez, Alfred B. U'Ren

Abstract: In this paper, we revisit the well-known Hong-Ou-Mandel (HOM) effect in which two photons, which meet at a beamsplitter, can interfere destructively, leading to null in coincidence counts. In a standard HOM measurement, the coincidence counts across the two output ports of the beamsplitter are monitored as the temporal delay between the two photons prior to the beamsplitter is varied, resulting in… ▽ More In this paper, we revisit the well-known Hong-Ou-Mandel (HOM) effect in which two photons, which meet at a beamsplitter, can interfere destructively, leading to null in coincidence counts. In a standard HOM measurement, the coincidence counts across the two output ports of the beamsplitter are monitored as the temporal delay between the two photons prior to the beamsplitter is varied, resulting in the well-known HOM dip. We show, both theoretically and experimentally, that by leaving the delay fixed at a particular value while relying on spectrally-resolved coincidence photon-counting, we can reconstruct the HOM dip, which would have been obtained through a standard delay-scanning, non-spectrally-resolved HOM measurement. We show that our numerical reconstruction procedure exhibits a novel dispersion cancellation effects, to all orders. We discuss how our present work can lead to a drastic reduction in the time required to acquire a HOM interferogram, and specifically discuss how this could be of particular importance for the implementation of efficient quantum-optical coherence tomography devices. △ Less

Submitted 20 May, 2020; originally announced May 2020.

Comments: received 01/21/2020; accepted 03/24/2020; posted 03/26/2020; Doc. ID 388693

arXiv:2005.09842 [pdf, other]

doi 10.1038/s41598-019-45088-0

Interference effects in quantum-optical coherence tomography using spectrally engineered photon pairs

Authors: Pablo Yepiz Graciano, Ali Michel Angulo Martinez, Dorilian Lopez-Mago, Gustavo Castro-Olvera, Martha Rosete-Aguilar, Jesus Garduño-Mejia, Roberto Ramirez Alarcon, Hector Cruz Ramirez, Alfred B. U'Ren

Abstract: Optical-coherence tomography (OCT) is a technique that employs light in order to measure the internal structure of semi-transparent, e.g. biological, samples. It is based on the interference pattern of low-coherence light. Quantum-OCT (QOCT), instead, employs the correlation properties of entangled photon pairs, for example, generated by the process of spontaneous parametric downconversion (SPDC).… ▽ More Optical-coherence tomography (OCT) is a technique that employs light in order to measure the internal structure of semi-transparent, e.g. biological, samples. It is based on the interference pattern of low-coherence light. Quantum-OCT (QOCT), instead, employs the correlation properties of entangled photon pairs, for example, generated by the process of spontaneous parametric downconversion (SPDC). The usual QOCT scheme uses photon pairs characterised by a joint-spectral amplitude with strict spectral anti-correlations. It has been shown that, in contrast with its classical counterpart, QOCT provides resolution enhancement and dispersion cancellation. In this paper, we revisit the theory of QOCT and extend the theoretical model so as to include photon pairs with arbitrary spectral correlations. We present experimental results that complement the theory and explain the physical underpinnings appearing in the interference pattern. In our experiment, we utilize a pump for the SPDC process ranging from continuous wave to pulsed in the femtosecond regime, and show that cross-correlation interference effects appearing for each pair of layers may be directly suppressed for a sufficiently large pump bandwidth. Our results provide insights and strategies that could guide practical implementations of QOCT. △ Less

Submitted 20 May, 2020; originally announced May 2020.

Journal ref: Sci Rep 9, 8954 (2019)

arXiv:1908.03679 [pdf, other]

Distance Map Loss Penalty Term for Semantic Segmentation

Authors: Francesco Caliva, Claudia Iriondo, Alejandro Morales Martinez, Sharmila Majumdar, Valentina Pedoia

Abstract: Convolutional neural networks for semantic segmentation suffer from low performance at object boundaries. In medical imaging, accurate representation of tissue surfaces and volumes is important for tracking of disease biomarkers such as tissue morphology and shape features. In this work, we propose a novel distance map derived loss penalty term for semantic segmentation. We propose to use distance… ▽ More Convolutional neural networks for semantic segmentation suffer from low performance at object boundaries. In medical imaging, accurate representation of tissue surfaces and volumes is important for tracking of disease biomarkers such as tissue morphology and shape features. In this work, we propose a novel distance map derived loss penalty term for semantic segmentation. We propose to use distance maps, derived from ground truth masks, to create a penalty term, guiding the network's focus towards hard-to-segment boundary regions. We investigate the effects of this penalizing factor against cross-entropy, Dice, and focal loss, among others, evaluating performance on a 3D MRI bone segmentation task from the publicly available Osteoarthritis Initiative dataset. We observe a significant improvement in the quality of segmentation, with better shape preservation at bone boundaries and areas affected by partial volume. We ultimately aim to use our loss penalty term to improve the extraction of shape biomarkers and derive metrics to quantitatively evaluate the preservation of shape. △ Less

Submitted 9 August, 2019; originally announced August 2019.

Comments: Medical Imaging with Deep Learning (MIDL2019) Conference [arXiv:1907.08612], Extended Abstract

Report number: MIDL/2019/ExtendedAbstract/B1eIcvS45V

arXiv:1808.04399 [pdf, other]

Cross-Cultural and Cultural-Specific Production and Perception of Facial Expressions of Emotion in the Wild

Authors: Ramprakash Srinivasan, Aleix M. Martinez

Abstract: Automatic recognition of emotion from facial expressions is an intense area of research, with a potentially long list of important application. Yet, the study of emotion requires knowing which facial expressions are used within and across cultures in the wild, not in controlled lab conditions; but such studies do not exist. Which and how many cross-cultural and cultural-specific facial expressions… ▽ More Automatic recognition of emotion from facial expressions is an intense area of research, with a potentially long list of important application. Yet, the study of emotion requires knowing which facial expressions are used within and across cultures in the wild, not in controlled lab conditions; but such studies do not exist. Which and how many cross-cultural and cultural-specific facial expressions do people commonly use? And, what affect variables does each expression communicate to observers? If we are to design technology that understands the emotion of users, we need answers to these two fundamental questions. In this paper, we present the first large-scale study of the production and visual perception of facial expressions of emotion in the wild. We find that of the 16,384 possible facial configurations that people can theoretically produce, only 35 are successfully used to transmit emotive information across cultures, and only 8 within a smaller number of cultures. Crucially, we find that visual analysis of cross-cultural expressions yields consistent perception of emotion categories and valence, but not arousal. In contrast, visual analysis of cultural-specific expressions yields consistent perception of valence and arousal, but not of emotion categories. Additionally, we find that the number of expressions used to communicate each emotion is also different, e.g., 17 expressions transmit happiness, but only 1 is used to convey disgust. △ Less

Submitted 13 August, 2018; originally announced August 2018.

arXiv:1807.09251 [pdf, other]

GANimation: Anatomically-aware Facial Animation from a Single Image

Authors: Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer

Abstract: Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis. The most successful architecture is StarGAN, that conditions GANs generation process with images of a specific domain, namely a set of images of persons sharing the same expression. While effective, this approach can only generate a discrete number of expressions, determ… ▽ More Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis. The most successful architecture is StarGAN, that conditions GANs generation process with images of a specific domain, namely a set of images of persons sharing the same expression. While effective, this approach can only generate a discrete number of expressions, determined by the content of the dataset. To address this limitation, in this paper, we introduce a novel GAN conditioning scheme based on Action Units (AU) annotations, which describes in a continuous manifold the anatomical facial movements defining a human expression. Our approach allows controlling the magnitude of activation of each AU and combine several of them. Additionally, we propose a fully unsupervised strategy to train the model, that only requires images annotated with their activated AUs, and exploit attention mechanisms that make our network robust to changing backgrounds and lighting conditions. Extensive evaluation show that our approach goes beyond competing conditional generators both in the capability to synthesize a much wider range of expressions ruled by anatomically feasible muscle movements, as in the capacity of dealing with images in the wild. △ Less

Submitted 28 August, 2018; v1 submitted 24 July, 2018; originally announced July 2018.

Comments: Accepted as oral at ECCV 2018. Code available at https://github.com/albertpumarola/GANimation. Added minor updates

arXiv:1704.01427 [pdf, ps, other]

doi 10.1016/j.knosys.2018.09.019

AMIDST: a Java Toolbox for Scalable Probabilistic Machine Learning

Authors: Andrés R. Masegosa, Ana M. Martínez, Darío Ramos-López, Rafael Cabañas, Antonio Salmerón, Thomas D. Nielsen, Helge Langseth, Anders L. Madsen

Abstract: The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorit… ▽ More The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorithms for either streaming or batch data. These algorithms are based on a flexible variational message passing scheme, which supports discrete and continu- ous variables from a wide range of probability distributions. AMIDST also leverages existing functionality and algorithms by interfacing to software tools such as Flink, Spark, MOA, Weka, R and HUGIN. AMIDST is an open source toolbox written in Java and available at http://www.amidsttoolbox.com under the Apache Software License version 2.0. △ Less

Submitted 4 April, 2017; originally announced April 2017.

ACM Class: I.2.6

arXiv:1703.01210 [pdf, other]

EmotioNet Challenge: Recognition of facial expressions of emotion in the wild

Authors: C. Fabian Benitez-Quiroz, Ramprakash Srinivasan, Qianli Feng, Yan Wang, Aleix M. Martinez

Abstract: This paper details the methodology and results of the EmotioNet challenge. This challenge is the first to test the ability of computer vision algorithms in the automatic analysis of a large number of images of facial expressions of emotion in the wild. The challenge was divided into two tracks. The first track tested the ability of current computer vision algorithms in the automatic detection of a… ▽ More This paper details the methodology and results of the EmotioNet challenge. This challenge is the first to test the ability of computer vision algorithms in the automatic analysis of a large number of images of facial expressions of emotion in the wild. The challenge was divided into two tracks. The first track tested the ability of current computer vision algorithms in the automatic detection of action units (AUs). Specifically, we tested the detection of 11 AUs. The second track tested the algorithms' ability to recognize emotion categories in images of facial expressions. Specifically, we tested the recognition of 16 basic and compound emotion categories. The results of the challenge suggest that current computer vision and machine learning algorithms are unable to reliably solve these two tasks. The limitations of current algorithms are more apparent when trying to recognize emotion. We also show that current algorithms are not affected by mild resolution changes, small occluders, gender or age, but that 3D pose is a major limiting factor on performance. We provide an in-depth discussion of the points that need special attention moving forward. △ Less

Submitted 3 March, 2017; originally announced March 2017.

arXiv:1702.04333 [pdf, other]

doi 10.1016/j.csl.2017.02.006

On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition

Authors: Angel Mario Castro Martinez, Sri Harish Mallidi, Bernd T. Meyer

Abstract: Previous studies support the idea of merging auditory-based Gabor features with deep learning architectures to achieve robust automatic speech recognition, however, the cause behind the gain of such combination is still unknown. We believe these representations provide the deep learning decoder with more discriminable cues. Our aim with this paper is to validate this hypothesis by performing exper… ▽ More Previous studies support the idea of merging auditory-based Gabor features with deep learning architectures to achieve robust automatic speech recognition, however, the cause behind the gain of such combination is still unknown. We believe these representations provide the deep learning decoder with more discriminable cues. Our aim with this paper is to validate this hypothesis by performing experiments with three different recognition tasks (Aurora 4, CHiME 2 and CHiME 3) and assess the discriminability of the information encoded by Gabor filterbank features. Additionally, to identify the contribution of low, medium and high temporal modulation frequencies subsets of the Gabor filterbank were used as features (dubbed LTM, MTM and HTM respectively). With temporal modulation frequencies between 16 and 25 Hz, HTM consistently outperformed the remaining ones in every condition, highlighting the robustness of these representations against channel distortions, low signal-to-noise ratios and acoustically challenging real-life scenarios with relative improvements from 11 to 56% against a Mel-filterbank-DNN baseline. To explain the results, a measure of similarity between phoneme classes from DNN activations is proposed and linked to their acoustic properties. We find this measure to be consistent with the observed error rates and highlight specific differences on phoneme level to pinpoint the benefit of the proposed features. △ Less

Submitted 14 February, 2017; originally announced February 2017.

Comments: accepted to Computer Speech & Language

arXiv:1608.05119 [pdf, other]

doi 10.1038/s41598-017-03185-y

Improving randomness characterization through Bayesian model selection

Authors: Rafael Díaz Hernández Rojas, Aldo Solís, Alí M. Angulo Martínez, Alfred B. U'Ren, Jorge G. Hirsch, Matteo Marsili, Isaac Pérez Castillo

Abstract: Nowadays random number generation plays an essential role in technology with important applications in areas ranging from cryptography, which lies at the core of current communication protocols, to Monte Carlo methods, and other probabilistic algorithms. In this context, a crucial scientific endeavour is to develop effective methods that allow the characterization of random number generators. Howe… ▽ More Nowadays random number generation plays an essential role in technology with important applications in areas ranging from cryptography, which lies at the core of current communication protocols, to Monte Carlo methods, and other probabilistic algorithms. In this context, a crucial scientific endeavour is to develop effective methods that allow the characterization of random number generators. However, commonly employed methods either lack formality (e.g. the NIST test suite), or are inapplicable in principle (e.g. the characterization derived from the Algorithmic Theory of Information (ATI)). In this letter we present a novel method based on Bayesian model selection, which is both rigorous and effective, for characterizing randomness in a bit sequence. We derive analytic expressions for a model's likelihood which is then used to compute its posterior probability distribution. Our method proves to be more rigorous than NIST's suite and the Borel-Normality criterion and its implementation is straightforward. We have applied our method to an experimental device based on the process of spontaneous parametric downconversion, implemented in our laboratory, to confirm that it behaves as a genuine quantum random number generator (QRNG). As our approach relies on Bayesian inference, which entails model generalizability, our scheme transcends individual sequence analysis, leading to a characterization of the source of the random sequences itself. △ Less

Submitted 12 June, 2017; v1 submitted 17 August, 2016; originally announced August 2016.

Comments: 25 pages

Journal ref: Scientific Reports 7, 3096 (2017)

arXiv:1604.07990 [pdf, other]

doi 10.1109/MCI.2016.2532267

Probabilistic Graphical Models on Multi-Core CPUs using Java 8

Authors: Andres R. Masegosa, Ana M. Martinez, Hanen Borchani

Abstract: In this paper, we discuss software design issues related to the development of parallel computational intelligence algorithms on multi-core CPUs, using the new Java 8 functional programming features. In particular, we focus on probabilistic graphical models (PGMs) and present the parallelisation of a collection of algorithms that deal with inference and learning of PGMs from data. Namely, maximum… ▽ More In this paper, we discuss software design issues related to the development of parallel computational intelligence algorithms on multi-core CPUs, using the new Java 8 functional programming features. In particular, we focus on probabilistic graphical models (PGMs) and present the parallelisation of a collection of algorithms that deal with inference and learning of PGMs from data. Namely, maximum likelihood estimation, importance sampling, and greedy search for solving combinatorial optimisation problems. Through these concrete examples, we tackle the problem of defining efficient data structures for PGMs and parallel processing of same-size batches of data sets using Java 8 features. We also provide straightforward techniques to code parallel algorithms that seamlessly exploit multi-core processors. The experimental analysis, carried out using our open source AMIDST (Analysis of MassIve Data STreams) Java toolbox, shows the merits of the proposed solutions. △ Less

Submitted 27 April, 2016; originally announced April 2016.

Comments: Pre-print version of the paper presented in the special issue on Computational Intelligence Software at IEEE Computational Intelligence Magazine journal

Journal ref: IEEE Computational Intelligence Magazine, 11(2), 41-54. 2016

arXiv:1502.05882 [pdf, other]

doi 10.1088/0031-8949/90/7/074034

How random are random numbers generated using photons?

Authors: Aldo Solis, Alí M. Angulo Martinez, Roberto Ramírez Alarcón, Hector Cruz Ramírez, Alfred B. U'Ren, Jorge G. Hirsch

Abstract: Randomness is fundamental in quantum theory, with many philosophical and practical implications. In this paper we discuss the concept of algorithmic randomness, which provides a quantitative method to assess the Borel normality of a given sequence of numbers, a necessary condition for it to be considered random. We use Borel normality as a tool to investigate the randomness of ten sequences of bit… ▽ More Randomness is fundamental in quantum theory, with many philosophical and practical implications. In this paper we discuss the concept of algorithmic randomness, which provides a quantitative method to assess the Borel normality of a given sequence of numbers, a necessary condition for it to be considered random. We use Borel normality as a tool to investigate the randomness of ten sequences of bits generated from the differences between detection times of photon pairs generated by spontaneous parametric downconversion. These sequences are shown to fulfil the randomness criteria without difficulties. As deviations from Borel normality for photon-generated random number sequences have been reported in previous work, a strategy to understand these diverging findings is outlined. △ Less

Submitted 20 February, 2015; originally announced February 2015.

Comments: 9 pages, 7 figures. To appear in Physica Scripta as an invited Article

Journal ref: Phys. Scr. 90 (2015) 074034

arXiv:0904.2351 [pdf, ps, other]

doi 10.1088/1742-6596/167/1/012061

Molecular architectures based on pi-conjugated block copolymers for global quantum computation

Authors: Cesar A. Mujica Martinez, Julio C. Arce, John H. Reina, Michael Thorwart

Abstract: We propose a molecular setup for the physical implementation of a barrier global quantum computation scheme based on the electron-doped pi-conjugated copolymer architecture of nine blocks PPP-PDA-PPP-PA-(CCH-acene)-PA-PPP-PDA-PPP (where each block is an oligomer). The physical carriers of information are electrons coupled through the Coulomb interaction, and the building block of the computing a… ▽ More We propose a molecular setup for the physical implementation of a barrier global quantum computation scheme based on the electron-doped pi-conjugated copolymer architecture of nine blocks PPP-PDA-PPP-PA-(CCH-acene)-PA-PPP-PDA-PPP (where each block is an oligomer). The physical carriers of information are electrons coupled through the Coulomb interaction, and the building block of the computing architecture is composed by three adjacent qubit systems in a quasi-linear arrangement, each of them allowing qubit storage, but with the central qubit exhibiting a third accessible state of electronic energy far away from that of the qubits' transition energy. The third state is reached from one of the computational states by means of an on-resonance coherent laser field, and acts as a barrier mechanism for the direct control of qubit entanglement. Initial estimations of the spontaneous emission decay rates associated to the energy level structure allow us to compute a damping rate of order 10^{-7} s, which suggest a not so strong coupling to the environment. Our results offer an all-optical, scalable, proposal for global quantum computing based on semiconducting pi-conjugated polymers. △ Less

Submitted 15 April, 2009; originally announced April 2009.

Comments: To appear in J. Phys.: Conf. Series (2009)

Journal ref: Journal of Physics: Conference Series 167, 012061 (2009)

Showing 1–19 of 19 results for author: Martínez, A M