-
PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with conditional generative models
Authors:
Jannis Born,
Matteo Manica,
Joris Cadow,
Greta Markert,
Nil Adell Mill,
Modestas Filipavicius,
María Rodríguez Martínez
Abstract:
With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affin…
▽ More
With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affinity model on predicting affinities of antiviral compounds to target proteins and couple this model with pharmacological toxicity predictors. Exploiting this multi-objective as a reward function of a conditional molecular generator (consisting of two VAEs), we showcase a framework that navigates the chemical space toward regions with more antiviral molecules. Specifically, we explore a challenging setting of generating ligands against unseen protein targets by performing a leave-one-out-cross-validation on 41 SARS-CoV-2-related target proteins. Using deep RL, it is demonstrated that in 35 out of 41 cases, the generation is biased towards sampling more binding ligands, with an average increase of 83% comparing to an unbiased VAE. We present a case-study on a potential Envelope-protein inhibitor and perform a synthetic accessibility assessment of the best generated molecules is performed that resembles a viable roadmap towards a rapid in-vitro evaluation of potential SARS-CoV-2 inhibitors.
△ Less
Submitted 6 July, 2020; v1 submitted 27 May, 2020;
originally announced May 2020.
-
Galaxy classification: deep learning on the OTELO and COSMOS databases
Authors:
José A. de Diego,
Jakub Nadolny,
Ángel Bongiovanni,
Jordi Cepa,
Mirjana Pović,
Ana María Pérez García,
Carmen P. Padilla Torres,
Maritza A. Lara-López,
Miguel Cerviño,
Ricardo Pérez Martínez,
Emilio J. Alfaro,
Héctor O. Castañeda,
Miriam Fernández-Lorenzo,
Jesús Gallego,
J. Jesús González,
J. Ignacio González-Serrano,
Irene Pintos-Castro,
Miguel Sánchez-Portal,
Bernab? Cedrés,
Mauro González-Otero,
D. Heath Jones,
Joss Bland-Hawthorn
Abstract:
Context. The accurate classification of hundreds of thousands of galaxies observed in modern deep surveys is imperative if we want to understand the universe and its evolution. Aims. Here, we report the use of machine learning techniques to classify early- and late-type galaxies in the OTELO and COSMOS databases using optical and infrared photometry and available shape parameters: either the Sersi…
▽ More
Context. The accurate classification of hundreds of thousands of galaxies observed in modern deep surveys is imperative if we want to understand the universe and its evolution. Aims. Here, we report the use of machine learning techniques to classify early- and late-type galaxies in the OTELO and COSMOS databases using optical and infrared photometry and available shape parameters: either the Sersic index or the concentration index. Methods. We used three classification methods for the OTELO database: 1) u-r color separation , 2) linear discriminant analysis using u-r and a shape parameter classification, and 3) a deep neural network using the r magnitude, several colors, and a shape parameter. We analyzed the performance of each method by sample bootstrapping and tested the performance of our neural network architecture using COSMOS data. Results. The accuracy achieved by the deep neural network is greater than that of the other classification methods, and it can also operate with missing data. Our neural network architecture is able to classify both OTELO and COSMOS datasets regardless of small differences in the photometric bands used in each catalog. Conclusions. In this study we show that the use of deep neural networks is a robust method to mine the cataloged data
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
The OTELO survey: Nature and mass-metallicity relation for H$α$ emitters at $z\sim\,0.4$
Authors:
Jakub Nadolny,
Maritza A. Lara-López,
Miguel Cerviño,
Ángel Bongiovanni,
Jordi Cepa,
José A. de Diego,
Ana María Pérez García,
Ricardo Pérez Martínez,
Miguel Sánchez-Portal,
Emilio Alfaro,
Héctor O. Castañeda,
Jesús Gallego,
J. Jesús González,
J. Ignacio González-Serrano,
Carmen P. Padilla Torres,
Irene Pintos-Castro,
Mirjana Pović
Abstract:
A sample of low-mass H$α$ emission line sources (ELS) at $z\,\sim\,0.4$ was studied in the context of the mass-metallicty relation (MZR) and its possible evolution. We drew our sample from the OSIRIS Tunable Emission Line Object (OTELO) survey, which exploits the red tunable filter of OSIRIS at the Gran Telescopio Canarias to perform a blind narrow-band spectral scan in a selected field of the Ext…
▽ More
A sample of low-mass H$α$ emission line sources (ELS) at $z\,\sim\,0.4$ was studied in the context of the mass-metallicty relation (MZR) and its possible evolution. We drew our sample from the OSIRIS Tunable Emission Line Object (OTELO) survey, which exploits the red tunable filter of OSIRIS at the Gran Telescopio Canarias to perform a blind narrow-band spectral scan in a selected field of the Extended Groth Strip. We were able to directly measure emission line fluxes and equivalent widths from the analysis of OTELO pseudo-spectra. This study aims to explore the MZR in the very low-mass regime. Our sample reaches stellar masses ($M_*$) as low as $10^{6.8}\,M_\odot$, where 63\% of the sample have $M_*\,<10^9\,M_\odot$. We also explore the relation of the star formation rate (SFR) and specific SFR (sSFR) with $M_*$ and gas-phase oxygen abundances, as well as the $M_*$-size relation and the morphological classification. The $M_*$ were estimated using synthetic rest-frame colours. Using an $χ^2$ minimization method, we separated the contribution of \Nii$λ$6583 to the H$α$ emission lines. Using the N2 index, we separated active galactic nuclei from star-forming galaxies (SFGs) and estimated the gas metallicity. We studied the morphology of the sampled galaxies qualitatively (visually) and quantitatively (automatically) using high-resolution data from the \textit{Hubble Space Telescope}-ACS. The physical size of the galaxies was derived from the morphological analysis using \texttt{GALAPAGOS2/GALFIT}, where we fit a single-Sérsic 2D model to each source.
△ Less
Submitted 16 March, 2020;
originally announced March 2020.
-
The OTELO survey. A case study of [O III]4959,5007 emitters at <z> = 0.83
Authors:
Ángel Bongiovanni,
Marina Ramón-Pérez,
Ana María Pérez García,
Miguel Cerviño,
Jordi Cepa,
Jakub Nadolny,
Ricardo Pérez Martínez,
Emilio J. Alfaro,
Héctor Castañeda,
Bernabé Cedrés,
José A. de Diego,
Alessandro Ederoclite,
Mirian Fernández-Lorenzo,
Jesús Gallego,
J. Jesús González,
J. Ignacio González-Serrano,
Maritza A. Lara-López,
Iván Oteo Gómez,
Carmen P. Padilla Torres,
Irene Pintos-Castro,
Mirjana Pović,
Miguel Sánchez-Portal,
D. Heath Jones,
Joss Bland-Hawthorn,
Antonio Cabrera-Lavers
Abstract:
The OTELO survey is a very deep, blind exploration of a selected region of the Extended Groth Strip and is designed for finding emission-line sources (ELSs). The survey design, observations, data reduction, astrometry, and photometry, as well as the correlation with ancillary data used to obtain a final catalogue, including photo-z estimates and a preliminary selection of ELS, were described in a…
▽ More
The OTELO survey is a very deep, blind exploration of a selected region of the Extended Groth Strip and is designed for finding emission-line sources (ELSs). The survey design, observations, data reduction, astrometry, and photometry, as well as the correlation with ancillary data used to obtain a final catalogue, including photo-z estimates and a preliminary selection of ELS, were described in a previous contribution. Here, we aim to determine the main properties and luminosity function (LF) of the [O III] ELS sample of OTELO as a scientific demonstration of its capabilities, advantages, and complementarity with respect to other surveys. The selection and analysis procedures of ELS candidates obtained using tunable filter (TF) pseudo-spectra are described. We performed simulations in the parameter space of the survey to obtain emission-line detection probabilities. Relevant characteristics of [O III] emitters and the LF([O III]), including the main selection biases and uncertainties, are presented. A total of 184 sources were confirmed as [O III] emitters at a mean redshift z=0.83. The minimum detectable line flux and equivalent width (EW) in this ELS sample are $\sim$5 $\times$ 10$^{-19}$ erg s$^{-1}$ cm$^{2}$ and $\sim$6 Å, respectively. We are able to constrain the faint-end slope ($α= -1.03\pm0.08$) of the observed LF([O III]) at z=0.83. This LF reaches values that are approximately ten times lower than those from other surveys. The vast majority (84\%) of the morphologically classified [O III] ELSs are disc-like sources, and 87\% of this sample is comprised of galaxies with stellar masses of M$_\star$ $<$ 10$^{10}$ M$_{\odot}$.
△ Less
Submitted 27 February, 2020; v1 submitted 20 February, 2020;
originally announced February 2020.
-
The OTELO survey. III. Demography, morphology, IR luminosity and environment of AGN hosts
Authors:
Marina Ramón-Pérez,
Ángel Bongiovanni,
Ana Mará Pérez García,
Jordi Cepa,
Jakub Nadolny,
Irene Pintos-Castro,
Maritza A. Lara-López,
Emilio J. Alfaro Navarro,
Héctor O. Castañeda,
Miguel Cerviño,
José Antonio de Diego,
Mirian Fernández-Lorenzo,
Jesús Gallego,
J. Jesús González,
J. Ignacio González-Serrano,
Iván Oteo Gómez,
Ricardo Pérez Martínez,
Mirjana Pović,
Miguel Sánchez-Portal
Abstract:
We take advantage of the capabilities of the OTELO survey to select and study the AGN population in the field. We performed an analysis of the properties of these objects, including their demography, morphology, and IR luminosity. Focusing on the population of H$α$ emitters at $z \sim 0.4$, we also aim to study the environments of AGN and non-AGN galaxies at that redshift. We make use of the multi…
▽ More
We take advantage of the capabilities of the OTELO survey to select and study the AGN population in the field. We performed an analysis of the properties of these objects, including their demography, morphology, and IR luminosity. Focusing on the population of H$α$ emitters at $z \sim 0.4$, we also aim to study the environments of AGN and non-AGN galaxies at that redshift. We make use of the multiwavelength catalog of objects in the field compiled by the OTELO survey, unique in terms of minimum line flux and equivalent width. The OTELO pseudo-spectra allow the identification of emission lines and the spectral classification of the sources. We obtained a sample of 72 AGNs in the field of OTELO, selected with four different methods in the optical, X-rays, and mid-infrared bands. We find that using X-rays is the most efficient way to select AGNs. An analysis was performed on the AGN population of OTELO in order to characterize its members. At $z \sim 0.4$, we find that up to 26\% of our H$α$ emitters are AGNs. At that redshift, AGNs are found in identical environments to non-AGNs, although they represent the most clustered group when compared to passive and star-forming galaxies. The majority of our AGNs at any redshift were classified as late-type galaxies, including a 16\% proportion of irregulars. Another 16\% of AGNs show signs of interactions or mergers. Regarding the infrared luminosity, we are able to recover all the luminous infrared galaxies (LIRGs) in the field of OTELO up to $z\sim 1.6$. We find that the proportion of LIRGs and ultra-luminous infrared galaxies (ULIRGs) is higher among the AGN population, and that ULIRGs show a higher fraction of AGNs than LIRGs.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
The OTELO survey. II. The faint-end of the H$α$ luminosity function at z $\sim$ 0.40
Authors:
Marina Ramón-Pérez,
Ángel Bongiovanni,
Ana María Pérez García,
Jordi Cepa,
Maritza A. Lara-López,
José Antonio de Diego,
Emilio J. Alfaro Navarro,
Héctor O. Castañeda,
Miguel Cerviño,
Mirian Fernández-Lorenzo,
Jesús Gallego,
J. Jesús González,
J. Ignacio González-Serrano,
Jakub Nadolny,
Iván Oteo Gómez,
Ricardo Pérez Martínez,
I. Pintos-Castro,
Mirjana Pović,
Miguel Sánchez-Portal
Abstract:
We take advantage of the capability of the OTELO survey to obtain the H$α$ luminosity function (LF) at ${\rm z}\sim0.40$. Because of the deepest coverage of OTELO, we are able to determine the faint end of the LF, and thus better constrain the star formation rate and the number of galaxies at low luminosities. The AGN contribution to this LF is estimated as well. We make use of the multi-wavelengt…
▽ More
We take advantage of the capability of the OTELO survey to obtain the H$α$ luminosity function (LF) at ${\rm z}\sim0.40$. Because of the deepest coverage of OTELO, we are able to determine the faint end of the LF, and thus better constrain the star formation rate and the number of galaxies at low luminosities. The AGN contribution to this LF is estimated as well. We make use of the multi-wavelength catalogue of objects in the field compiled by the OTELO survey, which is unique in terms of minimum flux and equivalent width. We also take advantage of the pseudo-spectra built for each source, which allow the identification of emission lines and the discrimination of different types of objects. The H$α$ luminosity function at $z\sim0.40$ is obtained, which extends the current faint end by almost 1 dex, reaching minimal luminosities of $\log_{10}L_{\rm lim}=38.5$ erg s$^{-1}$ (or $\sim0.002\, \text{M}_\odot\text{ yr}^{-1})$. The AGN contribution to the total H$α$ luminosity is estimated. We find that no AGN should be expected below a luminosity of $\log_{10}L=38.6$ erg s$^{-1}$. From the sample of non-AGN (presumably, pure SFG) at $z\sim0.40$ we estimated a star formation rate density of $ρ_{\rm SFR}=0.012\pm0.005\ {\rm \text{M}_{\odot}\ yr^{-1}\ Mpc^{-3}}$.
△ Less
Submitted 6 February, 2020;
originally announced February 2020.
-
The OTELO survey. I. Description, data reduction, and multi-wavelength catalogue
Authors:
Ángel Bongiovanni,
Marina Ramón-Pérez,
Ana Mará Pérez García,
Jordi Cepa,
Miguel Cerviño,
Jakub Nadolny,
Ricardo Pérez Martínez,
Emilio J. Alfaro Navarro,
Héctor O. Castañeda,
José Antonio de Diego,
Alessandro Ederoclite,
Mirian Fernández-Lorenzo,
Jesús Gallego,
J. Jesús González,
J. Ignacio González-Serrano,
Maritza A. Lara-López,
Iván Oteo Gómez,
Carmen P. Padilla Torres,
Irene Pintos-Castro,
Mirjana Pović,
Miguel Sánchez-Portal,
D. Heath Jones,
Joss Bland-Hawthorn,
Antonio Cabrera-Lavers
Abstract:
The evolution of galaxies through cosmic time is studied observationally by means of extragalactic surveys. The OTELO survey aims to provide the deepest narrow-band survey to date in terms of minimum detectable flux and emission line equivalent width in order to detect the faintest extragalactic emission line systems. In this way, OTELO data will complements other broad-band, narrow-band, and spec…
▽ More
The evolution of galaxies through cosmic time is studied observationally by means of extragalactic surveys. The OTELO survey aims to provide the deepest narrow-band survey to date in terms of minimum detectable flux and emission line equivalent width in order to detect the faintest extragalactic emission line systems. In this way, OTELO data will complements other broad-band, narrow-band, and spectroscopic surveys. The red tunable filter of the OSIRIS instrument on the 10.4 m Gran Telescopio Canarias (GTC) is used to scan a spectral window centred at $9175 Å$, which is free from strong sky emission lines, with a sampling interval of $6 Å$ and a bandwidth of $12 Å$ in the most deeply explored Extended Groth Strip region. Careful data reduction using improved techniques for sky ring subtraction, accurate astrometry, photometric calibration, and source extraction enables us to compile the OTELO catalogue. This catalogue is complemented with ancillary data ranging from deep X-ray to far-infrared, including high resolution HST images, which allow us to segregate the different types of targets, derive precise photometric redshifts, and obtain the morphological classification of the extragalactic objects detected. The OTELO multi-wavelength catalogue contains 11237 entries and is 50\% complete at AB magnitude 26.38. Of these sources, 6600 have photometric redshifts with an uncertainty $z_{phot}$ better than $0.2 (1+z_{phot})$. A total of 4336 of these sources correspond to preliminary emission line candidates, which are complemented by 81 candidate stars and 483 sources that qualify as absorption line systems. The OTELO survey products were released to the public on 2019.
△ Less
Submitted 7 February, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
A Catalog of M-dwarf Flares with ASAS-SN
Authors:
Romy Rodríguez Martínez,
Laura A. Lopez,
Benjamin J. Shappee,
Sarah J. Schmidt,
Tharindu Jayasinghe,
Christopher S. Kochanek,
Katie Auchettl,
Thomas W. -S. Holoien
Abstract:
We analyzed the light curves of 1376 early-to-late, nearby M dwarfs to search for white-light flares using photometry from the All-Sky Automated Survey for Supernovae (ASAS-SN). We identified 480 M dwarfs with at least one potential flare employing a simple statistical algorithm that searches for sudden increases in $V$-band flux. After more detailed evaluation, we identified 62 individual flares…
▽ More
We analyzed the light curves of 1376 early-to-late, nearby M dwarfs to search for white-light flares using photometry from the All-Sky Automated Survey for Supernovae (ASAS-SN). We identified 480 M dwarfs with at least one potential flare employing a simple statistical algorithm that searches for sudden increases in $V$-band flux. After more detailed evaluation, we identified 62 individual flares on 62 stars. The event amplitudes range from $0.12 <ΔV < 2.04$ mag. Using classical-flare models, we place lower limits on the flare energies and obtain $V$-band energies spanning $2.0\times10^{30} \lesssim E_{V} \lesssim 6.9\times10^{35}$ erg. The fraction of flaring stars increases with spectral type, and most flaring stars show moderate to strong H$α$ emission. Additionally, we find that 14 of the 62 flaring stars are rotational variables, and they have shorter rotation periods and stronger H$α$ emission than non-flaring rotational variable M dwarfs.
△ Less
Submitted 11 December, 2019;
originally announced December 2019.
-
KELT-25b and KELT-26b: A Hot Jupiter and a Substellar Companion Transiting Young A-stars Observed by TESS
Authors:
Romy Rodríguez Martínez,
B. Scott Gaudi,
Joseph E. Rodriguez,
George Zhou,
Jonathan Labadie-Bartz,
Samuel N. Quinn,
Kaloyan Minev Penev,
Thiam-Guan Tan,
David W. Latham,
Leonardo A. Paredes,
John Kielkopf,
Brett C. Addison,
Duncan J. Wright,
Johanna K. Teske,
Steve B. Howell,
David R. Ciardi,
Carl Ziegler,
Keivan G. Stassun,
Marshall C. Johnson,
Jason D. Eastman,
Robert J. Siverd,
Thomas G. Beatty,
Luke G. Bouma,
Joshua Pepper,
Michael B. Lund
, et al. (67 additional authors not shown)
Abstract:
We present the discoveries of KELT-25b (TIC 65412605, TOI-626.01) and KELT-26b (TIC 160708862, TOI-1337.01), two transiting companions orbiting relatively bright, early A-stars. The transit signals were initially detected by the KELT survey, and subsequently confirmed by \textit{TESS} photometry. KELT-25b is on a 4.40-day orbit around the V = 9.66 star CD-24 5016 (…
▽ More
We present the discoveries of KELT-25b (TIC 65412605, TOI-626.01) and KELT-26b (TIC 160708862, TOI-1337.01), two transiting companions orbiting relatively bright, early A-stars. The transit signals were initially detected by the KELT survey, and subsequently confirmed by \textit{TESS} photometry. KELT-25b is on a 4.40-day orbit around the V = 9.66 star CD-24 5016 ($T_{\rm eff} = 8280^{+440}_{-180}$ K, $M_{\star}$ = $2.18^{+0.12}_{-0.11}$ $M_{\odot}$), while KELT-26b is on a 3.34-day orbit around the V = 9.95 star HD 134004 ($T_{\rm eff}$ =$8640^{+500}_{-240}$ K, $M_{\star}$ = $1.93^{+0.14}_{-0.16}$ $M_{\odot}$), which is likely an Am star. We have confirmed the sub-stellar nature of both companions through detailed characterization of each system using ground-based and \textit{TESS} photometry, radial velocity measurements, Doppler Tomography, and high-resolution imaging. For KELT-25, we determine a companion radius of $R_{\rm P}$ = $1.64^{+0.039}_{-0.043}$ $R_{\rm J}$, and a 3-sigma upper limit on the companion's mass of $\sim64~M_{\rm J}$. For KELT-26b, we infer a planetary mass and radius of $M_{\rm P}$ = $1.41^{+0.43}_{-0.51}$ $M_{\rm J}$ and $R_{\rm P}$ = $1.940^{+0.060}_{-0.058}$ $R_{\rm J}$. From Doppler Tomographic observations, we find KELT-26b to reside in a highly misaligned orbit. This conclusion is weakly corroborated by a subtle asymmetry in the transit light curve from the \textit{TESS} data. KELT-25b appears to be in a well-aligned, prograde orbit, and the system is likely a member of a cluster or moving group.
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
DeStress: Deep Learning for Unsupervised Identification of Mental Stress in Firefighters from Heart-rate Variability (HRV) Data
Authors:
Ali Oskooei,
Sophie Mai Chau,
Jonas Weiss,
Arvind Sridhar,
María Rodríguez Martínez,
Bruno Michel
Abstract:
In this work we perform a study of various unsupervised methods to identify mental stress in firefighter trainees based on unlabeled heart rate variability data. We collect RR interval time series data from nearly 100 firefighter trainees that participated in a drill. We explore and compare three methods in order to perform unsupervised stress detection: 1) traditional K-Means clustering with engi…
▽ More
In this work we perform a study of various unsupervised methods to identify mental stress in firefighter trainees based on unlabeled heart rate variability data. We collect RR interval time series data from nearly 100 firefighter trainees that participated in a drill. We explore and compare three methods in order to perform unsupervised stress detection: 1) traditional K-Means clustering with engineered time and frequency domain features 2) convolutional autoencoders and 3) long short-term memory (LSTM) autoencoders, both trained on the raw RRI measurements combined with DBSCAN clustering and K-Nearest-Neighbors classification. We demonstrate that K-Means combined with engineered features is unable to capture meaningful structure within the data. On the other hand, convolutional and LSTM autoencoders tend to extract varying structure from the data pointing to different clusters with different sizes of clusters. We attempt at identifying the true stressed and normal clusters using the HRV markers of mental stress reported in the literature. We demonstrate that the clusters produced by the convolutional autoencoders consistently and successfully stratify stressed versus normal samples, as validated by several established physiological stress markers such as RMSSD, Max-HR, Mean-HR and LF-HF ratio.
△ Less
Submitted 18 November, 2019;
originally announced November 2019.
-
Trapping flocking particles with asymmetric obstacles
Authors:
Raul Martinez,
Francisco Alarcon,
Juan Luis Aragones,
Chantal Valeriani
Abstract:
Asymmetric obstacles can be exploited to direct the motion and induce sorting of run-and-tumbling particles. In this work, we show that flocking particles which follow the Vicsek model aligning rules experience a collective trapping in the presence of a wall of funnels made of chevrons, concentrating at the opposite side of a wall of funnels than run-and-tumbling particles. Flocking particles can…
▽ More
Asymmetric obstacles can be exploited to direct the motion and induce sorting of run-and-tumbling particles. In this work, we show that flocking particles which follow the Vicsek model aligning rules experience a collective trapping in the presence of a wall of funnels made of chevrons, concentrating at the opposite side of a wall of funnels than run-and-tumbling particles. Flocking particles can be completely trapped or exhibit a dynamical trapping behaviour; these two regimes open the door to the design of a system with two perpendicular flows of active particles. This systematic study broaden our understanding about the emergence of collective motion of microorganisms in confined environments and direct the design of new microfluidics devices able to controlthese collective behaviours.
△ Less
Submitted 29 November, 2019;
originally announced November 2019.
-
Collective behavior of Vicsek particles without and with obstacles
Authors:
Raul Martinez,
Francisco Alarcon,
Diego Rogel Rodriguez,
Juan Luis Aragones,
Chantal Valeriani
Abstract:
In our work we have studied a two-dimensional suspension of finite-size Vicsek hard-disks, whose time evolution follows an event-driven dynamics between subsequent time steps. Having compared its collective behaviour with the one expected for a system of scalar Vicsek point-like particles, we have analysed the effect of considering two possible bouncing rules between the disks: a Vicsek-like rule…
▽ More
In our work we have studied a two-dimensional suspension of finite-size Vicsek hard-disks, whose time evolution follows an event-driven dynamics between subsequent time steps. Having compared its collective behaviour with the one expected for a system of scalar Vicsek point-like particles, we have analysed the effect of considering two possible bouncing rules between the disks: a Vicsek-like rule and a pseudo-elastic one, focusing on the order-disorder transition. Next, we have added to the two-dimensional suspension of hard-disk Vicsek particles disk-like passive obstacles of two types: either fixed in space or moving according to the same event-driven dynamics. We have performed a detailed analysis of the particles' collective behaviour observed for both fixed and moving obstacles. In the fixed obstacles case, we have observed formation of clusters at low noise, in agreement with previous studies. When using moving passive obstacles, we found that that order of active particles is better destroyed as the drag of obstacles increases. In the no drag limit an interesting result was found: introduction of low drag passive particles can lead in some cases to a more ordered state of active flocking particles than what they show in bulk.
△ Less
Submitted 23 November, 2019;
originally announced November 2019.
-
Automatically Neutralizing Subjective Bias in Text
Authors:
Reid Pryzant,
Richard Diehl Martinez,
Nathan Dass,
Sadao Kurohashi,
Dan Jurafsky,
Diyi Yang
Abstract:
Texts like news, encyclopedias, and some social media strive for objectivity. Yet bias in the form of inappropriate subjectivity - introducing attitudes via framing, presupposing truth, and casting doubt - remains ubiquitous. This kind of bias erodes our collective trust and fuels social conflict. To address this issue, we introduce a novel testbed for natural language generation: automatically br…
▽ More
Texts like news, encyclopedias, and some social media strive for objectivity. Yet bias in the form of inappropriate subjectivity - introducing attitudes via framing, presupposing truth, and casting doubt - remains ubiquitous. This kind of bias erodes our collective trust and fuels social conflict. To address this issue, we introduce a novel testbed for natural language generation: automatically bringing inappropriately subjective text into a neutral point of view ("neutralizing" biased text). We also offer the first parallel corpus of biased language. The corpus contains 180,000 sentence pairs and originates from Wikipedia edits that removed various framings, presuppositions, and attitudes from biased sentences. Last, we propose two strong encoder-decoder baselines for the task. A straightforward yet opaque CONCURRENT system uses a BERT encoder to identify subjective words as part of the generation process. An interpretable and controllable MODULAR algorithm separates these steps, using (1) a BERT-based classifier to identify problematic words and (2) a novel join embedding through which the classifier can edit the hidden states of the encoder. Large-scale human evaluation across four domains (encyclopedias, news headlines, books, and political speeches) suggests that these algorithms are a first step towards the automatic identification and reduction of bias.
△ Less
Submitted 12 December, 2019; v1 submitted 21 November, 2019;
originally announced November 2019.
-
From Peccei Quinn symmetry to mass hierarchy problem
Authors:
Y. A. Garnica,
S. F. Mantilla,
R. Martinez,
H. Vargas
Abstract:
We propose a non-universal $\mathrm{U}(1)_{X}$ gauge extension to the Standard Model (SM) and an additional Peccei-Quinn (PQ) global symmetry to study the mass hierarchy and strong CP problem. The scheme allows us to distinguish among fermion families and to generate the fermionic mass spectrum of particles of the SM. The symmetry breaking is performed by two scalar Higgs doublets and two scalar H…
▽ More
We propose a non-universal $\mathrm{U}(1)_{X}$ gauge extension to the Standard Model (SM) and an additional Peccei-Quinn (PQ) global symmetry to study the mass hierarchy and strong CP problem. The scheme allows us to distinguish among fermion families and to generate the fermionic mass spectrum of particles of the SM. The symmetry breaking is performed by two scalar Higgs doublets and two scalar Higgs singlets, where one of these has the axion which turns out to be a candidate for Cold Dark Matter. The exotic sector is composed by one up-like $T$ and two down-like $J^{1,2}$ heavy quarks, two heavy charged leptons $E,\mathcal{E}$, one additional right-handed neutrino per family $ν_{R}^{e,μ,τ}$, and an invisible axion $a$. In addition, the large energy scale associated to the breaking of the PQ-symmetry gives masses to the right-handed neutrinos in such a way that the active neutrinos acquire eV-mass values due to the see-saw mechanism. On the other hand, from the non-linear effective Lagrangian, the flavour changing of the down quarks and charged leptons with the axion are considered.
△ Less
Submitted 15 May, 2021; v1 submitted 13 November, 2019;
originally announced November 2019.
-
MonoNet: Towards Interpretable Models by Learning Monotonic Features
Authors:
An-phi Nguyen,
María Rodríguez Martínez
Abstract:
Being able to interpret, or explain, the predictions made by a machine learning model is of fundamental importance. This is especially true when there is interest in deploying data-driven models to make high-stakes decisions, e.g. in healthcare. While recent years have seen an increasing interest in interpretable machine learning research, this field is currently lacking an agreed-upon definition…
▽ More
Being able to interpret, or explain, the predictions made by a machine learning model is of fundamental importance. This is especially true when there is interest in deploying data-driven models to make high-stakes decisions, e.g. in healthcare. While recent years have seen an increasing interest in interpretable machine learning research, this field is currently lacking an agreed-upon definition of interpretability, and some researchers have called for a more active conversation towards a rigorous approach to interpretability. Joining this conversation, we claim in this paper that the difficulty of interpreting a complex model stems from the existing interactions among features. We argue that by enforcing monotonicity between features and outputs, we are able to reason about the effect of a single feature on an output independently from other features, and consequently better understand the model. We show how to structurally introduce this constraint in deep learning models by adding new simple layers. We validate our model on benchmark datasets, and compare our results with previously proposed interpretable models.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Boundedness of Fatou components of the family f(z)=λsin(z)+a
Authors:
F. R. Martinez,
G. Sienra
Abstract:
In this paper we discuss the boundedness of the Fatou components for the sine family and the extended sine family, mainly when the parameter λhas modulus greater than 1 and the map is post-critically bounded.
In this paper we discuss the boundedness of the Fatou components for the sine family and the extended sine family, mainly when the parameter λhas modulus greater than 1 and the map is post-critically bounded.
△ Less
Submitted 23 October, 2019; v1 submitted 28 September, 2019;
originally announced September 2019.
-
A $U(1)_{X}$ extension to the SM with three families and Peccei Quinn symmetry
Authors:
Y. A. Garnica,
R. Martinez
Abstract:
We propose a non-universal $U(1)_{X}$ extension to the Standard Model with three families and an additional global anomala Peccei-Quinn (PQ) symmetry. The breaking of the former allows us to give masses to the exotic fermionic sector and the later generates the necessary zeros in the mass matrices to explain the fermionic mass hierarchy. In addition, the large energy scale associated with the spon…
▽ More
We propose a non-universal $U(1)_{X}$ extension to the Standard Model with three families and an additional global anomala Peccei-Quinn (PQ) symmetry. The breaking of the former allows us to give masses to the exotic fermionic sector and the later generates the necessary zeros in the mass matrices to explain the fermionic mass hierarchy. In addition, the large energy scale associated with the spontaneously breaking (SSB) of the PQ symmetry provides a solution to the strong CP-problem and an axion that could be a possible dark matter candidate. Also, the SSB allows to generate right-handed neutrino masses, so the active neutrinos acquire $eV$-mass values due to the see-saw mechanism implementation.
△ Less
Submitted 25 September, 2019;
originally announced September 2019.
-
PaccMann$^{RL}$: Designing anticancer drugs from transcriptomic data via reinforcement learning
Authors:
Jannis Born,
Matteo Manica,
Ali Oskooei,
Joris Cadow,
Karsten Borgwardt,
María Rodríguez Martínez
Abstract:
With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capa…
▽ More
With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capable of tailoring anticancer compounds for a specific biomolecular profile. Using a RL framework, the transcriptomic profiles of cancer cells are used as a context for the generation of candidate molecules. Our molecule generator combines two separately pretrained variational autoencoders (VAEs) - the first VAE encodes transcriptomic profiles into a smooth, latent space which in turn is used to condition a second VAE to generate novel molecular structures on the given transcriptomic profile. The generative process is optimized through PaccMann, a previously developed drug sensitivity prediction model to obtain effective anticancer compounds for the given context (i.e., transcriptomic profile). We demonstrate how the molecule generation can be biased towards compounds with high predicted inhibitory effect against individual cell lines or specific cancer sites. We verify our approach by investigating candidate drugs generated against specific cancer types and find the highest structural similarity to existing compounds with known efficacy against these cancer types. We envision our approach to transform in silico anticancer drug design by leveraging the biomolecular characteristics of the disease in order to increase success rates in lead compound discovery.
△ Less
Submitted 16 April, 2020; v1 submitted 29 August, 2019;
originally announced September 2019.
-
A $U(1)_X$ extension to the MSSM with three families
Authors:
J. S. Alvarado,
Carlos E. Diaz,
R. Martinez
Abstract:
We propose a supersymmetric extension of the anomaly-free and three families nonuniversal $U(1)$ model, with the inclusion of four Higgs doublets and four Higgs singlets. The quark sector is extended by adding three exotic quark singlets, while the lepton sector includes two exotic charged lepton singlets, three right-handed neutrinos and three sterile Majorana neutrinos to obtain the fermionic ma…
▽ More
We propose a supersymmetric extension of the anomaly-free and three families nonuniversal $U(1)$ model, with the inclusion of four Higgs doublets and four Higgs singlets. The quark sector is extended by adding three exotic quark singlets, while the lepton sector includes two exotic charged lepton singlets, three right-handed neutrinos and three sterile Majorana neutrinos to obtain the fermionic mass spectrum. By implementing an additional $\mathbb{Z}_2$ symmetry, the Yukawa coupling terms are suited in such a way that the fermion mass hierarchy is obtained without fine-tuning. The effective mass matrix for SM neutrinos is fitted to current neutrino oscillation data to check the consistency of the model with experimental evidence, obtaining that the normal-ordering scheme is preferred over the inverse ones. The electron and up, down and strange quarks are massless at tree level, but they get masses through radiative correction at one loop level coming from the sleptons and Higgsinos contributions. We show that the model predicts a like-Higgs SM mass at electroweak scale by using the VEV according to the symmetry breaking and fermion masses.
△ Less
Submitted 18 September, 2019; v1 submitted 6 September, 2019;
originally announced September 2019.
-
Data-Driven Modelling of the Van Allen Belts: The 5DRBM Model for Trapped Electrons
Authors:
Lionel Métrailler,
Guillaume Bélanger,
Peter Kretschmar,
Erik Kuulkers,
Ricardo Pérez Martínez,
Jan-Uwe Ness,
Pedro Rodriguez,
Mauro Casale,
Jorge Fauste,
Timothy Finn,
Celia Sanchez,
Thomas Godard,
Richard Southworth
Abstract:
The magnetosphere sustained by the rotation of the Earth's liquid iron core traps charged particles, mostly electrons and protons, into structures referred to as the Van Allen belts. These radiation belts, in which the density of charged energetic particles can be very destructive for sensitive instrumentation, have to be crossed on every orbit of satellites traveling in elliptical orbits around t…
▽ More
The magnetosphere sustained by the rotation of the Earth's liquid iron core traps charged particles, mostly electrons and protons, into structures referred to as the Van Allen belts. These radiation belts, in which the density of charged energetic particles can be very destructive for sensitive instrumentation, have to be crossed on every orbit of satellites traveling in elliptical orbits around the Earth, as is the case for ESA's INTEGRAL and XMM-Newton missions. This paper presents the first working version of the 5DRBM-e model, a global, data-driven model of the radiation belts for trapped electrons. The model is based on in-situ measurements of electrons by the radiation monitors on board the INTEGRAL and XMM-Newton satellites along their long elliptical orbits for respectively 16 and 19 years of operations. This model, in its present form, features the integral flux for trapped electrons within energies ranging from 0.7 to 1.75 MeV. Cross-validation of the 5DRBM-e with the well-known AE8min/max and AE9mean models for a low eccentricity GPS orbit shows excellent agreement, and demonstrates that the new model can be used to provide reliable predictions along widely different orbits around Earth for the purpose of designing, planning, and operating satellites with more accurate instrument safety margins. Future work will include extending the model based on electrons of different energies and proton radiation measurement data.
△ Less
Submitted 25 July, 2019;
originally announced July 2019.
-
Dark matter in Inert Doublet Model with one scalar singlet and $U(1)_X$ gauge symmetry
Authors:
M. A. Arroyo-Ureña,
R. Gaitan,
R. Martinez,
J. H. Montes de Oca Yemha
Abstract:
We study Dark Matter (DM) abundance in the framework of the extension of the Standard Model (SM) with an additional $U(1)_X$ gauge symmetry. One complex singlet is included to break the $U(1)_X$ gauge symmetry, meanwhile one of the doublets is considered inert to introduce a DM candidate. The stability of the DM candidate is analyzed with a continuous $U(1)_X$ gauge symmetry as well as discrete…
▽ More
We study Dark Matter (DM) abundance in the framework of the extension of the Standard Model (SM) with an additional $U(1)_X$ gauge symmetry. One complex singlet is included to break the $U(1)_X$ gauge symmetry, meanwhile one of the doublets is considered inert to introduce a DM candidate. The stability of the DM candidate is analyzed with a continuous $U(1)_X$ gauge symmetry as well as discrete $Z_2$ symmetry. We find allowed regions for the free model parameters which are in agreement with the most up-to-date experimental results reported by CMS and ATLAS collaborations, the upper limit on WIMP-nucleon cross section imposed by XENON1T collaboration and the upper limit on the production cross-section of a $Z^{\prime}$ gauge boson times the branching ratio of the $Z^{\prime}$ boson decaying into $\ell^-\ell^+$. We also obtain allowed regions for the DM candidate mass from the relic density reported by the PLANCK collaboration including light, intermediate and heavy masses; depending mainly on two parameters of the scalar potential, $λ_{2x}$ and $λ_{345}=λ_3+λ_4+2λ_5$. We find that trough $pp\rightarrow χχγ$ production, it may only be possible for a future hadron-hadron Circular Collider (FCC-hh) to be able to detect a DM candidate within the range of masses 10-60 GeV.
△ Less
Submitted 4 August, 2020; v1 submitted 18 July, 2019;
originally announced July 2019.
-
Searching for Wide Companions and Identifying Circum(sub)stellar Disks through PSF-Fitting of Spitzer/IRAC Archival Images
Authors:
Raquel A. Martinez,
Adam L. Kraus
Abstract:
Direct imaging surveys have discovered wide-orbit planetary-mass companions that challenge existing models of both star and planet formation, but their demographics remain poorly sampled. We have developed an automated binary companion point spread function (PSF) fitting pipeline to take advantage of Spitzer's infrared sensitivity to planetary-mass objects and circum(sub)stellar disks, measuring p…
▽ More
Direct imaging surveys have discovered wide-orbit planetary-mass companions that challenge existing models of both star and planet formation, but their demographics remain poorly sampled. We have developed an automated binary companion point spread function (PSF) fitting pipeline to take advantage of Spitzer's infrared sensitivity to planetary-mass objects and circum(sub)stellar disks, measuring photometry across the four IRAC channels of 3.6 $μ$m, 4.5 $μ$m, 5.8 $μ$m, and 8.0 $μ$m. We present PSF-fitting photometry of archival Spitzer/IRAC images for 11 young, low-mass ($M\sim0.044$-0.88 $M_{\odot}$; M7.5-K3.5) members of three nearby star-forming regions (Chameleon, Taurus, and Upper Scorpius; $d\sim$ 150 pc; $τ\sim$ 1-10 Myr) that host confirmed or candidate faint companions at $ρ= 1.68^{\prime\prime}-7.31^{\prime\prime}$. We recover all system primaries, six confirmed, and two candidate low-mass companions in our sample. We also measure non-photospheric $[3.6]-[8.0]$ colors for three of the system primaries, four of the confirmed companions, and one candidate companion, signifying the presence of circumstellar or circum(sub)stellar disks. We furthermore report the confirmation of a $ρ=4.66^{\prime\prime}$ (540 au) companion to [SCH06] J0359+2009 which was previously identified as a candidate via imaging over five years ago, but was not studied further. Based on its brightness ($M_{[3.6]}=8.53$ mag), we infer the companion mass to be $M=20\pm5$ $M_\mathrm{Jup}$ given the primary's model-derived age of 10 Myr. Our framework is sensitive to companions with masses less than 10 $M_\mathrm{Jup}$ at separations of $ρ= 300$ au in nearby star-forming regions, opening up a new regime of parameter space that has yet to be studied in detail, discovering planetary-mass companions in their birth environments and revealing their circum(sub)stellar disks.
△ Less
Submitted 15 July, 2019;
originally announced July 2019.
-
Un Modelo Ontológico para el Gobierno Electrónico
Authors:
Carlos Roberto Brys,
José F. Aldana-Montes,
David Luis La Red Martínez
Abstract:
Decision making often requires information that must be Provided with the rich data format. Addressing these new requirements appropriately makes it necessary for government agencies to orchestrate large amounts of information from different sources and formats, to be efficiently delivered through the devices commonly used by people, such as computers, netbooks, tablets and smartphones. To overcom…
▽ More
Decision making often requires information that must be Provided with the rich data format. Addressing these new requirements appropriately makes it necessary for government agencies to orchestrate large amounts of information from different sources and formats, to be efficiently delivered through the devices commonly used by people, such as computers, netbooks, tablets and smartphones. To overcome these problems, a model is proposed for the conceptual representation of the State's organizational units, seen as georeferenced entities of Electronic Government, based on ontologies designed under the principles of Linked Open Data, which allows the automatic extraction of information through the machines, which supports the process of governmental decision making and gives citizens full access to find and process through mobile technologies.
△ Less
Submitted 4 July, 2019;
originally announced July 2019.
-
TESS Hunt for Young and Maturing Exoplanets (THYME): A planet in the 45 Myr Tucana-Horologium association
Authors:
Elisabeth R. Newton,
Andrew W. Mann,
Benjamin M. Tofflemire,
Logan Pearce,
Aaron C. Rizzuto,
Andrew Vanderburg,
Raquel A. Martinez,
Jason J. Wang,
Jean-Baptiste Ruffio,
Adam L. Kraus,
Marshall C. Johnson,
Pa Chia Thao,
Mackenna L. Wood,
Rayna Rampalli,
Eric L. Nielsen,
Karen A. Collins,
Diana Dragomir,
Coel Hellier,
D. R. Anderson,
Thomas Barclay,
Carolyn Brown,
Gregory Feiden,
Rhodes Hart,
Giovanni Isopi,
John F. Kielkopf
, et al. (27 additional authors not shown)
Abstract:
Young exoplanets are snapshots of the planetary evolution process. Planets that orbit stars in young associations are particularly important because the age of the planetary system is well constrained. We present the discovery of a transiting planet larger than Neptune but smaller than Saturn in the 45 Myr Tucana-Horologium young moving group. The host star is a visual binary, and our follow-up ob…
▽ More
Young exoplanets are snapshots of the planetary evolution process. Planets that orbit stars in young associations are particularly important because the age of the planetary system is well constrained. We present the discovery of a transiting planet larger than Neptune but smaller than Saturn in the 45 Myr Tucana-Horologium young moving group. The host star is a visual binary, and our follow-up observations demonstrate that the planet orbits the G6V primary component, DS Tuc A (HD 222259A, TIC 410214986). We first identified transits using photometry from the Transiting Exoplanet Survey Satellite (TESS; alerted as TOI 200.01). We validated the planet and improved the stellar parameters using a suite of new and archival data, including spectra from SOAR/Goodman, SALT/HRS and LCO/NRES; transit photometry from Spitzer; and deep adaptive optics imaging from Gemini/GPI. No additional stellar or planetary signals are seen in the data. We measured the planetary parameters by simultaneously modeling the photometry with a transit model and a Gaussian process to account for stellar variability. We determined that the planetary radius is $5.70\pm0.17$ Earth radii and that the orbital period is 8.1 days. The inclination angles of the host star's spin axis, the planet's orbital axis, and the visual binary's orbital axis are aligned within 15 degrees to within the uncertainties of the relevant data. DS Tuc Ab is bright enough (V=8.5) for detailed characterization using radial velocities and transmission spectroscopy.
△ Less
Submitted 25 June, 2019;
originally announced June 2019.
-
A system for the 2019 Sentiment, Emotion and Cognitive State Task of DARPAs LORELEI project
Authors:
Victor R Martinez,
Anil Ramakrishna,
Ming-Chang Chiu,
Karan Singla,
Shrikanth Narayanan
Abstract:
During the course of a Humanitarian Assistance-Disaster Relief (HADR) crisis, that can happen anywhere in the world, real-time information is often posted online by the people in need of help which, in turn, can be used by different stakeholders involved with management of the crisis. Automated processing of such posts can considerably improve the effectiveness of such efforts; for example, unders…
▽ More
During the course of a Humanitarian Assistance-Disaster Relief (HADR) crisis, that can happen anywhere in the world, real-time information is often posted online by the people in need of help which, in turn, can be used by different stakeholders involved with management of the crisis. Automated processing of such posts can considerably improve the effectiveness of such efforts; for example, understanding the aggregated emotion from affected populations in specific areas may help inform decision-makers on how to best allocate resources for an effective disaster response. However, these efforts may be severely limited by the availability of resources for the local language. The ongoing DARPA project Low Resource Languages for Emergent Incidents (LORELEI) aims to further language processing technologies for low resource languages in the context of such a humanitarian crisis. In this work, we describe our submission for the 2019 Sentiment, Emotion and Cognitive state (SEC) pilot task of the LORELEI project. We describe a collection of sentiment analysis systems included in our submission along with the features extracted. Our fielded systems obtained the best results in both English and Spanish language evaluations of the SEC pilot task.
△ Less
Submitted 1 May, 2019;
originally announced May 2019.
-
The meteorite flux of the last 2 Myr recorded in the Atacama desert
Authors:
A. Drouard,
J. Gattacceca,
A. Hutzler,
P. Rochette,
R. Braucher,
D. Bourlès,
ASTER Team,
M. Gounelle,
A. Morbidelli,
V. Debaille,
M. Van Ginneken,
M. Valenzuela,
Y. Quesnel,
R. Martinez
Abstract:
The evolution of the meteorite flux to the Earth can be studied by determining the terrestrial ages of meteorite collected in hot deserts. We have measured the terrestrial ages of 54 stony meteorites from the El Médano area, in the Atacama Desert, using the cosmogenic nuclide chlorine 36. With an average age of 710 ka, this collection is the oldest collection of non fossil meteorites at the Earth'…
▽ More
The evolution of the meteorite flux to the Earth can be studied by determining the terrestrial ages of meteorite collected in hot deserts. We have measured the terrestrial ages of 54 stony meteorites from the El Médano area, in the Atacama Desert, using the cosmogenic nuclide chlorine 36. With an average age of 710 ka, this collection is the oldest collection of non fossil meteorites at the Earth's surface. This allows both determining the average meteorite flux intensity over the last 2 Myr (222 meteorites larger than 10 g per km2 per Myr) and discussing its possible compositional variability over the Quaternary period. A change in the flux composition, with more abundant H chondrites, occurred between 0.5 and 1 Ma, possibly due to the direct delivery to Earth of a meteoroid swarm from the asteroid belt.
△ Less
Submitted 29 April, 2019;
originally announced April 2019.
-
Towards Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-based Convolutional Encoders
Authors:
Matteo Manica,
Ali Oskooei,
Jannis Born,
Vigneshwari Subramanian,
Julio Sáez-Rodríguez,
María Rodríguez Martínez
Abstract:
In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior kn…
▽ More
In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior knowledge on intracellular interactions from protein-protein interaction networks. We demonstrate that our multiscale convolutional attention-based (MCA) encoder significantly outperforms a baseline model trained on Morgan fingerprints, a selection of encoders based on SMILES as well as previously reported state of the art for multimodal drug sensitivity prediction (R2 = 0.86 and RMSE = 0.89). Moreover, the explainability of our approach is demonstrated by a thorough analysis of the attention weights. We show that the attended genes significantly enrich apoptotic processes and that the drug attention is strongly correlated with a standard chemical structure similarity index. Finally, we report a case study of two receptor tyrosine kinase (RTK) inhibitors acting on a leukemia cell line, showcasing the ability of the model to focus on informative genes and submolecular regions of the two compounds. The demonstrated generalizability and the interpretability of our model testify its potential for in-silico prediction of anticancer compound efficacy on unseen cancer cells, positioning it as a valid solution for the development of personalized therapies as well as for the evaluation of candidate compounds in de novo drug design.
△ Less
Submitted 14 July, 2019; v1 submitted 25 April, 2019;
originally announced April 2019.
-
edGNN: a Simple and Powerful GNN for Directed Labeled Graphs
Authors:
Guillaume Jaume,
An-phi Nguyen,
María Rodríguez Martínez,
Jean-Philippe Thiran,
Maria Gabrani
Abstract:
The ability of a graph neural network (GNN) to leverage both the graph topology and graph labels is fundamental to building discriminative node and graph embeddings. Building on previous work, we theoretically show that edGNN, our model for directed labeled graphs, is as powerful as the Weisfeiler-Lehman algorithm for graph isomorphism. Our experiments support our theoretical findings, confirming…
▽ More
The ability of a graph neural network (GNN) to leverage both the graph topology and graph labels is fundamental to building discriminative node and graph embeddings. Building on previous work, we theoretically show that edGNN, our model for directed labeled graphs, is as powerful as the Weisfeiler-Lehman algorithm for graph isomorphism. Our experiments support our theoretical findings, confirming that graph neural networks can be used effectively for inference problems on directed graphs with both node and edge labels. Code available at https://github.com/guillaumejaume/edGNN.
△ Less
Submitted 4 December, 2019; v1 submitted 18 April, 2019;
originally announced April 2019.
-
CP symmetry violation in the scalar sector of 331 models
Authors:
Camilo A. Rojas,
F. Ochoa,
R. Martinez
Abstract:
In order to understand some frameworks for CP Violation scenarios in the scalar sector, a 331 model was considered which its main property is the incorporation of a local group symmetry SU(3) in the electroweak sector. In particular, a 331 model with a particular choice of free parameter. CP Violation scenarios were obtained by introducing a discrete symmetry in the scalar triplets, which exhibit…
▽ More
In order to understand some frameworks for CP Violation scenarios in the scalar sector, a 331 model was considered which its main property is the incorporation of a local group symmetry SU(3) in the electroweak sector. In particular, a 331 model with a particular choice of free parameter. CP Violation scenarios were obtained by introducing a discrete symmetry in the scalar triplets, which exhibit a spontaneous CP Violation frame with just one independent CP phase associated. Mass state rotations were obtained.
△ Less
Submitted 11 April, 2019;
originally announced April 2019.
-
The Polarimetric and Helioseismic Imager on Solar Orbiter
Authors:
S. K. Solanki,
J. C. del Toro Iniesta,
J. Woch,
A. Gandorfer,
J. Hirzberger,
A. Alvarez-Herrero,
T. Appourchaux,
V. Martínez Pillet,
I. Pérez-Grande,
E. Sanchis Kilders,
W. Schmidt,
J. M. Gómez Cama,
H. Michalik,
W. Deutsch,
G. Fernandez-Rico,
B. Grauf,
L. Gizon,
K. Heerlein,
M. Kolleck,
A. Lagg,
R. Meller,
R. Müller,
U. Schühle,
J. Staub,
K. Albert
, et al. (99 additional authors not shown)
Abstract:
This paper describes the Polarimetric and Helioseismic Imager on the Solar Orbiter mission (SO/PHI), the first magnetograph and helioseismology instrument to observe the Sun from outside the Sun-Earth line. It is the key instrument meant to address the top-level science question: How does the solar dynamo work and drive connections between the Sun and the heliosphere? SO/PHI will also play an impo…
▽ More
This paper describes the Polarimetric and Helioseismic Imager on the Solar Orbiter mission (SO/PHI), the first magnetograph and helioseismology instrument to observe the Sun from outside the Sun-Earth line. It is the key instrument meant to address the top-level science question: How does the solar dynamo work and drive connections between the Sun and the heliosphere? SO/PHI will also play an important role in answering the other top-level science questions of Solar Orbiter, as well as hosting the potential of a rich return in further science.
SO/PHI measures the Zeeman effect and the Doppler shift in the FeI 617.3nm spectral line. To this end, the instrument carries out narrow-band imaging spectro-polarimetry using a tunable LiNbO_3 Fabry-Perot etalon, while the polarisation modulation is done with liquid crystal variable retarders (LCVRs). The line and the nearby continuum are sampled at six wavelength points and the data are recorded by a 2kx2k CMOS detector. To save valuable telemetry, the raw data are reduced on board, including being inverted under the assumption of a Milne-Eddington atmosphere, although simpler reduction methods are also available on board. SO/PHI is composed of two telescopes; one, the Full Disc Telescope (FDT), covers the full solar disc at all phases of the orbit, while the other, the High Resolution Telescope (HRT), can resolve structures as small as 200km on the Sun at closest perihelion. The high heat load generated through proximity to the Sun is greatly reduced by the multilayer-coated entrance windows to the two telescopes that allow less than 4% of the total sunlight to enter the instrument, most of it in a narrow wavelength band around the chosen spectral line.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
Data Assimilation in Large-Prandtl Rayleigh-Bénard Convection from Thermal Measurements
Authors:
A. Farhat,
N. E. Glatt-Holtz,
V. R. Martinez,
S. A. McQuarrie,
J. P. Whitehead
Abstract:
This work applies a continuous data assimilation scheme---a particular framework for reconciling sparse and potentially noisy observations to a mathematical model---to Rayleigh-Bénard convection at infinite or large Prandtl numbers using only the temperature field as observables. These Prandtl numbers are applicable to the earth's mantle and to gases under high pressure. We rigorously identify con…
▽ More
This work applies a continuous data assimilation scheme---a particular framework for reconciling sparse and potentially noisy observations to a mathematical model---to Rayleigh-Bénard convection at infinite or large Prandtl numbers using only the temperature field as observables. These Prandtl numbers are applicable to the earth's mantle and to gases under high pressure. We rigorously identify conditions that guarantee synchronization between the observed system and the model, then confirm the applicability of these results via numerical simulations. Our numerical experiments show that the analytically derived conditions for synchronization are far from sharp; that is, synchronization often occurs even when the conditions of our theorems are not met. We also develop estimates on the convergence of an infinite Prandtl model to a large (but finite) Prandtl number generated set of observations. Numerical simulations in this hybrid setting indicate that the mathematically rigorous results are accurate, but of practical interest only for extremely large Prandtl numbers.
△ Less
Submitted 4 March, 2019;
originally announced March 2019.
-
A non-universal $U(1)_{X}$ gauge extension to the MSSM
Authors:
J. S. Alvarado,
Carlos E. Diaz,
R. Martinez
Abstract:
We propose a supersymmetric extension of the anomaly-free and three families nonuniversal $U(1)$ model, with the inclusion of four Higgs doublets and four Higgs singlets. The quark sector is extended by adding three exotic quark singlets, while the lepton sector includes two exotic charged lepton singlets, three right-handed neutrinos and three sterile Majorana neutrinos to obtain the fermionic ma…
▽ More
We propose a supersymmetric extension of the anomaly-free and three families nonuniversal $U(1)$ model, with the inclusion of four Higgs doublets and four Higgs singlets. The quark sector is extended by adding three exotic quark singlets, while the lepton sector includes two exotic charged lepton singlets, three right-handed neutrinos and three sterile Majorana neutrinos to obtain the fermionic mass spectrum. By implementing an additional $\mathbb{Z}_2$ symmetry, the Yukawa coupling terms are suited in such a way that the fermion mass hierarchy is obtained without fine-tuning. The effective mass matrix for SM neutrinos is fitted to current neutrino oscillation data to check the consistency of the model with experimental evidence, obtaining that the normal-ordering scheme is preferred over the inverse ones. The electron and up, down and strange quarks are massless at tree level, but they get masses through radiative correction at one loop level coming from the sleptons and Higgsinos contributions. We show that the model predicts a like-Higgs SM mass at electroweak scale by using the VEV according to the symmetry breaking and fermion masses.
△ Less
Submitted 29 July, 2019; v1 submitted 22 February, 2019;
originally announced February 2019.
-
Inference of the three-dimensional chromatin structure and its temporal behavior
Authors:
Bianca-Cristina Cristescu,
Zalán Borsos,
John Lygeros,
María Rodríguez Martínez,
Maria Anna Rapsomaniki
Abstract:
Understanding the three-dimensional (3D) structure of the genome is essential for elucidating vital biological processes and their links to human disease. To determine how the genome folds within the nucleus, chromosome conformation capture methods such as HiC have recently been employed. However, computational methods that exploit the resulting high-throughput, high-resolution data are still suff…
▽ More
Understanding the three-dimensional (3D) structure of the genome is essential for elucidating vital biological processes and their links to human disease. To determine how the genome folds within the nucleus, chromosome conformation capture methods such as HiC have recently been employed. However, computational methods that exploit the resulting high-throughput, high-resolution data are still suffering from important limitations. In this work, we explore the idea of manifold learning for the 3D chromatin structure inference and present a novel method, REcurrent Autoencoders for CHromatin 3D structure prediction (REACH-3D). Our framework employs autoencoders with recurrent neural units to reconstruct the chromatin structure. In comparison to existing methods, REACH-3D makes no transfer function assumption and permits dynamic analysis. Evaluating REACH-3D on synthetic data indicated high agreement with the ground truth. When tested on real experimental HiC data, REACH-3D recovered most faithfully the expected biological properties and obtained the highest correlation coefficient with microscopy measurements. Last, REACH-3D was applied to dynamic HiC data, where it successfully modeled chromatin conformation during the cell cycle.
△ Less
Submitted 22 November, 2018;
originally announced November 2018.
-
How to Constrain Your M dwarf II: the mass-luminosity-metallicity relation from 0.075 to 0.70$M_\odot$
Authors:
Andrew W. Mann,
Trent Dupuy,
Adam L. Kraus,
Eric Gaidos,
Megan Ansdell,
Michael Ireland,
Aaron C. Rizzuto,
Chao-Ling Hung,
Jason Dittmann,
Samuel Factor,
Gregory Feiden,
Raquel A. Martinez,
Dary Ruiz-Rodriguez,
Pa Chia Thao
Abstract:
The mass-luminosity relation for late-type stars has long been a critical tool for estimating stellar masses. However, there is growing need for both a higher-precision relation and a better understanding of systematic effects (e.g., metallicity). Here we present an empirical relationship between Mks and mass spanning $0.075M_\odot<M<0.70M_\odot$. The relation is derived from 62 nearby binaries, w…
▽ More
The mass-luminosity relation for late-type stars has long been a critical tool for estimating stellar masses. However, there is growing need for both a higher-precision relation and a better understanding of systematic effects (e.g., metallicity). Here we present an empirical relationship between Mks and mass spanning $0.075M_\odot<M<0.70M_\odot$. The relation is derived from 62 nearby binaries, whose orbits we determine using a combination of Keck/NIRC2 imaging, archival adaptive optics data, and literature astrometry. From their orbital parameters, we determine the total mass of each system, with a precision better than 1% in the best cases. We use these total masses, in combination with resolved Ks magnitudes and system parallaxes, to calibrate the mass-Mks relation. The result can be used to determine masses of single stars with a precision of 2-3%, which we confirm by a comparison to dynamical masses from the literature. The precision is limited by scatter around the best-fit relation beyond mass uncertainties, perhaps driven by intrinsic variation in the mass-Mks relation or underestimated measurement errors. We find the effect of [Fe/H] on the mass-Mks relation is likely negligible for metallicities in the Solar neighborhood (0.0+/-2.2% change in mass per dex change in [Fe/H]). This weak effect is consistent with predictions from the Dartmouth Stellar Evolution Database, but inconsistent with those from MESA Isochrones and Stellar Tracks. A sample of binaries with a wider range of abundances will be required to discern the importance of metallicity in extreme populations (e.g., in the Galactic Halo or thick disk).
△ Less
Submitted 26 January, 2019; v1 submitted 16 November, 2018;
originally announced November 2018.
-
PaccMann: Prediction of anticancer compound sensitivity with multi-modal attention-based neural networks
Authors:
Ali Oskooei,
Jannis Born,
Matteo Manica,
Vigneshwari Subramanian,
Julio Sáez-Rodríguez,
María Rodríguez Martínez
Abstract:
We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a…
▽ More
We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a drug-cell pair consisting of SMILES encoding of a compound and the gene expression profile of a cancer cell and predicts an IC50 sensitivity value. Gene expression profiles are encoded using an attention-based encoding mechanism that assigns high weights to the most informative genes. We present and study three encoders for SMILES string of compounds: 1) bidirectional recurrent 2) convolutional 3) attention-based encoders. We compare our devised models against a baseline model that ingests engineered fingerprints to represent the molecular structure. We demonstrate that using our attention-based encoders, we can surpass the baseline model. The use of attention-based encoders enhance interpretability and enable us to identify genes, bonds and atoms that were used by the network to make a prediction.
△ Less
Submitted 14 July, 2019; v1 submitted 16 November, 2018;
originally announced November 2018.
-
See saw mechanism with Yukawa alignment for neutrinos
Authors:
R. Martinez,
F. Ochoa,
M. Ospina
Abstract:
In the extension of the standard model with one right-handed neutrino and one Higgs triplet, we propose a suppression mechanism, obtaining small masses for the active neutrinos, while mixing angles are predicted with a right-handed neutrino at the TeV scale and Yukawa couplings at the order of $\mathcal{O}(1)$. In this extension, the seesaw formula is proportional to the difference between two Yuk…
▽ More
In the extension of the standard model with one right-handed neutrino and one Higgs triplet, we propose a suppression mechanism, obtaining small masses for the active neutrinos, while mixing angles are predicted with a right-handed neutrino at the TeV scale and Yukawa couplings at the order of $\mathcal{O}(1)$. In this extension, the seesaw formula is proportional to the difference between two Yukawa couplings: the one that governs the interactions of the ordinary matter through the Higgs triplet, and the coupling of the new neutrino through the scalar doublet, so that by aligning both Yukawa couplings, exact zero-mass active neutrinos are obtained. By perturbating this alignment condition, we obtain neutrino masses proportional to the magnitude and direction of the perturbation in the flavour space. Bimaximal and nearly bimaximal mass structures emerge from specific unalignment forms.
△ Less
Submitted 11 October, 2018;
originally announced October 2018.
-
Continuous data assimilation with blurred-in-time measurements of the surface quasi-geostrophic equation
Authors:
Michael S. Jolly,
Vincent R. Martinez,
Eric J. Olson,
Edriss S. Titi
Abstract:
An intrinsic property of almost any physical measuring device is that it makes observations which are slightly blurred in time. We consider a nudging-based approach for data assimilation that constructs an approximate solution based on a feedback control mechanism that is designed to account for observations that have been blurred by a moving time average. Analysis of this nudging model in the con…
▽ More
An intrinsic property of almost any physical measuring device is that it makes observations which are slightly blurred in time. We consider a nudging-based approach for data assimilation that constructs an approximate solution based on a feedback control mechanism that is designed to account for observations that have been blurred by a moving time average. Analysis of this nudging model in the context of the subcritical surface quasi-geostrophic equation shows, provided the time-averaging window is sufficiently small and the resolution of the observations sufficiently fine, that the approximating solution converges exponentially fast to the observed solution over time. In particular, we demonstrate that observational data with a small blur in time possess no significant obstructions to data assimilation provided that the nudging properly takes the time averaging into account. Two key ingredients in our analysis are additional boundedness properties for the relevant interpolant observation operators and a non-local Gronwall inequality.
△ Less
Submitted 31 August, 2018;
originally announced September 2018.
-
Network-based Biased Tree Ensembles (NetBiTE) for Drug Sensitivity Prediction and Drug Sensitivity Biomarker Identification in Cancer
Authors:
Ali Oskooei,
Matteo Manica,
Roland Mathis,
Maria Rodriguez Martinez
Abstract:
We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assign…
▽ More
We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assignment of high weights to the drug targets and propagating the assigned weights over a protein-protein interaction network such as STRING. The propagation of weights, defines neighborhoods of influence around the drug targets and as such simulates the spread of perturbations within the cell, following drug administration. Using a synthetic dataset, we showcase how application of biased tree ensembles (BiTE) results in significant accuracy gains at a much lower computational cost compared to the unbiased random forests (RF) algorithm. We then apply NetBiTE to the Genomics of Drug Sensitivity in Cancer (GDSC) dataset and demonstrate that NetBiTE outperforms RF in predicting IC50 drug sensitivity, only for drugs that target membrane receptor pathways (MRPs): RTK, EGFR and IGFR signaling pathways. We propose based on the NetBiTE results, that for drugs that inhibit MRPs, the expression of target genes prior to drug administration is a biomarker for IC50 drug sensitivity following drug administration. We further verify and reinforce this proposition through control studies on, PI3K/MTOR signaling pathway inhibitors, a drug category that does not target MRPs, and through assignment of dummy targets to MRP inhibiting drugs and investigating the variation in NetBiTE accuracy.
△ Less
Submitted 26 April, 2019; v1 submitted 18 August, 2018;
originally announced August 2018.
-
Characterization of Low Mass K2 Planet Hosts Using Near-Infrared Spectroscopy
Authors:
Romy Rodríguez Martínez,
Sarah Ballard,
Andrew Mayo,
Andrew Vanderburg,
Benjamin T. Montet,
Jessie L. Christiansen
Abstract:
We present moderate resolution near-infrared spectra in $H, J$ and $K$ band of M dwarf hosts to candidate transiting exoplanets discovered by NASA's K2 mission. We employ known empirical relationships between spectral features and physical stellar properties to measure the effective temperature, radius, metallicity, and luminosity of our sample. Out of an initial sample of 56 late-type stars in K2…
▽ More
We present moderate resolution near-infrared spectra in $H, J$ and $K$ band of M dwarf hosts to candidate transiting exoplanets discovered by NASA's K2 mission. We employ known empirical relationships between spectral features and physical stellar properties to measure the effective temperature, radius, metallicity, and luminosity of our sample. Out of an initial sample of 56 late-type stars in K2, we identify 35 objects as M dwarfs. For that sub-sample, we derive temperatures ranging from $2,870$ to $4,187$ K, radii of $0.09-0.83$ $R_{\odot}$, luminosities of $-2.67<log L/L_{\odot}<-0.67$ and [Fe/H] metallicities between $-0.49$ and $0.83$ dex. We then employ the stellar properties derived from spectra, in tandem with the K2 lightcurves, to characterize their planets. We report 33 exoplanet candidates with orbital periods ranging from 0.19 to 21.16 days, and median radii and equilibrium temperatures of 2.3 $R_{\oplus}$ and 986 K, respectively. Using planet mass-radius relationships from the literature, we identify 7 exoplanets as potentially rocky, although we conclude that probably none reside in the habitable zone of their parent stars.
△ Less
Submitted 10 August, 2018;
originally announced August 2018.
-
Resource Orchestration of 5G Transport Networks for Vertical Industries
Authors:
K. Antevski,
J. Martín-Pérez,
Nuria Molner,
C. F. Chiasserini,
F. Malandrino,
P. Frangoudis,
A. Ksentini,
X. Li,
J. SalvatLozano,
R. Martínez,
I. Pascual,
J. Mangues-Bafalluy,
J. Baranda,
B. Martini,
M. Gharbaoui
Abstract:
The future 5G transport networks are envisioned to support a variety of vertical services through network slicing and efficient orchestration over multiple administrative domains. In this paper, we propose an orchestrator architecture to support vertical services to meet their diverse resource and service requirements. We then present a system model for resource orchestration of transport networks…
▽ More
The future 5G transport networks are envisioned to support a variety of vertical services through network slicing and efficient orchestration over multiple administrative domains. In this paper, we propose an orchestrator architecture to support vertical services to meet their diverse resource and service requirements. We then present a system model for resource orchestration of transport networks as well as low-complexity algorithms that aim at minimizing service deployment cost and/or service latency. Importantly, the proposed model can work with any level of abstractions exposed by the underlying network or the federated domains depending on their representation of resources.
△ Less
Submitted 27 July, 2018;
originally announced July 2018.
-
Grapevine: A Wine Prediction Algorithm Using Multi-dimensional Clustering Methods
Authors:
Richard Diehl Martinez,
Geoffrey Angus,
Rooz Mahdavian
Abstract:
We present a method for a wine recommendation system that employs multidimensional clustering and unsupervised learning methods. Our algorithm first performs clustering on a large corpus of wine reviews. It then uses the resulting wine clusters as an approximation of the most common flavor palates, recommending a user a wine by optimizing over a price-quality ratio within clusters that they demons…
▽ More
We present a method for a wine recommendation system that employs multidimensional clustering and unsupervised learning methods. Our algorithm first performs clustering on a large corpus of wine reviews. It then uses the resulting wine clusters as an approximation of the most common flavor palates, recommending a user a wine by optimizing over a price-quality ratio within clusters that they demonstrated a preference for.
△ Less
Submitted 29 June, 2018;
originally announced July 2018.
-
Using General Adversarial Networks for Marketing: A Case Study of Airbnb
Authors:
Richard Diehl Martinez,
John Kaleialoha Kamalu
Abstract:
In this paper, we examine the use case of general adversarial networks (GANs) in the field of marketing. In particular, we analyze how GAN models can replicate text patterns from successful product listings on Airbnb, a peer-to-peer online market for short-term apartment rentals. To do so, we define the Diehl-Martinez-Kamalu (DMK) loss function as a new class of functions that forces the model's g…
▽ More
In this paper, we examine the use case of general adversarial networks (GANs) in the field of marketing. In particular, we analyze how GAN models can replicate text patterns from successful product listings on Airbnb, a peer-to-peer online market for short-term apartment rentals. To do so, we define the Diehl-Martinez-Kamalu (DMK) loss function as a new class of functions that forces the model's generated output to include a set of user-defined keywords. This allows the general adversarial network to recommend a way of rewording the phrasing of a listing description to increase the likelihood that it is booked. Although we tailor our analysis to Airbnb data, we believe this framework establishes a more general model for how generative algorithms can be used to produce text samples for the purposes of marketing.
△ Less
Submitted 29 June, 2018;
originally announced June 2018.
-
Ignition: An End-to-End Supervised Model for Training Simulated Self-Driving Vehicles
Authors:
Rooz Mahdavian,
Richard Diehl Martinez
Abstract:
We introduce Ignition: an end-to-end neural network architecture for training unconstrained self-driving vehicles in simulated environments. The model is a ResNet-18 variant, which is fed in images from the front of a simulated F1 car, and outputs optimal labels for steering, throttle, braking. Importantly, we never explicitly train the model to detect road features like the outline of a track or…
▽ More
We introduce Ignition: an end-to-end neural network architecture for training unconstrained self-driving vehicles in simulated environments. The model is a ResNet-18 variant, which is fed in images from the front of a simulated F1 car, and outputs optimal labels for steering, throttle, braking. Importantly, we never explicitly train the model to detect road features like the outline of a track or distance to other cars; instead, we illustrate that these latent features can be automatically encapsulated by the network.
△ Less
Submitted 29 June, 2018;
originally announced June 2018.
-
Theory of Machine Networks: A Case Study
Authors:
Rooz Mahdavian,
Richard Diehl Martinez
Abstract:
We propose a simplification of the Theory-of-Mind Network architecture, which focuses on modeling complex, deterministic machines as a proxy for modeling nondeterministic, conscious entities. We then validate this architecture in the context of understanding engines, which, we argue, meet the required internal and external complexity to yield meaningful abstractions.
We propose a simplification of the Theory-of-Mind Network architecture, which focuses on modeling complex, deterministic machines as a proxy for modeling nondeterministic, conscious entities. We then validate this architecture in the context of understanding engines, which, we argue, meet the required internal and external complexity to yield meaningful abstractions.
△ Less
Submitted 26 June, 2018;
originally announced June 2018.
-
Beyond $\mathcal{R}(D^{(*)})$ with the general 2HDM-III for $b\to cτν$
Authors:
R. Martinez,
C. F. Sierra,
German Valencia
Abstract:
We review the parameter regions allowed by measurements of $\mathcal{R}(D^{(*)})$ and by a theoretical limit on ${\cal B}(B_{c}\toτν)$ in terms of generic scalar and pseudoscalar new physics couplings, $g_s$ and $g_p$. We then use these regions as constraints to predict the ranges for additional observables in $b\to cτν$ including the differential decay distributions $dΓ/dq^{2}$; the ratios…
▽ More
We review the parameter regions allowed by measurements of $\mathcal{R}(D^{(*)})$ and by a theoretical limit on ${\cal B}(B_{c}\toτν)$ in terms of generic scalar and pseudoscalar new physics couplings, $g_s$ and $g_p$. We then use these regions as constraints to predict the ranges for additional observables in $b\to cτν$ including the differential decay distributions $dΓ/dq^{2}$; the ratios $\mathcal{R}(J/ψ)$ and $\mathcal{R}(Λ_{c})$; and the tau-lepton polarisation in $B\to D^{(\star)}τν$, with emphasis on the CP violating normal polarisation. Finally we map the allowed regions in $g_s$ and $g_p$ into the parameters of four versions of the Yukawa couplings of the general 2HDM-III model. We find that the model is still viable but could be ruled out by a confirmation of a large $\mathcal{R}(J/ψ)$.
△ Less
Submitted 24 August, 2018; v1 submitted 10 May, 2018;
originally announced May 2018.
-
PIMKL: Pathway Induced Multiple Kernel Learning
Authors:
Matteo Manica,
Joris Cadow,
Roland Mathis,
María Rodríguez Martínez
Abstract:
Reliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many meth…
▽ More
Reliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many methods behave as black boxes, and we have very little understanding about the mechanisms that lead to the prediction. While opaqueness concerning machine behaviour might not be a problem in deterministic domains, in health care, providing explanations about the molecular factors and phenotypes that are driving the classification is crucial to build trust in the performance of the predictive system. We propose Pathway Induced Multiple Kernel Learning (PIMKL), a novel methodology to reliably classify samples that can also help gain insights into the molecular mechanisms that underlie the classification. PIMKL exploits prior knowledge in the form of a molecular interaction network and annotated gene sets, by optimizing a mixture of pathway-induced kernels using a Multiple Kernel Learning (MKL) algorithm, an approach that has demonstrated excellent performance in different machine learning applications. After optimizing the combination of kernels for prediction of a specific phenotype, the model provides a stable molecular signature that can be interpreted in the light of the ingested prior knowledge and that can be used in transfer learning tasks.
△ Less
Submitted 5 July, 2018; v1 submitted 29 March, 2018;
originally announced March 2018.
-
End-to-end 5G services via an SDN/NFV-based multi-tenant network and cloud testbed
Authors:
Raul Muñoz,
Josep Mangues-Bafalluy,
Nikolaos Bartzoudis,
Ricard Vilalta,
Ricardo Martínez,
Ramon Casellas,
Nicola Baldo,
José Núñez-Martínez,
Manuel Requena-Esteso,
Oriol Font-Bach,
Marco Miozzo,
Pol Henarejos,
Ana Pérez-Neira,
Miquel Payaró
Abstract:
5G has a main requirement of highly flexible, ultralow latency and ultra-high bandwidth virtualized infrastructure in order to deliver end-to-end services. This requirement can be met by efficiently integrating all network segments (radio access, aggregation and core) with heterogeneous wireless and optical technologies (5G, mmWave, LTE/LTE-A, Wi-Fi, Ethernet, MPLS, WDM, software-defined optical t…
▽ More
5G has a main requirement of highly flexible, ultralow latency and ultra-high bandwidth virtualized infrastructure in order to deliver end-to-end services. This requirement can be met by efficiently integrating all network segments (radio access, aggregation and core) with heterogeneous wireless and optical technologies (5G, mmWave, LTE/LTE-A, Wi-Fi, Ethernet, MPLS, WDM, software-defined optical transmission, etc.), and massive computing and storage cloud services (offered in edge/core data centers). This paper introduces the preliminary architecture aiming at integrating three consolidated and standalone experimental infrastructures at CTTC, in order to deploy the required end-to-end top-to-bottom converged infrastructure pointed out above for testing and developing advanced 5G services.
△ Less
Submitted 20 March, 2018;
originally announced March 2018.
-
The CTTC 5G end-to-end experimental platform: Integrating heterogeneous wireless/optical networks, distributed cloud, and IoT devices
Authors:
Raul Muñóz,
Josep Mangues,
Ricard Vilalta,
Christos Verikoukis,
Jesús Alonso-Zarate,
Nikolaos Bartzoudis,
Apostolos Georgiadis,
Miquel Payaró,
Ana Pérez-Neira,
Ramon Casellas,
Ricardo Martínez,
José Núñez-Martínez,
Manuel Requena-Esteso,
David Pubill,
Oriol Font-Bach,
Pol Henarejos,
Jordi Serra,
Francisco Vazquez-Gallego
Abstract:
The Internet of Things (IoT) will facilitate a wide variety of applications in different domains, such as smart cities, smart grids, industrial automation (Industry 4.0), smart driving, assistance of the elderly, and home automation. Billions of heterogeneous smart devices with different application requirements will be connected to the networks and will generate huge aggregated volumes of data th…
▽ More
The Internet of Things (IoT) will facilitate a wide variety of applications in different domains, such as smart cities, smart grids, industrial automation (Industry 4.0), smart driving, assistance of the elderly, and home automation. Billions of heterogeneous smart devices with different application requirements will be connected to the networks and will generate huge aggregated volumes of data that will be processed in distributed cloud infrastructures. On the other hand, there is also a general trend to deploy functions as software (SW) instances in cloud infrastructures [e.g., network function virtualization (NFV) or mobile edge computing (MEC)]. Thus, the next generation of mobile networks, the fifth-generation (5G), will need not only to develop new radio interfaces or waveforms to cope with the expected traffic growth but also to integrate heterogeneous networks from end to end (E2E) with distributed cloud resources to deliver E2E IoT and mobile services. This article presents the E2E 5G platform that is being developed by the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC), the first known platform capable of reproducing such an ambitious scenario.
△ Less
Submitted 20 March, 2018;
originally announced March 2018.
-
Machine learning-assisted virtual patching of web applications
Authors:
Gustavo Betarte,
Eduardo Giménez,
Rodrigo Martínez,
Álvaro Pardo
Abstract:
Web applications are permanently being exposed to attacks that exploit their vulnerabilities. In this work we investigate the application of machine learning techniques to leverage Web Application Firewall (WAF), a technology that is used to detect and prevent attacks. We propose a combined approach of machine learning models, based on one-class classification and n-gram analysis, to enhance the d…
▽ More
Web applications are permanently being exposed to attacks that exploit their vulnerabilities. In this work we investigate the application of machine learning techniques to leverage Web Application Firewall (WAF), a technology that is used to detect and prevent attacks. We propose a combined approach of machine learning models, based on one-class classification and n-gram analysis, to enhance the detection and accuracy capabilities of MODSECURITY, an open source and widely used WAF. The results are promising and outperform MODSECURITY when configured with the OWASP Core Rule Set, the baseline configuration setting of a widely deployed, rule-based WAF technology. The proposed solution, combining both approaches, allow us to deploy a WAF when no training data for the application is available (using one-class classification), and an improved one using n-grams when training data is available.
△ Less
Submitted 14 March, 2018;
originally announced March 2018.
-
Approximate Bayesian Computation in controlled branching processes: the role of summary statistics
Authors:
M. González,
R. Martínez,
C. Minuesa,
I. del Puerto
Abstract:
Controlled branching processes are stochastic growth population models in which the number of individuals with reproductive capacity in each generation is controlled by a random control function. The purpose of this work is to examine the Approximate Bayesian Computation (ABC) methods and to propose appropriate summary statistics for them in the context of these processes. This methodology enables…
▽ More
Controlled branching processes are stochastic growth population models in which the number of individuals with reproductive capacity in each generation is controlled by a random control function. The purpose of this work is to examine the Approximate Bayesian Computation (ABC) methods and to propose appropriate summary statistics for them in the context of these processes. This methodology enables to approximate the posterior distribution of the parameters of interest satisfactorily without explicit likelihood calculations and under a minimal set of assumptions. In particular, the tolerance rejection algorithm, the sequential Monte Carlo ABC algorithm, and a post-sampling correction method based on local-linear regression are provided. The accuracy of the proposed methods are illustrated and compared with a "likelihood free" Markov chain Monte Carlo technique by the way of a simulated example developed with the statistical software R.
△ Less
Submitted 1 July, 2019; v1 submitted 12 March, 2018;
originally announced March 2018.