Skip to main content

Showing 1–36 of 36 results for author: Aman

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.17826  [pdf, other

    cs.RO cs.LG stat.ML

    Rate-Informed Discovery via Bayesian Adaptive Multifidelity Sampling

    Authors: Aman Sinha, Payam Nikdel, Supratik Paul, Shimon Whiteson

    Abstract: Ensuring the safety of autonomous vehicles (AVs) requires both accurate estimation of their performance and efficient discovery of potential failure cases. This paper introduces Bayesian adaptive multifidelity sampling (BAMS), which leverages the power of adaptive Bayesian sampling to achieve efficient discovery while simultaneously estimating the rate of adverse events. BAMS prioritizes explorati… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: Published at CoRL 2024: https://openreview.net/forum?id=bftFwjSJxk

  2. arXiv:2411.16370  [pdf, other

    cs.CV cs.AI cs.LG eess.IV stat.ML

    A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation

    Authors: M. M. A. Valiuddin, R. J. G. van Sloun, C. G. A. Viviers, P. H. N. de With, F. van der Sommen

    Abstract: Advancements in image segmentation play an integral role within the broad scope of Deep Learning-based Computer Vision. Furthermore, their widespread applicability in critical real-world tasks has resulted in challenges related to the reliability of such algorithms. Hence, uncertainty quantification has been extensively studied within this context, enabling the expression of model ignorance (epist… ▽ More

    Submitted 12 March, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

    Comments: 20 pages, revised

  3. arXiv:2411.07305  [pdf, other

    astro-ph.GA astro-ph.IM stat.ML

    PICZL: Image-based Photometric Redshifts for AGN

    Authors: William Roster, Mara Salvato, Sven Krippendorf, Aman Saxena, Raphael Shirley, Johannes Buchner, Julien Wolf, Tom Dwelly, Franz E. Bauer, James Aird, Claudio Ricci, Roberto J. Assef, Scott F. Anderson, Xin Liu, Andrea Merloni, Jochen Weller, Kirpal Nandra

    Abstract: Computing photo-z for AGN is challenging, primarily due to the interplay of relative emissions associated with the SMBH and its host galaxy. SED fitting methods, effective in pencil-beam surveys, face limitations in all-sky surveys with fewer bands available, lacking the ability to capture the AGN contribution to the SED accurately. This limitation affects the many 10s of millions of AGN clearly s… ▽ More

    Submitted 13 November, 2024; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: Accepted for publication in Astronomy & Astrophysics. 24 pages, 21 figures

    Journal ref: A&A 692, A260 (2024)

  4. arXiv:2411.00999  [pdf, other

    cs.LG stat.ML

    Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers

    Authors: Gavia Gray, Aman Tiwari, Shane Bergsma, Joel Hestness

    Abstract: Per-example gradient norms are a vital ingredient for estimating gradient noise scale (GNS) with minimal variance. Observing the tensor contractions required to compute them, we propose a method with minimal FLOPs in 3D or greater tensor regimes by simultaneously computing the norms while computing the parameter gradients. Using this method we are able to observe the GNS of different layers at hig… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: 23 pages, 16 figures, to be published in the proceedings of the 2024 Conference on Neural Information Processing Systems (NeurIPS), code is available at: https://github.com/CerebrasResearch/nanoGNS

    ACM Class: I.2.6

  5. arXiv:2407.11249  [pdf, other

    cs.LG cs.AI q-bio.NC stat.ML

    Disentangling Representations through Multi-task Learning

    Authors: Pantelis Vafidis, Aman Bhargava, Antonio Rangel

    Abstract: Intelligent perception and interaction with the world hinges on internal representations that capture its underlying structure (''disentangled'' or ''abstract'' representations). Disentangled representations serve as world models, isolating latent factors of variation in the world along approximately orthogonal directions, thus facilitating feature-based generalization. We provide experimental and… ▽ More

    Submitted 2 March, 2025; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 43 pages, 17 figures

    Journal ref: International Conference on Learning Representations, 2025 https://openreview.net/forum?id=yVGGtsOgc7

  6. arXiv:2405.10490  [pdf

    stat.ME cs.AI cs.IR cs.LG math.OC

    Neural Optimization with Adaptive Heuristics for Intelligent Marketing System

    Authors: Changshuai Wei, Benjamin Zelditch, Joyce Chen, Andre Assuncao Silva T Ribeiro, Jingyi Kenneth Tay, Borja Ocejo Elizondo, Keerthi Selvaraj, Aman Gupta, Licurgo Benemann De Almeida

    Abstract: Computational marketing has become increasingly important in today's digital world, facing challenges such as massive heterogeneous data, multi-channel customer journeys, and limited marketing budgets. In this paper, we propose a general framework for marketing AI systems, the Neural Optimization with Adaptive Heuristics (NOAH) framework. NOAH is the first general framework for marketing optimizat… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: KDD 2024

    ACM Class: G.3; G.1.6; I.2

  7. arXiv:2401.12924  [pdf, other

    stat.ML cs.LG stat.ME

    Performance Analysis of Support Vector Machine (SVM) on Challenging Datasets for Forest Fire Detection

    Authors: Ankan Kar, Nirjhar Nath, Utpalraj Kemprai, Aman

    Abstract: This article delves into the analysis of performance and utilization of Support Vector Machines (SVMs) for the critical task of forest fire detection using image datasets. With the increasing threat of forest fires to ecosystems and human settlements, the need for rapid and accurate detection systems is of utmost importance. SVMs, renowned for their strong classification capabilities, exhibit prof… ▽ More

    Submitted 7 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 19 pages, 8 figures

    Journal ref: Int. J. Communications, Network and System Sciences, 17, 11-29 (2024)

  8. arXiv:2309.01885  [pdf, other

    stat.ML cs.CL cs.LG

    QuantEase: Optimization-based Quantization for Language Models

    Authors: Kayhan Behdin, Ayan Acharya, Aman Gupta, Qingquan Song, Siyu Zhu, Sathiya Keerthi, Rahul Mazumder

    Abstract: With the rising popularity of Large Language Models (LLMs), there has been an increasing interest in compression techniques that enable their efficient deployment. This study focuses on the Post-Training Quantization (PTQ) of LLMs. Drawing from recent advances, our work introduces QuantEase, a layer-wise quantization framework where individual layers undergo separate quantization. The problem is f… ▽ More

    Submitted 1 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  9. arXiv:2307.07014  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Factored Action Spaces for Off-Policy Evaluation

    Authors: Aaman Rebello, Shengpu Tang, Jenna Wiens, Sonali Parbhoo

    Abstract: Off-policy evaluation (OPE) aims to estimate the benefit of following a counterfactual sequence of actions, given data collected from executed sequences. However, existing OPE estimators often exhibit high bias and high variance in problems involving large, combinatorial action spaces. We investigate how to mitigate this issue using factored action spaces i.e. expressing each action as a combinati… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Main paper: 8 pages, 7 figures. Appendix: 30 pages, 17 figures. Accepted at ICML 2023 Workshop on Counterfactuals in Minds and Machines, Honolulu, Hawaii, USA. Camera ready version

    MSC Class: 62D20 (Primary) 62M05; 60J10; 62D05; 62P10 (Secondary) ACM Class: I.2.6; I.2.8; G.3; J.3

  10. arXiv:2305.05532  [pdf, other

    eess.SP cs.AI cs.LG stat.AP stat.ML

    An ensemble of convolution-based methods for fault detection using vibration signals

    Authors: Xian Yeow Lee, Aman Kumar, Lasitha Vidyaratne, Aniruddha Rajendra Rao, Ahmed Farahat, Chetan Gupta

    Abstract: This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent st… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 12 Pages, 9 Figures, 2 Tables. Accepted at ICPHM 2023

    Journal ref: 2023 IEEE International Conference on Prognostics and Health Management (ICPHM)

  11. arXiv:2302.09693  [pdf, other

    stat.ML cs.LG

    mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

    Authors: Kayhan Behdin, Qingquan Song, Aman Gupta, Sathiya Keerthi, Ayan Acharya, Borja Ocejo, Gregory Dexter, Rajiv Khanna, David Durfee, Rahul Mazumder

    Abstract: Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. The Sharpness-Aware Minimization (SAM) technique modifies the fundamental loss function that steers gradient descent methods toward flatter minima, which are believed to exhibit enhanced generalization prowess. Our study delves into a specific variant of SAM known as… ▽ More

    Submitted 30 September, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2212.04343

  12. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  13. arXiv:2202.04837  [pdf, other

    stat.ML cs.LG

    Heterogeneous Calibration: A post-hoc model-agnostic framework for improved generalization

    Authors: David Durfee, Aman Gupta, Kinjal Basu

    Abstract: We introduce the notion of heterogeneous calibration that applies a post-hoc model-agnostic transformation to model outputs for improving AUC performance on binary classification tasks. We consider overconfident models, whose performance is significantly better on training vs test data and give intuition onto why they might under-utilize moderately effective simple patterns in the data. We refer t… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  14. arXiv:2111.05267  [pdf, other

    cs.SI cs.LG math.PR stat.ML

    Community detection using low-dimensional network embedding algorithms

    Authors: Aman Barot, Shankar Bhamidi, Souvik Dhara

    Abstract: With the increasing relevance of large networks in important areas such as the study of contact networks for spread of disease, or social networks for their impact on geopolitics, it has become necessary to study machine learning tools that are scalable to very large networks, often containing millions of nodes. One major class of such scalable algorithms is known as network representation learnin… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  15. arXiv:2108.05839  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Logit Attenuating Weight Normalization

    Authors: Aman Gupta, Rohan Ramanath, Jun Shi, Anika Ramachandran, Sirou Zhou, Mingzhou Zhou, S. Sathiya Keerthi

    Abstract: Over-parameterized deep networks trained using gradient-based optimizers are a popular choice for solving classification and ranking problems. Without appropriately tuned $\ell_2$ regularization or weight decay, such networks have the tendency to make output scores (logits) and network weights large, causing training loss to become too small and the network to lose its adaptivity (ability to move… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 23 pages

  16. arXiv:2103.15859  [pdf, other

    stat.AP

    U.S. Power Resilience for 2002--2019

    Authors: Aman Ankit, Zhanlin Liu, Scott B. Miles, Youngjun Choe

    Abstract: Prolonged power outages debilitate the economy and threaten public health. Existing research is generally limited in its scope to a single event, an outage cause, or a region. Here, we provide one of the most comprehensive analyses of U.S. power outages for 2002--2019. We categorized all outage data collected under U.S. federal mandates into four outage causes and computed industry-standard reliab… ▽ More

    Submitted 20 July, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

  17. arXiv:2008.10581  [pdf, other

    cs.LG stat.ML

    Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems

    Authors: Aman Sinha, Matthew O'Kelly, Russ Tedrake, John Duchi

    Abstract: Learning-based methodologies increasingly find applications in safety-critical domains like autonomous driving and medical robotics. Due to the rare nature of dangerous events, real-world testing is prohibitively expensive and unscalable. In this work, we employ a probabilistic approach to safety evaluation in simulation, where we are concerned with computing the probability of dangerous events. W… ▽ More

    Submitted 8 August, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: NeurIPS 2020

  18. Using LSTM for the Prediction of Disruption in ADITYA Tokamak

    Authors: Aman Agarwal, Aditya Mishra, Priyanka Sharma, Swati Jain, Sutapa Ranjan, Ranjana Manchanda

    Abstract: Major disruptions in tokamak pose a serious threat to the vessel and its surrounding pieces of equipment. The ability of the systems to detect any behavior that can lead to disruption can help in alerting the system beforehand and prevent its harmful effects. Many machine learning techniques have already been in use at large tokamaks like JET and ASDEX, but are not suitable for ADITYA, which is co… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: 7 pages, 4 figures

    Journal ref: Plasma Physics and Controlled Fusion, Volume 63, Number 11, 2021

  19. arXiv:2006.14707  [pdf, other

    cs.LG q-bio.QM stat.ML

    Machine-Learning Driven Drug Repurposing for COVID-19

    Authors: Semih Cantürk, Aman Singh, Patrick St-Amant, Jason Behrmann

    Abstract: The integration of machine learning methods into bioinformatics provides particular benefits in identifying how therapeutics effective in one context might have utility in an unknown clinical context or against a novel pathology. We aim to discover the underlying associations between viral proteins and antiviral therapeutics that are effective against them by employing neural network models. Using… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Submitted to NeurIPS 2020. 11 pages, 3 figures, 5 tables, 12 pages of appendices

    MSC Class: 68T07 (Primary); 68T10 (Secondary) ACM Class: I.2.6

  20. arXiv:2005.08033  [pdf, other

    cs.LG stat.ML

    Towards classification parity across cohorts

    Authors: Aarsh Patel, Rahul Gupta, Mukund Harakere, Satyapriya Krishna, Aman Alok, Peng Liu

    Abstract: Recently, there has been a lot of interest in ensuring algorithmic fairness in machine learning where the central question is how to prevent sensitive information (e.g. knowledge about the ethnic group of an individual) from adding "unfair" bias to a learning algorithm (Feldman et al. (2015), Zemel et al. (2013)). This has led to several debiasing algorithms on word embeddings (Qian et al. (2019)… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: Published in ML-IRL ICLR 2020 workshop

  21. arXiv:2003.03900  [pdf, other

    cs.LG cs.MA cs.RO stat.ML

    FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis

    Authors: Aman Sinha, Matthew O'Kelly, Hongrui Zheng, Rahul Mangharam, John Duchi, Russ Tedrake

    Abstract: Balancing performance and safety is crucial to deploying autonomous vehicles in multi-agent environments. In particular, autonomous racing is a domain that penalizes safe but conservative policies, highlighting the need for robust, adaptive strategies. Current approaches either make simplifying assumptions about other agents or lack robust mechanisms for online adaptation. This work makes algorith… ▽ More

    Submitted 22 August, 2020; v1 submitted 8 March, 2020; originally announced March 2020.

    Comments: ICML 2020: https://icml.cc/virtual/2020/poster/6277

  22. arXiv:1912.03618  [pdf, other

    cs.LG cs.RO stat.ML

    Efficient Black-box Assessment of Autonomous Vehicle Safety

    Authors: Justin Norden, Matthew O'Kelly, Aman Sinha

    Abstract: While autonomous vehicle (AV) technology has shown substantial progress, we still lack tools for rigorous and scalable testing. Real-world testing, the $\textit{de-facto}$ evaluation method, is dangerous to the public. Moreover, due to the rare nature of failures, billions of miles of driving are needed to statistically validate performance claims. Thus, the industry has largely turned to simulati… ▽ More

    Submitted 5 June, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

  23. arXiv:1911.06857  [pdf, other

    econ.EM stat.AP

    Semiparametric Estimation of Correlated Random Coefficient Models without Instrumental Variables

    Authors: Samuele Centorrino, Aman Ullah, Jing Xue

    Abstract: We study a linear random coefficient model where slope parameters may be correlated with some continuous covariates. Such a model specification may occur in empirical research, for instance, when quantifying the effect of a continuous treatment observed at two time periods. We show one can carry identification and estimation without instruments. We propose a semiparametric estimator of average par… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

  24. arXiv:1908.09899  [pdf, other

    cs.LG stat.ML

    SynGAN: Towards Generating Synthetic Network Attacks using GANs

    Authors: Jeremy Charlier, Aman Singh, Gaston Ormazabal, Radu State, Henning Schulzrinne

    Abstract: The rapid digital transformation without security considerations has resulted in the rise of global-scale cyberattacks. The first line of defense against these attacks are Network Intrusion Detection Systems (NIDS). Once deployed, however, these systems work as blackboxes with a high rate of false positives with no measurable effectiveness. There is a need to continuously test and improve these sy… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

  25. arXiv:1907.07804  [pdf, other

    cs.LG cs.CL cs.CV cs.NE stat.ML

    OmniNet: A unified architecture for multi-modal multi-task learning

    Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

    Abstract: Transformer is a popularly used neural network architecture, especially for language understanding. We introduce an extended and unified architecture that can be used for tasks involving a variety of modalities like image, text, videos, etc. We propose a spatio-temporal cache mechanism that enables learning spatial dimension of the input in addition to the hidden states corresponding to the tempor… ▽ More

    Submitted 3 July, 2020; v1 submitted 17 July, 2019; originally announced July 2019.

    Comments: Source code available at: https://github.com/subho406/OmniNet

  26. arXiv:1906.08858  [pdf, other

    cs.LG stat.ML

    One-vs-All Models for Asynchronous Training: An Empirical Analysis

    Authors: Rahul Gupta, Aman Alok, Shankar Ananthakrishnan

    Abstract: Any given classification problem can be modeled using multi-class or One-vs-All (OVA) architecture. An OVA system consists of as many OVA models as the number of classes, providing the advantage of asynchrony, where each OVA model can be re-trained independent of other models. This is particularly advantageous in settings where scalable model training is a consideration (for instance in an industr… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 5 pages, Accepted to Interspeech 2019

  27. arXiv:1812.00528  [pdf, ps, other

    cs.LG q-bio.PE stat.ML

    Modeling disease progression in longitudinal EHR data using continuous-time hidden Markov models

    Authors: Aman Verma, Guido Powell, Yu Luo, David Stephens, David L. Buckeridge

    Abstract: Modeling disease progression in healthcare administrative databases is complicated by the fact that patients are observed only at irregular intervals when they seek healthcare services. In a longitudinal cohort of 76,888 patients with chronic obstructive pulmonary disease (COPD), we used a continuous-time hidden Markov model with a generalized linear model to model healthcare utilization events. W… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/145

  28. arXiv:1812.00293  [pdf, other

    cs.LG stat.ML

    In-silico Risk Analysis of Personalized Artificial Pancreas Controllers via Rare-event Simulation

    Authors: Matthew O'Kelly, Aman Sinha, Justin Norden, Hongseok Namkoong

    Abstract: Modern treatments for Type 1 diabetes (T1D) use devices known as artificial pancreata (APs), which combine an insulin pump with a continuous glucose monitor (CGM) operating in a closed-loop manner to control blood glucose levels. In practice, poor performance of APs (frequent hyper- or hypoglycemic events) is common enough at a population level that many T1D patients modify the algorithms on exist… ▽ More

    Submitted 1 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  29. arXiv:1811.08687  [pdf, other

    cs.LG cs.AI stat.ML

    Surrogate-assisted parallel tempering for Bayesian neural learning

    Authors: Rohitash Chandra, Konark Jain, Arpit Kapoor, Ashray Aman

    Abstract: Due to the need for robust uncertainty quantification, Bayesian neural learning has gained attention in the era of deep learning and big data. Markov Chain Monte-Carlo (MCMC) methods typically implement Bayesian inference which faces several challenges given a large number of parameters, complex and multimodal posterior distributions, and computational complexity of large neural network models. Pa… ▽ More

    Submitted 14 May, 2020; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: Engineering Applications of Artificial Intelligence

  30. arXiv:1811.02659  [pdf, other

    cs.CV cs.LG stat.ML

    Machine Learning Algorithms for Classification of Microcirculation Images from Septic and Non-Septic Patients

    Authors: Perikumar Javia, Aman Rana, Nathan Shapiro, Pratik Shah

    Abstract: Sepsis is a life-threatening disease and one of the major causes of death in hospitals. Imaging of microcirculatory dysfunction is a promising approach for automated diagnosis of sepsis. We report a machine learning classifier capable of distinguishing non-septic and septic images from dark field microcirculation videos of patients. The classifier achieves an accuracy of 89.45%. The area under the… ▽ More

    Submitted 20 February, 2019; v1 submitted 24 October, 2018; originally announced November 2018.

    Comments: Accepted for publication at 2018 IEEE International Conference on Machine Learning and Applications (IEEE ICMLA)

  31. arXiv:1811.02642  [pdf, other

    cs.CV cs.LG stat.ML

    Computational Histological Staining and Destaining of Prostate Core Biopsy RGB Images with Generative Adversarial Neural Networks

    Authors: Aman Rana, Gregory Yauney, Alarice Lowe, Pratik Shah

    Abstract: Histopathology tissue samples are widely available in two states: paraffin-embedded unstained and non-paraffin-embedded stained whole slide RGB images (WSRI). Hematoxylin and eosin stain (H&E) is one of the principal stains in histology but suffers from several shortcomings related to tissue preparation, staining protocols, slowness and human error. We report two novel approaches for training mach… ▽ More

    Submitted 20 February, 2019; v1 submitted 26 October, 2018; originally announced November 2018.

    Comments: Accepted for publication at 2018 IEEE International Conference on Machine Learning and Applications (ICMLA)

  32. arXiv:1811.00145  [pdf, ps, other

    cs.LG cs.RO stat.ML

    Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation

    Authors: Matthew O'Kelly, Aman Sinha, Hongseok Namkoong, John Duchi, Russ Tedrake

    Abstract: While recent developments in autonomous vehicle (AV) technology highlight substantial progress, we lack tools for rigorous and scalable testing. Real-world testing, the $\textit{de facto}$ evaluation environment, places the public in danger, and, due to the rare nature of accidents, will require billions of miles in order to statistically validate performance claims. We implement a simulation fram… ▽ More

    Submitted 12 January, 2019; v1 submitted 31 October, 2018; originally announced November 2018.

    Comments: NeurIPS 2018

  33. arXiv:1810.10664  [pdf, other

    cs.LG q-bio.QM stat.ML

    Automated Process Incorporating Machine Learning Segmentation and Correlation of Oral Diseases with Systemic Health

    Authors: Gregory Yauney, Aman Rana, Lawrence C. Wong, Perikumar Javia, Ali Muftu, Pratik Shah

    Abstract: Imaging fluorescent disease biomarkers in tissues and skin is a non-invasive method to screen for health conditions. We report an automated process that combines intraoral fluorescent porphyrin biomarker imaging, clinical examinations and machine learning for correlation of systemic health conditions with periodontal disease. 1215 intraoral fluorescent images, from 284 consenting adults aged 18-90… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: Submitted to IEEE Journal of Biomedical and Health Informatics, 2018

  34. arXiv:1806.03555  [pdf, other

    cs.LG cs.IR stat.ML

    Consistent Position Bias Estimation without Online Interventions for Learning-to-Rank

    Authors: Aman Agarwal, Ivan Zaitsev, Thorsten Joachims

    Abstract: Presentation bias is one of the key challenges when learning from implicit feedback in search engines, as it confounds the relevance signal with uninformative signals due to position in the ranking, saliency, and other presentation factors. While it was recently shown how counterfactual learning-to-rank (LTR) approaches \cite{Joachims/etal/17a} can provably overcome presentation bias if observatio… ▽ More

    Submitted 9 June, 2018; originally announced June 2018.

  35. arXiv:1710.10571  [pdf, ps, other

    stat.ML cs.LG

    Certifying Some Distributional Robustness with Principled Adversarial Training

    Authors: Aman Sinha, Hongseok Namkoong, Riccardo Volpi, John Duchi

    Abstract: Neural networks are vulnerable to adversarial examples and researchers have proposed many heuristic attack and defense mechanisms. We address this problem through the principled lens of distributionally robust optimization, which guarantees performance under adversarial input perturbations. By considering a Lagrangian penalty formulation of perturbing the underlying data distribution in a Wasserst… ▽ More

    Submitted 1 May, 2020; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: ICLR 2018: https://openreview.net/forum?id=Hk6kPgZA-

  36. arXiv:1010.3812  [pdf, ps, other

    cs.DS cs.CG math.DG stat.ML

    Random Projection Trees Revisited

    Authors: Aman Dhesi, Purushottam Kar

    Abstract: The Random Projection Tree structures proposed in [Freund-Dasgupta STOC08] are space partitioning data structures that automatically adapt to various notions of intrinsic dimensionality of data. We prove new results for both the RPTreeMax and the RPTreeMean data structures. Our result for RPTreeMax gives a near-optimal bound on the number of levels required by this data structure to reduce the siz… ▽ More

    Submitted 20 October, 2010; v1 submitted 19 October, 2010; originally announced October 2010.

    Comments: Accepted for publication at NIPS 2010. This version corrects an incorrect usage of the term Assouad dimension - acknowledgments : James Lee