Skip to main content

Showing 1–10 of 10 results for author: Verma, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.02683  [pdf, other

    cs.LG stat.ML

    Learning to Defer to a Population: A Meta-Learning Approach

    Authors: Dharmesh Tailor, Aditya Patra, Rajeev Verma, Putra Manggala, Eric Nalisnick

    Abstract: The learning to defer (L2D) framework allows autonomous systems to be safe and robust by allocating difficult decisions to a human expert. All existing work on L2D assumes that each expert is well-identified, and if any expert were to change, the system should be re-trained. In this work, we alleviate this constraint, formulating an L2D system that can cope with never-before-seen experts at test-t… ▽ More

    Submitted 13 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  2. arXiv:2210.16955  [pdf, other

    stat.ML cs.LG

    Learning to Defer to Multiple Experts: Consistent Surrogate Losses, Confidence Calibration, and Conformal Ensembles

    Authors: Rajeev Verma, Daniel Barrejón, Eric Nalisnick

    Abstract: We study the statistical properties of learning to defer (L2D) to multiple experts. In particular, we address the open problems of deriving a consistent surrogate loss, confidence calibration, and principled ensembling of experts. Firstly, we derive two consistent surrogates -- one based on a softmax parameterization, the other on a one-vs-all (OvA) parameterization -- that are analogous to the si… ▽ More

    Submitted 23 February, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally. Accepted at the International Conference on Artificial Intelligence and Statistics (AISTATS), 2023

  3. arXiv:2210.07521  [pdf, other

    stat.CO stat.AP

    Reliability-Based Robust Design Optimization Method for Engineering Systems with Uncertainty Quantification

    Authors: Richa Verma, Dinesh Kumar, Kazuma Kobayashi, Syed Alam

    Abstract: Robust optimization is a method for optimization under uncertainties in engineering systems and designs for applications ranging from aeronautics to nuclear. In a robust design process, parameter variability (or uncertainty) is incorporated into the engineering systems' optimization process to assure the systems' quality and reliability. This chapter focuses on a robust optimization approach for d… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Journal ref: Handbook of Smart Energy Systems, 2022

  4. arXiv:2210.00074  [pdf

    cs.LG stat.AP

    Leveraging Industry 4.0 -- Deep Learning, Surrogate Model and Transfer Learning with Uncertainty Quantification Incorporated into Digital Twin for Nuclear System

    Authors: M. Rahman, Abid Khan, Sayeed Anowar, Md Al-Imran, Richa Verma, Dinesh Kumar, Kazuma Kobayashi, Syed Alam

    Abstract: Industry 4.0 targets the conversion of the traditional industries into intelligent ones through technological revolution. This revolution is only possible through innovation, optimization, interconnection, and rapid decision-making capability. Numerical models are believed to be the key components of Industry 4.0, facilitating quick decision-making through simulations instead of costly experiments… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  5. arXiv:2209.12146  [pdf

    eess.SY cs.LG stat.ML

    Machine Learning and Artificial Intelligence-Driven Multi-Scale Modeling for High Burnup Accident-Tolerant Fuels for Light Water-Based SMR Applications

    Authors: Md. Shamim Hassan, Abid Hossain Khan, Richa Verma, Dinesh Kumar, Kazuma Kobayashi, Shoaib Usman, Syed Alam

    Abstract: The concept of small modular reactor has changed the outlook for tackling future energy crises. This new reactor technology is very promising considering its lower investment requirements, modularity, design simplicity, and enhanced safety features. The application of artificial intelligence-driven multi-scale modeling (neutronics, thermal hydraulics, fuel performance, etc.) incorporating Digital… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Journal ref: Handbook of Smart Energy Systems, 2022

  6. arXiv:2202.03673  [pdf, other

    cs.LG stat.ML

    Calibrated Learning to Defer with One-vs-All Classifiers

    Authors: Rajeev Verma, Eric Nalisnick

    Abstract: The learning to defer (L2D) framework has the potential to make AI systems safer. For a given input, the system can defer the decision to a human if the human is more likely than the model to take the correct action. We study the calibration of L2D systems, investigating if the probabilities they output are sound. We find that Mozannar & Sontag's (2020) multiclass framework is not calibrated with… ▽ More

    Submitted 18 June, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted at the International Conference on Machine Learning (ICML), 2022

  7. arXiv:2004.09846  [pdf, other

    cs.LG cs.AI stat.ML

    SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning

    Authors: Somjit Nath, Richa Verma, Abhik Ray, Harshad Khadilkar

    Abstract: We propose a generic reward shaping approach for improving the rate of convergence in reinforcement learning (RL), called Self Improvement Based REwards, or SIBRE. The approach is designed for use in conjunction with any existing RL algorithm, and consists of rewarding improvement over the agent's own past performance. We prove that SIBRE converges in expectation under the same conditions as the o… ▽ More

    Submitted 21 December, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: 7 pages, 10 figures

  8. arXiv:2004.04871  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM stat.AP

    MRQy: An Open-Source Tool for Quality Control of MR Imaging Data

    Authors: Amir Reza Sadri, Andrew Janowczyk, Ren Zou, Ruchika Verma, Niha Beig, Jacob Antunes, Anant Madabhushi, Pallavi Tiwari, Satish E. Viswanath

    Abstract: We sought to develop a quantitative tool to quickly determine relative differences in MRI volumes both within and between large MR imaging cohorts (such as available in The Cancer Imaging Archive (TCIA)), in order to help determine the generalizability of radiomics and machine learning schemes to unseen datasets. The tool is intended to help quantify presence of (a) site- or scanner-specific varia… ▽ More

    Submitted 17 August, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: 28 pages, 7 figures. Submitted to Medical Physics

  9. arXiv:1912.03789   

    cs.LG stat.ML

    Feature Engineering Combined with 1 D Convolutional Neural Network for Improved Mortality Prediction

    Authors: Saumil Maheshwari, Rohit Verma, Anupam Shukla, Ritu Tiwari, Rishu Garg

    Abstract: The intensive care units (ICUs) are responsible for generating a wealth of useful data in the form of Electronic Health Record (EHR). This data allows for the development of a prediction tool with perfect knowledge backing. We aimed to build a mortality prediction model on 2012 Physionet Challenge mortality prediction database of 4000 patients admitted in ICU. The challenges in the dataset, such a… ▽ More

    Submitted 27 July, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Being a short term project, this paper is not exhaustive

  10. arXiv:1911.04947  [pdf, other

    cs.LG stat.ML

    Accelerating Training in Pommerman with Imitation and Reinforcement Learning

    Authors: Hardik Meisheri, Omkar Shelke, Richa Verma, Harshad Khadilkar

    Abstract: The Pommerman simulation was recently developed to mimic the classic Japanese game Bomberman, and focuses on competitive gameplay in a multi-agent setting. We focus on the 2$\times$2 team version of Pommerman, developed for a competition at NeurIPS 2018. Our methodology involves training an agent initially through imitation learning on a noisy expert policy, followed by a proximal-policy optimizat… ▽ More

    Submitted 13 November, 2019; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: Presented at Deep Reinforcement Learning workshop, NeurIPS-2019