Skip to main content

Showing 1–50 of 59 results for author: Disha

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  2. arXiv:2506.04131  [pdf, ps, other

    cs.CL cs.AI cs.LG

    CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues

    Authors: Disha Sheshanarayana, Tanishka Magar, Ayushi Mittal, Neelam Chaplot

    Abstract: Courtrooms are places where lives are determined and fates are sealed, yet they are not impervious to manipulation. Strategic use of manipulation in legal jargon can sway the opinions of judges and affect the decisions. Despite the growing advancements in NLP, its application in detecting and analyzing manipulation within the legal domain remains largely unexplored. Our work addresses this gap by… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted to SICon 2025 ACL

  3. arXiv:2505.18179  [pdf, ps, other

    cs.LG cs.AI

    GAIA: A Foundation Model for Operational Atmospheric Dynamics

    Authors: Ata Akbari Asanjan, Olivia Alexander, Tom Berg, Clara Zhang, Matt Yang, Jad Makki, Disha Shidham, Srija Chakraborty, William Bender, Stephen Peng, Arun Ravindran, Olivier Raiman, David Potere, David Bell

    Abstract: We present the GAIA (Geospatial Artificial Intelligence for Atmospheres) Foundation Model, a novel model that combines masked autoencoders (MAE) and self-DIstillation with NO labels (DINO) for analyzing global atmospheric patterns in satellite imagery. By integrating these complementary self-supervised learning approaches, our model simultaneously captures both local features and global dependenci… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 14 pages, 7 figures

  4. arXiv:2504.04247  [pdf, other

    stat.ML cs.LG math.NA

    Randomised Postiterations for Calibrated BayesCG

    Authors: Niall Vyas, Disha Hegde, Jon Cockayne

    Abstract: The Bayesian conjugate gradient method offers probabilistic solutions to linear systems but suffers from poor calibration, limiting its utility in uncertainty quantification tasks. Recent approaches leveraging postiterations to construct priors have improved computational properties but failed to correct calibration issues. In this work, we propose a novel randomised postiteration strategy that en… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

  5. arXiv:2503.17265  [pdf, other

    stat.ML cs.LG math.NA

    Learning to Solve Related Linear Systems

    Authors: Disha Hegde, Jon Cockayne

    Abstract: Solving multiple parametrised related systems is an essential component of many numerical tasks. Borrowing strength from the solved systems and learning will make this process faster. In this work, we propose a novel probabilistic linear solver over the parameter space. This leverages information from the solved linear systems in a regression setting to provide an efficient posterior mean and cova… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  6. arXiv:2503.13399  [pdf, other

    cs.CV cs.AI cs.CL cs.LG q-bio.CB

    MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

    Authors: James Burgess, Jeffrey J Nirschl, Laura Bravo-Sánchez, Alejandro Lozano, Sanket Rajan Gupte, Jesus G. Galaz-Montoya, Yuhui Zhang, Yuchang Su, Disha Bhowmik, Zachary Coman, Sarina M. Hasan, Alexandra Johannesson, William D. Leineweber, Malvika G Nair, Ridhi Yarlagadda, Connor Zuraski, Wah Chiu, Sarah Cohen, Jan N. Hansen, Manuel D Leonetti, Chad Liu, Emma Lundberg, Serena Yeung-Levy

    Abstract: Scientific research demands sophisticated reasoning over multimodal data, a challenge especially prevalent in biology. Despite recent advances in multimodal large language models (MLLMs) for AI-assisted research, existing multimodal reasoning benchmarks only target up to college-level difficulty, while research-level benchmarks emphasize lower-level perception, falling short of the complex multimo… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: CVPR 2025 (Conference on Computer Vision and Pattern Recognition) Project page at https://jmhb0.github.io/microvqa Benchmark at https://huggingface.co/datasets/jmhb/microvqa

  7. arXiv:2502.19187  [pdf, other

    cs.CL

    BIG-Bench Extra Hard

    Authors: Mehran Kazemi, Bahare Fatemi, Hritik Bansal, John Palowitch, Chrysovalantis Anastasiou, Sanket Vaibhav Mehta, Lalit K. Jain, Virginia Aglietti, Disha Jindal, Peter Chen, Nishanth Dikkala, Gladys Tyen, Xin Liu, Uri Shalit, Silvia Chiappa, Kate Olszewska, Yi Tay, Vinh Q. Tran, Quoc V. Le, Orhan Firat

    Abstract: Large language models (LLMs) are increasingly deployed in everyday applications, demanding robust general reasoning capabilities and diverse reasoning skillset. However, current LLM reasoning benchmarks predominantly focus on mathematical and coding abilities, leaving a gap in evaluating broader reasoning proficiencies. One particular exception is the BIG-Bench dataset, which has served as a cruci… ▽ More

    Submitted 6 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  8. arXiv:2502.13893  [pdf, other

    cs.SD eess.AS

    Audio-Based Classification of Insect Species Using Machine Learning Models: Cicada, Beetle, Termite, and Cricket

    Authors: Manas V Shetty, Yoga Disha Sendhil Kumar

    Abstract: This project addresses the challenge of classifying insect species: Cicada, Beetle, Termite, and Cricket using sound recordings. Accurate species identification is crucial for ecological monitoring and pest management. We employ machine learning models such as XGBoost, Random Forest, and K Nearest Neighbors (KNN) to analyze audio features, including Mel Frequency Cepstral Coefficients (MFCC). The… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  9. arXiv:2502.10920  [pdf, other

    cs.CV cs.AI

    Do Deepfake Detectors Work in Reality?

    Authors: Simiao Ren, Hengwei Xu, Tsang Ng, Kidus Zewde, Shengkai Jiang, Ramini Desai, Disha Patil, Ning-Yau Cheng, Yining Zhou, Ragavi Muthukrishnan

    Abstract: Deepfakes, particularly those involving faceswap-based manipulations, have sparked significant societal concern due to their increasing realism and potential for misuse. Despite rapid advancements in generative models, detection methods have not kept pace, creating a critical gap in defense strategies. This disparity is further amplified by the disconnect between academic research and real-world a… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  10. arXiv:2501.18542  [pdf

    cs.AI

    Semantic Web and Creative AI -- A Technical Report from ISWS 2023

    Authors: Raia Abu Ahmad, Reham Alharbi, Roberto Barile, Martin Böckling, Francisco Bolanos, Sara Bonfitto, Oleksandra Bruns, Irene Celino, Yashrajsinh Chudasama, Martin Critelli, Claudia d'Amato, Giada D'Ippolito, Ioannis Dasoulas, Stefano De Giorgis, Vincenzo De Leo, Chiara Di Bonaventura, Marco Di Panfilo, Daniil Dobriy, John Domingue, Xuemin Duan, Michel Dumontier, Sefika Efeoglu, Ruben Eschauzier, Fakih Ginwa, Nicolas Ferranti , et al. (52 additional authors not shown)

    Abstract: The International Semantic Web Research School (ISWS) is a week-long intensive program designed to immerse participants in the field. This document reports a collaborative effort performed by ten teams of students, each guided by a senior researcher as their mentor, attending ISWS 2023. Each team provided a different perspective to the topic of creative AI, substantiated by a set of research quest… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: Technical Report

  11. arXiv:2412.04166  [pdf, other

    cs.LG math.NA

    An In-Depth Examination of Risk Assessment in Multi-Class Classification Algorithms

    Authors: Disha Ghandwani, Neeraj Sarna, Yuanyuan Li, Yang Lin

    Abstract: Advanced classification algorithms are being increasingly used in safety-critical applications like health-care, engineering, etc. In such applications, miss-classifications made by ML algorithms can result in substantial financial or health-related losses. To better anticipate and prepare for such losses, the algorithm user seeks an estimate for the probability that the algorithm miss-classifies… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  12. arXiv:2412.02732  [pdf, other

    cs.CV

    Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications

    Authors: Daniela Szwarcman, Sujit Roy, Paolo Fraccaro, Þorsteinn Elí Gíslason, Benedikt Blumenstiel, Rinki Ghosal, Pedro Henrique de Oliveira, Joao Lucas de Sousa Almeida, Rocco Sedona, Yanghui Kang, Srija Chakraborty, Sizhe Wang, Carlos Gomes, Ankur Kumar, Myscon Truong, Denys Godwin, Hyunho Lee, Chia-Yu Hsu, Ata Akbari Asanjan, Besart Mujeci, Disha Shidham, Trevor Keenan, Paulo Arevalo, Wenwen Li, Hamed Alemohammad , et al. (10 additional authors not shown)

    Abstract: This technical report presents Prithvi-EO-2.0, a new geospatial foundation model that offers significant improvements over its predecessor, Prithvi-EO-1.0. Trained on 4.2M global time series samples from NASA's Harmonized Landsat and Sentinel-2 data archive at 30m resolution, the new 300M and 600M parameter models incorporate temporal and location embeddings for enhanced performance across various… ▽ More

    Submitted 3 February, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

  13. arXiv:2410.08796  [pdf, other

    stat.ML cs.LG math.NA

    Calibrated Computation-Aware Gaussian Processes

    Authors: Disha Hegde, Mohamed Adil, Jon Cockayne

    Abstract: Gaussian processes are notorious for scaling cubically with the size of the training set, preventing application to very large regression problems. Computation-aware Gaussian processes (CAGPs) tackle this scaling issue by exploiting probabilistic linear solvers to reduce complexity, widening the posterior with additional computational uncertainty due to reduced computation. However, the most commo… ▽ More

    Submitted 21 March, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted at the 28th International Conference on Artificial Intelligence and Statistics (AISTATS), 2025

  14. arXiv:2409.12917  [pdf, other

    cs.LG

    Training Language Models to Self-Correct via Reinforcement Learning

    Authors: Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust

    Abstract: Self-correction is a highly desirable capability of large language models (LLMs), yet it has consistently been found to be largely ineffective in modern LLMs. Current methods for training self-correction typically depend on either multiple models, a more advanced model, or additional forms of supervision. To address these shortcomings, we develop a multi-turn online reinforcement learning (RL) app… ▽ More

    Submitted 4 October, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

  15. arXiv:2409.03118  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn cs.LG physics.chem-ph

    Generative artificial intelligence for computational chemistry: a roadmap to predicting emergent phenomena

    Authors: Pratyush Tiwary, Lukas Herron, Richard John, Suemin Lee, Disha Sanwal, Ruiyu Wang

    Abstract: The recent surge in Generative Artificial Intelligence (AI) has introduced exciting possibilities for computational chemistry. Generative AI methods have made significant progress in sampling molecular structures across chemical species, developing force fields, and speeding up simulations. This Perspective offers a structured overview, beginning with the fundamental theoretical concepts in both G… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  16. arXiv:2408.05896  [pdf, ps, other

    stat.ME cs.IR

    Scalable recommender system based on factor analysis

    Authors: Disha Ghandwani, Trevor Hastie

    Abstract: Recommender systems have become crucial in the modern digital landscape, where personalized content, products, and services are essential for enhancing user experience. This paper explores statistical models for recommender systems, focusing on crossed random effects models and factor analysis. We extend the crossed random effects model to include random slopes, enabling the capture of varying cov… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  17. arXiv:2407.21090  [pdf, other

    cs.LG

    Learning Optimal Signal Temporal Logic Decision Trees for Classification: A Max-Flow MILP Formulation

    Authors: Kaier Liang, Gustavo A. Cardona, Disha Kamale, Cristian-Ioan Vasile

    Abstract: This paper presents a novel framework for inferring timed temporal logic properties from data. The dataset comprises pairs of finite-time system traces and corresponding labels, denoting whether the traces demonstrate specific desired behaviors, e.g. whether the ship follows a safe route or not. Our proposed approach leverages decision-tree-based methods to infer Signal Temporal Logic classifiers… ▽ More

    Submitted 14 August, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

  18. arXiv:2407.14030  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    HeCiX: Integrating Knowledge Graphs and Large Language Models for Biomedical Research

    Authors: Prerana Sanjay Kulkarni, Muskaan Jain, Disha Sheshanarayana, Srinivasan Parthiban

    Abstract: Despite advancements in drug development strategies, 90% of clinical trials fail. This suggests overlooked aspects in target validation and drug optimization. In order to address this, we introduce HeCiX-KG, Hetionet-Clinicaltrials neXus Knowledge Graph, a novel fusion of data from ClinicalTrials.gov and Hetionet in a single knowledge graph. HeCiX-KG combines data on previously conducted clinical… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures, under review

  19. arXiv:2406.17102  [pdf, other

    cs.LG cs.CY

    Achieving Fairness Across Local and Global Models in Federated Learning

    Authors: Disha Makhija, Xing Han, Joydeep Ghosh, Yejin Kim

    Abstract: Achieving fairness across diverse clients in Federated Learning (FL) remains a significant challenge due to the heterogeneity of the data and the inaccessibility of sensitive attributes from clients' private datasets. This study addresses this issue by introducing \texttt{EquiFL}, a novel approach designed to enhance both local and global fairness in federated learning environments. \texttt{EquiFL… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  20. arXiv:2406.01848  [pdf, other

    cs.RO

    Optimal Control Synthesis with Relaxed Global Temporal Logic Specifications for Homogeneous Multi-robot Teams

    Authors: Disha Kamale, Cristian-Ioan Vasile

    Abstract: In this work, we address the problem of control synthesis for a homogeneous team of robots given a global temporal logic specification and formal user preferences for relaxation in case of infeasibility. The relaxation preferences are represented as a Weighted Finite-state Edit System and are used to compute a relaxed specification automaton that captures all allowable relaxations of the mission s… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  21. A Design Space for Intelligent and Interactive Writing Assistants

    Authors: Mina Lee, Katy Ilonka Gero, John Joon Young Chung, Simon Buckingham Shum, Vipul Raheja, Hua Shen, Subhashini Venugopalan, Thiemo Wambsganss, David Zhou, Emad A. Alghamdi, Tal August, Avinash Bhat, Madiha Zahrah Choksi, Senjuti Dutta, Jin L. C. Guo, Md Naimul Hoque, Yewon Kim, Simon Knight, Seyed Parsa Neshaei, Agnia Sergeyuk, Antonette Shibani, Disha Shrivastava, Lila Shroff, Jessi Stark, Sarah Sterman , et al. (11 additional authors not shown)

    Abstract: In our era of rapid technological advancement, the research landscape for writing assistants has become increasingly fragmented across various research communities. We seek to address this challenge by proposing a design space as a structured way to examine and explore the multidimensional space of intelligent and interactive writing assistants. Through a large community collaboration, we explore… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at CHI 2024

  22. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  23. arXiv:2402.17705  [pdf, other

    cs.LG

    Federated Learning for Estimating Heterogeneous Treatment Effects

    Authors: Disha Makhija, Joydeep Ghosh, Yejin Kim

    Abstract: Machine learning methods for estimating heterogeneous treatment effects (HTE) facilitate large-scale personalized decision-making across various domains such as healthcare, policy making, education, and more. Current machine learning approaches for HTE require access to substantial amounts of data per treatment, and the high costs associated with interventions makes centrally collecting so much da… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  24. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  25. arXiv:2311.09564  [pdf, other

    cs.CL cs.AI

    LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

    Authors: Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral

    Abstract: Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been systematically explored. Assessing these models on long sequences is crucial since prior work in the general domain has demonstrated performance degradation of LLMs on longer texts. Motivated by this,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages

  26. Drone-Enabled Load Management for Solar Small Cell Networks in Next-Gen Communications Optimization for Solar Small Cells

    Authors: Daksh Dave, Dhruv Khut, Sahil Nawale, Pushkar Aggrawal, Disha Rastogi, Kailas Devadkar

    Abstract: In recent years, the cellular industry has witnessed a major evolution in communication technologies. It is evident that the Next Generation of cellular networks(NGN) will play a pivotal role in the acceptance of emerging IoT applications supporting high data rates, better Quality of Service(QoS), and reduced latency. However, the deployment of NGN will introduce a power overhead on the communicat… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures, 1 table, 1 algorithm

  27. arXiv:2309.07347  [pdf, other

    cs.RO

    Energy-Constrained Active Exploration Under Incremental-Resolution Symbolic Perception

    Authors: Disha Kamale, Sofie Haesaert, Cristian-Ioan Vasile

    Abstract: In this work, we consider the problem of autonomous exploration in search of targets while respecting a fixed energy budget. The robot is equipped with an incremental-resolution symbolic perception module wherein the perception of targets in the environment improves as the robot's distance from targets decreases. We assume no prior information about the total number of targets, their locations as… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  28. arXiv:2306.10998  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    RepoFusion: Training Code Models to Understand Your Repository

    Authors: Disha Shrivastava, Denis Kocetkov, Harm de Vries, Dzmitry Bahdanau, Torsten Scholak

    Abstract: Despite the huge success of Large Language Models (LLMs) in coding assistants like GitHub Copilot, these models struggle to understand the context present in the repository (e.g., imports, parent classes, files with similar names, etc.), thereby producing inaccurate code completions. This effect is more pronounced when using these assistants for repositories that the model has not seen during trai… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  29. arXiv:2306.07959  [pdf, other

    cs.LG

    Privacy Preserving Bayesian Federated Learning in Heterogeneous Settings

    Authors: Disha Makhija, Joydeep Ghosh, Nhat Ho

    Abstract: In several practical applications of federated learning (FL), the clients are highly heterogeneous in terms of both their data and compute resources, and therefore enforcing the same model architecture for each client is very limiting. Moreover, the need for uncertainty quantification and data privacy constraints are often particularly amplified for clients that have limited local data. This paper… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  30. arXiv:2304.02993  [pdf, other

    cs.RO cs.CL cs.HC

    Natural Language Robot Programming: NLP integrated with autonomous robotic grasping

    Authors: Muhammad Arshad Khan, Max Kenney, Jack Painter, Disha Kamale, Riza Batista-Navarro, Amir Ghalamzan-E

    Abstract: In this paper, we present a grammar-based natural language framework for robot programming, specifically for pick-and-place tasks. Our approach uses a custom dictionary of action words, designed to store together words that share meaning, allowing for easy expansion of the vocabulary by adding more action words from a lexical database. We validate our Natural Language Robot Programming (NLRP) fram… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: submitted to IROS 2023

  31. arXiv:2304.02822  [pdf, other

    cs.HC cs.CL cs.LG

    Approach Intelligent Writing Assistants Usability with Seven Stages of Action

    Authors: Avinash Bhat, Disha Shrivastava, Jin L. C. Guo

    Abstract: Despite the potential of Large Language Models (LLMs) as writing assistants, they are plagued by issues like coherence and fluency of the model output, trustworthiness, ownership of the generated content, and predictability of model performance, thereby limiting their usability. In this position paper, we propose to adopt Norman's seven stages of action as a framework to approach the interaction d… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: The Second Workshop on Intelligent and Interactive Writing Assistants co-located with The ACM CHI Conference on Human Factors in Computing Systems (CHI 2023)

  32. arXiv:2303.09416  [pdf, other

    cs.RO cs.CV

    Symbolic Perception Risk in Autonomous Driving

    Authors: Guangyi Liu, Disha Kamale, Cristian-Ioan Vasile, Nader Motee

    Abstract: We develop a novel framework to assess the risk of misperception in a traffic sign classification task in the presence of exogenous noise. We consider the problem in an autonomous driving setting, where visual input quality gradually improves due to improved resolution, and less noise since the distance to traffic signs decreases. Using the estimated perception statistics obtained using the standa… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted at 2023 American Control Conference

  33. arXiv:2303.07881  [pdf, ps, other

    cs.IT

    An Algorithm to find the Generators of Multidimensional Cyclic Codes over a Finite Chain Ring

    Authors: Disha, Sucheta Dutt

    Abstract: The aim of this paper is to determine the algebraic structure of multidimensional cyclic codes over a finite chain ring $\mathfrak{R}$. An algorithm to find the generator polynomials of $n$ dimensional ($n$D) cyclic codes of length $m_{1}m_{2}\dots m_{n}$ over $\mathfrak{R}$ has been developed using the generator polynomials of cyclic codes over $\mathfrak{R}$. Additionally, the generators of $n$D… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  34. arXiv:2209.09818  [pdf, other

    cs.RO

    Cautious Planning with Incremental Symbolic Perception: Designing Verified Reactive Driving Maneuvers

    Authors: Disha Kamale, Sofie Haesaert, Cristian-Ioan Vasile

    Abstract: This work presents a step towards utilizing incrementally-improving symbolic perception knowledge of the robot's surroundings for provably correct reactive control synthesis applied to an autonomous driving problem. Combining abstract models of motion control and information gathering, we show that assume-guarantee specifications (a subclass of Linear Temporal Logic) can be used to define and reso… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  35. arXiv:2209.03177  [pdf, other

    eess.IV cs.CV cs.LG

    Morphology-preserving Autoregressive 3D Generative Modelling of the Brain

    Authors: Petru-Daniel Tudosiu, Walter Hugo Lopez Pinaya, Mark S. Graham, Pedro Borges, Virginia Fernandez, Dai Yang, Jeremy Appleyard, Guido Novati, Disha Mehra, Mike Vella, Parashkev Nachev, Sebastien Ourselin, Jorge Cardoso

    Abstract: Human anatomy, morphology, and associated diseases can be studied using medical imaging data. However, access to medical imaging data is restricted by governance and privacy concerns, data ownership, and the cost of acquisition, thus limiting our ability to understand the human body. A possible solution to this issue is the creation of a model able to learn and then generate synthetic images of th… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 13 pages, 3 figures, 2 tables, accepted at SASHIMI MICCAI 2022

    MSC Class: 68T99 (Primary) 92C55 (Secondary) ACM Class: I.2.1; J.3

  36. arXiv:2207.11321  [pdf, other

    cs.SI cs.LG

    A flexible PageRank-based graph embedding framework closely related to spectral eigenvector embeddings

    Authors: Disha Shur, Yufan Huang, David F. Gleich

    Abstract: We study a simple embedding technique based on a matrix of personalized PageRank vectors seeded on a random set of nodes. We show that the embedding produced by the element-wise logarithm of this matrix (1) are related to the spectral embedding for a class of graphs where spectral embeddings are significant, and hence useful representation of the data, (2) can be done for the entire network or a s… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  37. arXiv:2206.12839  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Repository-Level Prompt Generation for Large Language Models of Code

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: With the success of large language models (LLMs) of code and their use as code assistants (e.g. Codex used in GitHub Copilot), techniques for introducing domain-specific knowledge in the prompt design process become important. In this work, we propose a framework called Repo-Level Prompt Generator that learns to generate example-specific prompts using prompt proposals. The prompt proposals take co… ▽ More

    Submitted 5 June, 2023; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: ICML 2023 (Camera-Ready version)

    Journal ref: ICML, 2023

  38. arXiv:2205.12493  [pdf, other

    cs.LG cs.DC

    Federated Self-supervised Learning for Heterogeneous Clients

    Authors: Disha Makhija, Nhat Ho, Joydeep Ghosh

    Abstract: Federated Learning has become an important learning paradigm due to its privacy and computational benefits. As the field advances, two key challenges that still remain to be addressed are: (1) system heterogeneity - variability in the compute and/or data resources present on each client, and (2) lack of labeled data in certain federated settings. Several recent developments have tried to overcome… ▽ More

    Submitted 31 May, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  39. arXiv:2202.07757  [pdf, other

    cs.LG

    Architecture Agnostic Federated Learning for Neural Networks

    Authors: Disha Makhija, Xing Han, Nhat Ho, Joydeep Ghosh

    Abstract: With growing concerns regarding data privacy and rapid increase in data volume, Federated Learning(FL) has become an important learning paradigm. However, jointly learning a deep neural network model in a FL setting proves to be a non-trivial task because of the complexities associated with the neural networks, such as varied architectures across clients, permutation invariance of the neurons, and… ▽ More

    Submitted 7 July, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  40. Auto Tuning of Hadoop and Spark parameters

    Authors: Tanuja Patanshetti, Ashish Anil Pawar, Disha Patel, Sanket Thakare

    Abstract: Data of the order of terabytes, petabytes, or beyond is known as Big Data. This data cannot be processed using the traditional database software, and hence there comes the need for Big Data Platforms. By combining the capabilities and features of various big data applications and utilities, Big Data Platforms form a single solution. It is a platform that helps to develop, deploy and manage the big… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 12 Pages, 9 Figures, 12 Tables, Published with International Journal of Engineering Trends and Technology (IJETT)

    Journal ref: International Journal of Engineering Trends and Technology 69.11(2021):22-33

  41. arXiv:2107.13650  [pdf, other

    cs.RO

    Automata-based Optimal Planning with Relaxed Specifications

    Authors: Disha Kamale, Eleni Karyofylli, Cristian-Ioan Vasile

    Abstract: In this paper, we introduce an automata-based framework for planning with relaxed specifications. User relaxation preferences are represented as weighted finite state edit systems that capture permissible operations on the specification, substitution and deletion of tasks, with complex constraints on ordering and grouping. We propose a three-way product automaton construction method that allows us… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: To be presented at International Conference on Intelligent Robots and Systems (IROS 2021)

  42. arXiv:2106.07175  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Learning to Combine Per-Example Solutions for Neural Program Synthesis

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: The goal of program synthesis from examples is to find a computer program that is consistent with a given set of input-output examples. Most learning-based approaches try to find a program that satisfies all examples at once. Our work, by contrast, considers an approach that breaks the problem into two stages: (a) find programs that satisfy only one example, and (b) leverage these per-example solu… ▽ More

    Submitted 1 November, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 (camera-ready version)

  43. arXiv:2106.01051  [pdf, other

    cs.CL

    Minimax and Neyman-Pearson Meta-Learning for Outlier Languages

    Authors: Edoardo Maria Ponti, Rahul Aralikatte, Disha Shrivastava, Siva Reddy, Anders Søgaard

    Abstract: Model-agnostic meta-learning (MAML) has been recently put forth as a strategy to learn resource-poor languages in a sample-efficient fashion. Nevertheless, the properties of these languages are often not well represented by those available during training. Hence, we argue that the i.i.d. assumption ingrained in MAML makes it ill-suited for cross-lingual NLP. In fact, under a decision-theoretic fra… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  44. arXiv:2011.08575  [pdf, other

    cs.LG cs.CY

    Audience Creation for Consumables -- Simple and Scalable Precision Merchandising for a Growing Marketplace

    Authors: Shreyas S, Harsh Maheshwari, Avijit Saha, Samik Datta, Shashank Jain, Disha Makhija, Anuj Nagpal, Sneha Shukla, Suyash S

    Abstract: Consumable categories, such as grocery and fast-moving consumer goods, are quintessential to the growth of e-commerce marketplaces in developing countries. In this work, we present the design and implementation of a precision merchandising system, which creates audience sets from over 10 million consumers and is deployed at Flipkart Supermart, one of the largest online grocery stores in India. We… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 10 pages

  45. arXiv:2009.03468  [pdf

    cs.AR

    Quad-Core RSA Processor with Countermeasure Against Power Analysis Attacks

    Authors: Javad Bagherzadeh, Vishishtha Bothra, Disha Gujar, Sugandha Gupta, Jinal Shah

    Abstract: Rivest-Shamir-Adleman (RSA) cryptosystem uses modular multiplication for encryption and decryption. So, performance of RSA can be drastically improved by optimizing modular multiplication. This paper proposes a new parallel, high-radix Montgomery multiplier for 1024 bits multi-core RSA processor. Each computation step operates in radix 4. The computation speed is increased by more than 4 times. We… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  46. arXiv:2003.11768   

    cs.LG cs.AI cs.SE stat.ML

    On-the-Fly Adaptation of Source Code Models using Meta-Learning

    Authors: Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

    Abstract: The ability to adapt to unseen, local contexts is an important challenge that successful models of source code must overcome. One of the most popular approaches for the adaptation of such models is dynamic evaluation. With dynamic evaluation, when running a model on an unseen file, the model is updated immediately after having observed each token in that file. In this work, we propose instead to f… ▽ More

    Submitted 19 September, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: This paper has been withdrawn because we found a bug in the FOMAML implementation that invalidates some of the key claims in the paper

  47. arXiv:2002.00336  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    3D Object Detection on Point Clouds using Local Ground-aware and Adaptive Representation of scenes' surface

    Authors: Arun CS Kumar, Disha Ahuja, Ashwath Aithal

    Abstract: A novel, adaptive ground-aware, and cost-effective 3D Object Detection pipeline is proposed. The ground surface representation introduced in this paper, in comparison to its uni-planar counterparts (methods that model the surface of a whole 3D scene using single plane), is far more accurate while being ~10x faster. The novelty of the ground representation lies both in the way in which the ground s… ▽ More

    Submitted 26 June, 2020; v1 submitted 2 February, 2020; originally announced February 2020.

  48. arXiv:1906.03574  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning by Modeling a Distribution over Policies

    Authors: Disha Shrivastava, Eeshan Gunesh Dhekane, Riashat Islam

    Abstract: Exploration and adaptation to new tasks in a transfer learning setup is a central challenge in reinforcement learning. In this work, we build on the idea of modeling a distribution over policies in a Bayesian deep reinforcement learning setup to propose a transfer strategy. Recent works have shown to induce diversity in the learned policies by maximizing the entropy of a distribution of policies (… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: Accepted at the ICML 2019 workshop on Multi-Task and Lifelong Reinforcement Learning

  49. arXiv:1809.06247  [pdf

    cs.CV cs.LG stat.ML

    Left Ventricle Segmentation and Volume Estimation on Cardiac MRI using Deep Learning

    Authors: Ehab Abdelmaguid, Jolene Huang, Sanjay Kenchareddy, Disha Singla, Laura Wilke, Mai H. Nguyen, Ilkay Altintas

    Abstract: In the United States, heart disease is the leading cause of death for both men and women, accounting for 610,000 deaths each year [1]. Physicians use Magnetic Resonance Imaging (MRI) scans to take images of the heart in order to non-invasively estimate its structural and functional parameters for cardiovascular diagnosis and disease management. The end-systolic volume (ESV) and end-diastolic volum… ▽ More

    Submitted 21 November, 2018; v1 submitted 14 September, 2018; originally announced September 2018.

    Comments: 42 pages

  50. arXiv:1809.00414  [pdf, ps, other

    cs.IR cs.AI cs.CL cs.LG

    Hypernyms Through Intra-Article Organization in Wikipedia

    Authors: Disha Shrivastava, Sreyash Kenkre, Santosh Penubothula

    Abstract: We introduce a new measure for unsupervised hypernym detection and directionality. The motivation is to keep the measure computationally light and portatable across languages. We show that the relative physical location of words in explanatory articles captures the directionality property. Further, the phrases in section titles of articles about the word, capture the semantic similarity needed for… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.