Skip to main content

Showing 1–50 of 110 results for author: Rao, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23997  [pdf, ps, other

    cs.HC cs.CY

    Fitting the Message to the Moment: Designing Calendar-Aware Stress Messaging with Large Language Models

    Authors: Pranav Rao, Maryam Taj, Alex Mariakakis, Joseph Jay Williams, Ananya Bhattacharjee

    Abstract: Existing stress-management tools fail to account for the timing and contextual specificity of students' daily lives, often providing static or misaligned support. Digital calendars contain rich, personal indicators of upcoming responsibilities, yet this data is rarely leveraged for adaptive wellbeing interventions. In this short paper, we explore how large language models (LLMs) might use digital… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.03087  [pdf, ps, other

    cs.RO

    Fabrication and Characterization of Additively Manufactured Stretchable Strain Sensors Towards the Shape Sensing of Continuum Robots

    Authors: Daniel C. Moyer, Wenpeng Wang, Logan S. Karschner, Loris Fichera, Pratap M. Rao

    Abstract: This letter describes the manufacturing and experimental characterization of novel stretchable strain sensors for continuum robots. The overarching goal of this research is to provide a new solution for the shape sensing of these devices. The sensors are fabricated via direct ink writing, an extrusion-based additive manufacturing technique. Electrically conductive material (i.e., the \textit{ink})… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: Author's manuscript. Accepted for publication in IEEE Robotics and Automation Letters

  3. arXiv:2504.08952  [pdf, other

    cs.SE cs.HC

    RiskRAG: A Data-Driven Solution for Improved AI Model Risk Reporting

    Authors: Pooja S. B. Rao, Sanja Šćepanović, Ke Zhou, Edyta Paulina Bogucka, Daniele Quercia

    Abstract: Risk reporting is essential for documenting AI models, yet only 14% of model cards mention risks, out of which 96% copying content from a small set of cards, leading to a lack of actionable insights. Existing proposals for improving model cards do not resolve these issues. To address this, we introduce RiskRAG, a Retrieval Augmented Generation based risk reporting solution guided by five design re… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  4. arXiv:2504.06306  [pdf, other

    q-bio.QM cs.AI

    Predicting Survivability of Cancer Patients with Metastatic Patterns Using Explainable AI

    Authors: Polycarp Nalela, Deepthi Rao, Praveen Rao

    Abstract: Cancer remains a leading global health challenge and a major cause of mortality. This study leverages machine learning (ML) to predict the survivability of cancer patients with metastatic patterns using the comprehensive MSK-MET dataset, which includes genomic and clinical data from 25,775 patients across 27 cancer types. We evaluated five ML models-XGBoost, Naïve Bayes, Decision Tree, Logistic Re… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  5. arXiv:2503.21796  [pdf, other

    cs.NE cs.LG q-bio.NC

    Meta-Representational Predictive Coding: Biomimetic Self-Supervised Learning

    Authors: Alexander Ororbia, Karl Friston, Rajesh P. N. Rao

    Abstract: Self-supervised learning has become an increasingly important paradigm in the domain of machine intelligence. Furthermore, evidence for self-supervised adaptation, such as contrastive formulations, has emerged in recent computational neuroscience and brain-inspired research. Nevertheless, current work on self-supervised learning relies on biologically implausible credit assignment -- in the form o… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

  6. arXiv:2502.12360  [pdf, other

    cs.CV cs.AI cs.LG

    Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions

    Authors: Sujan Sai Gannamaneni, Rohil Prakash Rao, Michael Mock, Maram Akila, Stefan Wrobel

    Abstract: Slice discovery methods (SDMs) are prominent algorithms for finding systematic weaknesses in DNNs. They identify top-k semantically coherent slices/subsets of data where a DNN-under-test has low performance. For being directly useful, slices should be aligned with human-understandable and relevant dimensions, which, for example, are defined by safety and domain experts as part of the operational d… ▽ More

    Submitted 6 March, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  7. arXiv:2501.12911  [pdf, ps, other

    cs.CR cs.DC cs.LG

    A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated Learning

    Authors: Abdulkadir Korkmaz, Praveen Rao

    Abstract: Federated learning (FL) has come forward as a critical approach for privacy-preserving machine learning in healthcare, allowing collaborative model training across decentralized medical datasets without exchanging clients' data. However, current security implementations for these systems face a fundamental trade-off: rigorous cryptographic protections like fully homomorphic encryption (FHE) impose… ▽ More

    Submitted 5 June, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

    Comments: 18 pages, 18 figures

    ACM Class: C.2.0; C.2.4; I.2.6

  8. arXiv:2411.10406  [pdf, other

    quant-ph cond-mat.dis-nn cs.AI cs.DC

    How to Build a Quantum Supercomputer: Scaling from Hundreds to Millions of Qubits

    Authors: Masoud Mohseni, Artur Scherer, K. Grace Johnson, Oded Wertheim, Matthew Otten, Navid Anjum Aadit, Yuri Alexeev, Kirk M. Bresniker, Kerem Y. Camsari, Barbara Chapman, Soumitra Chatterjee, Gebremedhin A. Dagnew, Aniello Esposito, Farah Fahim, Marco Fiorentino, Archit Gajjar, Abdullah Khalid, Xiangzhou Kong, Bohdan Kulchytskyy, Elica Kyoseva, Ruoyu Li, P. Aaron Lott, Igor L. Markov, Robert F. McDermott, Giacomo Pedretti , et al. (16 additional authors not shown)

    Abstract: In the span of four decades, quantum computation has evolved from an intellectual curiosity to a potentially realizable technology. Today, small-scale demonstrations have become possible for quantum algorithmic primitives on hundreds of physical qubits and proof-of-principle error-correction on a single logical qubit. Nevertheless, despite significant progress and excitement, the path toward a ful… ▽ More

    Submitted 31 January, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: 76 pages, 46 figures. General revision, added figures, added references, added appendices

  9. arXiv:2410.15998  [pdf, other

    cs.CL cs.AI cs.LG

    1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification

    Authors: Ram Mohan Rao Kadiyala, M. V. P. Chandra Sekhara Rao

    Abstract: Social media is a great source of data for users reporting information and regarding their health and how various things have had an effect on them. This paper presents various approaches using Transformers and Large Language Models and their ensembles, their performance along with advantages and drawbacks for various tasks of SMM4H'24 - Classifying texts on impact of nature and outdoor spaces on… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: short paper , acl 2024

  10. arXiv:2410.14357  [pdf, other

    quant-ph cs.DC hep-ph physics.chem-ph

    Efficient charge-preserving excited state preparation with variational quantum algorithms

    Authors: Zohim Chandani, Kazuki Ikeda, Zhong-Bo Kang, Dmitri E. Kharzeev, Alexander McCaskey, Andrea Palermo, C. R. Ramakrishnan, Pooja Rao, Ranjani G. Sundaram, Kwangmin Yu

    Abstract: Determining the spectrum and wave functions of excited states of a system is crucial in quantum physics and chemistry. Low-depth quantum algorithms, such as the Variational Quantum Eigensolver (VQE) and its variants, can be used to determine the ground-state energy. However, current approaches to computing excited states require numerous controlled unitaries, making the application of the original… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 20 pages, 6 figures, 1 table

  11. arXiv:2409.16732  [pdf, other

    cs.HC

    Perfectly to a Tee: Understanding User Perceptions of Personalized LLM-Enhanced Narrative Interventions

    Authors: Ananya Bhattacharjee, Sarah Yi Xu, Pranav Rao, Yuchen Zeng, Jonah Meyerhoff, Syed Ishtiaque Ahmed, David C Mohr, Michael Liut, Alex Mariakakis, Rachel Kornfield, Joseph Jay Williams

    Abstract: Stories about overcoming personal struggles can effectively illustrate the application of psychological theories in real life, yet they may fail to resonate with individuals' experiences. In this work, we employ large language models (LLMs) to create tailored narratives that acknowledge and address unique challenging thoughts and situations faced by individuals. Our study, involving 346 young adul… ▽ More

    Submitted 12 May, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

  12. arXiv:2408.15922  [pdf, other

    cs.CV

    DiffAge3D: Diffusion-based 3D-aware Face Aging

    Authors: Junaid Wahid, Fangneng Zhan, Pramod Rao, Christian Theobalt

    Abstract: Face aging is the process of converting an individual's appearance to a younger or older version of themselves. Existing face aging techniques have been limited to 2D settings, which often weaken their applications as there is a growing demand for 3D face modeling. Moreover, existing aging methods struggle to perform faithful aging, maintain identity, and retain the fine details of the input image… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  13. arXiv:2407.20879  [pdf, other

    cs.AI q-bio.QM

    A Scalable Tool For Analyzing Genomic Variants Of Humans Using Knowledge Graphs and Machine Learning

    Authors: Shivika Prasanna, Ajay Kumar, Deepthi Rao, Eduardo Simoes, Praveen Rao

    Abstract: The integration of knowledge graphs and graph machine learning (GML) in genomic data analysis offers several opportunities for understanding complex genetic relationships, especially at the RNA level. We present a comprehensive approach for leveraging these technologies to analyze genomic variants, specifically in the context of RNA sequencing (RNA-seq) data from COVID-19 patient samples. The prop… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.04423

  14. arXiv:2407.12964  [pdf, other

    cs.RO cs.AI

    Learning Long-Horizon Predictions for Quadrotor Dynamics

    Authors: Pratyaksh Prabhav Rao, Alessandro Saviolo, Tommaso Castiglione Ferrari, Giuseppe Loianno

    Abstract: Accurate modeling of system dynamics is crucial for achieving high-performance planning and control of robotic systems. Although existing data-driven approaches represent a promising approach for modeling dynamics, their accuracy is limited to a short prediction horizon, overlooking the impact of compounding prediction errors over longer prediction horizons. Strategies to mitigate these cumulative… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures, 3 tables. Paper accepted by IROS 2024

  15. Lite2Relight: 3D-aware Single Image Portrait Relighting

    Authors: Pramod Rao, Gereon Fox, Abhimitra Meka, Mallikarjun B R, Fangneng Zhan, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt

    Abstract: Achieving photorealistic 3D view synthesis and relighting of human portraits is pivotal for advancing AR/VR applications. Existing methodologies in portrait relighting demonstrate substantial limitations in terms of generalization and 3D consistency, coupled with inaccuracies in physically realistic lighting and identity preservation. Furthermore, personalization from a single view is difficult to… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted at SIGGRAPH '24: ACM SIGGRAPH 2024 Conference Papers

  16. arXiv:2407.08767  [pdf, other

    quant-ph cs.ET cs.RO

    A Quantum Computing Approach for Multi-robot Coverage Path Planning

    Authors: Poojith U Rao, Florian Speelman, Balwinder Sodhi, Sachin Kinge

    Abstract: This paper tackles the multi-vehicle Coverage Path Planning (CPP) problem, crucial for applications like search and rescue or environmental monitoring. Due to its NP-hard nature, finding optimal solutions becomes infeasible with larger problem sizes. This motivates the development of heuristic approaches that enhance efficiency even marginally. We propose a novel approach for exploring paths in a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  17. arXiv:2406.05812  [pdf, other

    cs.CL cs.AI

    Seventeenth-Century Spanish American Notary Records for Fine-Tuning Spanish Large Language Models

    Authors: Shraboni Sarker, Ahmad Tamim Hamad, Hulayyil Alshammari, Viviana Grieco, Praveen Rao

    Abstract: Large language models have gained tremendous popularity in domains such as e-commerce, finance, healthcare, and education. Fine-tuning is a common approach to customize an LLM on a domain-specific dataset for a desired downstream task. In this paper, we present a valuable resource for fine-tuning LLMs developed for the Spanish language to perform a variety of tasks such as classification, masked l… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  18. arXiv:2405.19426  [pdf, other

    cs.CL cs.SD eess.AS

    Deep Learning for Assessment of Oral Reading Fluency

    Authors: Mithilesh Vaidya, Binaya Kumar Sahoo, Preeti Rao

    Abstract: Reading fluency assessment is a critical component of literacy programmes, serving to guide and monitor early education interventions. Given the resource intensive nature of the exercise when conducted by teachers, the development of automatic tools that can operate on audio recordings of oral reading is attractive as an objective and highly scalable solution. Multiple complex aspects such as accu… ▽ More

    Submitted 1 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  19. arXiv:2405.09572  [pdf, other

    eess.SP cs.AI

    Deep Neural Operator Enabled Digital Twin Modeling for Additive Manufacturing

    Authors: Ning Liu, Xuxiao Li, Manoj R. Rajanna, Edward W. Reutzel, Brady Sawyer, Prahalada Rao, Jim Lua, Nam Phan, Yue Yu

    Abstract: A digital twin (DT), with the components of a physics-based model, a data-driven model, and a machine learning (ML) enabled efficient surrogate, behaves as a virtual twin of the real-world physical process. In terms of Laser Powder Bed Fusion (L-PBF) based additive manufacturing (AM), a DT can predict the current and future states of the melt pool and the resulting defects corresponding to the inp… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  20. Crowdsourcing Dermatology Images with Google Search Ads: Creating a Real-World Skin Condition Dataset

    Authors: Abbi Ward, Jimmy Li, Julie Wang, Sriram Lakshminarasimhan, Ashley Carrick, Bilson Campana, Jay Hartford, Pradeep Kumar S, Tiya Tiyasirichokchai, Sunny Virmani, Renee Wong, Yossi Matias, Greg S. Corrado, Dale R. Webster, Dawn Siegel, Steven Lin, Justin Ko, Alan Karthikesalingam, Christopher Semturs, Pooja Rao

    Abstract: Background: Health datasets from clinical sources do not reflect the breadth and diversity of disease in the real world, impacting research, medical education, and artificial intelligence (AI) tool development. Dermatology is a suitable area to develop and test a new and scalable method to create representative health datasets. Methods: We used Google Search advertisements to invite contribution… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Journal ref: JAMA Network Open (2024)

  21. arXiv:2402.14175  [pdf, other

    cs.RO

    Towards Contact-Aided Motion Planning for Tendon-Driven Continuum Robots

    Authors: Priyanka Rao, Oren Salzman, Jessica Burgner-Kahrs

    Abstract: Tendon-driven continuum robots (TDCRs), with their flexible backbones, offer the advantage of being used for navigating complex, cluttered environments. However, to do so, they typically require multiple segments, often leading to complex actuation and control challenges. To this end, we propose a novel approach to navigate cluttered spaces effectively for a single-segment long TDCR which is the s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 8 pages, 5 figures, 2 tables

  22. arXiv:2401.05121  [pdf, other

    cs.ET cs.LG

    Photonics for Sustainable Computing

    Authors: Farbin Fayza, Satyavolu Papa Rao, Darius Bunandar, Udit Gupta, Ajay Joshi

    Abstract: Photonic integrated circuits are finding use in a variety of applications including optical transceivers, LIDAR, bio-sensing, photonic quantum computing, and Machine Learning (ML). In particular, with the exponentially increasing sizes of ML models, photonics-based accelerators are getting special attention as a sustainable solution because they can perform ML inferences with multiple orders of ma… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  23. arXiv:2312.17479  [pdf, other

    cs.AI cs.CY cs.HC cs.LG

    Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning

    Authors: Nigini Oliveira, Jasmine Li, Koosha Khalvati, Rodolfo Cortes Barragan, Katharina Reinecke, Andrew N. Meltzoff, Rajesh P. N. Rao

    Abstract: Constructing a universal moral code for artificial intelligence (AI) is difficult or even impossible, given that different human cultures have different definitions of morality and different societal norms. We therefore argue that the value system of an AI should be culturally attuned: just as a child raised in a particular culture learns the specific values and norms of that culture, we propose t… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  24. arXiv:2312.04423  [pdf, other

    cs.AI cs.DB q-bio.QM

    Scalable Knowledge Graph Construction and Inference on Human Genome Variants

    Authors: Shivika Prasanna, Deepthi Rao, Eduardo Simoes, Praveen Rao

    Abstract: Real-world knowledge can be represented as a graph consisting of entities and relationships between the entities. The need for efficient and scalable solutions arises when dealing with vast genomic data, like RNA-sequencing. Knowledge graphs offer a powerful approach for various tasks in such large-scale genomic data, such as analysis and inference. In this work, variant-level information extracte… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  25. arXiv:2311.17705  [pdf, other

    cs.SE

    Q-PAC: Automated Detection of Quantum Bug-Fix Patterns

    Authors: Pranav K. Nayak, Krishn V. Kher, M. Bharat Chandra, M. V. Panduranga Rao, Lei Zhang

    Abstract: Context: Bug-fix pattern detection has been investigated in the past in the context of classical software. However, while quantum software is developing rapidly, the literature still lacks automated methods and tools to identify, analyze, and detect bug-fix patterns. To the best of our knowledge, our work previously published in SEKE'23 was the first to leverage classical techniques to detect bug-… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 16 pages, 2 figures

  26. arXiv:2310.04781  [pdf, other

    cs.RO

    Unifying Foundation Models with Quadrotor Control for Visual Tracking Beyond Object Categories

    Authors: Alessandro Saviolo, Pratyaksh Rao, Vivek Radhakrishnan, Jiuhong Xiao, Giuseppe Loianno

    Abstract: Visual control enables quadrotors to adaptively navigate using real-time sensory data, bridging perception with action. Yet, challenges persist, including generalization across scenarios, maintaining reliability, and ensuring real-time responsiveness. This paper introduces a perception framework grounded in foundation models for universal object detection and tracking, moving beyond specific train… ▽ More

    Submitted 8 April, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

  27. arXiv:2308.11809  [pdf, other

    q-bio.NC cs.AI cs.NE

    Expressive probabilistic sampling in recurrent neural networks

    Authors: Shirui Chen, Linxing Preston Jiang, Rajesh P. N. Rao, Eric Shea-Brown

    Abstract: In sampling-based Bayesian models of brain function, neural activities are assumed to be samples from probability distributions that the brain uses for probabilistic computation. However, a comprehensive understanding of how mechanistic models of neural dynamics can sample from arbitrary distributions is still lacking. We use tools from functional analysis and stochastic differential equations to… ▽ More

    Submitted 14 November, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  28. arXiv:2308.07870  [pdf, other

    cs.AI cs.LG cs.NE

    A Survey on Brain-Inspired Deep Learning via Predictive Coding

    Authors: Tommaso Salvatori, Ankur Mali, Christopher L. Buckley, Thomas Lukasiewicz, Rajesh P. N. Rao, Karl Friston, Alexander Ororbia

    Abstract: Artificial intelligence (AI) is rapidly becoming one of the key technologies of this century. The majority of results in AI thus far have been achieved using deep neural networks trained with the error backpropagation learning algorithm. However, the ubiquitous adoption of this approach has highlighted some important limitations such as substantial computational cost, difficulty in quantifying unc… ▽ More

    Submitted 23 January, 2025; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: 37 Pages, 9 Figures

  29. arXiv:2306.09384  [pdf, other

    eess.AS cs.AI

    MobileASR: A resource-aware on-device learning framework for user voice personalization applications on mobile phones

    Authors: Zitha Sasindran, Harsha Yelchuri, Pooja Rao, T. V. Prabhakar

    Abstract: We describe a comprehensive methodology for developing user-voice personalized automatic speech recognition (ASR) models by effectively training models on mobile phones, allowing user data and models to be stored and used locally. To achieve this, we propose a resource-aware sub-model-based training approach that considers the RAM, and battery capabilities of mobile phones. By considering the eval… ▽ More

    Submitted 9 November, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted in AIMLSystems 2023

  30. arXiv:2306.03494  [pdf, other

    eess.IV cs.CV

    Structurally Different Neural Network Blocks for the Segmentation of Atrial and Aortic Perivascular Adipose Tissue in Multi-centre CT Angiography Scans

    Authors: Ikboljon Sobirov, Cheng Xie, Muhammad Siddique, Parijat Patel, Kenneth Chan, Thomas Halborg, Christos P. Kotanidis, Zarqaish Fatima, Henry West, Sheena Thomas, Maria Lyasheva, Donna Alexander, David Adlam, Praveen Rao, Das Indrajeet, Aparna Deshpande, Amrita Bajaj, Jonathan C L Rodrigues, Benjamin J Hudson, Vivek Srivastava, George Krasopoulos, Rana Sayeed, Qiang Zhang, Pete Tomlins, Cheerag Shirodaria , et al. (4 additional authors not shown)

    Abstract: Since the emergence of convolutional neural networks (CNNs) and, later, vision transformers (ViTs), deep learning architectures have predominantly relied on identical block types with varying hyperparameters. We propose a novel block alternation strategy to leverage the complementary strengths of different architectural designs, assembling structurally distinct components similar to Lego blocks. W… ▽ More

    Submitted 28 May, 2025; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 15 pages, 4 figures, 3 tables

  31. arXiv:2305.19444  [pdf

    cs.HC

    Pixelated Interactions: Exploring Pixel Art for Graphical Primitives on a Tactile Display

    Authors: Tigmanshu Bhatnagar, Vikas Upadhyay, Anchal Sharma, P V Madhusudhan Rao, Mark Miodownik, Nicolai Marquardt, Catherine Holloway

    Abstract: Two-dimensional pin array tactile displays enable access to tactile graphics that are important for the education of students with visual impairments. Due to their prohibitive cost, limited access, and limited research within HCI, the rules to design graphical primitives on these low-resolution tactile displays are unclear. In this paper, eight tactile readers with visual impairments qualitatively… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 25 pages, 10 figures. To appear in DIS'23 Designing Interactive Systems Conference, July 10 to 14, 2023, Pittsburgh, PA, USA

  32. arXiv:2304.09982  [pdf, other

    cs.CL

    Radar de Parité: An NLP system to measure gender representation in French news stories

    Authors: Valentin-Gabriel Soumah, Prashanth Rao, Philipp Eibl, Maite Taboada

    Abstract: We present the Radar de Parité, an automated Natural Language Processing (NLP) system that measures the proportion of women and men quoted daily in six Canadian French-language media outlets. We outline the system's architecture and detail the challenges we overcame to address French-specific issues, in particular regarding coreference resolution, a new contribution to the NLP literature on French… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Full conference paper plus appendix

    Journal ref: The 36th Canadian Conference on Artificial Intelligence. 5-9 June 2023, Montréal

  33. arXiv:2211.16988  [pdf, other

    cs.CV

    QuadFormer: Quadruple Transformer for Unsupervised Domain Adaptation in Power Line Segmentation of Aerial Images

    Authors: Pratyaksh Prabhav Rao, Feng Qiao, Weide Zhang, Yiliang Xu, Yong Deng, Guangbin Wu, Qiang Zhang

    Abstract: Accurate segmentation of power lines in aerial images is essential to ensure the flight safety of aerial vehicles. Acquiring high-quality ground truth annotations for training a deep learning model is a laborious process. Therefore, developing algorithms that can leverage knowledge from labelled synthetic data to unlabelled real images is highly demanded. This process is studied in Unsupervised do… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  34. arXiv:2210.13461  [pdf, other

    cs.LG cs.AI cs.CV cs.NE q-bio.NC

    Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

    Authors: Rajesh P. N. Rao, Dimitrios C. Gklezakos, Vishwas Sathish

    Abstract: Predictive coding has emerged as a prominent model of how the brain learns through predictions, anticipating the importance accorded to predictive learning in recent AI architectures such as transformers. Here we propose a new framework for predictive coding called active predictive coding which can learn hierarchical world models and solve two radically different open problems in AI: (1) how do w… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 15 pages, 10 figures, 2 supplementary figures

  35. arXiv:2210.11478  [pdf, other

    q-bio.NC cs.AI

    Neural Co-Processors for Restoring Brain Function: Results from a Cortical Model of Grasping

    Authors: Matthew J. Bryan, Linxing Preston Jiang, Rajesh P N Rao

    Abstract: Objective: A major challenge in designing closed-loop brain-computer interfaces is finding optimal stimulation patterns as a function of ongoing neural activity for different subjects and objectives. Approach: To achieve goal-directed closed-loop neurostimulation, we propose "neural co-processors" which use artificial neural networks and deep learning to learn optimal closed-loop stimulation polic… ▽ More

    Submitted 20 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 45 pages, 19 figures. Submitted the IOP Journal of Neural Engineering

  36. arXiv:2210.04766  [pdf, other

    cs.LG physics.chem-ph

    Hierarchical Learning in Euclidean Neural Networks

    Authors: Joshua A. Rackers, Pranav Rao

    Abstract: Equivariant machine learning methods have shown wide success at 3D learning applications in recent years. These models explicitly build in the reflection, translation and rotation symmetries of Euclidean space and have facilitated large advances in accuracy and data efficiency for a range of applications in the physical sciences. An outstanding question for equivariant models is why they achieve s… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 9 pages, 3 figures

  37. arXiv:2209.00291  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Generating Coherent Drum Accompaniment With Fills And Improvisations

    Authors: Rishabh Dahale, Vaibhav Talwadker, Preeti Rao, Prateek Verma

    Abstract: Creating a complex work of art like music necessitates profound creativity. With recent advancements in deep learning and powerful models such as transformers, there has been huge progress in automatic music generation. In an accompaniment generation context, creating a coherent drum pattern with apposite fills and improvisations at proper locations in a song is a challenging task even for an expe… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 8 pages, 7 figures, 23rd International Society for Music Information Retrieval Conference (ISMIR 2022), Bengaluru, India

  38. arXiv:2207.10511  [pdf, other

    cs.HC cs.AI cs.RO

    A cost effective eye movement tracker based wheel chair control algorithm for people with paraplegia

    Authors: Skanda Upadhyaya, Shravan Bhat, Siddhanth P. Rao, V Ashwin, Krishnan Chemmangat

    Abstract: Spinal cord injuries can often lead to quadriplegia in patients limiting their mobility. Wheelchairs could be a good proposition for patients, but most of them operate either manually or with the help of electric motors operated with a joystick. This, however, requires the use of hands, making it unsuitable for quadriplegic patients. Controlling eye movement, on the other hand, is retained even by… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: 5 pages, 6 figures

    ACM Class: I.4.8; I.2.9

  39. arXiv:2207.03593  [pdf, other

    cs.LG

    Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

    Authors: Dimitrios C. Gklezakos, Rishi Jha, Rajesh P. N. Rao

    Abstract: Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action mappings that generalize not only to new goals but most importantly to novel, u… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  40. arXiv:2206.08462  [pdf, other

    cs.CV cs.LG

    Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

    Authors: Ares Fisher, Rajesh P. N. Rao

    Abstract: Human vision involves parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using capsule networks, reference frames and active predictive coding, but a generative model formulation has been lacking. We introduce Recursive Neural Programs (RNPs),… ▽ More

    Submitted 25 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures. fixed LaTeX typo for algorithm reference

  41. arXiv:2204.06150  [pdf, other

    quant-ph cs.LG stat.ML

    A quantum generative model for multi-dimensional time series using Hamiltonian learning

    Authors: Haim Horowitz, Pooja Rao, Santosh Kumar Radha

    Abstract: Synthetic data generation has proven to be a promising solution for addressing data availability issues in various domains. Even more challenging is the generation of synthetic time series data, where one has to preserve temporal dynamics, i.e., the generated time series must respect the original relationships between variables across time. Recently proposed techniques such as generative adversari… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  42. arXiv:2204.03166  [pdf

    eess.AS cs.SD

    Musical Information Extraction from the Singing Voice

    Authors: Preeti Rao

    Abstract: Music information retrieval is currently an active research area that addresses the extraction of musically important information from audio signals, and the applications of such information. The extracted information can be used for search and retrieval of music in recommendation systems, or to aid musicological studies or even in music learning. Sophisticated signal processing techniques are app… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  43. Repairing Brain-Computer Interfaces with Fault-Based Data Acquisition

    Authors: Cailin Winston, Caleb Winston, Chloe N Winston, Claris Winston, Cleah Winston, Rajesh PN Rao, René Just

    Abstract: Brain-computer interfaces (BCIs) decode recorded neural signals from the brain and/or stimulate the brain with encoded neural signals. BCIs span both hardware and software and have a wide range of applications in restorative medicine, from restoring movement through prostheses and robotic limbs to restoring sensation and communication through spellers. BCIs also have applications in diagnostic med… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted at International Conference on Software Engineering (ICSE-2022)

  44. arXiv:2203.06583  [pdf

    cs.SD cs.AI eess.AS

    Bi-Sampling Approach to Classify Music Mood leveraging Raga-Rasa Association in Indian Classical Music

    Authors: Mohan Rao B C, Vinayak Arkachaari, Harsha M N, Sushmitha M N, Gayathri Ramesh K K, Ullas M S, Pathi Mohan Rao, Sudha G, Narayana Darapaneni

    Abstract: The impact of Music on the mood or emotion of the listener is a well-researched area in human psychology and behavioral science. In Indian classical music, ragas are the melodic structure that defines the various styles and forms of the music. Each raga has been found to evoke a specific emotion in the listener. With the advent of advanced capabilities of audio signal processing and the applicatio… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  45. XtraLibD: Detecting Irrelevant Third-Party libraries in Java and Python Applications

    Authors: Ritu Kapur, Poojith U Rao, Agrim Dewan, Balwinder Sodhi

    Abstract: Software development comprises the use of multiple Third-Party Libraries (TPLs). However, the irrelevant libraries present in software application's distributable often lead to excessive consumption of resources such as CPU cycles, memory, and modile-devices' battery usage. Therefore, the identification and removal of unused TPLs present in an application are desirable. We present a rapid, storage… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 25 pages, 5 figures, 4 tables, Book Chapter of Springer's Communications in Computer and Information Science, vol 1556. Springer, Cham. Extended version of paper published in Evaluation of Novel Approaches to Software Engineering. ENASE 2021

    Journal ref: Springer's Communications in Computer and Information Science, vol 1556. Springer, Cham. Extended version of paper published in Evaluation of Novel Approaches to Software Engineering. ENASE 2021

  46. arXiv:2202.05206  [pdf, ps, other

    cs.LG

    Zero Shot Learning for Predicting Energy Usage of Buildings in Sustainable Design

    Authors: Arun Zachariah, Praveen Rao, Brian Corn, Dominique Davison

    Abstract: The 2030 Challenge is aimed at making all new buildings and major renovations carbon neutral by 2030. One of the potential solutions to meet this challenge is through innovative sustainable design strategies. For developing such strategies it is important to understand how the various building factors contribute to energy usage of a building, right at design time. The growth of artificial intellig… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: To appear in 1st Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE) @ AAAI 2022

  47. arXiv:2201.08813  [pdf, other

    cs.CV cs.AI cs.LG

    Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole Hierarchies

    Authors: Dimitrios C. Gklezakos, Rajesh P. N. Rao

    Abstract: We introduce Active Predictive Coding Networks (APCNs), a new class of neural networks that solve a major problem posed by Hinton and others in the fields of artificial intelligence and brain modeling: how can neural networks learn intrinsic reference frames for objects and parse visual scenes into part-whole hierarchies by dynamically allocating nodes in a parse tree? APCNs address this problem b… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  48. arXiv:2112.03871  [pdf, ps, other

    eess.AS cs.SD

    Training end-to-end speech-to-text models on mobile phones

    Authors: Zitha S, Raghavendra Rao Suresh, Pooja Rao, T. V. Prabhakar

    Abstract: Training the state-of-the-art speech-to-text (STT) models in mobile devices is challenging due to its limited resources relative to a server environment. In addition, these models are trained on generic datasets that are not exhaustive in capturing user-specific characteristics. Recently, on-device personalization techniques have been making strides in mitigating the problem. Although many current… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  49. arXiv:2112.00635  [pdf, other

    eess.AS cs.SD eess.SP

    Predicting lexical skills from oral reading with acoustic measures

    Authors: Charvi Vitthal, Shreeharsha B S, Kamini Sabu, Preeti Rao

    Abstract: Literacy assessment is an important activity for education administrators across the globe. Typically achieved in a school setting by testing a child's oral reading, it is intensive in human resources. While automatic speech recognition (ASR) is a potential solution to the problem, it tends to be computationally expensive for hand-held devices apart from needing language and accent-specific speech… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  50. arXiv:2110.14273  [pdf, other

    cs.CL cs.SD eess.AS

    Deep Learning For Prominence Detection In Children's Read Speech

    Authors: Mithilesh Vaidya, Kamini Sabu, Preeti Rao

    Abstract: The detection of perceived prominence in speech has attracted approaches ranging from the design of linguistic knowledge-based acoustic features to the automatic feature learning from suprasegmental attributes such as pitch and intensity contours. We present here, in contrast, a system that operates directly on segmented speech waveforms to learn features relevant to prominent word detection for c… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Under review at ICASSP 2022. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works