Skip to main content

Showing 1–50 of 59 results for author: Vandana

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03122  [pdf, ps, other

    cs.CL

    AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation

    Authors: Prashanth Vijayaraghavan, Luyao Shi, Ehsan Degan, Vandana Mukherjee, Xin Zhang

    Abstract: Analog circuit topology synthesis is integral to Electronic Design Automation (EDA), enabling the automated creation of circuit structures tailored to specific design requirements. However, the vast design search space and strict constraint adherence make efficient synthesis challenging. Leveraging the versatility of Large Language Models (LLMs), we propose AUTOCIRCUIT-RL,a novel reinforcement lea… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 9 Pages (Content), 4 Pages (Appendix), 7 figures, ICML'2025

  2. arXiv:2506.02230  [pdf, ps, other

    eess.AS cs.SD

    Towards Machine Unlearning for Paralinguistic Speech Processing

    Authors: Orchid Chetia Phukan, Girish, Mohd Mujtaba Akhtar, Shubham Singh, Swarup Ranjan Behera, Vandana Rajan, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma

    Abstract: In this work, we pioneer the study of Machine Unlearning (MU) for Paralinguistic Speech Processing (PSP). We focus on two key PSP tasks: Speech Emotion Recognition (SER) and Depression Detection (DD). To this end, we propose, SISA++, a novel extension to previous state-of-the-art (SOTA) MU method, SISA by merging models trained on different shards with weight-averaging. With such modifications, we… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Accepted to INTERSPEECH 2025

  3. arXiv:2505.23980  [pdf, other

    cs.CV cs.LG eess.IV

    DeepTopoNet: A Framework for Subglacial Topography Estimation on the Greenland Ice Sheets

    Authors: Bayu Adhi Tama, Mansa Krishna, Homayra Alam, Mostafa Cham, Omar Faruque, Gong Cheng, Jianwu Wang, Mathieu Morlighem, Vandana Janeja

    Abstract: Understanding Greenland's subglacial topography is critical for projecting the future mass loss of the ice sheet and its contribution to global sea-level rise. However, the complex and sparse nature of observational data, particularly information about the bed topography under the ice sheet, significantly increases the uncertainty in model projections. Bed topography is traditionally measured by a… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Submitted to SIGSPATIAL 2025

  4. arXiv:2505.14971  [pdf, ps, other

    cs.CL cs.CY

    DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis

    Authors: Prashanth Vijayaraghavan, Soroush Vosoughi, Lamogha Chiazor, Raya Horesh, Rogerio Abreu de Paula, Ehsan Degan, Vandana Mukherjee

    Abstract: Recent advancements in large language models (LLMs) have revolutionized natural language processing (NLP) and expanded their applications across diverse domains. However, despite their impressive capabilities, LLMs have been shown to reflect and perpetuate harmful societal biases, including those based on ethnicity, gender, and religion. A critical and underexplored issue is the reinforcement of c… ▽ More

    Submitted 4 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: 7 (content pages) + 2 (reference pages) + 5 (Appendix pages), 5 figures, 6 Tables, IJCAI 2025

  5. arXiv:2503.02112  [pdf, other

    cs.LG astro-ph.IM

    Building Machine Learning Challenges for Anomaly Detection in Science

    Authors: Elizabeth G. Campolongo, Yuan-Tang Chou, Ekaterina Govorkova, Wahid Bhimji, Wei-Lun Chao, Chris Harris, Shih-Chieh Hsu, Hilmar Lapp, Mark S. Neubauer, Josephine Namayanja, Aneesh Subramanian, Philip Harris, Advaith Anand, David E. Carlyn, Subhankar Ghosh, Christopher Lawrence, Eric Moreno, Ryan Raikman, Jiaman Wu, Ziheng Zhang, Bayu Adhi, Mohammad Ahmadi Gharehtoragh, Saúl Alonso Monsalve, Marta Babicz, Furqan Baig , et al. (125 additional authors not shown)

    Abstract: Scientific discoveries are often made by finding a pattern or object that was not predicted by the known rules of science. Oftentimes, these anomalous events or objects that do not conform to the norms are an indication that the rules of science governing the data are incomplete, and something new needs to be present to explain these unexpected outliers. The challenge of finding anomalies can be c… ▽ More

    Submitted 29 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 17 pages 6 figures to be submitted to Nature Communications

  6. arXiv:2502.07741  [pdf, other

    cs.LG

    Advancing climate model interpretability: Feature attribution for Arctic melt anomalies

    Authors: Tolulope Ale, Nicole-Jeanne Schlegel, Vandana P. Janeja

    Abstract: The focus of our work is improving the interpretability of anomalies in climate models and advancing our understanding of Arctic melt dynamics. The Arctic and Antarctic ice sheets are experiencing rapid surface melting and increased freshwater runoff, contributing significantly to global sea level rise. Understanding the mechanisms driving snowmelt in these regions is crucial. ERA5, a widely used… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 9 pages

  7. arXiv:2411.14586  [pdf, other

    cs.SD cs.CY eess.AS

    Listening for Expert Identified Linguistic Features: Assessment of Audio Deepfake Discernment among Undergraduate Students

    Authors: Noshaba N. Bhalli, Nehal Naqvi, Chloe Evered, Christine Mallinson, Vandana P. Janeja

    Abstract: This paper evaluates the impact of training undergraduate students to improve their audio deepfake discernment ability by listening for expert-defined linguistic features. Such features have been shown to improve performance of AI algorithms; here, we ascertain whether this improvement in AI algorithms also translates to improvement of the perceptual awareness and discernment ability of listeners.… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  8. arXiv:2411.05969  [pdf

    cs.SD cs.CL eess.AS

    Toward Transdisciplinary Approaches to Audio Deepfake Discernment

    Authors: Vandana P. Janeja, Christine Mallinson

    Abstract: This perspective calls for scholars across disciplines to address the challenge of audio deepfake detection and discernment through an interdisciplinary lens across Artificial Intelligence methods and linguistics. With an avalanche of tools for the generation of realistic-sounding fake speech on one side, the detection of deepfakes is lagging on the other. Particularly hindering audio deepfake det… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

  9. arXiv:2410.15577  [pdf

    cs.SD eess.AS

    ALDAS: Audio-Linguistic Data Augmentation for Spoofed Audio Detection

    Authors: Zahra Khanjani, Christine Mallinson, James Foulds, Vandana P Janeja

    Abstract: Spoofed audio, i.e. audio that is manipulated or AI-generated deepfake audio, is difficult to detect when only using acoustic features. Some recent innovative work involving AI-spoofed audio detection models augmented with phonetic and phonological features of spoken English, manually annotated by experts, led to improved model performance. While this augmented model produced substantial improveme… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  10. arXiv:2409.14312  [pdf, other

    eess.AS cs.SD

    Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

    Authors: Orchid Chetia Phukan, Swarup Ranjan Behera, Shubham Singh, Muskaan Singh, Vandana Rajan, Arun Balaji Buduru, Rajesh Sharma, S. R. Mahadeva Prasanna

    Abstract: In this study, we address the challenge of depression detection from speech, focusing on the potential of non-semantic features (NSFs) to capture subtle markers of depression. While prior research has leveraged various features for this task, NSFs-extracted from pre-trained models (PTMs) designed for non-semantic tasks such as paralinguistic speech processing (TRILLsson), speaker recognition (x-ve… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: Submitted to ICASSP 2025

    MSC Class: 68T45 ACM Class: I.2.7

  11. arXiv:2409.12590  [pdf, other

    cs.LG

    Hybrid Ensemble Deep Graph Temporal Clustering for Spatiotemporal Data

    Authors: Francis Ndikum Nji, Omar Faruque, Mostafa Cham, Janeja Vandana, Jianwu Wang

    Abstract: Classifying subsets based on spatial and temporal features is crucial to the analysis of spatiotemporal data given the inherent spatial and temporal variability. Since no single clustering algorithm ensures optimal results, researchers have increasingly explored the effectiveness of ensemble approaches. Ensemble clustering has attracted much attention due to increased diversity, better generalizat… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 10 pages

  12. arXiv:2409.06033  [pdf

    cs.SD cs.CL eess.AS

    Investigating Causal Cues: Strengthening Spoofed Audio Detection with Human-Discernible Linguistic Features

    Authors: Zahra Khanjani, Tolulope Ale, Jianwu Wang, Lavon Davis, Christine Mallinson, Vandana P. Janeja

    Abstract: Several types of spoofed audio, such as mimicry, replay attacks, and deepfakes, have created societal challenges to information integrity. Recently, researchers have worked with sociolinguistics experts to label spoofed audio samples with Expert Defined Linguistic Features (EDLFs) that can be discerned by the human ear: pitch, pause, word-initial and word-final release bursts of consonant stops, a… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  13. arXiv:2407.10042  [pdf, other

    cs.LG cs.IR

    Harnessing Feature Clustering For Enhanced Anomaly Detection With Variational Autoencoder And Dynamic Threshold

    Authors: Tolulope Ale, Nicole-Jeanne Schlegel, Vandana P. Janeja

    Abstract: We introduce an anomaly detection method for multivariate time series data with the aim of identifying critical periods and features influencing extreme climate events like snowmelt in the Arctic. This method leverages the Variational Autoencoder (VAE) integrated with dynamic thresholding and correlation-based feature clustering. This framework enhances the VAE's ability to identify localized depe… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: This work was presented at the 2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024, 07-12 July 2024, Athens, Greece

  14. arXiv:2407.09535  [pdf, ps, other

    cs.CV

    Assessing Annotation Accuracy in Ice Sheets Using Quantitative Metrics

    Authors: Bayu Adhi Tama, Vandana Janeja, Sanjay Purushotham

    Abstract: The increasing threat of sea level rise due to climate change necessitates a deeper understanding of ice sheet structures. This study addresses the need for accurate ice sheet data interpretation by introducing a suite of quantitative metrics designed to validate ice sheet annotation techniques. Focusing on both manual and automated methods, including ARESELP and its modified version, MARESELP, we… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

  15. arXiv:2405.18327  [pdf

    q-bio.QM cs.AI cs.CV cs.LG

    Histopathology Based AI Model Predicts Anti-Angiogenic Therapy Response in Renal Cancer Clinical Trial

    Authors: Jay Jasti, Hua Zhong, Vandana Panwar, Vipul Jarmale, Jeffrey Miyata, Deyssy Carrillo, Alana Christie, Dinesh Rakheja, Zora Modrusan, Edward Ernest Kadel III, Niha Beig, Mahrukh Huseni, James Brugarolas, Payal Kapur, Satwik Rajaram

    Abstract: Predictive biomarkers of treatment response are lacking for metastatic clear cell renal cell carcinoma (ccRCC), a tumor type that is treated with angiogenesis inhibitors, immune checkpoint inhibitors, mTOR inhibitors and a HIF2 inhibitor. The Angioscore, an RNA-based quantification of angiogenesis, is arguably the best candidate to predict anti-angiogenic (AA) response. However, the clinical adopt… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 19 pages, 4 Figures

  16. arXiv:2405.06130  [pdf

    cs.IR

    Creating Geospatial Trajectories from Human Trafficking Text Corpora

    Authors: Saydeh N. Karabatis, Vandana P. Janeja

    Abstract: Human trafficking is a crime that affects the lives of millions of people across the globe. Traffickers exploit the victims through forced labor, involuntary sex, or organ harvesting. Migrant smuggling could also be seen as a form of human trafficking when the migrant fails to pay the smuggler and is forced into coerced activities. Several news agencies and anti-trafficking organizations have repo… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  17. arXiv:2405.06129  [pdf

    cs.IR cs.CL

    Narrative to Trajectory (N2T+): Extracting Routes of Life or Death from Human Trafficking Text Corpora

    Authors: Saydeh N. Karabatis, Vandana P. Janeja

    Abstract: Climate change and political unrest in certain regions of the world are imposing extreme hardship on many communities and are forcing millions of vulnerable populations to abandon their homelands and seek refuge in safer lands. As international laws are not fully set to deal with the migration crisis, people are relying on networks of exploiting smugglers to escape the devastation in order to live… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  18. arXiv:2405.05911  [pdf, other

    eess.SY cs.ET cs.NI

    Small-Scale Testbed for Evaluating C-V2X Applications on 5G Cellular Networks

    Authors: Kaj Munhoz Arfvidsson, Kleio Fragkedaki, Frank J. Jiang, Vandana Narri, Hans-Cristian Lindh, Karl H. Johansson, Jonas Mårtensson

    Abstract: In this work, we present a small-scale testbed for evaluating the real-life performance of cellular V2X (C-V2X) applications on 5G cellular networks. Despite the growing interest and rapid technology development for V2X applications, researchers still struggle to prototype V2X applications with real wireless networks, hardware, and software in the loop in a controlled environment. To help alleviat… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  19. arXiv:2404.11803  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation

    Authors: Thomas Monninger, Vandana Dokkadi, Md Zafar Anwar, Steffen Staab

    Abstract: Autonomous driving requires an accurate representation of the environment. A strategy toward high accuracy is to fuse data from several sensors. Learned Bird's-Eye View (BEV) encoders can achieve this by mapping data from individual sensors into one joint latent space. For cost-efficient camera-only systems, this provides an effective mechanism to fuse data from multiple cameras with different vie… ▽ More

    Submitted 18 September, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted for 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  20. arXiv:2402.09658  [pdf

    eess.IV cs.CV

    Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm

    Authors: Amir Mohammad Naderi, Jennifer G. Casey, Mao-Hsiang Huang, Rachelle Victorio, David Y. Chiang, Calum MacRae, Hung Cao, Vandana A. Gupta

    Abstract: Quantifying cardiovascular parameters like ejection fraction in zebrafish as a host of biological investigations has been extensively studied. Since current manual monitoring techniques are time-consuming and fallible, several image processing frameworks have been proposed to automate the process. Most of these works rely on supervised deep-learning architectures. However, supervised methods tend… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  21. arXiv:2401.12085  [pdf, other

    eess.AS cs.SD

    Consistency Based Unsupervised Self-training For ASR Personalisation

    Authors: Jisi Zhang, Vandana Rajan, Haaris Mehmood, David Tuckey, Pablo Peso Parada, Md Asif Jalal, Karthikeyan Saravanan, Gil Ho Lee, Jungin Lee, Seokyeong Jung

    Abstract: On-device Automatic Speech Recognition (ASR) models trained on speech data of a large population might underperform for individuals unseen during training. This is due to a domain shift between user data and the original training data, differed by user's speaking characteristics and environmental acoustic conditions. ASR personalisation is a solution that aims to exploit user data to improve model… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted for IEEE ASRU 2023

  22. arXiv:2308.11590  [pdf, other

    cs.RO

    Vision-Based Intelligent Robot Grasping Using Sparse Neural Network

    Authors: Priya Shukla, Vandana Kushwaha, G C Nandi

    Abstract: In the modern era of Deep Learning, network parameters play a vital role in models efficiency but it has its own limitations like extensive computations and memory requirements, which may not be suitable for real time intelligent robot grasping tasks. Current research focuses on how the model efficiency can be maintained by introducing sparsity but without compromising accuracy of the model in the… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  23. arXiv:2303.02648  [pdf, other

    cs.CV cs.LG

    Comparative study of Transformer and LSTM Network with attention mechanism on Image Captioning

    Authors: Pranav Dandwate, Chaitanya Shahane, Vandana Jagtap, Shridevi C. Karande

    Abstract: In a globalized world at the present epoch of generative intelligence, most of the manual labour tasks are automated with increased efficiency. This can support businesses to save time and money. A crucial component of generative intelligence is the integration of vision and language. Consequently, image captioning become an intriguing area of research. There have been multiple attempts by the res… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: 13 pages, 7 figures, 2 tables

  24. arXiv:2302.05224  [pdf, other

    cs.RO eess.SY

    Shared Situational Awareness with V2X Communication and Set-membership Estimation

    Authors: Vandana Narri, Amr Alanwar, Jonas Mårtensson, Christoffer Norén, Karl Henrik Johansson

    Abstract: The ability to perceive and comprehend a traffic situation and to estimate the state of the vehicles and road-users in the surrounding of the ego-vehicle is known as situational awareness. Situational awareness for a heavy-duty autonomous vehicle is a critical part of the automation platform and depends on the ego-vehicle's field-of-view. But when it comes to the urban scenario, the field-of-view… ▽ More

    Submitted 29 May, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

  25. arXiv:2212.05560  [pdf, other

    cs.CV cs.RO

    Context-aware 6D Pose Estimation of Known Objects using RGB-D data

    Authors: Ankit Kumar, Priya Shukla, Vandana Kushwaha, G. C. Nandi

    Abstract: 6D object pose estimation has been a research topic in the field of computer vision and robotics. Many modern world applications like robot grasping, manipulation, autonomous navigation etc, require the correct pose of objects present in a scene to perform their specific task. It becomes even harder when the objects are placed in a cluttered scene and the level of occlusion is high. Prior works ha… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  26. arXiv:2212.05335  [pdf

    cs.SD cs.IR eess.AS

    A Comparison of Audio Preprocessing Techniques and Deep Learning Algorithms for Raga Recognition

    Authors: Devayani Hebbar, Vandana Jagtap

    Abstract: Ragas form the foundation for Indian Classical Music. The task of Raga Recognition has gained traction in the Music Information Retrieval community in the recent past, which can be attributed to the nuances of Indian Classical Music that have resulted in a plethora of research problems in Computing. In this work, we used two different digital audio signal processing techniques to preprocess audio… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: 7 pages, 6 figures, 7 tables

  27. arXiv:2202.09821  [pdf, other

    cs.RO

    Generating Quality Grasp Rectangle using Pix2Pix GAN for Intelligent Robot Grasping

    Authors: Vandana Kushwaha, Priya Shukla, G C Nandi

    Abstract: Intelligent robot grasping is a very challenging task due to its inherent complexity and non availability of sufficient labelled data. Since making suitable labelled data available for effective training for any deep learning based model including deep reinforcement learning is so crucial for successful grasp learning, in this paper we propose to solve the problem of generating grasping Poses/Rect… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  28. arXiv:2202.09263  [pdf, other

    cs.LG cs.MM

    Is Cross-Attention Preferable to Self-Attention for Multi-Modal Emotion Recognition?

    Authors: Vandana Rajan, Alessio Brutti, Andrea Cavallaro

    Abstract: Humans express their emotions via facial expressions, voice intonation and word choices. To infer the nature of the underlying emotion, recognition models may use a single modality, such as vision, audio, and text, or a combination of modalities. Generally, models that fuse complementary information from multiple modalities outperform their uni-modal counterparts. However, a successful model that… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: Accepted at ICASSP 2022

  29. arXiv:2112.03351  [pdf

    cs.SD cs.AI eess.AS

    Audio Deepfake Perceptions in College Going Populations

    Authors: Gabrielle Watson, Zahra Khanjani, Vandana P. Janeja

    Abstract: Deepfake is content or material that is generated or manipulated using AI methods, to pass off as real. There are four different deepfake types: audio, video, image and text. In this research we focus on audio deepfakes and how people perceive it. There are several audio deepfake generation frameworks, but we chose MelGAN which is a non-autoregressive and fast audio deepfake generating framework,… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Summary of study findings

  30. arXiv:2112.03001  [pdf, other

    cs.RO cs.AI

    Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data

    Authors: Priya Shukla, Vandana Kushwaha, G. C. Nandi

    Abstract: Grasping objects intelligently is a challenging task even for humans and we spend a considerable amount of time during our childhood to learn how to grasp objects correctly. In the case of robots, we can not afford to spend that much time on making it to learn how to grasp objects effectively. Therefore, in the present research we propose an efficient learning architecture based on VQVAE so that r… ▽ More

    Submitted 6 November, 2021; originally announced December 2021.

    Comments: 12

  31. arXiv:2111.14203  [pdf

    cs.SD cs.AI eess.AS

    How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey

    Authors: Zahra Khanjani, Gabrielle Watson, Vandana P. Janeja

    Abstract: Deepfake is content or material that is synthetically generated or manipulated using artificial intelligence (AI) methods, to be passed off as real and can include audio, video, image, and text synthesis. This survey has been conducted with a different perspective compared to existing survey papers, that mostly focus on just video and image deepfakes. This survey not only evaluates generation and… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: Abbreviated version of a longer survey under review

  32. arXiv:2107.11413  [pdf, other

    cs.LG cs.HC

    An Instance-Dependent Simulation Framework for Learning with Label Noise

    Authors: Keren Gu, Xander Masotto, Vandana Bachani, Balaji Lakshminarayanan, Jack Nikodem, Dong Yin

    Abstract: We propose a simulation framework for generating instance-dependent noisy labels via a pseudo-labeling paradigm. We show that the distribution of the synthetic noisy labels generated with our framework is closer to human labels compared to independent and class-conditional random flipping. Equipped with controllable label noise, we study the negative impact of noisy labels across a few practical s… ▽ More

    Submitted 17 October, 2021; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: Datasets released at https://github.com/deepmind/deepmind-research/tree/master/noisy_label

  33. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  34. arXiv:2106.09201  [pdf, other

    cs.CV

    Trilateral Attention Network for Real-time Medical Image Segmentation

    Authors: Ghada Zamzmi, Vandana Sachdev, Sameer Antani

    Abstract: Accurate segmentation of medical images into anatomically meaningful regions is critical for the extraction of quantitative indices or biomarkers. The common pipeline for segmentation comprises regions of interest detection stage and segmentation stage, which are independent of each other and typically performed using separate deep learning networks. The performance of the segmentation stage highl… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  35. arXiv:2103.01791  [pdf, other

    eess.SY cs.RO

    Set-Membership Estimation in Shared Situational Awareness for Automated Vehicles in Occluded Scenarios

    Authors: Vandana Narri, Amr Alanwar, Jonas Mårtensson, Christoffer Norén, Laura Dal Col, Karl Henrik Johansson

    Abstract: One of the main challenges in developing autonomous transport systems based on connected and automated vehicles is the comprehension and understanding of the environment around each vehicle. In many situations, the understanding is limited to the information gathered by the sensors mounted on the ego-vehicle, and it might be severely affected by occlusion caused by other vehicles or fixed obstacle… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: This paper is submitted to IEEE Intelligent Vehicles Symposium 2021

  36. arXiv:2011.01631  [pdf, other

    cs.LG cs.MM

    Robust Latent Representations via Cross-Modal Translation and Alignment

    Authors: Vandana Rajan, Alessio Brutti, Andrea Cavallaro

    Abstract: Multi-modal learning relates information across observation modalities of the same physical phenomenon to leverage complementary information. Most multi-modal machine learning methods require that all the modalities used for training are also available for testing. This is a limitation when the signals from some modalities are unavailable or are severely degraded by noise. To address this limitati… ▽ More

    Submitted 8 March, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Journal ref: ICASSP 2021

  37. Towards Accuracy and Scalability: Combining Isogeometric Analysis with Deflation to Obtain Scalable Convergence for the Helmholtz Equation

    Authors: Vandana Dwarka, Roel Tielen, Matthias Möller, Kees Vuik

    Abstract: Finding fast yet accurate numerical solutions to the Helmholtz equation remains a challenging task. The pollution error (i.e. the discrepancy between the numerical and analytical wave number k) requires the mesh resolution to be kept fine enough to obtain accurate solutions. A recent study showed that the use of Isogeometric Analysis (IgA) for the spatial discretization significantly reduces the p… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  38. arXiv:2009.07386  [pdf, other

    cs.CV

    Creation and Validation of a Chest X-Ray Dataset with Eye-tracking and Report Dictation for AI Development

    Authors: Alexandros Karargyris, Satyananda Kashyap, Ismini Lourentzou, Joy Wu, Arjun Sharma, Matthew Tong, Shafiq Abedin, David Beymer, Vandana Mukherjee, Elizabeth A Krupinski, Mehdi Moradi

    Abstract: We developed a rich dataset of Chest X-Ray (CXR) images to assist investigators in artificial intelligence. The data were collected using an eye tracking system while a radiologist reviewed and reported on 1,083 CXR images. The dataset contains the following aligned data: CXR image, transcribed radiology report text, radiologist's dictation audio and eye gaze coordinates data. We hope this dataset… ▽ More

    Submitted 8 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

  39. Receptivity of an AI Cognitive Assistant by the Radiology Community: A Report on Data Collected at RSNA

    Authors: Karina Kanjaria, Anup Pillai, Chaitanya Shivade, Marina Bendersky, Ashutosh Jadhav, Vandana Mukherjee, Tanveer Syeda-Mahmood

    Abstract: Due to advances in machine learning and artificial intelligence (AI), a new role is emerging for machines as intelligent assistants to radiologists in their clinical workflows. But what systematic clinical thought processes are these machines using? Are they similar enough to those of radiologists to be trusted as assistants? A live demonstration of such a technology was conducted at the 2016 Scie… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Journal ref: Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, ISBN 978-989-758-398-8, pages 178-186. 2020

  40. arXiv:2009.00685  [pdf, other

    cs.NI

    Distributed Cooperation Under Uncertainty in Drone-Based Wireless Networks: A Bayesian Coalitional Game

    Authors: Vandana Mittal, Setareh Maghsudi, Ekram Hossain

    Abstract: We study the resource sharing problem in a drone-based wireless network. We consider a distributed control setting under uncertainty (i.e. unavailability of full information). In particular, the drones cooperate in serving the users while pooling their spectrum and energy resources in the absence of prior knowledge about different system characteristics such as the amount of available power at the… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

  41. arXiv:2008.08023  [pdf

    cs.CV

    Multilanguage Number Plate Detection using Convolutional Neural Networks

    Authors: Jatin Gupta, Vandana Saini, Kamaldeep Garg

    Abstract: Object Detection is a popular field of research for recent technologies. In recent years, profound learning performance attracts the researchers to use it in many applications. Number plate (NP) detection and classification is analyzed over decades however, it needs approaches which are more precise and state, language and design independent since cars are now moving from state to another easily.… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  42. arXiv:2008.03375  [pdf, other

    cs.CE cs.CV eess.IV

    Distributed optimization for nonrigid nano-tomography

    Authors: Viktor Nikitin, Vincent De Andrade, Azat Slyamov, Benjamin J. Gould, Yuepeng Zhang, Vandana Sampathkumar, Narayanan Kasthuri, Doga Gursoy, Francesco De Carlo

    Abstract: Resolution level and reconstruction quality in nano-computed tomography (nano-CT) are in part limited by the stability of microscopes, because the magnitude of mechanical vibrations during scanning becomes comparable to the imaging resolution, and the ability of the samples to resist beam damage during data acquisition. In such cases, there is no incentive in recovering the sample state at differe… ▽ More

    Submitted 28 February, 2021; v1 submitted 11 July, 2020; originally announced August 2020.

  43. arXiv:1911.02095  [pdf, other

    q-bio.QM cs.DB

    IBM Functional Genomics Platform, A Cloud-Based Platform for Studying Microbial Life at Scale

    Authors: Edward E. Seabolt, Gowri Nayar, Harsha Krishnareddy, Akshay Agarwal, Kristen L. Beck, Ignacio Terrizzano, Eser Kandogan, Mary Roth, Vandana Mukherjee, James H. Kaufman

    Abstract: The rapid growth in biological sequence data is revolutionizing our understanding of genotypic diversity and challenging conventional approaches to informatics. With the increasing availability of genomic data, traditional bioinformatic tools require substantial computational time and the creation of ever-larger indices each time a researcher seeks to gain insight from the data. To address these c… ▽ More

    Submitted 30 March, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  44. arXiv:1910.09806  [pdf, other

    cs.CV

    A low-power end-to-end hybrid neuromorphic framework for surveillance applications

    Authors: Andres Ussa, Luca Della Vedova, Vandana Reddy Padala, Deepak Singla, Jyotibdha Acharya, Charles Zhang Lei, Garrick Orchard, Arindam Basu, Bharath Ramesh

    Abstract: With the success of deep learning, object recognition systems that can be deployed for real-world applications are becoming commonplace. However, inference that needs to largely take place on the `edge' (not processed on servers), is a highly computational and memory intensive workload, making it intractable for low-power mobile nodes and remote security applications. To address this challenge, th… ▽ More

    Submitted 29 January, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 12 pages, 3 figures, pre-print to BMVC workshops 2018

  45. arXiv:1910.01851  [pdf, other

    cs.CV

    EBBIOT: A Low-complexity Tracking Algorithm for Surveillance in IoVT Using Stationary Neuromorphic Vision Sensors

    Authors: Jyotibdha Acharya, Andres Ussa Caycedo, Vandana Reddy Padala, Rishi Raj Sidhu Singh, Garrick Orchard, Bharath Ramesh, Arindam Basu

    Abstract: In this paper, we present EBBIOT-a novel paradigm for object tracking using stationary neuromorphic vision sensors in low-power sensor nodes for the Internet of Video Things (IoVT). Different from fully event based tracking or fully frame based approaches, we propose a mixed approach where we create event-based binary images (EBBI) that can use memory efficient noise filtering algorithms. We explo… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: 6 pages, 5 figures

  46. arXiv:1907.06234  [pdf, other

    astro-ph.IM cs.DL

    Robust Archives Maximize Scientific Accessibility

    Authors: J. E. G. Peek, Vandana Desai, Richard L. White, Raffaele D'Abrusco, Joseph M. Mazzarella, Carolyn Grant, Jenny L. Novacescu, Elena Scire, Sherry Winkelman

    Abstract: We present a bibliographic analysis of Chandra, Hubble, and Spitzer publications. We find (a) archival data are used in >60% of the publication output and (b) archives for these missions enable a much broader set of institutions and countries to scientifically use data from these missions. Specifically, we find that authors from institutions that have published few papers from a given mission publ… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

    Comments: White Paper submitted to the NAS call for Astro2020 Decadal Survey APC papers

  47. arXiv:1902.09864  [pdf, ps, other

    cs.NE cs.ET

    Spiking Neural Network based Region Proposal Networks for Neuromorphic Vision Sensors

    Authors: Jyotibdha Acharya, Vandana Padala, Arindam Basu

    Abstract: This paper presents a three layer spiking neural network based region proposal network operating on data generated by neuromorphic vision sensors. The proposed architecture consists of refractory, convolution and clustering layers designed with bio-realistic leaky integrate and fire (LIF) neurons and synapses. The proposed algorithm is tested on traffic scene recordings from a DAVIS sensor setup.… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted in IEEE ISCAS, 2019

  48. DeepMiner: Discovering Interpretable Representations for Mammogram Classification and Explanation

    Authors: Jimmy Wu, Bolei Zhou, Diondra Peck, Scott Hsieh, Vandana Dialani, Lester Mackey, Genevieve Patterson

    Abstract: We propose DeepMiner, a framework to discover interpretable representations in deep neural networks and to build explanations for medical predictions. By probing convolutional neural networks (CNNs) trained to classify cancer in mammograms, we show that many individual units in the final convolutional layer of a CNN respond strongly to diseased tissue concepts specified by the BI-RADS lexicon. Aft… ▽ More

    Submitted 17 August, 2021; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: Harvard Data Science Review (HDSR), 2021. Code available at https://github.com/jimmyyhwu/ddsm-visual-primitives

  49. Expert identification of visual primitives used by CNNs during mammogram classification

    Authors: Jimmy Wu, Diondra Peck, Scott Hsieh, Vandana Dialani, Constance D. Lehman, Bolei Zhou, Vasilis Syrgkanis, Lester Mackey, Genevieve Patterson

    Abstract: This work interprets the internal representations of deep neural networks trained for classification of diseased tissue in 2D mammograms. We propose an expert-in-the-loop interpretation method to label the behavior of internal units in convolutional neural networks (CNNs). Expert radiologists identify that the visual patterns detected by the units are correlated with meaningful medical phenomena s… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.

    Journal ref: Medical Imaging 2018: Computer-Aided Diagnosis, Proc. of SPIE Vol. 10575, 105752T

  50. arXiv:1711.10751  [pdf, ps, other

    cs.CR

    UC Secure Issuer-Free Adaptive Oblivious Transfer with Hidden Access Policy

    Authors: Vandana Guleria, Ratna Dutta

    Abstract: Privacy is a major concern in designing any cryptographic primitive when frequent transactions are done electronically. During electronic transactions, people reveal their personal data into several servers and believe that this information does not leak too much about them. The adaptive oblivious transfer with hidden access policy (AOT-HAP) takes measure against such privacy issues. The existing… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.