Skip to main content

Showing 1–50 of 64 results for author: Agarwal, V

.
  1. arXiv:2505.20482  [pdf, ps, other

    cs.CL cs.AI

    Conversation Kernels: A Flexible Mechanism to Learn Relevant Context for Online Conversation Understanding

    Authors: Vibhor Agarwal, Arjoo Gupta, Suparna De, Nishanth Sastry

    Abstract: Understanding online conversations has attracted research attention with the growth of social networks and online discussion forums. Content analysis of posts and replies in online conversations is difficult because each individual utterance is usually short and may implicitly refer to other posts within the same conversation. Thus, understanding individual posts requires capturing the conversatio… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted at International AAAI Conference on Web and Social Media (ICWSM) 2025

  2. arXiv:2412.12827  [pdf, other

    cs.CV

    TabSniper: Towards Accurate Table Detection & Structure Recognition for Bank Statements

    Authors: Abhishek Trivedi, Sourajit Mukherjee, Rajat Kumar Singh, Vani Agarwal, Sriranjani Ramakrishnan, Himanshu S. Bhatt

    Abstract: Extraction of transaction information from bank statements is required to assess one's financial well-being for credit rating and underwriting decisions. Unlike other financial documents such as tax forms or financial statements, extracting the transaction descriptions from bank statements can provide a comprehensive and recent view into the cash flows and spending patterns. With multiple variatio… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  3. arXiv:2411.00653  [pdf, other

    cs.LG

    Rethinking Node Representation Interpretation through Relation Coherence

    Authors: Ying-Chun Lin, Jennifer Neville, Cassiano Becker, Purvanshi Metha, Nabiha Asghar, Vipul Agarwal

    Abstract: Understanding node representations in graph-based models is crucial for uncovering biases ,diagnosing errors, and building trust in model decisions. However, previous work on explainable AI for node representations has primarily emphasized explanations (reasons for model predictions) rather than interpretations (mapping representations to understandable concepts). Furthermore, the limited research… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  4. arXiv:2410.21837  [pdf, other

    cs.CE cs.SC

    Accelerated Relaxation Engines for Optimizing to Minimum Energy Path

    Authors: Sandra Liz Simon, Nitin Kaistha, Vishal Agarwal

    Abstract: In the last few decades, several novel algorithms have been designed for finding critical points on PES and the minimum energy paths connecting them. This has led to considerably improve our understanding of reaction mechanisms and kinetics of the underlying processes. These methods implicitly rely on computation of energy and forces on the PES, which are usually obtained by computationally demand… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

  5. arXiv:2409.19492  [pdf, ps, other

    cs.CL cs.AI

    MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models

    Authors: Vibhor Agarwal, Yiqiao Jin, Mohit Chandra, Munmun De Choudhury, Srijan Kumar, Nishanth Sastry

    Abstract: The remarkable capabilities of large language models (LLMs) in language understanding and generation have not rendered them immune to hallucinations. LLMs can still generate plausible-sounding but factually incorrect or fabricated information. As LLM-empowered chatbots become popular, laypeople may frequently ask health-related queries and risk falling victim to these LLM hallucinations, resulting… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 14 pages

  6. arXiv:2409.06703  [pdf, other

    cs.CV

    LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

    Authors: Archana Swaminathan, Anubhav Gupta, Kamal Gupta, Shishira R. Maiya, Vatsal Agarwal, Abhinav Shrivastava

    Abstract: Neural Radiance Fields (NeRFs) have revolutionized the reconstruction of static scenes and objects in 3D, offering unprecedented quality. However, extending NeRFs to model dynamic objects or object articulations remains a challenging problem. Previous works have tackled this issue by focusing on part-level reconstruction and motion estimation for objects, but they often rely on heuristics regardin… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: Accepted to ECCV 2024. Project Website at https://archana1998.github.io/leia/

  7. arXiv:2408.08333  [pdf, ps, other

    cs.SE cs.AI cs.CL

    CodeMirage: Hallucinations in Code Generated by Large Language Models

    Authors: Vibhor Agarwal, Yulong Pei, Salwa Alamir, Xiaomo Liu

    Abstract: Large Language Models (LLMs) have shown promising potentials in program generation and no-code automation. However, LLMs are prone to generate hallucinations, i.e., they generate text which sounds plausible but is incorrect. Although there has been a recent surge in research on LLM hallucinations for text generation, similar hallucination phenomenon can happen in code generation. Sometimes the gen… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Accepted at AutoMates @ IJCAI 2024

  8. arXiv:2407.05887  [pdf, other

    cs.CL cs.AI cs.LG

    Generation and De-Identification of Indian Clinical Discharge Summaries using LLMs

    Authors: Sanjeet Singh, Shreya Gupta, Niralee Gupta, Naimish Sharma, Lokesh Srivastava, Vibhu Agarwal, Ashutosh Modi

    Abstract: The consequences of a healthcare data breach can be devastating for the patients, providers, and payers. The average financial impact of a data breach in recent months has been estimated to be close to USD 10 million. This is especially significant for healthcare organizations in India that are managing rapid digitization while still establishing data governance procedures that align with the lett… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at BioNLP Workshop at ACL 2024; 21 pages (9 pages main content)

  9. What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised Generic Event Boundary Detection

    Authors: Sourabh Vasant Gothe, Vibhav Agarwal, Sourav Ghosh, Jayesh Rajkumar Vachhani, Pranay Kashyap, Barath Raj Kandur Raja

    Abstract: Generic Event Boundary Detection (GEBD) task aims to recognize generic, taxonomy-free boundaries that segment a video into meaningful events. Current methods typically involve a neural model trained on a large volume of data, demanding substantial computational power and storage space. We explore two pivotal questions pertaining to GEBD: Can non-parametric algorithms outperform unsupervised neural… ▽ More

    Submitted 15 February, 2024; originally announced April 2024.

    Comments: Accepted in WACV-2024. Supplementary at https://openaccess.thecvf.com/content/WACV2024/supplemental/Gothe_Whats_in_the_WACV_2024_supplemental.pdf

    Journal ref: 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 2024, pp. 6926-6935

  10. arXiv:2404.05501  [pdf

    q-bio.NC cs.AI cs.LG

    Data Science In Olfaction

    Authors: Vivek Agarwal, Joshua Harvey, Dmitry Rinberg, Vasant Dhar

    Abstract: Advances in neural sensing technology are making it possible to observe the olfactory process in great detail. In this paper, we conceptualize smell from a Data Science and AI perspective, that relates the properties of odorants to how they are sensed and analyzed in the olfactory system from the nose to the brain. Drawing distinctions to color vision, we argue that smell presents unique measureme… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 20 pages, 10 Figures, 2 Appendix, 1 Table

  11. arXiv:2404.03048  [pdf, other

    cs.CY cs.CL

    Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse

    Authors: Vibhor Agarwal, Aravindh Raman, Nishanth Sastry, Ahmed M. Abdelmoniem, Gareth Tyson, Ignacio Castro

    Abstract: The recent development of decentralised and interoperable social networks (such as the "fediverse") creates new challenges for content moderators. This is because millions of posts generated on one server can easily "spread" to another, even if the recipient server has very different moderation policies. An obvious solution would be to leverage moderation tools to automatically tag (and filter) po… ▽ More

    Submitted 16 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted at International AAAI Conference on Web and Social Media (ICWSM) 2024. Please cite accordingly!

  12. TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

    Authors: Vibhav Agarwal, Sourav Ghosh, Harichandana BSS, Himanshu Arora, Barath Raj Kandur Raja

    Abstract: Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems. In the context of conversational AI solutions that can work directly with local data on the user's device, architectures utilizing large pre-trained language models (PLMs) are impractical for on-device deployment due to a high memory fo… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

    Comments: Published in the IEEE/ACM Transactions on Audio, Speech, and Language Processing. (Sourav Ghosh and Vibhav Agarwal contributed equally to this work.)

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1173-1184, 2024

  13. arXiv:2402.01687  [pdf, ps, other

    cs.CY cs.HC cs.LG

    "Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

    Authors: Vibhor Agarwal, Madhav Krishan Garg, Sahiti Dharmavaram, Dhruv Kumar

    Abstract: This study evaluates the effectiveness of various large language models (LLMs) in performing tasks common among undergraduate computer science students. Although a number of research studies in the computing education community have explored the possibility of using LLMs for a variety of tasks, there is a lack of comprehensive research comparing different LLMs and evaluating which LLMs are most ef… ▽ More

    Submitted 3 April, 2024; v1 submitted 22 January, 2024; originally announced February 2024.

    Comments: Under review

  14. arXiv:2311.17921  [pdf, other

    cs.CV

    Do text-free diffusion models learn discriminative visual representations?

    Authors: Soumik Mukhopadhyay, Matthew Gwilliam, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Jun Ohya, Abhinav Shrivastava

    Abstract: While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which addresses both families of tasks simultaneously. We identify diffusion models, a state-of-the-art method for generative tasks, as a prime candidate. Such models involve training a U-Net to iteratively predict and re… ▽ More

    Submitted 24 September, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Website: see https://mgwillia.github.io/diffssl/ . Code: see https://github.com/soumik-kanad/diffssl . The first two authors contributed equally. 27 pages, 10 figures, 17 tables. Submission under review. (this article supersedes arXiv:2307.08702)

  15. arXiv:2310.14028  [pdf, other

    cs.CL

    GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding

    Authors: Vibhor Agarwal, Yu Chen, Nishanth Sastry

    Abstract: Online conversation understanding is an important yet challenging NLP problem which has many useful applications (e.g., hate speech detection). However, online conversations typically unfold over a series of posts and replies to those posts, forming a tree structure within which individual posts may refer to semantic context from higher up the tree. Such semantic cross-referencing makes it difficu… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  16. arXiv:2310.13985  [pdf, ps, other

    cs.CL

    HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online Posts using Large Language Models

    Authors: Vibhor Agarwal, Yu Chen, Nishanth Sastry

    Abstract: Hate speech has become pervasive in today's digital age. Although there has been considerable research to detect hate speech or generate counter speech to combat hateful views, these approaches still cannot completely eliminate the potential harmful societal consequences of hate speech -- hate speech, even when detected, can often not be taken down or is often not taken down enough; and hate speec… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  17. arXiv:2308.14608  [pdf, other

    cs.LG cs.CL cs.CY cs.SI

    AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics

    Authors: Vahid Ghafouri, Vibhor Agarwal, Yong Zhang, Nishanth Sastry, Jose Such, Guillermo Suarez-Tangil

    Abstract: The introduction of ChatGPT and the subsequent improvement of Large Language Models (LLMs) have prompted more and more individuals to turn to the use of ChatBots, both for information and assistance with decision-making. However, the information the user is after is often not formulated by these ChatBots objectively enough to be provided with a definite, globally accepted answer. Controversial t… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  18. arXiv:2307.08702  [pdf, other

    cs.CV

    Diffusion Models Beat GANs on Image Classification

    Authors: Soumik Mukhopadhyay, Matthew Gwilliam, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Srinidhi Hegde, Tianyi Zhou, Abhinav Shrivastava

    Abstract: While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which uses a single pre-training stage to address both families of tasks simultaneously. We identify diffusion models as a prime candidate. Diffusion models have risen to prominence as a state-of-the-art method for image… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 15 pages, 7 figures, 10 tables, submission under review

  19. arXiv:2306.13995  [pdf, other

    cs.AI cs.LG

    A clustering and graph deep learning-based framework for COVID-19 drug repurposing

    Authors: Chaarvi Bansal, Rohitash Chandra, Vinti Agarwal, P. R. Deepa

    Abstract: Drug repurposing (or repositioning) is the process of finding new therapeutic uses for drugs already approved by drug regulatory authorities (e.g., the Food and Drug Administration (FDA) and Therapeutic Goods Administration (TGA)) for other diseases. This involves analyzing the interactions between different biological entities, such as drug targets (genes/proteins and biological pathways) and dru… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  20. arXiv:2304.14507  [pdf

    cs.CV eess.IV

    Suspicious Vehicle Detection Using Licence Plate Detection And Facial Feature Recognition

    Authors: Vrinda Agarwal, Aaron George Pichappa, Manideep Ramisetty, Bala Murugan MS, Manoj kumar Rajagopal

    Abstract: With the increasing need to strengthen vehicle safety and detection, the availability of pre-existing methods of catching criminals and identifying vehicles manually through the various traffic surveillance cameras is not only time-consuming but also inefficient. With the advancement of technology in every field the use of real-time traffic surveillance models will help facilitate an easy approach… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: eight pages and three figures

  21. arXiv:2212.10405  [pdf, other

    cs.CL cs.SI

    AnnoBERT: Effectively Representing Multiple Annotators' Label Choices to Improve Hate Speech Detection

    Authors: Wenjie Yin, Vibhor Agarwal, Aiqi Jiang, Arkaitz Zubiaga, Nishanth Sastry

    Abstract: Supervised approaches generally rely on majority-based labels. However, it is hard to achieve high agreement among annotators in subjective tasks such as hate speech detection. Existing neural network models principally regard labels as categorical variables, while ignoring the semantic information in diverse label texts. In this paper, we propose AnnoBERT, a first-of-its-kind architecture integra… ▽ More

    Submitted 10 January, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: accepted at ICWSM 2023

    Journal ref: 17th International AAAI Conference on Web and Social Media (ICWSM 2023). Please cite accordingly

  22. arXiv:2211.09207  [pdf, other

    cs.CL cs.AI cs.CY

    A Graph-Based Context-Aware Model to Understand Online Conversations

    Authors: Vibhor Agarwal, Anthony P. Young, Sagar Joglekar, Nishanth Sastry

    Abstract: Online forums that allow for participatory engagement between users have been transformative for the public discussion of many important issues. However, such conversations can sometimes escalate into full-blown exchanges of hate and misinformation. Existing approaches in natural language processing (NLP), such as deep learning models for classification tasks, use as inputs only a single comment o… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 25 pages, 9 figures. arXiv admin note: text overlap with arXiv:2202.08175

    Journal ref: ACM Transactions on the Web 2023

  23. arXiv:2211.08122  [pdf, ps, other

    physics.chem-ph

    The Desorption Rate at Liquid-Solid Interface

    Authors: Krishna Jaiswal, Horia Metiu, Vishal Agarwal

    Abstract: We use a simple generic model to study the desorption of atoms from a solid surface in contact with a liquid, by using a combination of Monte Carlo and molecular dynamics simulations. The behavior of the system depends on two parameters: the strength $ε_{LS}$ of the solid-liquid interaction energy and the strength $ε_{LL}$ of the liquid-liquid interaction energy. The contact with the solid surface… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  24. arXiv:2211.06104  [pdf, other

    cs.CV

    Bounding Box Priors for Cell Detection with Point Annotations

    Authors: Hari Om Aggrawal, Dipam Goswami, Vinti Agarwal

    Abstract: The size of an individual cell type, such as a red blood cell, does not vary much among humans. We use this knowledge as a prior for classifying and detecting cells in images with only a few ground truth bounding box annotations, while most of the cells are annotated with points. This setting leads to weakly semi-supervised learning. We propose replacing points with either stochastic (ST) boxes or… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  25. arXiv:2208.01439  [pdf, other

    q-bio.OT cs.LG stat.ML

    Unsupervised machine learning framework for discriminating major variants of concern during COVID-19

    Authors: Rohitash Chandra, Chaarvi Bansal, Mingyue Kang, Tom Blau, Vinti Agarwal, Pranjal Singh, Laurence O. W. Wilson, Seshadri Vasan

    Abstract: Due to the high mutation rate of the virus, the COVID-19 pandemic evolved rapidly. Certain variants of the virus, such as Delta and Omicron, emerged with altered viral properties leading to severe transmission and death rates. These variants burdened the medical systems worldwide with a major impact to travel, productivity, and the world economy. Unsupervised machine learning methods have the abil… ▽ More

    Submitted 25 May, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Journal ref: PLOS ONE, 2023

  26. arXiv:2206.10225  [pdf, other

    cs.CV cs.HC

    Broken News: Making Newspapers Accessible to Print-Impaired

    Authors: Vishal Agarwal, Tanuja Ganu, Saikat Guha

    Abstract: Accessing daily news content still remains a big challenge for people with print-impairment including blind and low-vision due to opacity of printed content and hindrance from online sources. In this paper, we present our approach for digitization of print newspaper into an accessible file format such as HTML. We use an ensemble of instance segmentation and detection framework for newspaper layout… ▽ More

    Submitted 23 June, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Journal ref: Extended Abstract at Accessibility, Vision, and Autonomy Meet (CVPR 2022 Workshop)

  27. arXiv:2205.01404  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?

    Authors: Subba Reddy Oota, Jashn Arora, Veeral Agarwal, Mounika Marreddy, Manish Gupta, Bapi Raju Surampudi

    Abstract: Several popular Transformer based language models have been found to be successful for text-driven brain encoding. However, existing literature leverages only pretrained text Transformer models and has not explored the efficacy of task-specific learned Transformer representations. In this work, we explore transfer learning from representations learned for ten popular natural language processing ta… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 18 pages, 18 figures

  28. "Way back then": A Data-driven View of 25+ years of Web Evolution

    Authors: Vibhor Agarwal, Nishanth Sastry

    Abstract: Since the inception of the first web page three decades back, the Web has evolved considerably, from static HTML pages in the beginning to the dynamic web pages of today, from mainly the text-based pages of the 1990s to today's multimedia rich pages, etc. Although much of this is known anecdotally, to our knowledge, there is no quantitative documentation of the extent and timing of these changes.… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: To appear at The ACM Web Conference 2022

  29. GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates

    Authors: Vibhor Agarwal, Sagar Joglekar, Anthony P. Young, Nishanth Sastry

    Abstract: Online forums that allow participatory engagement between users have been transformative for public discussion of important issues. However, debates on such forums can sometimes escalate into full blown exchanges of hate or misinformation. An important tool in understanding and tackling such problems is to be able to infer the argumentative relation of whether a reply is supporting or attacking th… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: To appear at The ACM Web Conference 2022

  30. PrivPAS: A real time Privacy-Preserving AI System and applied ethics

    Authors: Harichandana B S S, Vibhav Agarwal, Sourav Ghosh, Gopi Ramena, Sumit Kumar, Barath Raj Kandur Raja

    Abstract: With 3.78 billion social media users worldwide in 2021 (48% of the human population), almost 3 billion images are shared daily. At the same time, a consistent evolution of smartphone cameras has led to a photography explosion with 85% of all new pictures being captured using smartphones. However, lately, there has been an increased discussion of privacy concerns when a person being photographed is… ▽ More

    Submitted 8 February, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

    Comments: Accepted at 16th IEEE International Conference on Semantic Computing (ICSC), January 26-28, 2022 [update: Best Paper candidate at ICSC 2022]

    Journal ref: 2022 IEEE 16th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2022, pp. 9-16

  31. arXiv:2112.08984  [pdf, other

    eess.AS cs.SD eess.SP physics.app-ph

    Object-based synthesis of scraping and rolling sounds based on non-linear physical constraints

    Authors: Vinayak Agarwal, Maddie Cusimano, James Traer, Josh McDermott

    Abstract: Sustained contact interactions like scraping and rolling produce a wide variety of sounds. Previous studies have explored ways to synthesize these sounds efficiently and intuitively but could not fully mimic the rich structure of real instances of these sounds. We present a novel source-filter model for realistic synthesis of scraping and rolling sounds with physically and perceptually relevant co… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Journal ref: Proceeding of the 24th International Conference on Digital Audio Effects (DAFx-20in21), 2021

  32. arXiv:2111.10374  [pdf, other

    q-bio.QM cs.CV eess.IV

    Urine Microscopic Image Dataset

    Authors: Dipam Goswami, Hari Om Aggrawal, Rajiv Gupta, Vinti Agarwal

    Abstract: Urinalysis is a standard diagnostic test to detect urinary system related problems. The automation of urinalysis will reduce the overall diagnostic time. Recent studies used urine microscopic datasets for designing deep learning based algorithms to classify and detect urine cells. But these datasets are not publicly available for further research. To alleviate the need for urine datsets, we prepar… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 7 pages, 1 image

  33. arXiv:2111.00861  [pdf, other

    cs.CV cs.LG

    A Frequency Perspective of Adversarial Robustness

    Authors: Shishira R Maiya, Max Ehrlich, Vatsal Agarwal, Ser-Nam Lim, Tom Goldstein, Abhinav Shrivastava

    Abstract: Adversarial examples pose a unique challenge for deep learning systems. Despite recent advances in both attacks and defenses, there is still a lack of clarity and consensus in the community about the true nature and underlying properties of adversarial examples. A deep understanding of these examples can provide new insights towards the development of more effective attacks and defenses. Driven by… ▽ More

    Submitted 26 October, 2021; originally announced November 2021.

  34. LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

    Authors: Vibhav Agarwal, Sudeep Deepak Shivnikar, Sourav Ghosh, Himanshu Arora, Yashwant Saini

    Abstract: Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in 2021 IEEE 20th International Conference on Machine Learning and Applications (ICMLA)

    Journal ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), Pasadena, CA, USA, 2021, pp. 1112-1117

  35. arXiv:2110.08413  [pdf, other

    cs.CL cs.LG

    Invariant Language Modeling

    Authors: Maxime Peyrard, Sarvjeet Singh Ghotra, Martin Josifoski, Vidhan Agarwal, Barun Patra, Dean Carignan, Emre Kiciman, Robert West

    Abstract: Large pretrained language models are critical components of modern NLP pipelines. Yet, they suffer from spurious correlations, poor out-of-domain generalization, and biases. Inspired by recent progress in causal machine learning, in particular the invariant risk minimization (IRM) paradigm, we propose invariant language modeling, a framework for learning invariant representations that generalize b… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Published at EMNLP 2022

  36. A novel approach for modelling and classifying sit-to-stand kinematics using inertial sensors

    Authors: Maitreyee Wairagkar, Emma Villeneuve, Rachel King, Balazs Janko, Malcolm Burnett, Ann Ashburn, Veena Agarwal, R. Simon Sherratt, William Holderbaum, William Harwin

    Abstract: Sit-to-stand transitions are an important part of activities of daily living and play a key role in functional mobility in humans. The sit-to-stand movement is often affected in older adults due to frailty and in patients with motor impairments such as Parkinson's disease leading to falls. Studying kinematics of sit-to-stand transitions can provide insight in assessment, monitoring and developing… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: 25 pages, 11 figures

  37. arXiv:2105.07135  [pdf

    cs.MM cs.AI cs.SD eess.AS eess.IV

    Analyzing Images for Music Recommendation

    Authors: Anant Baijal, Vivek Agarwal, Danny Hyun

    Abstract: Experiencing images with suitable music can greatly enrich the overall user experience. The proposed image analysis method treats an artwork image differently from a photograph image. Automatic image classification is performed using deep-learning based models. An illustrative analysis showcasing the ability of our deep-models to inherently learn and utilize perceptually relevant features when cla… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: IEEE International Conference on Consumer Electronics (IEEE ICCE 2021)

  38. arXiv:2104.14095  [pdf, ps, other

    cs.AI cs.LG

    Analyzing the Nuances of Transformers' Polynomial Simplification Abilities

    Authors: Vishesh Agarwal, Somak Aditya, Navin Goyal

    Abstract: Symbolic Mathematical tasks such as integration often require multiple well-defined steps and understanding of sub-tasks to reach a solution. To understand Transformers' abilities in such tasks in a fine-grained manner, we deviate from traditional end-to-end settings, and explore a step-wise polynomial simplification task. Polynomials can be written in a simple normal form as a sum of monomials wh… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: 16 pages, 18 Tables, Accepted ICLR 2021 MathAI Workshop

  39. arXiv:2103.16150  [pdf

    cs.CV cs.LG

    FONTNET: On-Device Font Understanding and Prediction Pipeline

    Authors: Rakshith S, Rishabh Khurana, Vibhav Agarwal, Jayesh Rajkumar Vachhani, Guggilla Bhanodai

    Abstract: Fonts are one of the most basic and core design concepts. Numerous use cases can benefit from an in depth understanding of Fonts such as Text Customization which can change text in an image while maintaining the Font attributes like style, color, size. Currently, Text recognition solutions can group recognized text based on line breaks or paragraph breaks, if the Font attributes are known multiple… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in IEEE ICASSP 2021: 46th IEEE International Conference on Acoustics, Speech, & Signal Processing

  40. arXiv:2103.13511  [pdf, other

    cs.LG cs.AI cs.CV

    Addressing catastrophic forgetting for medical domain expansion

    Authors: Sharut Gupta, Praveer Singh, Ken Chang, Liangqiong Qu, Mehak Aggarwal, Nishanth Arun, Ashwin Vaswani, Shruti Raghavan, Vibha Agarwal, Mishka Gidwani, Katharina Hoebel, Jay Patel, Charles Lu, Christopher P. Bridge, Daniel L. Rubin, Jayashree Kalpathy-Cramer

    Abstract: Model brittleness is a key concern when deploying deep learning models in real-world medical settings. A model that has high performance at one institution may suffer a significant decline in performance when tested at other institutions. While pooling datasets from multiple institutions and retraining may provide a straightforward solution, it is often infeasible and may compromise patient privac… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: First three authors contributed equally

  41. arXiv:2103.05806  [pdf, other

    physics.optics cond-mat.mtrl-sci

    Optimization of wide-band quasi-omnidirectional 1-D photonic structures

    Authors: Victor Castillo-Gallardo, Luis Eduardo Puente-Díaz, David Ariza-Flores, Héctor Pérez-Aguilar, W. Luis Mochán, Vivechana Agarwal

    Abstract: We have designed, optimized, fabricated and characterized highly reflective quasi-omnidirectional (angular range of $0-60^\circ$) multilayered structures with a wide spectral range. Two techniques, chirping (a continuous change in thicknesses) and stacking of Bragg-type sub-structures, have been used to enhance the reflectance with minimum thickness for a given pair of refractive indices. Numerica… ▽ More

    Submitted 23 March, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: 11 páginas

  42. arXiv:2103.04442  [pdf, other

    cs.CY

    Differential Tracking Across Topical Webpages of Indian News Media

    Authors: Yash Vekaria, Vibhor Agarwal, Pushkal Agarwal, Sangeeta Mahapatra, Sakthi Balan Muthiah, Nishanth Sastry, Nicolas Kourtellis

    Abstract: Online user privacy and tracking have been extensively studied in recent years, especially due to privacy and personal data-related legislations in the EU and the USA, such as the General Data Protection Regulation, ePrivacy Regulation, and California Consumer Privacy Act. Research has revealed novel tracking and personal identifiable information leakage methods that first- and third-parties emplo… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

  43. arXiv:2102.03656  [pdf, other

    cs.CY

    Under the Spotlight: Web Tracking in Indian Partisan News Websites

    Authors: Vibhor Agarwal, Yash Vekaria, Pushkal Agarwal, Sangeeta Mahapatra, Shounak Set, Sakthi Balan Muthiah, Nishanth Sastry, Nicolas Kourtellis

    Abstract: India is experiencing intense political partisanship and sectarian divisions. The paper performs, to the best of our knowledge, the first comprehensive analysis on the Indian online news media with respect to tracking and partisanship. We build a dataset of 103 online, mostly mainstream news websites. With the help of two experts, alongside data from the Media Ownership Monitor of the Reporters wi… ▽ More

    Submitted 8 March, 2021; v1 submitted 6 February, 2021; originally announced February 2021.

  44. Analytical Model for the Current Density in the Electrochemical Synthesis of Porous Silicon Structures with a Lateral Gradient

    Authors: C. A. Ospina-Delacruz, V. Agarwal, W. L. Mochán

    Abstract: Layered optical devices with a lateral gradient can be fabricated through electrochemical synthesis of porous silicon (PS) using a position dependent etching current density $\bm j(\bm r_\|)$. Predicting the local value of $\bm j(\bm r_\|)$ and the corresponding porosity $p(\bm r_\|)$ and etching rate $v(\bm r_\|)$ is desirable for their systematic design. We develop a simple analytical model for… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: 19 pages, 13 figures

    Journal ref: Optical Materials, Volume 113, 110859 (2021)

  45. arXiv:2101.03025  [pdf, other

    cs.CL cs.LG

    EmpLite: A Lightweight Sequence Labeling Model for Emphasis Selection of Short Texts

    Authors: Vibhav Agarwal, Sourav Ghosh, Kranti Chalamalasetti, Bharath Challa, Sonal Kumari, Harshavardhana, Barath Raj Kandur Raja

    Abstract: Word emphasis in textual content aims at conveying the desired intention by changing the size, color, typeface, style (bold, italic, etc.), and other typographical features. The emphasized words are extremely helpful in drawing the readers' attention to specific information that the authors wish to emphasize. However, performing such emphasis using a soft keyboard for social media interactions is… ▽ More

    Submitted 15 December, 2020; originally announced January 2021.

    Comments: Accepted for publication in ICON 2020: 17th International Conference on Natural Language Processing

    Report number: 2020.icon-1.3 (ACL Anthology)

    Journal ref: 17th International Conference on Natural Language Processing (ICON), Patna, India, December 18 - 21, 2020, pages 19-26, ACL Anthology: 2020.icon-1.3

  46. LiteMuL: A Lightweight On-Device Sequence Tagger using Multi-task Learning

    Authors: Sonal Kumari, Vibhav Agarwal, Bharath Challa, Kranti Chalamalasetti, Sourav Ghosh, Harshavardhana, Barath Raj Kandur Raja

    Abstract: Named entity detection and Parts-of-speech tagging are the key tasks for many NLP applications. Although the current state of the art methods achieved near perfection for long, formal, structured text there are hindrances in deploying these models on memory-constrained devices such as mobile phones. Furthermore, the performance of these models is degraded when they encounter short, informal, and c… ▽ More

    Submitted 29 March, 2021; v1 submitted 15 December, 2020; originally announced January 2021.

    Comments: Published in 2021 IEEE 15th International Conference on Semantic Computing (ICSC); Candidate for Best Paper Award

    Journal ref: 2021 IEEE 15th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2021, pp. 1-8

  47. arXiv:2011.05910  [pdf, other

    cs.CL cs.AI

    Audrey: A Personalized Open-Domain Conversational Bot

    Authors: Chung Hoon Hong, Yuan Liang, Sagnik Sinha Roy, Arushi Jain, Vihang Agarwal, Ryan Draves, Zhizhuo Zhou, William Chen, Yujian Liu, Martha Miracky, Lily Ge, Nikola Banovic, David Jurgens

    Abstract: Conversational Intelligence requires that a person engage on informational, personal and relational levels. Advances in Natural Language Understanding have helped recent chatbots succeed at dialog on the informational level. However, current techniques still lag for conversing with humans on a personal level and fully relating to them. The University of Michigan's submission to the Alexa Prize Gra… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

  48. arXiv:2010.10156  [pdf, other

    cs.AI cs.CL

    Extracting Procedural Knowledge from Technical Documents

    Authors: Shivali Agarwal, Shubham Atreja, Vikas Agarwal

    Abstract: Procedures are an important knowledge component of documents that can be leveraged by cognitive assistants for automation, question-answering or driving a conversation. It is a challenging problem to parse big dense documents like product manuals, user guides to automatically understand which parts are talking about procedures and subsequently extract them. Most of the existing research has focuse… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  49. arXiv:2005.00943  [pdf, other

    physics.optics cond-mat.mes-hall

    Stable Calculation of Optical Properties of Large Non-Periodic Dissipative Multilayered Systems

    Authors: Luis Eduardo Puente-Díaz, Victor Castillo-Gallardo, Guillermo P. Ortiz, José Samuel Pérez-Huerta, Héctor Pérez-Aguilar, Vivechana Agarwal, W. Luis Mochán

    Abstract: The calculation of the transfer matrix for a large non-periodic multilayered system may become unstable in the presence of absorption. We discuss the origin of this instability and we explore two methods to overcome it: the use of a total matrix to solve for all the fields at all the interfaces simultaneously and an expansion in the Bloch-like modes of a periodic artificially repeated system. We a… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: 20 pages, 7 figures

  50. arXiv:2003.13440  [pdf

    eess.IV cs.CV

    Computer Aided Detection for Pulmonary Embolism Challenge (CAD-PE)

    Authors: Germán González, Daniel Jimenez-Carretero, Sara Rodríguez-López, Carlos Cano-Espinosa, Miguel Cazorla, Tanya Agarwal, Vinit Agarwal, Nima Tajbakhsh, Michael B. Gotway, Jianming Liang, Mojtaba Masoudi, Noushin Eftekhari, Mahdi Saadatmand, Hamid-Reza Pourreza, Patricia Fraga-Rivas, Eduardo Fraile, Frank J. Rybicki, Ara Kassarjian, Raúl San José Estépar, Maria J. Ledesma-Carbayo

    Abstract: Rationale: Computer aided detection (CAD) algorithms for Pulmonary Embolism (PE) algorithms have been shown to increase radiologists' sensitivity with a small increase in specificity. However, CAD for PE has not been adopted into clinical practice, likely because of the high number of false positives current CAD software produces. Objective: To generate a database of annotated computed tomography… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 8 pages, 3 figures