Skip to main content

Showing 1–50 of 121 results for author: Ramakrishnan, N

.
  1. arXiv:2506.00765  [pdf, ps, other

    cs.AI

    HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset

    Authors: Shengkun Wang, Yanshen Sun, Fanglan Chen, Linhan Wang, Naren Ramakrishnan, Chang-Tien Lu, Yinlin Chen

    Abstract: Accurate house-price forecasting is essential for investors, planners, and researchers. However, reproducible benchmarks with sufficient spatiotemporal depth and contextual richness for long horizon prediction remain scarce. To address this, we introduce HouseTS a large scale, multimodal dataset covering monthly house prices from March 2012 to December 2023 across 6,000 ZIP codes in 30 major U.S.… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  2. arXiv:2505.18485  [pdf, ps, other

    cs.LG

    The Prompt is Mightier than the Example

    Authors: Shengzhe Xu, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Numerous recent prompt optimization approaches like chain-of-thought, have been demonstrated to significantly improve the quality of content generated by large language models (LLMs). In-context learning (ICL), a recent paradigm where a few representative examples guide content generation has also led to strong improvements in generation quality of LLM generated content. This idea has been applied… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  3. arXiv:2505.17135  [pdf, ps, other

    cs.CL

    When can isotropy help adapt LLMs' next word prediction to numerical domains?

    Authors: Rashed Shelim, Shengzhe Xu, Walid Saad, Naren Ramakrishnan

    Abstract: Recent studies have shown that vector representations of contextual embeddings learned by pre-trained large language models (LLMs) are effective in various downstream tasks in numerical domains. Despite their significant benefits, the tendency of LLMs to hallucinate in such domains can have severe consequences in applications such as energy, nature, finance, healthcare, retail and transportation,… ▽ More

    Submitted 4 June, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

  4. arXiv:2505.01712  [pdf, other

    cs.AI cs.NI

    World Model-Based Learning for Long-Term Age of Information Minimization in Vehicular Networks

    Authors: Lingyi Wang, Rashed Shelim, Walid Saad, Naren Ramakrishnan

    Abstract: Traditional reinforcement learning (RL)-based learning approaches for wireless networks rely on expensive trial-and-error mechanisms and real-time feedback based on extensive environment interactions, which leads to low data efficiency and short-sighted policies. These limitations become particularly problematic in complex, dynamic networks with high uncertainty and long-term planning requirements… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  5. arXiv:2502.15177  [pdf, other

    cs.LG cs.CY

    Optimizing Product Provenance Verification using Data Valuation Methods

    Authors: Raquib Bin Yousuf, Hoang Anh Just, Shengzhe Xu, Brian Mayer, Victor Deklerck, Jakub Truszkowski, John C. Simeone, Jade Saunders, Chang-Tien Lu, Ruoxi Jia, Naren Ramakrishnan

    Abstract: Determining and verifying product provenance remains a critical challenge in global supply chains, particularly as geopolitical conflicts and shifting borders create new incentives for misrepresentation of commodities, such as hiding the origin of illegally harvested timber or agriculture grown on illegally cleared land. Stable Isotope Ratio Analysis (SIRA), combined with Gaussian process regressi… ▽ More

    Submitted 16 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  6. arXiv:2502.14115  [pdf, other

    cs.LG cs.CE cs.CY

    Chasing the Timber Trail: Machine Learning to Reveal Harvest Location Misrepresentation

    Authors: Shailik Sarkar, Raquib Bin Yousuf, Linhan Wang, Brian Mayer, Thomas Mortier, Victor Deklerck, Jakub Truszkowski, John C. Simeone, Marigold Norman, Jade Saunders, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Illegal logging poses a significant threat to global biodiversity, climate stability, and depresses international prices for legal wood harvesting and responsible forest products trade, affecting livelihoods and communities across the globe. Stable isotope ratio analysis (SIRA) is rapidly becoming an important tool for determining the harvest location of traded, organic, products. The spatial patt… ▽ More

    Submitted 16 March, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 9 pages, 5 figures

    ACM Class: J.m; K.4.1; I.2.0; J.2

  7. arXiv:2502.13019  [pdf, other

    cs.CL

    Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation

    Authors: Sha Li, Naren Ramakrishnan

    Abstract: Retrieval-Augmented Generation (RAG) aims to augment the capabilities of Large Language Models (LLMs) by retrieving and incorporate external documents or chunks prior to generation. However, even improved retriever relevance can brings erroneous or contextually distracting information, undermining the effectiveness of RAG in downstream tasks. We introduce a compact, efficient, and pluggable module… ▽ More

    Submitted 26 April, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: 16 pages

  8. arXiv:2502.10976  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    QuOTE: Question-Oriented Text Embeddings

    Authors: Andrew Neeser, Kaylen Latimer, Aadyant Khatri, Chris Latimer, Naren Ramakrishnan

    Abstract: We present QuOTE (Question-Oriented Text Embeddings), a novel enhancement to retrieval-augmented generation (RAG) systems, aimed at improving document representation for accurate and nuanced retrieval. Unlike traditional RAG pipelines, which rely on embedding raw text chunks, QuOTE augments chunks with hypothetical questions that the chunk can potentially answer, enriching the representation space… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

    ACM Class: H.3

  9. arXiv:2502.07591  [pdf, other

    cs.LG cs.AI

    DMWM: Dual-Mind World Model with Long-Term Imagination

    Authors: Lingyi Wang, Rashed Shelim, Walid Saad, Naren Ramakrishnan

    Abstract: Imagination in world models is crucial for enabling agents to learn long-horizon policy in a sample-efficient manner. Existing recurrent state-space model (RSSM)-based world models depend on single-step statistical inference to capture the environment dynamics, and, hence, they are unable to perform long-term imagination tasks due to the accumulation of prediction errors. Inspired by the dual-proc… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  10. arXiv:2411.16116  [pdf, other

    cs.CL cs.AI

    LLM Augmentations to support Analytical Reasoning over Multiple Documents

    Authors: Raquib Bin Yousuf, Nicholas Defelice, Mandar Sharma, Shengzhe Xu, Naren Ramakrishnan

    Abstract: Building on their demonstrated ability to perform a variety of tasks, we investigate the application of large language models (LLMs) to enhance in-depth analytical reasoning within the context of intelligence analysis. Intelligence analysts typically work with massive dossiers to draw connections between seemingly unrelated entities, and uncover adversaries' plans and motives. We explore if and ho… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 2024 IEEE International Conference on Big Data (IEEE BigData 2024)

  11. arXiv:2409.17289  [pdf, other

    cs.HC

    Steering LLM Summarization with Visual Workspaces for Sensemaking

    Authors: Xuxin Tang, Eric Krokos, Can Liu, Kylie Davidson, Kirsten Whitley, Naren Ramakrishnan, Chris North

    Abstract: Large Language Models (LLMs) have been widely applied in summarization due to their speedy and high-quality text generation. Summarization for sensemaking involves information compression and insight extraction. Human guidance in sensemaking tasks can prioritize and cluster relevant information for LLMs. However, users must translate their cognitive thinking into natural language to communicate wi… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 11 figures, 7 pages

  12. arXiv:2406.14541  [pdf, other

    cs.LG

    Why LLMs Are Bad at Synthetic Table Generation (and what to do about it)

    Authors: Shengzhe Xu, Cho-Ting Lee, Mandar Sharma, Raquib Bin Yousuf, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Synthetic data generation is integral to ML pipelines, e.g., to augment training data, replace sensitive information, and even to power advanced platforms like DeepSeek. While LLMs fine-tuned for synthetic data generation are gaining traction, synthetic table generation -- a critical data type in business and science -- remains under-explored compared to text and image synthesis. This paper shows… ▽ More

    Submitted 13 March, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

  13. arXiv:2406.14005  [pdf, other

    cs.CL cs.AI cs.LG

    Information Guided Regularization for Fine-tuning Language Models

    Authors: Mandar Sharma, Nikhil Muralidhar, Shengzhe Xu, Raquib Bin Yousuf, Naren Ramakrishnan

    Abstract: The pretraining-fine-tuning paradigm has been the de facto strategy for transfer learning in modern language modeling. With the understanding that task adaptation in LMs is often a function of parameters shared across tasks, we argue that a more surgical approach to regularization needs to exist for smoother transfer learning. Towards this end, we investigate how the pretraining loss landscape is… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  14. arXiv:2406.13073  [pdf, other

    cs.LG cs.CR cs.CV

    Let the Noise Speak: Harnessing Noise for a Unified Defense Against Adversarial and Backdoor Attacks

    Authors: Md Hasan Shahriar, Ning Wang, Naren Ramakrishnan, Y. Thomas Hou, Wenjing Lou

    Abstract: The exponential adoption of machine learning (ML) is propelling the world into a future of distributed and intelligent automation and data-driven solutions. However, the proliferation of malicious data manipulation attacks against ML, namely adversarial and backdoor attacks, jeopardizes its reliability in safety-critical applications. The existing detection methods are attack-specific and built up… ▽ More

    Submitted 13 April, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: 20 pages, 9 figures

  15. arXiv:2406.10453  [pdf, other

    eess.SY

    Fast Geometric Learning of MIMO Signal Detection over Grassmannian Manifolds

    Authors: Rashed Shelim, Walid Saad, Naren Ramakrishnan

    Abstract: Domain or statistical distribution shifts are a key staple of the wireless communication channel, because of the dynamics of the environment. Deep learning (DL) models for detecting multiple-input multiple-output (MIMO) signals in dynamic communication require large training samples (in the order of hundreds of thousands to millions) and online retraining to adapt to domain shift. Some dynamic net… ▽ More

    Submitted 3 August, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  16. Laying Anchors: Semantically Priming Numerals in Language Modeling

    Authors: Mandar Sharma, Rutuja Murlidhar Taware, Pravesh Koirala, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Off-the-shelf pre-trained language models have become the de facto standard in NLP pipelines for a multitude of downstream tasks. However, the inability of these models to properly encode numerals limits their performance on tasks requiring numeric comprehension. We introduce strategies to semantically prime numerals in any corpus by generating anchors governed by the distribution of numerals in s… ▽ More

    Submitted 7 August, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to the findings of NAACL 2024

  17. arXiv:2402.01748  [pdf, other

    cs.NI cs.AI cs.CL cs.LG

    Large Multi-Modal Models (LMMs) as Universal Foundation Models for AI-Native Wireless Systems

    Authors: Shengzhe Xu, Christo Kurisummoottil Thomas, Omar Hashash, Nikhil Muralidhar, Walid Saad, Naren Ramakrishnan

    Abstract: Large language models (LLMs) and foundation models have been recently touted as a game-changer for 6G systems. However, recent efforts on LLMs for wireless networks are limited to a direct application of existing language models that were designed for natural language processing (NLP) applications. To address this challenge and create wireless-centric foundation models, this paper presents a compr… ▽ More

    Submitted 7 February, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

  18. arXiv:2305.08246  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency

    Authors: Mandar Sharma, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: The field of Math-NLP has witnessed significant growth in recent years, motivated by the desire to expand LLM performance to the learning of non-linguistic notions (numerals, and subsequently, arithmetic reasoning). However, non-linguistic skill injection typically comes at a cost for LLMs: it leads to catastrophic forgetting of core linguistic skills, a consequence that often remains unaddressed… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023's main conference

  19. arXiv:2302.14778  [pdf, other

    q-bio.PE q-bio.QM

    The self-organization of selfishness: Reinforcement Learning shows how selfish behavior can emerge from agent-environment interaction dynamics

    Authors: Aamir Sahil Chandroth, Nithya Ramakrishnan, Sanjay Chandrasekharan

    Abstract: When biological communities use signaling structures for complex coordination, 'free-riders' emerge. The free-riding agents do not contribute to the community resources (signals), but exploit them. Most models of such 'selfish' behavior consider free-riding as evolving through mutation and selection. Over generations, the mutation -- which is considered to create a stable trait -- spreads through… ▽ More

    Submitted 28 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 9 pages, 16 figs, 1 table. Reinforcement Learning - Parametric Analysis, Social Behavior

  20. Channel Simulation: Finite Blocklengths and Broadcast Channels

    Authors: Michael X. Cao, Navneeth Ramakrishnan, Mario Berta, Marco Tomamichel

    Abstract: We study channel simulation under common randomness assistance in the finite-blocklength regime and identify the smooth channel max-information as a linear program one-shot converse on the minimal simulation cost for fixed error tolerance. We show that this one-shot converse can be achieved exactly using no-signaling-assisted codes, and approximately achieved using common randomness-assisted codes… ▽ More

    Submitted 5 August, 2024; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: 38 pages, 6 figures

  21. arXiv:2211.02098  [pdf, other

    cs.CL cs.AI cs.LG

    Overcoming Barriers to Skill Injection in Language Modeling: Case Study in Arithmetic

    Authors: Mandar Sharma, Nikhil Muralidhar, Naren Ramakrishnan

    Abstract: Through their transfer learning abilities, highly-parameterized large pre-trained language models have dominated the NLP landscape for a multitude of downstream language tasks. Though linguistically proficient, the inability of these models to incorporate the learning of non-linguistic entities (numerals and arithmetic reasoning) limits their usage for tasks that require numeric comprehension or s… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022: Math-AI Workshop

  22. arXiv:2210.13994  [pdf, other

    cs.CV

    Minutiae-Guided Fingerprint Embeddings via Vision Transformers

    Authors: Steven A. Grosz, Joshua J. Engelsma, Rajeev Ranjan, Naveen Ramakrishnan, Manoj Aggarwal, Gerard G. Medioni, Anil K. Jain

    Abstract: Minutiae matching has long dominated the field of fingerprint recognition. However, deep networks can be used to extract fixed-length embeddings from fingerprints. To date, the few studies that have explored the use of CNN architectures to extract such embeddings have shown extreme promise. Inspired by these early works, we propose the first use of a Vision Transformer (ViT) to learn a discriminat… ▽ More

    Submitted 25 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  23. arXiv:2210.02841  [pdf, other

    cs.CR cs.LG

    Detecting Irregular Network Activity with Adversarial Learning and Expert Feedback

    Authors: Gopikrishna Rathinavel, Nikhil Muralidhar, Timothy O'Shea, Naren Ramakrishnan

    Abstract: Anomaly detection is a ubiquitous and challenging task relevant across many disciplines. With the vital role communication networks play in our daily lives, the security of these networks is imperative for smooth functioning of society. To this end, we propose a novel self-supervised deep learning framework CAAD for anomaly detection in wireless communication systems. Specifically, CAAD employs co… ▽ More

    Submitted 15 October, 2022; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures

  24. arXiv:2208.02867  [pdf, other

    math.OC cs.AI cs.NE

    Memetic algorithms for Spatial Partitioning problems

    Authors: Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Spatial optimization problems (SOPs) are characterized by spatial relationships governing the decision variables, objectives, and/or constraint functions. In this article, we focus on a specific type of SOP called spatial partitioning, which is a combinatorial problem due to the presence of discrete spatial units. Exact optimization methods do not scale with the size of the problem, especially wit… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 32 pages, accepted at ACM Transactions on Spatial Algorithms and Systems: Special issue on the Best Papers from the 2020 ACM SIGSPATIAL Conference

    ACM Class: G.1.6; I.2.8

  25. arXiv:2208.00493  [pdf, other

    cs.LG cs.AI

    Scrutinizing Shipment Records To Thwart Illegal Timber Trade

    Authors: Debanjan Datta, Sathappan Muthiah, John Simeone, Amelia Meadows, Naren Ramakrishnan

    Abstract: Timber and forest products made from wood, like furniture, are valuable commodities, and like the global trade of many highly-valued natural resources, face challenges of corruption, fraud, and illegal harvesting. These grey and black market activities in the wood and forest products sector are not limited to the countries where the wood was harvested, but extend throughout the global supply chain… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: Accepted in Proceedings of 6th Outlier Detection and Description Workshop, ACM SigKDD 2021 https://oddworkshop.github.io/assets/papers/7.pdf. arXiv admin note: substantial text overlap with arXiv:2104.01156

  26. arXiv:2207.12571  [pdf, other

    cs.CL

    Innovations in Neural Data-to-text Generation: A Survey

    Authors: Mandar Sharma, Ajay Gogineni, Naren Ramakrishnan

    Abstract: The neural boom that has sparked natural language processing (NLP) research through the last decade has similarly led to significant innovations in data-to-text generation (DTG). This survey offers a consolidated view into the neural DTG paradigm with a structured examination of the approaches, benchmark datasets, and evaluation protocols. This survey draws boundaries separating DTG from the rest… ▽ More

    Submitted 1 April, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted to ACM Transactions on Intelligent Systems and Technology 2024

  27. arXiv:2207.04029  [pdf, other

    cs.IR cs.AI

    Lessons from Deep Learning applied to Scholarly Information Extraction: What Works, What Doesn't, and Future Directions

    Authors: Raquib Bin Yousuf, Subhodip Biswas, Kulendra Kumar Kaushal, James Dunham, Rebecca Gelles, Sathappan Muthiah, Nathan Self, Patrick Butler, Naren Ramakrishnan

    Abstract: Understanding key insights from full-text scholarly articles is essential as it enables us to determine interesting trends, give insight into the research and development, and build knowledge graphs. However, some of the interesting key insights are only available when considering full-text. Although researchers have made significant progress in information extraction from short documents, extract… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: ACM KDD 2022 Workshop on Data-driven Science of Science

    ACM Class: I.2; I.2.7; H.3

  28. arXiv:2206.14384  [pdf, other

    cs.LG cs.AI stat.ME

    Framing Algorithmic Recourse for Anomaly Detection

    Authors: Debanjan Datta, Feng Chen, Naren Ramakrishnan

    Abstract: The problem of algorithmic recourse has been explored for supervised machine learning models, to provide more interpretable, transparent and robust outcomes from decision support systems. An unexplored area is that of algorithmic recourse for anomaly detection, specifically for tabular data with only discrete feature values. Here the problem is to present a set of counterfactuals that are deemed n… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: ACM SigKDD 2022, Research Track

  29. arXiv:2206.03703  [pdf, other

    cs.AI math.NA stat.AP

    Sampling-based techniques for designing school boundaries

    Authors: Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Recently, an increasing number of researchers, especially in the realm of political redistricting, have proposed sampling-based techniques to generate a subset of plans from the vast space of districting plans. These techniques have been increasingly adopted by U.S. courts of law and independent commissions as a tool for identifying partisan gerrymanders. Motivated by these recent developments, we… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: 11 pages, 4 figures

    ACM Class: I.2.1; I.5.3; I.2.8; G.3; G.1.6

  30. arXiv:2204.02531  [pdf, other

    cs.CL cs.AI

    Improving Zero-Shot Event Extraction via Sentence Simplification

    Authors: Sneha Mehta, Huzefa Rangwala, Naren Ramakrishnan

    Abstract: The success of sites such as ACLED and Our World in Data have demonstrated the massive utility of extracting events in structured formats from large volumes of textual data in the form of news, social media, blogs and discussion forums. Event extraction can provide a window into ongoing geopolitical crises and yield actionable intelligence. With the proliferation of large pretrained language model… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  31. arXiv:2203.05983  [pdf, other

    cs.CV

    PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems

    Authors: Shu Hu, Chun-Hao Liu, Jayanta Dutta, Ming-Ching Chang, Siwei Lyu, Naveen Ramakrishnan

    Abstract: Semi-supervised object detection methods are widely used in autonomous driving systems, where only a fraction of objects are labeled. To propagate information from the labeled objects to the unlabeled ones, pseudo-labels for unlabeled objects must be generated. Although pseudo-labels have proven to improve the performance of semi-supervised object detection significantly, the applications of image… ▽ More

    Submitted 16 April, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted by the Workshop on Autonomous Driving (WAD) at CVPR 2022

  32. arXiv:2203.02095  [pdf, other

    cs.LG cs.AR cs.CR eess.SP

    Contrastive Graph Convolutional Networks for Hardware Trojan Detection in Third Party IP Cores

    Authors: Nikhil Muralidhar, Abdullah Zubair, Nathanael Weidler, Ryan Gerdes, Naren Ramakrishnan

    Abstract: The availability of wide-ranging third-party intellectual property (3PIP) cores enables integrated circuit (IC) designers to focus on designing high-level features in ASICs/SoCs. The massive proliferation of ICs brings with it an increased number of bad actors seeking to exploit those circuits for various nefarious reasons. This is not surprising as integrated circuits affect every aspect of socie… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Journal ref: IEEE International Symposium on Hardware Oriented Security and Trust (HOST), 2021, pp. 181-191

  33. arXiv:2202.10446  [pdf, other

    cs.LG physics.soc-ph q-bio.PE stat.AP

    EINNs: Epidemiologically-informed Neural Networks

    Authors: Alexander Rodríguez, Jiaming Cui, Naren Ramakrishnan, Bijaya Adhikari, B. Aditya Prakash

    Abstract: We introduce EINNs, a framework crafted for epidemic forecasting that builds upon the theoretical grounds provided by mechanistic models as well as the data-driven expressibility afforded by AI models, and their capabilities to ingest heterogeneous information. Although neural forecasting models have been successful in multiple tasks, predictions well-correlated with epidemic trends and long-term… ▽ More

    Submitted 10 January, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: Appears in AAAI 2023

  34. Moderate deviation expansion for fully quantum tasks

    Authors: Navneeth Ramakrishnan, Marco Tomamichel, Mario Berta

    Abstract: The moderate deviation regime is concerned with the finite block length trade-off between communication cost and error for information processing tasks in the asymptotic regime, where the communication cost approaches a capacity-like quantity and the error vanishes at the same time. We find exact characterisations of these trade-offs for a variety of fully quantum communication tasks, including qu… ▽ More

    Submitted 8 October, 2023; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 32 pages

    Journal ref: IEEE Transactions on Information Theory 69(8), 5041-5059 (2023)

  35. arXiv:2111.05199  [pdf, other

    cs.LG

    Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

    Authors: Padmaksha Roy, Shailik Sarkar, Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Naren Ramakrishnan, Chang-Tien Lu

    Abstract: Modeling the spatiotemporal nature of the spread of infectious diseases can provide useful intuition in understanding the time-varying aspect of the disease spread and the underlying complex spatial dependency observed in people's mobility patterns. Besides, the county level multiple related time series information can be leveraged to make a forecast on an individual time series. Adding to this ch… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: 8 pages

    ACM Class: K.5

    Journal ref: Published as conference paper at ASONAM 2021, Research Track

  36. arXiv:2110.05633  [pdf, other

    cs.CL cs.AI

    TCube: Domain-Agnostic Neural Time-series Narration

    Authors: Mandar Sharma, John S. Brownstein, Naren Ramakrishnan

    Abstract: The task of generating rich and fluent narratives that aptly describe the characteristics, trends, and anomalies of time-series data is invaluable to the sciences (geology, meteorology, epidemiology) or finance (trades, stocks, or sales and inventory). The efforts for time-series narration hitherto are domain-specific and use predefined templates that offer consistency but lead to mechanical narra… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: To be published in IEEE ICDM 2021

  37. arXiv:2107.00079  [pdf, other

    cs.LG

    Using AntiPatterns to avoid MLOps Mistakes

    Authors: Nikhil Muralidhar, Sathappah Muthiah, Patrick Butler, Manish Jain, Yu Yu, Katy Burne, Weipeng Li, David Jones, Prakash Arunachalam, Hays 'Skip' McCormick, Naren Ramakrishnan

    Abstract: We describe lessons learned from developing and deploying machine learning models at scale across the enterprise in a range of financial analytics applications. These lessons are presented in the form of antipatterns. Just as design patterns codify best software engineering practices, antipatterns provide a vocabulary to describe defective practices and methodologies. Here we catalog and document… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

  38. arXiv:2104.01156  [pdf, other

    cs.LG

    Detecting Anomalies Through Contrast in Heterogeneous Data

    Authors: Debanjan Datta, Sathappan Muthiah, Naren Ramakrishnan

    Abstract: Detecting anomalies has been a fundamental approach in detecting potentially fraudulent activities. Tasked with detection of illegal timber trade that threatens ecosystems and economies and association with other illegal activities, we formulate our problem as one of anomaly detection. Among other challenges annotations are unavailable for our large-scale trade data with heterogeneous features (ca… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  39. arXiv:2101.10247  [pdf, other

    cs.LG

    Incorporating Expert Guidance in Epidemic Forecasting

    Authors: Alexander Rodríguez, Bijaya Adhikari, Naren Ramakrishnan, B. Aditya Prakash

    Abstract: Forecasting influenza like illnesses (ILI) has rapidly progressed in recent years from an art to a science with a plethora of data-driven methods. While these methods have achieved qualified success, their applicability is limited due to their inability to incorporate expert feedback and guidance systematically into the forecasting framework. We propose a new approach leveraging the Seldonian opti… ▽ More

    Submitted 24 December, 2020; originally announced January 2021.

    Comments: Appears in SIGKDD 2020 epiDAMIK

  40. arXiv:2012.06453  [pdf, other

    cs.NE cs.AI

    Better call Surrogates: A hybrid Evolutionary Algorithm for Hyperparameter optimization

    Authors: Subhodip Biswas, Adam D Cobb, Andreea Sistrunk, Naren Ramakrishnan, Brian Jalaian

    Abstract: In this paper, we propose a surrogate-assisted evolutionary algorithm (EA) for hyperparameter optimization of machine learning (ML) models. The proposed STEADE model initially estimates the objective function landscape using RadialBasis Function interpolation, and then transfers the knowledge to an EA technique called Differential Evolution that is used to evolve new solutions guided by a Bayesian… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted at the black box optimization challenge at NeurIPS 2020

  41. arXiv:2009.12740  [pdf, other

    cs.LG cs.CR

    STAN: Synthetic Network Traffic Generation with Generative Neural Models

    Authors: Shengzhe Xu, Manish Marwah, Martin Arlitt, Naren Ramakrishnan

    Abstract: Deep learning models have achieved great success in recent years but progress in some domains like cybersecurity is stymied due to a paucity of realistic datasets. Organizations are reluctant to share such data, even internally, due to privacy reasons. An alternative is to use synthetically generated data but existing methods are limited in their ability to capture complex dependency structures, b… ▽ More

    Submitted 2 August, 2021; v1 submitted 27 September, 2020; originally announced September 2020.

  42. arXiv:2009.11407  [pdf, other

    cs.LG stat.AP

    Steering a Historical Disease Forecasting Model Under a Pandemic: Case of Flu and COVID-19

    Authors: Alexander Rodríguez, Nikhil Muralidhar, Bijaya Adhikari, Anika Tabassum, Naren Ramakrishnan, B. Aditya Prakash

    Abstract: Forecasting influenza in a timely manner aids health organizations and policymakers in adequate preparation and decision making. However, effective influenza forecasting still remains a challenge despite increasing research interest. It is even more challenging amidst the COVID pandemic, when the influenza-like illness (ILI) counts are affected by various factors such as symptomatic similarities w… ▽ More

    Submitted 23 December, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: Appears in AAAI-21

  43. arXiv:2009.02649  [pdf, other

    cs.CL

    Once Upon A Time In Visualization: Understanding the Use of Textual Narratives for Causality

    Authors: Arjun Choudhry, Mandar Sharma, Pramod Chundury, Thomas Kapler, Derek W. S. Gray, Naren Ramakrishnan, Niklas Elmqvist

    Abstract: Causality visualization can help people understand temporal chains of events, such as messages sent in a distributed system, cause and effect in a historical conflict, or the interplay between political actors over time. However, as the scale and complexity of these event sequences grows, even these visualizations can become overwhelming to use. In this paper, we propose the use of textual narrati… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 9 pages + 2 references, 8 figures, 2 tables, IEEE VIS 2020 VAST Paper

  44. arXiv:2005.12423  [pdf, other

    cs.SI cs.CL cs.CY cs.IR physics.soc-ph

    Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis

    Authors: Bing He, Caleb Ziems, Sandeep Soni, Naren Ramakrishnan, Diyi Yang, Srijan Kumar

    Abstract: The spread of COVID-19 has sparked racism and hate on social media targeted towards Asian communities. However, little is known about how racial hate spreads during a pandemic and the role of counterspeech in mitigating this spread. In this work, we study the evolution and spread of anti-Asian hate speech through the lens of Twitter. We create COVID-HATE, the largest dataset of anti-Asian hate and… ▽ More

    Submitted 10 November, 2021; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: ASONAM 2021. The COVID-HATE dataset, annotations, and code are at http://claws.cc.gatech.edu/covid

  45. arXiv:2005.06539  [pdf, other

    q-bio.GN

    High fidelity epigenetic inheritance: Information theoretic model predicts $k$-threshold filling of histone modifications post replication

    Authors: Nithya Ramakrishnan, Sibi Raj B Pillai, Ranjith Padinhateeri

    Abstract: Beyond the genetic code, there is another layer of information encoded as chemical modifications on histone proteins positioned along the DNA. Maintaining these modifications is crucial for survival and identity of cells. How the information encoded in the histone marks gets inherited, given that only half the parental nucleosomes are transferred to each daughter chromatin, is a puzzle. We address… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 11 pages, 6 figures

  46. arXiv:1912.00835  [pdf, other

    cs.CL cs.LG

    Low Rank Factorization for Compact Multi-Head Self-Attention

    Authors: Sneha Mehta, Huzefa Rangwala, Naren Ramakrishnan

    Abstract: Effective representation learning from text has been an active area of research in the fields of NLP and text mining. Attention mechanisms have been at the forefront in order to learn contextual sentence representations. Current state-of-the-art approaches for many NLP tasks use large pre-trained language models such as BERT, XLNet and so on for learning representations. These models are based on… ▽ More

    Submitted 9 August, 2020; v1 submitted 26 November, 2019; originally announced December 2019.

    Comments: 9 pages, 5 figures

  47. arXiv:1911.04240  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Physics-guided Design and Learning of Neural Networks for Predicting Drag Force on Particle Suspensions in Moving Fluids

    Authors: Nikhil Muralidhar, Jie Bu, Ze Cao, Long He, Naren Ramakrishnan, Danesh Tafti, Anuj Karpatne

    Abstract: Physics-based simulations are often used to model and understand complex physical systems and processes in domains like fluid dynamics. Such simulations, although used frequently, have many limitations which could arise either due to the inability to accurately model a physical process owing to incomplete knowledge about certain facets of the process or due to the underlying process being too comp… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    MSC Class: 68T99; 76T20

  48. arXiv:1907.07590  [pdf, other

    cs.LG stat.ML

    Mitigating Uncertainty in Document Classification

    Authors: Xuchao Zhang, Fanglan Chen, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: The uncertainty measurement of classifiers' predictions is especially important in applications such as medical diagnoses that need to ensure limited human resources can focus on the most uncertain predictions returned by machine learning models. However, few existing uncertainty models attempt to improve overall prediction accuracy where human resources are involved in the text classification tas… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Accepted by NAACL19

  49. arXiv:1905.10022  [pdf, other

    cs.DL cs.LG stat.ML

    Patent Citation Dynamics Modeling via Multi-Attention Recurrent Networks

    Authors: Taoran Ji, Zhiqian Chen, Nathan Self, Kaiqun Fu, Chang-Tien Lu, Naren Ramakrishnan

    Abstract: Modeling and forecasting forward citations to a patent is a central task for the discovery of emerging technologies and for measuring the pulse of inventive progress. Conventional methods for forecasting these forward citations cast the problem as analysis of temporal point processes which rely on the conditional intensity of previously received citations. Recent approaches model the conditional i… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Journal ref: IJCAI 2019

  50. Computing Quantum Channel Capacities

    Authors: Navneeth Ramakrishnan, Raban Iten, Volkher B. Scholz, Mario Berta

    Abstract: The capacity of noisy quantum channels characterizes the highest rate at which information can be reliably transmitted and it is therefore of practical as well as fundamental importance. Capacities of classical channels are computed using alternating optimization schemes, called Blahut-Arimoto algorithms. In this work, we generalize classical Blahut-Arimoto algorithms to the quantum setting. In pa… ▽ More

    Submitted 1 July, 2021; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: v4: 22 pages, 4 figures, new title

    Journal ref: IEEE Transactions on Information Theory 67.2 (2020): 946-960