Skip to main content

Showing 1–50 of 95 results for author: Suresh, S

.
  1. arXiv:2510.01030  [pdf, ps, other

    cs.AI

    Uncovering the Computational Ingredients of Human-Like Representations in LLMs

    Authors: Zach Studdiford, Timothy T. Rogers, Kushin Mukherjee, Siddharth Suresh

    Abstract: The ability to translate diverse patterns of inputs into structured patterns of behavior has been thought to rest on both humans' and machines' ability to learn robust representations of relevant concepts. The rapid advancement of transformer-based large language models (LLMs) has led to a diversity of computational ingredients -- architectures, fine tuning methods, and training datasets among oth… ▽ More

    Submitted 1 October, 2025; originally announced October 2025.

    Comments: 9 pages

  2. Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity

    Authors: Tu-Hoa Pham, Philip Bailey, Daniel Posada, Georgios Georgakis, Jorge Enriquez, Surya Suresh, Marco Dolci, Philip Twu

    Abstract: We consider the problem of vision-based 6-DoF object pose estimation in the context of the notional Mars Sample Return campaign, in which a robotic arm would need to localize multiple objects of interest for low-clearance pickup and insertion, under severely constrained hardware. We propose a novel localization algorithm leveraging a custom renderer together with a new template matching metric tai… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

    Comments: To appear in IEEE Robotics and Automation Letters

  3. arXiv:2509.16369  [pdf, ps, other

    cs.IR cs.AI cs.CE

    Enhancing Financial RAG with Agentic AI and Multi-HyDE: A Novel Approach to Knowledge Retrieval and Hallucination Reduction

    Authors: Akshay Govind Srinivasan, Ryan Jacob George, Jayden Koshy Joe, Hrushikesh Kant, Harshith M R, Sachin Sundar, Sudharshan Suresh, Rahul Vimalkanth, Vijayavallabh

    Abstract: Accurate and reliable knowledge retrieval is vital for financial question-answering, where continually updated data sources and complex, high-stakes contexts demand precision. Traditional retrieval systems rely on a single database and retriever, but financial applications require more sophisticated approaches to handle intricate regulatory filings, market analyses, and extensive multi-year report… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: 14 Pages, 8 Tables, 2 Figures. Accepted and to be published in the proceedings of FinNLP, Empirical Methods in Natural Language Processing 2025

    ACM Class: H.4; H.5; H.3.3

  4. arXiv:2509.10430  [pdf, ps, other

    quant-ph

    Global vs. Local Discrimination of Locally Implementable Multipartite Unitaries

    Authors: Satyaki Manna, Sneha Suresh, Anandamay Das Bhowmik, Debashis Saha

    Abstract: We study single-shot distinguishability of locally implementable multipartite unitaries under Local Operations and Classical Communication (LOCC) and global operations. As unitary discrimination depends on both the choice of probing states and the measurements on the evolved states, we classify LOCC and global distinguishability into two categories: adaptive strategies, where probing states are ch… ▽ More

    Submitted 12 September, 2025; originally announced September 2025.

  5. arXiv:2508.06591  [pdf, ps, other

    cs.LG cond-mat.dis-nn cond-mat.mtrl-sci cond-mat.other cs.AI cs.CL

    Generative Artificial Intelligence Extracts Structure-Function Relationships from Plants for New Materials

    Authors: Rachel K. Luu, Jingyu Deng, Mohammed Shahrudin Ibrahim, Nam-Joon Cho, Ming Dao, Subra Suresh, Markus J. Buehler

    Abstract: Large language models (LLMs) have reshaped the research landscape by enabling new approaches to knowledge retrieval and creative ideation. Yet their application in discipline-specific experimental science, particularly in highly multi-disciplinary domains like materials science, remains limited. We present a first-of-its-kind framework that integrates generative AI with literature from hitherto-un… ▽ More

    Submitted 8 August, 2025; originally announced August 2025.

  6. arXiv:2507.21476  [pdf, ps, other

    cs.CL cs.AI

    Which LLMs Get the Joke? Probing Non-STEM Reasoning Abilities with HumorBench

    Authors: Reuben Narad, Siddharth Suresh, Jiayi Chen, Pine S. L. Dysart-Bricken, Bob Mankoff, Robert Nowak, Jifan Zhang, Lalit Jain

    Abstract: We present HumorBench, a benchmark designed to evaluate large language models' (LLMs) ability to reason about and explain sophisticated humor in cartoon captions. As reasoning models increasingly saturate existing benchmarks in mathematics and science, novel and challenging evaluations of model intelligence beyond STEM domains are essential. Reasoning is fundamentally involved in text-based humor… ▽ More

    Submitted 28 July, 2025; originally announced July 2025.

  7. arXiv:2507.04888  [pdf, ps, other

    cs.IR

    SimLab: A Platform for Simulation-based Evaluation of Conversational Information Access Systems

    Authors: Nolwenn Bernard, Sharath Chandra Etagi Suresh, Krisztian Balog, ChengXiang Zhai

    Abstract: Progress in conversational information access (CIA) systems has been hindered by the difficulty of evaluating such systems with reproducible experiments. While user simulation offers a promising solution, the lack of infrastructure and tooling to support this evaluation paradigm remains a significant barrier. To address this gap, we introduce SimLab, the first cloud-based platform providing a cent… ▽ More

    Submitted 24 October, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

  8. arXiv:2506.24072  [pdf, ps, other

    cs.LO cs.CR

    Protocol insecurity with finitely many sessions and XOR

    Authors: R Ramanujam, Vaishnavi Sundararajan, S P Suresh

    Abstract: We present a different proof of the insecurity problem for XOR, solved in by Chevalier, Kuesters, Rusinowitch and Turuani (2005). Our proof uses the notion of typed terms and well-typed proofs, and removes a restriction on the class of protocols to which the [CKRT05] proof applies, by introducing a slightly different (but very natural) notion of protocols, where honest agent sends are derivable fr… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  9. arXiv:2505.19333  [pdf, ps, other

    cs.AI

    Evaluating Steering Techniques using Human Similarity Judgments

    Authors: Zach Studdiford, Timothy T. Rogers, Siddharth Suresh, Kushin Mukherjee

    Abstract: Current evaluations of Large Language Model (LLM) steering techniques focus on task-specific performance, overlooking how well steered representations align with human cognition. Using a well-established triadic similarity judgment task, we assessed steered LLMs on their ability to flexibly judge similarity between concepts based on size or kind. We found that prompt-based steering methods outperf… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    ACM Class: I.2.7

  10. arXiv:2505.13559  [pdf, ps, other

    cs.CL cs.LG

    CS-Sum: A Benchmark for Code-Switching Dialogue Summarization and the Limits of Large Language Models

    Authors: Sathya Krishnan Suresh, Tanmay Surana, Lim Zhi Hao, Eng Siong Chng

    Abstract: Code-switching (CS) poses a significant challenge for Large Language Models (LLMs), yet its comprehensibility remains underexplored in LLMs. We introduce CS-Sum, to evaluate the comprehensibility of CS by the LLMs through CS dialogue to English summarization. CS-Sum is the first benchmark for CS dialogue summarization across Mandarin-English (EN-ZH), Tamil-English (EN-TA), and Malay-English (EN-MS… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 17 pages, 5 figures and 11 tables

  11. arXiv:2505.10718  [pdf, other

    cs.CL cs.AI cs.HC

    AI-enhanced semantic feature norms for 786 concepts

    Authors: Siddharth Suresh, Kushin Mukherjee, Tyler Giallanza, Xizheng Yu, Mia Patil, Jonathan D. Cohen, Timothy T. Rogers

    Abstract: Semantic feature norms have been foundational in the study of human conceptual knowledge, yet traditional methods face trade-offs between concept/feature coverage and verifiability of quality due to the labor-intensive nature of norming studies. Here, we introduce a novel approach that augments a dataset of human-generated feature norms with responses from large language models (LLMs) while verify… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 8 pages, 5 figures

  12. arXiv:2505.06950  [pdf, ps, other

    q-fin.RM q-fin.CP q-fin.ST

    Copula Analysis of Risk: A Multivariate Risk Analysis for VaR and CoVaR using Copulas and DCC-GARCH

    Authors: Aryan Singh, Paul O Reilly, Daim Sharif, Patrick Haughey, Eoghan McCarthy, Sathvika Thorali Suresh, Aakhil Anvar, Adarsh Sajeev Kumar

    Abstract: A multivariate risk analysis for VaR and CVaR using different copula families is performed on historical financial time series fitted with DCC-GARCH models. A theoretical background is provided alongside a comparison of goodness-of-fit across different copula families to estimate the validity and effectiveness of approaches discussed.

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: 15 pages, 12 figures, presented as part of the CS7DS1 - Data Analytics module at Trinity College Dublin, May 2025

    MSC Class: 60G70; 62H05; 91G70 ACM Class: G.3; I.5.1; I.2.6

  13. arXiv:2504.18553  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Cracking in polymer substrates for flexible devices and its mitigation

    Authors: Anush Ranka, Madhuja Layek, Sayaka Kochiyama, Cristina Lopez-Pernia, Alicia M. Chandler, Conrad A. Kocoj, Erica Magliano, Aldo Di Carlo, Francesca Brunetti, Peijun Guo, Subra Suresh, David C. Paine, Haneesh Kesari, Nitin P. Padture

    Abstract: Mechanical reliability plays an outsized role in determining the durability of flexible electronic devices because of the significant mechanical stresses they can experience during manufacturing and operation. These devices are typically built on sheets comprising stiff thin-film electrodes on compliant polymer substrates, and it is generally assumed that the high-toughness substrates do not crack… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 22 pages, 5 main figures, 7 supplementary figures, 2 supplementary tables, 2 supplementary notes

  14. arXiv:2502.20356  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs

    Authors: Kuan Lok Zhou, Jiayi Chen, Siddharth Suresh, Reuben Narad, Timothy T. Rogers, Lalit K Jain, Robert D Nowak, Bob Mankoff, Jifan Zhang

    Abstract: Large Language Models (LLMs) have shown significant limitations in understanding creative content, as demonstrated by Hessel et al. (2023)'s influential work on the New Yorker Cartoon Caption Contest (NYCCC). Their study exposed a substantial gap between LLMs and humans in humor comprehension, establishing that understanding and evaluating creative content is key challenge in AI development. We re… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  15. arXiv:2501.17310  [pdf, ps, other

    cs.AI cs.HC

    Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding

    Authors: Yun-Shiuan Chuang, Sameer Narendran, Nikunj Harlalka, Alexander Cheung, Sizhe Gao, Siddharth Suresh, Junjie Hu, Timothy T. Rogers

    Abstract: Guesstimation -- the task of making approximate quantitative estimates about objects or events -- is a common real-world skill, yet remains underexplored in large language model (LLM) research. We introduce three guesstimation datasets: MARBLES, FUTURE, and ELECPRED, spanning physical estimation (e.g., how many marbles fit in a cup) to abstract predictions (e.g., the 2024 U.S. presidential electio… ▽ More

    Submitted 23 September, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  16. arXiv:2501.14249  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1087 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 25 September, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  17. arXiv:2412.05868  [pdf

    cs.IR cs.AI cs.CL

    Automated Extraction and Creation of FBS Design Reasoning Knowledge Graphs from Structured Data in Product Catalogues Lacking Contextual Information

    Authors: Vijayalaxmi Sahadevan, Sushil Mario, Yash Jaiswal, Divyanshu Bajpai, Vishal Singh, Hiralal Aggarwal, Suhas Suresh, Manjunath Maigur

    Abstract: Ontology-based knowledge graphs (KG) are desirable for effective knowledge management and reuse in various decision making scenarios, including design. Creating and populating extensive KG based on specific ontological models can be highly labour and time-intensive unless automated processes are developed for knowledge extraction and graph creation. Most research and development on automated extra… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 31 pages, with 17 figures and 10 tables

  18. arXiv:2411.16511  [pdf, other

    cs.RO

    Use-Inspired Mobile Robot to Improve Safety of Building Retrofit Workforce in Constrained Spaces

    Authors: Smruti Suresh, Michael Angelo Carvajal, Nathaniel Hanson, Ethan Holand, Samuel Hibbard, Taskin Padir

    Abstract: The inspection of confined critical infrastructure such as attics or crawlspaces is challenging for human operators due to insufficient task space, limited visibility, and the presence of hazardous materials. This paper introduces a prototype of PARIS (Precision Application Robot for Inaccessible Spaces): a use-inspired teleoperated mobile robot manipulator system that was conceived, developed, an… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 6 Pages, 7 Figures. Accepted for publication in the Proceedings of 2024 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)

  19. Single-shot Distinguishability and Anti-distinguishability of Quantum Measurements

    Authors: Satyaki Manna, Sneha Suresh, Manan Singh Kachhawaha, Debashis Saha

    Abstract: Among the surprising features of quantum measurements, the problem of distinguishing and antidistinguishing general quantum measurements is fundamentally appealing. Unlike classical systems, quantum theory offers entangled states and peculiar state update rule of the post-measurement state, which gives rise to four distinct scenarios: (i) probing single systems and without access to the Post-measu… ▽ More

    Submitted 17 August, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: 22 pages, 11 figures. Close to the published version

    Journal ref: Phys. Rev. A 111, 022221 (2025)

  20. arXiv:2410.04236  [pdf, other

    cs.CL cs.AI cs.LG

    Overview of Factify5WQA: Fact Verification through 5W Question-Answering

    Authors: Suryavardan Suresh, Anku Rani, Parth Patwa, Aishwarya Reganti, Vinija Jain, Aman Chadha, Amitava Das, Amit Sheth, Asif Ekbal

    Abstract: Researchers have found that fake news spreads much times faster than real news. This is a major problem, especially in today's world where social media is the key source of news for many among the younger population. Fact verification, thus, becomes an important task and many media sites contribute to the cause. Manual fact verification is a tedious task, given the volume of fake news online. The… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted at defactify3@aaai2024

  21. arXiv:2410.01790  [pdf, other

    cs.RO

    Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning

    Authors: Prasanth Sengadu Suresh, Siddarth Jain, Prashant Doshi, Diego Romeres

    Abstract: The growing interest in human-robot collaboration (HRC), where humans and robots cooperate towards shared goals, has seen significant advancements over the past decade. While previous research has addressed various challenges, several key issues remain unresolved. Many domains within HRC involve activities that do not necessarily require human presence throughout the entire task. Existing literatu… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  22. arXiv:2409.19020  [pdf, other

    cs.CL cs.LG

    DiaSynth: Synthetic Dialogue Generation Framework for Low Resource Dialogue Applications

    Authors: Sathya Krishnan Suresh, Wu Mengjun, Tushar Pranav, Eng Siong Chng

    Abstract: The scarcity of domain-specific dialogue datasets limits the development of dialogue systems across applications. Existing research is constrained by general or niche datasets that lack sufficient scale for training dialogue systems. To address this gap, we introduce DiaSynth - a synthetic dialogue generation framework capable of generating high-quality, contextually rich dialogues across a wide r… ▽ More

    Submitted 10 February, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: 13 pages, 1 figure

    Journal ref: NAACL 2025

  23. arXiv:2409.15027  [pdf, other

    cs.CL cs.AI

    Generative LLM Powered Conversational AI Application for Personalized Risk Assessment: A Case Study in COVID-19

    Authors: Mohammad Amin Roshani, Xiangyu Zhou, Yao Qiang, Srinivasan Suresh, Steve Hicks, Usha Sethuraman, Dongxiao Zhu

    Abstract: Large language models (LLMs) have shown remarkable capabilities in various natural language tasks and are increasingly being applied in healthcare domains. This work demonstrates a new LLM-powered disease risk assessment approach via streaming human-AI conversation, eliminating the need for programming required by traditional machine learning approaches. In a COVID-19 severity risk assessment case… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  24. arXiv:2409.13171  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring

    Authors: Francis Ogoke, Sumesh Kalambettu Suresh, Jesse Adamczyk, Dan Bolintineanu, Anthony Garland, Michael Heiden, Amir Barati Farimani

    Abstract: The stochastic formation of defects during Laser Powder Bed Fusion (L-PBF) negatively impacts its adoption for high-precision use cases. Optical monitoring techniques can be used to identify defects based on layer-wise imaging, but these methods are difficult to scale to high resolutions due to cost and memory constraints. Therefore, we implement generative deep learning models to link low-cost, l… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  25. arXiv:2407.17891  [pdf, other

    astro-ph.GA astro-ph.SR

    Role of NH3 Binding Energy in the Early Evolution of Protostellar Cores

    Authors: S. Kakkenpara Suresh, O. Sipila, P. Caselli, F. Dulieu

    Abstract: NH$_{3}$(ammonia) plays a critical role in the chemistry of star and planet formation, yet uncertainties in its binding energy (BE) values complicate accurate estimates of its abundances. Recent research suggests a multi-binding energy approach, challenging the previous single-value notion. In this work, we use different values of NH$_{3}$ binding energy to examine its effects on the NH$_{3}$ abun… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Journal ref: A&A 696, A71 (2025)

  26. arXiv:2406.10522  [pdf, other

    cs.LG cs.AI cs.CL

    Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

    Authors: Jifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy Rogers, Kevin Jamieson, Robert Mankoff, Robert Nowak

    Abstract: We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2.2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years. This unique dataset supports the development and evaluation of multimodal large language models and preference-based fine-tuning… ▽ More

    Submitted 18 December, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  27. Learning from metastable grain boundaries

    Authors: Avanish Mishra, Sumit A. Suresh, Saryu J. Fensin, Nithin Mathew, Edward M. Kober

    Abstract: Grain boundaries (GBs) govern critical properties of polycrystals. Although significant advancements have been made in characterizing minimum energy GBs, real GBs are seldom found in such states, making it challenging to establish structure-property relationships. This diversity of atomic arrangements in metastable states motivates using data-driven methods to establish these relationships. In thi… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  28. arXiv:2405.03029  [pdf, other

    cs.CE

    Optimal Box Contraction for Solving Linear Systems via Simulated and Quantum Annealing

    Authors: Sanjay Suresh, Krishnan Suresh

    Abstract: Solving linear systems of equations is an important problem in science and engineering. Many quantum algorithms, such as the Harrow-Hassidim-Lloyd (HHL) algorithm (for quantum-gate computers) and the box algorithm (for quantum-annealing machines), have been proposed for solving such systems. The focus of this paper is on improving the efficiency of the box algorithm. The basic principle behind t… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  29. arXiv:2404.14462  [pdf, other

    cs.LG

    Towards smaller, faster decoder-only transformers: Architectural variants and their implications

    Authors: Sathya Krishnan Suresh, Shunmugapriya P

    Abstract: In recent times, the research on Large Language Models (LLMs) has grown exponentially, predominantly focusing on models underpinned by the transformer architecture, as established by [1], and further developed through the decoder-only variations by [2]. Contemporary efforts in this field primarily aim to enhance model capabilities by scaling up both the architecture and data volumes utilized durin… ▽ More

    Submitted 8 October, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures

  30. arXiv:2402.18105  [pdf, ps, other

    stat.ME

    JEL ratio test for independence between a continuous and a categorical random variable

    Authors: Saparya Suresh, Sudheesh K. Kattumannil

    Abstract: The categorical Gini covariance is a dependence measure between a numerical variable and a categorical variable. The Gini covariance measures dependence by quantifying the difference between the conditional and unconditional distributional functions. The categorical Gini covariance equals zero if and only if the numerical variable and the categorical variable are independent. We propose a non-para… ▽ More

    Submitted 19 September, 2025; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: This is the first test developed in this direction

  31. arXiv:2402.05086  [pdf, other

    physics.optics physics.med-ph

    Hyperspectral acquisition with ScanImage at the single pixel level: Application to time domain coherent Raman imaging

    Authors: Samuel Metais, Sisira Suresh, Paulo Diniz, Siddarth Shivkumar, Randy Bartels, Nicolas Forget, Hervé Rigneault

    Abstract: We present a comprehensive strategy and its practical implementation using the commercial ScanImage software platform to perform hyperspectral point scanning microscopy when a fast time dependent signal varies at each pixel level. In the proposed acquisition scheme the scan along the X axis is slowed down while the data acquisition is maintained at high pace to enable the rapid acquisition of the… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  32. arXiv:2401.09816  [pdf, other

    stat.ME

    Jackknife empirical likelihood ratio test for testing the equality of semivariance

    Authors: Saparya Suresh, Sudheesh K. Kattumannil

    Abstract: Semivariance is a measure of the dispersion of all observations that fall above the mean or target value of a random variable and it plays an important role in life-length, actuarial and income studies. In this paper, we develop a new non-parametric test for equality of upper semi-variance. We use the U-statistic theory to derive the test statistic and then study the asymptotic properties of the t… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    MSC Class: 62G10

  33. arXiv:2312.13469  [pdf, other

    cs.RO cs.CV cs.LG

    Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

    Authors: Sudharshan Suresh, Haozhi Qi, Tingfan Wu, Taosha Fan, Luis Pineda, Mike Lambeta, Jitendra Malik, Mrinal Kalakrishnan, Roberto Calandra, Michael Kaess, Joseph Ortiz, Mustafa Mukadam

    Abstract: To achieve human-level dexterity, robots must infer spatial awareness from multimodal sensing to reason over contact interactions. During in-hand manipulation of novel objects, such spatial awareness involves estimating the object's pose and shape. The status quo for in-hand perception primarily employs vision, and restricts to tracking a priori known objects. Moreover, visual occlusion of objects… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 43 pages, 20 figures, 1 table; https://suddhu.github.io/neural-feels/

  34. arXiv:2311.18619  [pdf, other

    astro-ph.GA astro-ph.SR

    Experimental study of the binding energy of NH3 on different types of ice and its impact on the snow line of NH3 and H2O

    Authors: S. Kakkenpara Suresh, F. Dulieu, J. Vitorino, P. Caselli

    Abstract: N-bearing molecules (like N2H+ or NH3) are excellent tracers of high-density, low-temperature regions like dense cloud cores and could shed light into snowlines in protoplanetary disks and the chemical evolution of comets. However, uncertainties exist about the grain surface chemistry of these molecules -- which could play an important role in their formation and evolution. This study explores exp… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  35. arXiv:2311.10127  [pdf, other

    cs.AI cs.HC cs.LG

    Learning interactions to boost human creativity with bandits and GPT-4

    Authors: Ara Vartanian, Xiaoxi Sun, Yun-Shiuan Chuang, Siddharth Suresh, Xiaojin Zhu, Timothy T. Rogers

    Abstract: This paper considers how interactions with AI algorithms can boost human creative thought. We employ a psychological task that demonstrates limits on human creativity, namely semantic feature generation: given a concept name, respondents must list as many of its features as possible. Human participants typically produce only a fraction of the features they know before getting "stuck." In experimen… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  36. arXiv:2311.09947  [pdf

    cs.LG

    Natural Disaster Analysis using Satellite Imagery and Social-Media Data for Emergency Response Situations

    Authors: Sukeerthi Mandyam, Shanmuga Priya MG, Shalini Suresh, Kavitha Srinivasan

    Abstract: Disaster Management is one of the most promising research areas because of its significant economic, environmental and social repercussions. This research focuses on analyzing different types of data (pre and post satellite images and twitter data) related to disaster management for in-depth analysis of location-wise emergency requirements. This research has been divided into two stages, namely, s… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  37. arXiv:2311.09665  [pdf, other

    cs.CL

    The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

    Authors: Yun-Shiuan Chuang, Siddharth Suresh, Nikunj Harlalka, Agam Goyal, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Human groups are able to converge on more accurate beliefs through deliberation, even in the presence of polarization and partisan bias -- a phenomenon known as the "wisdom of partisan crowds." Generated agents powered by Large Language Models (LLMs) are increasingly used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human gro… ▽ More

    Submitted 16 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  38. arXiv:2311.09618  [pdf, other

    physics.soc-ph cs.CL

    Simulating Opinion Dynamics with Networks of LLM-based Agents

    Authors: Yun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings re… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  39. arXiv:2311.04592  [pdf, other

    cs.LG cs.CV

    On Characterizing the Evolution of Embedding Space of Neural Networks using Algebraic Topology

    Authors: Suryaka Suresh, Bishshoy Das, Vinayak Abrol, Sumantra Dutta Roy

    Abstract: We study how the topology of feature embedding space changes as it passes through the layers of a well-trained deep neural network (DNN) through Betti numbers. Motivated by existing studies using simplicial complexes on shallow fully connected networks (FCN), we present an extended analysis using Cubical homology instead, with a variety of popular deep architectures and real image datasets. We dem… ▽ More

    Submitted 9 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  40. arXiv:2310.14340  [pdf, other

    cs.CL

    Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations

    Authors: Revanth Gangi Reddy, Hao Bai, Wentao Yao, Sharath Chandra Etagi Suresh, Heng Ji, ChengXiang Zhai

    Abstract: Open-domain dialog involves generating search queries that help obtain relevant knowledge for holding informative conversations. However, it can be challenging to determine what information to retrieve when the user is passive and does not express a clear need or request. To tackle this issue, we present a novel approach that focuses on generating internet search queries that are guided by social… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP 2023 Findings

  41. arXiv:2310.02388  [pdf, other

    math.NA cs.ET

    Computing a Sparse Approximate Inverse on Quantum Annealing Machines

    Authors: Sanjay Suresh, Krishnan Suresh

    Abstract: Many engineering problems involve solving large linear systems of equations. Conjugate gradient (CG) is one of the most popular iterative methods for solving such systems. However, CG typically requires a good preconditioner to speed up convergence. One such preconditioner is the sparse approximate inverse (SPAI). In this paper, we explore the computation of an SPAI on quantum annealing machines… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 16 pages, 8 figures

  42. arXiv:2310.02273  [pdf, other

    stat.ME math.ST

    A New measure of income inequality

    Authors: Sudheesh K Kattumannil, Saparya Suresh

    Abstract: A new measure of income inequality that captures the heavy tail behavior of the income distribution is proposed. We discuss two different approaches to find the estimators of the proposed measure. We show that these estimators are consistent and have an asymptotically normal distribution. We also obtain a jackknife empirical likelihood (JEL) confidence interval of the income inequality measure. A… ▽ More

    Submitted 20 August, 2024; v1 submitted 28 September, 2023; originally announced October 2023.

  43. arXiv:2309.09979  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    General In-Hand Object Rotation with Vision and Touch

    Authors: Haozhi Qi, Brent Yi, Sudharshan Suresh, Mike Lambeta, Yi Ma, Roberto Calandra, Jitendra Malik

    Abstract: We introduce RotateIt, a system that enables fingertip-based object rotation along multiple axes by leveraging multimodal sensory inputs. Our system is trained in simulation, where it has access to ground-truth object shapes and physical properties. Then we distill it to operate on realistic yet noisy simulated visuotactile and proprioceptive sensory inputs. These multimodal inputs are fused via a… ▽ More

    Submitted 28 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: CoRL 2023; Website: https://haozhi.io/rotateit/

  44. arXiv:2308.13773  [pdf, other

    cs.LO cs.CR

    Solving the insecurity problem for assertions

    Authors: R Ramanujam, Vaishnavi Sundararajan, S P Suresh

    Abstract: In the symbolic verification of cryptographic protocols, a central problem is deciding whether a protocol admits an execution which leaks a designated secret to the malicious intruder. Rusinowitch & Turuani (2003) show that, when considering finitely many sessions, this ``insecurity problem'' is NP-complete. Central to their proof strategy is the observation that any execution of a protocol can be… ▽ More

    Submitted 26 January, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

  45. arXiv:2307.16393  [pdf, other

    cs.RO

    Modular Self-Lock Origami: design, modeling, and simulation to improve the performance of a rotational joint

    Authors: Samira Zare, Alex Spaeth, Sandya Suresh, and Mircea Teodorescu

    Abstract: Origami structures have been widely explored in robotics due to their many potential advantages. Origami robots can be very compact, as well as cheap and efficient to produce. In particular, they can be constructed in a flat format using modern manufacturing techniques. Rotational motion is essential for robotics, and a variety of origami rotational joints have been proposed in the literature. How… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 11 pages, 8 figures

  46. arXiv:2306.15236  [pdf, other

    cond-mat.stat-mech cond-mat.soft physics.chem-ph physics.optics

    Towards Stirling engine using an optically confined particle subjected to asymmetric temperature profile

    Authors: Gokul Nalupurackal, Muruga Lokesh, Sarangi Suresh, Srestha Roy, Snigdhadev Chakraborty, Jayesh Goswami, Arnab Pal, Basudev Roy

    Abstract: The realization of microscopic heat engines has gained a surge of research interest in statistical physics, soft matter, and biological physics. A typical microscopic heat engine employs a colloidal particle trapped in a confining potential, which is modulated in time to mimic the cycle operations. Here, we use a lanthanide-doped upconverting particle (UCP) suspended in a passive aqueous bath, whi… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: For published version, see https://iopscience.iop.org/article/10.1088/1367-2630/acd94e/meta

    Journal ref: New J. Phys. 25 063001 (2023)

  47. arXiv:2304.05591  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Feature Verification in FLAN-T5

    Authors: Siddharth Suresh, Kushin Mukherjee, Timothy T. Rogers

    Abstract: This study evaluates the potential of a large language model for aiding in generation of semantic feature norms - a critical tool for evaluating conceptual structure in cognitive science. Building from an existing human-generated dataset, we show that machine-verified norms capture aspects of conceptual structure beyond what is expressed in human norms alone, and better explain human judgments of… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: To appear as a Tiny Paper at ICLR 2023

  48. arXiv:2304.05012  [pdf, other

    cs.CL cs.AI

    Human-machine cooperation for semantic feature listing

    Authors: Kushin Mukherjee, Siddharth Suresh, Timothy T. Rogers

    Abstract: Semantic feature norms, lists of features that concepts do and do not possess, have played a central role in characterizing human conceptual knowledge, but require extensive human labor. Large language models (LLMs) offer a novel avenue for the automatic generation of such feature lists, but are prone to significant error. Here, we present a new method for combining a learned model of human lexica… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: To be published in the ICLR TinyPaper track

  49. arXiv:2304.02754  [pdf, other

    cs.AI cs.CL cs.LG

    Conceptual structure coheres in human cognition but not in large language models

    Authors: Siddharth Suresh, Kushin Mukherjee, Xizheng Yu, Wei-Chun Huang, Lisa Padua, Timothy T Rogers

    Abstract: Neural network models of language have long been used as a tool for developing hypotheses about conceptual representation in the mind and brain. For many years, such use involved extracting vector-space representations of words and using distances among these to predict or understand human behavior in various semantic tasks. Contemporary large language models (LLMs), however, make it possible to i… ▽ More

    Submitted 10 November, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  50. arXiv:2304.01396  [pdf, other

    cs.RO cs.CV

    Lidar based 3D Tracking and State Estimation of Dynamic Objects

    Authors: Patil Shubham Suresh, Gautham Narayan Narasimhan

    Abstract: State estimation of oncoming vehicles: Earlier research has been based on determining states like position, velocity, orientation , angular velocity, etc of ego-vehicle. Our approach focuses on estimating the states of non-ego vehicles which is crucial for Motion planning and decision-making. Dynamic Scene Based Localization: Our project will work on dynamic scenes like moving ego (self) and non-e… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 6 pages, 12 figures, Carnegie Mellon University work