Skip to main content

Showing 1–50 of 62 results for author: Suresh, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.20356  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs

    Authors: Kuan Lok Zhou, Jiayi Chen, Siddharth Suresh, Reuben Narad, Timothy T. Rogers, Lalit K Jain, Robert D Nowak, Bob Mankoff, Jifan Zhang

    Abstract: Large Language Models (LLMs) have shown significant limitations in understanding creative content, as demonstrated by Hessel et al. (2023)'s influential work on the New Yorker Cartoon Caption Contest (NYCCC). Their study exposed a substantial gap between LLMs and humans in humor comprehension, establishing that understanding and evaluating creative content is key challenge in AI development. We re… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  2. arXiv:2501.17310  [pdf, other

    cs.AI cs.HC

    Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding

    Authors: Yun-Shiuan Chuang, Nikunj Harlalka, Sameer Narendran, Alexander Cheung, Sizhe Gao, Siddharth Suresh, Junjie Hu, Timothy T. Rogers

    Abstract: Guesstimation, the task of making approximate quantity estimates, is a common real-world challenge. However, it has been largely overlooked in large language models (LLMs) and vision language models (VLMs) research. We introduce a novel guesstimation dataset, MARBLES. This dataset requires one to estimate how many items (e.g., marbles) can fit into containers (e.g., a one-cup measuring cup), both… ▽ More

    Submitted 30 January, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  3. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  4. arXiv:2412.05868  [pdf

    cs.IR cs.AI cs.CL

    Automated Extraction and Creation of FBS Design Reasoning Knowledge Graphs from Structured Data in Product Catalogues Lacking Contextual Information

    Authors: Vijayalaxmi Sahadevan, Sushil Mario, Yash Jaiswal, Divyanshu Bajpai, Vishal Singh, Hiralal Aggarwal, Suhas Suresh, Manjunath Maigur

    Abstract: Ontology-based knowledge graphs (KG) are desirable for effective knowledge management and reuse in various decision making scenarios, including design. Creating and populating extensive KG based on specific ontological models can be highly labour and time-intensive unless automated processes are developed for knowledge extraction and graph creation. Most research and development on automated extra… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

    Comments: 31 pages, with 17 figures and 10 tables

  5. arXiv:2411.16511  [pdf, other

    cs.RO

    Use-Inspired Mobile Robot to Improve Safety of Building Retrofit Workforce in Constrained Spaces

    Authors: Smruti Suresh, Michael Angelo Carvajal, Nathaniel Hanson, Ethan Holand, Samuel Hibbard, Taskin Padir

    Abstract: The inspection of confined critical infrastructure such as attics or crawlspaces is challenging for human operators due to insufficient task space, limited visibility, and the presence of hazardous materials. This paper introduces a prototype of PARIS (Precision Application Robot for Inaccessible Spaces): a use-inspired teleoperated mobile robot manipulator system that was conceived, developed, an… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 6 Pages, 7 Figures. Accepted for publication in the Proceedings of 2024 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)

  6. arXiv:2410.04236  [pdf, other

    cs.CL cs.AI cs.LG

    Overview of Factify5WQA: Fact Verification through 5W Question-Answering

    Authors: Suryavardan Suresh, Anku Rani, Parth Patwa, Aishwarya Reganti, Vinija Jain, Aman Chadha, Amitava Das, Amit Sheth, Asif Ekbal

    Abstract: Researchers have found that fake news spreads much times faster than real news. This is a major problem, especially in today's world where social media is the key source of news for many among the younger population. Fact verification, thus, becomes an important task and many media sites contribute to the cause. Manual fact verification is a tedious task, given the volume of fake news online. The… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted at defactify3@aaai2024

  7. arXiv:2410.01790  [pdf, other

    cs.RO

    Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning

    Authors: Prasanth Sengadu Suresh, Siddarth Jain, Prashant Doshi, Diego Romeres

    Abstract: The growing interest in human-robot collaboration (HRC), where humans and robots cooperate towards shared goals, has seen significant advancements over the past decade. While previous research has addressed various challenges, several key issues remain unresolved. Many domains within HRC involve activities that do not necessarily require human presence throughout the entire task. Existing literatu… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  8. arXiv:2409.19020  [pdf, other

    cs.CL cs.LG

    DiaSynth: Synthetic Dialogue Generation Framework for Low Resource Dialogue Applications

    Authors: Sathya Krishnan Suresh, Wu Mengjun, Tushar Pranav, Eng Siong Chng

    Abstract: The scarcity of domain-specific dialogue datasets limits the development of dialogue systems across applications. Existing research is constrained by general or niche datasets that lack sufficient scale for training dialogue systems. To address this gap, we introduce DiaSynth - a synthetic dialogue generation framework capable of generating high-quality, contextually rich dialogues across a wide r… ▽ More

    Submitted 10 February, 2025; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: 13 pages, 1 figure

    Journal ref: NAACL 2025

  9. arXiv:2409.15027  [pdf, other

    cs.CL cs.AI

    Generative LLM Powered Conversational AI Application for Personalized Risk Assessment: A Case Study in COVID-19

    Authors: Mohammad Amin Roshani, Xiangyu Zhou, Yao Qiang, Srinivasan Suresh, Steve Hicks, Usha Sethuraman, Dongxiao Zhu

    Abstract: Large language models (LLMs) have shown remarkable capabilities in various natural language tasks and are increasingly being applied in healthcare domains. This work demonstrates a new LLM-powered disease risk assessment approach via streaming human-AI conversation, eliminating the need for programming required by traditional machine learning approaches. In a COVID-19 severity risk assessment case… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  10. arXiv:2409.13171  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring

    Authors: Francis Ogoke, Sumesh Kalambettu Suresh, Jesse Adamczyk, Dan Bolintineanu, Anthony Garland, Michael Heiden, Amir Barati Farimani

    Abstract: The stochastic formation of defects during Laser Powder Bed Fusion (L-PBF) negatively impacts its adoption for high-precision use cases. Optical monitoring techniques can be used to identify defects based on layer-wise imaging, but these methods are difficult to scale to high resolutions due to cost and memory constraints. Therefore, we implement generative deep learning models to link low-cost, l… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  11. arXiv:2406.10522  [pdf, other

    cs.LG cs.AI cs.CL

    Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

    Authors: Jifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy Rogers, Kevin Jamieson, Robert Mankoff, Robert Nowak

    Abstract: We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2.2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years. This unique dataset supports the development and evaluation of multimodal large language models and preference-based fine-tuning… ▽ More

    Submitted 18 December, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  12. arXiv:2405.03029  [pdf, other

    cs.CE

    Optimal Box Contraction for Solving Linear Systems via Simulated and Quantum Annealing

    Authors: Sanjay Suresh, Krishnan Suresh

    Abstract: Solving linear systems of equations is an important problem in science and engineering. Many quantum algorithms, such as the Harrow-Hassidim-Lloyd (HHL) algorithm (for quantum-gate computers) and the box algorithm (for quantum-annealing machines), have been proposed for solving such systems. The focus of this paper is on improving the efficiency of the box algorithm. The basic principle behind t… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  13. arXiv:2404.14462  [pdf, other

    cs.LG

    Towards smaller, faster decoder-only transformers: Architectural variants and their implications

    Authors: Sathya Krishnan Suresh, Shunmugapriya P

    Abstract: In recent times, the research on Large Language Models (LLMs) has grown exponentially, predominantly focusing on models underpinned by the transformer architecture, as established by [1], and further developed through the decoder-only variations by [2]. Contemporary efforts in this field primarily aim to enhance model capabilities by scaling up both the architecture and data volumes utilized durin… ▽ More

    Submitted 8 October, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures

  14. arXiv:2312.13469  [pdf, other

    cs.RO cs.CV cs.LG

    Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

    Authors: Sudharshan Suresh, Haozhi Qi, Tingfan Wu, Taosha Fan, Luis Pineda, Mike Lambeta, Jitendra Malik, Mrinal Kalakrishnan, Roberto Calandra, Michael Kaess, Joseph Ortiz, Mustafa Mukadam

    Abstract: To achieve human-level dexterity, robots must infer spatial awareness from multimodal sensing to reason over contact interactions. During in-hand manipulation of novel objects, such spatial awareness involves estimating the object's pose and shape. The status quo for in-hand perception primarily employs vision, and restricts to tracking a priori known objects. Moreover, visual occlusion of objects… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 43 pages, 20 figures, 1 table; https://suddhu.github.io/neural-feels/

  15. arXiv:2311.10127  [pdf, other

    cs.AI cs.HC cs.LG

    Learning interactions to boost human creativity with bandits and GPT-4

    Authors: Ara Vartanian, Xiaoxi Sun, Yun-Shiuan Chuang, Siddharth Suresh, Xiaojin Zhu, Timothy T. Rogers

    Abstract: This paper considers how interactions with AI algorithms can boost human creative thought. We employ a psychological task that demonstrates limits on human creativity, namely semantic feature generation: given a concept name, respondents must list as many of its features as possible. Human participants typically produce only a fraction of the features they know before getting "stuck." In experimen… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  16. arXiv:2311.09947  [pdf

    cs.LG

    Natural Disaster Analysis using Satellite Imagery and Social-Media Data for Emergency Response Situations

    Authors: Sukeerthi Mandyam, Shanmuga Priya MG, Shalini Suresh, Kavitha Srinivasan

    Abstract: Disaster Management is one of the most promising research areas because of its significant economic, environmental and social repercussions. This research focuses on analyzing different types of data (pre and post satellite images and twitter data) related to disaster management for in-depth analysis of location-wise emergency requirements. This research has been divided into two stages, namely, s… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  17. arXiv:2311.09665  [pdf, other

    cs.CL

    The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

    Authors: Yun-Shiuan Chuang, Siddharth Suresh, Nikunj Harlalka, Agam Goyal, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Human groups are able to converge on more accurate beliefs through deliberation, even in the presence of polarization and partisan bias -- a phenomenon known as the "wisdom of partisan crowds." Generated agents powered by Large Language Models (LLMs) are increasingly used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human gro… ▽ More

    Submitted 16 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  18. arXiv:2311.09618  [pdf, other

    physics.soc-ph cs.CL

    Simulating Opinion Dynamics with Networks of LLM-based Agents

    Authors: Yun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings re… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  19. arXiv:2311.04592  [pdf, other

    cs.LG cs.CV

    On Characterizing the Evolution of Embedding Space of Neural Networks using Algebraic Topology

    Authors: Suryaka Suresh, Bishshoy Das, Vinayak Abrol, Sumantra Dutta Roy

    Abstract: We study how the topology of feature embedding space changes as it passes through the layers of a well-trained deep neural network (DNN) through Betti numbers. Motivated by existing studies using simplicial complexes on shallow fully connected networks (FCN), we present an extended analysis using Cubical homology instead, with a variety of popular deep architectures and real image datasets. We dem… ▽ More

    Submitted 9 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  20. arXiv:2310.14340  [pdf, other

    cs.CL

    Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations

    Authors: Revanth Gangi Reddy, Hao Bai, Wentao Yao, Sharath Chandra Etagi Suresh, Heng Ji, ChengXiang Zhai

    Abstract: Open-domain dialog involves generating search queries that help obtain relevant knowledge for holding informative conversations. However, it can be challenging to determine what information to retrieve when the user is passive and does not express a clear need or request. To tackle this issue, we present a novel approach that focuses on generating internet search queries that are guided by social… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP 2023 Findings

  21. arXiv:2310.02388  [pdf, other

    math.NA cs.ET

    Computing a Sparse Approximate Inverse on Quantum Annealing Machines

    Authors: Sanjay Suresh, Krishnan Suresh

    Abstract: Many engineering problems involve solving large linear systems of equations. Conjugate gradient (CG) is one of the most popular iterative methods for solving such systems. However, CG typically requires a good preconditioner to speed up convergence. One such preconditioner is the sparse approximate inverse (SPAI). In this paper, we explore the computation of an SPAI on quantum annealing machines… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 16 pages, 8 figures

  22. arXiv:2309.09979  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    General In-Hand Object Rotation with Vision and Touch

    Authors: Haozhi Qi, Brent Yi, Sudharshan Suresh, Mike Lambeta, Yi Ma, Roberto Calandra, Jitendra Malik

    Abstract: We introduce RotateIt, a system that enables fingertip-based object rotation along multiple axes by leveraging multimodal sensory inputs. Our system is trained in simulation, where it has access to ground-truth object shapes and physical properties. Then we distill it to operate on realistic yet noisy simulated visuotactile and proprioceptive sensory inputs. These multimodal inputs are fused via a… ▽ More

    Submitted 28 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: CoRL 2023; Website: https://haozhi.io/rotateit/

  23. arXiv:2308.13773  [pdf, other

    cs.LO cs.CR

    Solving the insecurity problem for assertions

    Authors: R Ramanujam, Vaishnavi Sundararajan, S P Suresh

    Abstract: In the symbolic verification of cryptographic protocols, a central problem is deciding whether a protocol admits an execution which leaks a designated secret to the malicious intruder. Rusinowitch & Turuani (2003) show that, when considering finitely many sessions, this ``insecurity problem'' is NP-complete. Central to their proof strategy is the observation that any execution of a protocol can be… ▽ More

    Submitted 26 January, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

  24. arXiv:2307.16393  [pdf, other

    cs.RO

    Modular Self-Lock Origami: design, modeling, and simulation to improve the performance of a rotational joint

    Authors: Samira Zare, Alex Spaeth, Sandya Suresh, and Mircea Teodorescu

    Abstract: Origami structures have been widely explored in robotics due to their many potential advantages. Origami robots can be very compact, as well as cheap and efficient to produce. In particular, they can be constructed in a flat format using modern manufacturing techniques. Rotational motion is essential for robotics, and a variety of origami rotational joints have been proposed in the literature. How… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 11 pages, 8 figures

  25. arXiv:2304.05591  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Feature Verification in FLAN-T5

    Authors: Siddharth Suresh, Kushin Mukherjee, Timothy T. Rogers

    Abstract: This study evaluates the potential of a large language model for aiding in generation of semantic feature norms - a critical tool for evaluating conceptual structure in cognitive science. Building from an existing human-generated dataset, we show that machine-verified norms capture aspects of conceptual structure beyond what is expressed in human norms alone, and better explain human judgments of… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: To appear as a Tiny Paper at ICLR 2023

  26. arXiv:2304.05012  [pdf, other

    cs.CL cs.AI

    Human-machine cooperation for semantic feature listing

    Authors: Kushin Mukherjee, Siddharth Suresh, Timothy T. Rogers

    Abstract: Semantic feature norms, lists of features that concepts do and do not possess, have played a central role in characterizing human conceptual knowledge, but require extensive human labor. Large language models (LLMs) offer a novel avenue for the automatic generation of such feature lists, but are prone to significant error. Here, we present a new method for combining a learned model of human lexica… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: To be published in the ICLR TinyPaper track

  27. arXiv:2304.02754  [pdf, other

    cs.AI cs.CL cs.LG

    Conceptual structure coheres in human cognition but not in large language models

    Authors: Siddharth Suresh, Kushin Mukherjee, Xizheng Yu, Wei-Chun Huang, Lisa Padua, Timothy T Rogers

    Abstract: Neural network models of language have long been used as a tool for developing hypotheses about conceptual representation in the mind and brain. For many years, such use involved extracting vector-space representations of words and using distances among these to predict or understand human behavior in various semantic tasks. Contemporary large language models (LLMs), however, make it possible to i… ▽ More

    Submitted 10 November, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  28. arXiv:2304.01396  [pdf, other

    cs.RO cs.CV

    Lidar based 3D Tracking and State Estimation of Dynamic Objects

    Authors: Patil Shubham Suresh, Gautham Narayan Narasimhan

    Abstract: State estimation of oncoming vehicles: Earlier research has been based on determining states like position, velocity, orientation , angular velocity, etc of ego-vehicle. Our approach focuses on estimating the states of non-ego vehicles which is crucial for Motion planning and decision-making. Dynamic Scene Based Localization: Our project will work on dynamic scenes like moving ego (self) and non-e… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 6 pages, 12 figures, Carnegie Mellon University work

  29. arXiv:2210.15120  [pdf, other

    cs.LG

    Federated Graph Representation Learning using Self-Supervision

    Authors: Susheel Suresh, Danny Godbout, Arko Mukherjee, Mayank Shrivastava, Jennifer Neville, Pan Li

    Abstract: Federated graph representation learning (FedGRL) brings the benefits of distributed training to graph structured data while simultaneously addressing some privacy and compliance concerns related to data curation. However, several interesting real-world graph data characteristics viz. label deficiency and downstream task heterogeneity are not taken into consideration in current FedGRL setups. In th… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: FedGraph'22 workshop (non archival) version. (https://sites.google.com/view/fedgraph2022/accepted-papers)

  30. arXiv:2210.14210  [pdf, other

    cs.RO

    MidasTouch: Monte-Carlo inference over distributions across sliding touch

    Authors: Sudharshan Suresh, Zilin Si, Stuart Anderson, Michael Kaess, Mustafa Mukadam

    Abstract: We present MidasTouch, a tactile perception system for online global localization of a vision-based touch sensor sliding on an object surface. This framework takes in posed tactile images over time, and outputs an evolving distribution of sensor pose on the object's surface, without the need for visual priors. Our key insight is to estimate local surface geometry with tactile sensing, learn a comp… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted at CoRL 2022 (Oral). Project website: https://suddhu.github.io/midastouch-tactile/

  31. arXiv:2207.13664  [pdf

    cs.HC

    Generic Approach to Visualization of Time Series Data

    Authors: Sathya Krishnan Suresh, Shunmugapriya P

    Abstract: Time series is a collection of data instances that are ordered according to a time stamp. Stock prices, temperature, etc are examples of time series data in real life. Time series data are used for forecasting sales, predicting trends. Visualization is the process of visually representing data or the relationship between features of a data either in a two-dimensional plot or a three-dimensional pl… ▽ More

    Submitted 24 April, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 5 pages. International Conference on Engineering and Advancement in Technology-2022

  32. arXiv:2207.11649  [pdf, other

    cs.PL cs.FL cs.LG

    OCTAL: Graph Representation Learning for LTL Model Checking

    Authors: Prasita Mukherjee, Haoteng Yin, Susheel Suresh, Tiark Rompf

    Abstract: Model Checking is widely applied in verifying the correctness of complex and concurrent systems against a specification. Pure symbolic approaches while popular, still suffer from the state space explosion problem that makes them impractical for large scale systems and/or specifications. In this paper, we propose to use graph representation learning (GRL) for solving linear temporal logic (LTL) mod… ▽ More

    Submitted 26 July, 2022; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: change the style of bibliography

  33. arXiv:2207.09372  [pdf, other

    cs.RO cs.AI

    On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios

    Authors: Jayprakash S. Nair, Divya D. Kulkarni, Ajitem Joshi, Sruthy Suresh

    Abstract: Federated Learning (FL) allows for collaboratively aggregating learned information across several computing devices and sharing the same amongst them, thereby tackling issues of privacy and the need of huge bandwidth. FL techniques generally use a central server or cloud for aggregating the models received from the devices. Such centralized FL techniques suffer from inherent problems such as failu… ▽ More

    Submitted 7 September, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Submitted to SEEDA 2022. This arxiv is a preprint and NOT the final version

  34. arXiv:2207.00725  [pdf, other

    cs.MA

    Metacognitive Decision Making Framework for Multi-UAV Target Search Without Communication

    Authors: J. Senthilnath, K. Harikumar, S. Suresh

    Abstract: This paper presents a new Metacognitive Decision Making (MDM) framework inspired by human-like metacognitive principles. The MDM framework is incorporated in unmanned aerial vehicles (UAVs) deployed for decentralized stochastic search without communication for detecting stationary targets (fixed/sudden pop-up) and dynamic targets. The UAVs are equipped with multiple sensors (varying sensing capabi… ▽ More

    Submitted 19 August, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: 12 pages, 9 figures, 9 tables

  35. arXiv:2205.04876  [pdf, other

    stat.ML cs.LG

    Turtle Score -- Similarity Based Developer Analyzer

    Authors: Sanjjushri Varshini, Ponshriharini V, Santhosh Kannan, Snekha Suresh, Harshavardhan Ramesh, Rohith Mahadevan, Raja CSP Raman

    Abstract: In day-to-day life, a highly demanding task for IT companies is to find the right candidates who fit the companies' culture. This research aims to comprehend, analyze and automatically produce convincing outcomes to find a candidate who perfectly fits right in the company. Data is examined and collected for each employee who works in the IT domain focusing on their performance measure. This is don… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 10 pages, 3 figures

  36. arXiv:2203.03018  [pdf, other

    cs.RO eess.SY

    RAPTOR: Rapid Aerial Pickup and Transport of Objects by Robots

    Authors: Aurel Appius, Erik Bauer, Marc Blöchlinger, Aashi Kalra, Robin Oberson, Arman Raayatsanati, Pascal Strauch, Sarath Suresh, Marco von Salis, Robert K. Katzschmann

    Abstract: Rapid aerial grasping through robots can lead to many applications that utilize fast and dynamic picking and placing of objects. Rigid grippers traditionally used in aerial manipulators require high precision and specific object geometries for successful grasping. We propose RAPTOR, a quadcopter platform combined with a custom Fin Ray gripper to enable more flexible grasping of objects with differ… ▽ More

    Submitted 5 August, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: 7 pages, 10 figures, accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022. Video: https://youtu.be/KHkBlBABsC8

  37. arXiv:2202.04518  [pdf, ps, other

    cs.CR cs.LO

    Insecurity problem for assertions remains in NP

    Authors: R. Ramanujam, Vaishnavi Sundararajan, S. P. Suresh

    Abstract: In the symbolic verification of cryptographic protocols, a central problem is deciding whether a protocol admits an execution which leaks a designated secret to the malicious intruder. Rusinowitch and Turuani (2003) show that, when considering finitely many sessions and a protocol model where only terms are communicated, this ``insecurity problem'' is NP-complete. Central to their proof strategy i… ▽ More

    Submitted 25 January, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

  38. arXiv:2201.09534  [pdf, other

    cs.LG cs.AI

    PaRT: Parallel Learning Towards Robust and Transparent AI

    Authors: Mahsa Paknezhad, Hamsawardhini Rengarajan, Chenghao Yuan, Sujanya Suresh, Manas Gupta, Savitha Ramasamy, Hwee Kuan Lee

    Abstract: This paper takes a parallel learning approach for robust and transparent AI. A deep neural network is trained in parallel on multiple tasks, where each task is trained only on a subset of the network resources. Each subset consists of network segments, that can be combined and shared across specific tasks. Tasks can share resources with other tasks, while having independent task-related network re… ▽ More

    Submitted 23 February, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  39. arXiv:2201.06941  [pdf, other

    cs.CY cs.AI cs.LG

    Incremental Knowledge Tracing from Multiple Schools

    Authors: Sujanya Suresh, Savitha Ramasamy, P. N. Suganthan, Cheryl Sze Yin Wong

    Abstract: Knowledge tracing is the task of predicting a learner's future performance based on the history of the learner's performance. Current knowledge tracing models are built based on an extensive set of data that are collected from multiple schools. However, it is impossible to pool learner's data from all schools, due to data privacy and PDPA policies. Hence, this paper explores the feasibility of bui… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: In AAAI22 AI4EDU Workshop

  40. arXiv:2109.09884  [pdf, other

    cs.RO

    ShapeMap 3-D: Efficient shape mapping through dense touch and vision

    Authors: Sudharshan Suresh, Zilin Si, Joshua G. Mangelson, Wenzhen Yuan, Michael Kaess

    Abstract: Knowledge of 3-D object shape is of great importance to robot manipulation tasks, but may not be readily available in unstructured environments. While vision is often occluded during robot-object interaction, high-resolution tactile sensors can give a dense local perspective of the object. However, tactile sensors have limited sensing area and the shape representation must faithfully approximate n… ▽ More

    Submitted 10 March, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Camera-ready version for the 2022 IEEE International Conference on Robotics and Automation (ICRA 2022). Modified PDF title

  41. arXiv:2109.07788  [pdf, other

    cs.RO cs.AI cs.CV

    Marginal MAP Estimation for Inverse RL under Occlusion with Observer Noise

    Authors: Prasanth Sengadu Suresh, Prashant Doshi

    Abstract: We consider the problem of learning the behavioral preferences of an expert engaged in a task from noisy and partially-observable demonstrations. This is motivated by real-world applications such as a line robot learning from observing a human worker, where some observations are occluded by environmental objects that cannot be removed. Furthermore, robotic perception tends to be imperfect and nois… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  42. arXiv:2109.05329  [pdf, other

    cs.DC

    MODC: Resilience for disaggregated memory architectures using task-based programming

    Authors: Kimberly Keeton, Sharad Singhal, Haris Volos, Yupu Zhang, Ramesh Chandra Chaurasiya, Clarete Riana Crasta, Sherin T George, Nagaraju K N, Mashood Abdulla K, Kavitha Natarajan, Porno Shome, Sanish Suresh

    Abstract: Disaggregated memory architectures provide benefits to applications beyond traditional scale out environments, such as independent scaling of compute and memory resources. They also provide an independent failure model, where computations or the compute nodes they run on may fail independently of the disaggregated memory; thus, data that's resident in the disaggregated memory is unaffected by the… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: 9 pages, 4 figures

    ACM Class: D.4.1; D.4.5; D.4.7; C.1.4; E.1

    Journal ref: Proceedings of 2nd Workshop on Resource Disaggregation and Serverless (WORDS'21), Co-located with ASPLOS'21, April 2021

  43. arXiv:2109.05112  [pdf, other

    cs.CL

    Improved Latent Tree Induction with Distant Supervision via Span Constraints

    Authors: Zhiyang Xu, Andrew Drozdov, Jay Yoon Lee, Tim O'Gorman, Subendhu Rongali, Dylan Finkbeiner, Shilpa Suresh, Mohit Iyyer, Andrew McCallum

    Abstract: For over thirty years, researchers have developed and analyzed methods for latent tree induction as an approach for unsupervised syntactic parsing. Nonetheless, modern systems still do not perform well enough compared to their supervised counterparts to have any practical use as structural annotation of text. In this work, we present a technique that uses distant supervision in the form of span co… ▽ More

    Submitted 1 November, 2021; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  44. arXiv:2107.08145  [pdf, other

    physics.space-ph cs.CE cs.MS

    Refactoring the MPS/University of Chicago Radiative MHD(MURaM) Model for GPU/CPU Performance Portability Using OpenACC Directives

    Authors: Eric Wright, Damien Przybylski, Matthias Rempel, Cena Miller, Supreeth Suresh, Shiquan Su, Richard Loft, Sunita Chandrasekaran

    Abstract: The MURaM (Max Planck University of Chicago Radiative MHD) code is a solar atmosphere radiative MHD model that has been broadly applied to solar phenomena ranging from quiet to active sun, including eruptive events such as flares and coronal mass ejections. The treatment of physics is sufficiently realistic to allow for the synthesis of emission from visible light to extreme UV and X-rays, which i… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  45. Breaking the Limit of Graph Neural Networks by Improving the Assortativity of Graphs with Local Mixing Patterns

    Authors: Susheel Suresh, Vinith Budde, Jennifer Neville, Pan Li, Jianzhu Ma

    Abstract: Graph neural networks (GNNs) have achieved tremendous success on multiple graph-based learning tasks by fusing network structure and node features. Modern GNN models are built upon iterative aggregation of neighbor's/proximity features by message passing. Its prediction performance has been shown to be strongly bounded by assortative mixing in the graph, a key property wherein nodes with similar a… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: published in KDD 2021; 11 pages;

  46. arXiv:2106.05819  [pdf, other

    cs.LG cs.AI

    Adversarial Graph Augmentation to Improve Graph Contrastive Learning

    Authors: Susheel Suresh, Pan Li, Cong Hao, Jennifer Neville

    Abstract: Self-supervised learning of graph neural networks (GNN) is in great need because of the widespread label scarcity issue in real-world graph/network data. Graph contrastive learning (GCL), by training GNNs to maximize the correspondence between the representations of the same graph in its different augmented forms, may yield robust and transferable GNNs even without using labels. However, GNNs trai… ▽ More

    Submitted 2 November, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted to NeurIPS 2021

  47. arXiv:2012.03847  [pdf

    physics.ao-ph cs.LG physics.soc-ph

    Space observation on detoxing the unhealthy air quality during COVID-19 pandemic in India

    Authors: Prabhat Kumar, Rohit Kumar Kasera, S Suresh

    Abstract: The purpose of this study has extremely dedicated to exposing the correlation between coronavirus pandemic and space observation on unhealthy air quality in India. The world has undergone lockdown to break the chain of coronavirus infection. The Air Quality Index (AQI) has started to improve after the commencement of lockdown due to industrial and transportation sectors temporally closed. This stu… ▽ More

    Submitted 4 November, 2020; originally announced December 2020.

    Comments: 06 pages, 5 figures, 1 table

  48. arXiv:2011.07044  [pdf, other

    cs.RO

    Tactile SLAM: Real-time inference of shape and pose from planar pushing

    Authors: Sudharshan Suresh, Maria Bauza, Kuan-Ting Yu, Joshua G. Mangelson, Alberto Rodriguez, Michael Kaess

    Abstract: Tactile perception is central to robot manipulation in unstructured environments. However, it requires contact, and a mature implementation must infer object models while also accounting for the motion induced by the interaction. In this work, we present a method to estimate both object shape and pose in real-time from a stream of tactile measurements. This is applied towards tactile exploration o… ▽ More

    Submitted 26 March, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Camera-ready version to be presented at the 2021 IEEE International Conference on Robotics and Automation (ICRA 2021). For associated video file, see https://youtu.be/wdyagx5MM40

  49. arXiv:2009.10800  [pdf, other

    cs.AI cs.LG

    A Hybrid Model for Learning Embeddings and Logical Rules Simultaneously from Knowledge Graphs

    Authors: Susheel Suresh, Jennifer Neville

    Abstract: The problem of knowledge graph (KG) reasoning has been widely explored by traditional rule-based systems and more recently by knowledge graph embedding methods. While logical rules can capture deterministic behavior in a KG they are brittle and mining ones that infer facts beyond the known KG is challenging. Probabilistic embedding methods are effective in capturing global soft statistical tendenc… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 10 page extended version

  50. arXiv:2009.09151  [pdf, other

    cs.RO

    Design and Development of a Gecko-Adhesive Gripper for the Astrobee Free-Flying Robot

    Authors: A. Cauligi, T. G. Chen, S. A. Suresh, M. Dille, R. Garcia Ruiz, A. Mora Vargas, M. Pavone, M. Cutkosky

    Abstract: Assistive free-flying robots are a promising platform for supporting and working alongside astronauts in carrying out tasks that require interaction with the environment. However, current free-flying robot platforms are limited by existing manipulation technologies in being able to grasp and manipulate surrounding objects. Instead, gecko-inspired adhesives offer many advantages for an alternate gr… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.