Skip to main content

Showing 1–18 of 18 results for author: Tata, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.18914  [pdf, other

    cs.AI

    PRISM: Efficient Long-Range Reasoning With Short-Context LLMs

    Authors: Dulhan Jayalath, James Bradley Wendt, Nicholas Monath, Sandeep Tata, Beliz Gunel

    Abstract: Long-range tasks demand reasoning over long inputs. Current solutions require large compute budgets, training data, model weight access, or complex task-specific designs. We introduce PRISM, which processes information as a stream of chunks while maintaining a structured in-context memory specified with a typed hierarchical schema. PRISM outperforms baselines on diverse tasks while using at least… ▽ More

    Submitted 12 March, 2025; v1 submitted 25 December, 2024; originally announced December 2024.

    Comments: 28 pages, 7 figures, 5 tables

  2. arXiv:2411.05715  [pdf, other

    cs.SD cs.MM cs.NE eess.AS

    On the Role of Noise in AudioVisual Integration: Evidence from Artificial Neural Networks that Exhibit the McGurk Effect

    Authors: Lukas Grasse, Matthew S. Tata

    Abstract: Humans are able to fuse information from both auditory and visual modalities to help with understanding speech. This is frequently demonstrated through an phenomenon known as the McGurk Effect, during which a listener is presented with incongruent auditory and visual speech that fuse together into the percept of an illusory intermediate phoneme. Building on a recent framework that proposes how to… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

  3. arXiv:2407.15021  [pdf, other

    cs.CL

    Enhancing Incremental Summarization with Structured Representations

    Authors: EunJeong Hwang, Yichao Zhou, James Bradley Wendt, Beliz Gunel, Nguyen Vo, Jing Xie, Sandeep Tata

    Abstract: Large language models (LLMs) often struggle with processing extensive input contexts, which can lead to redundant, inaccurate, or incoherent summaries. Recent methods have used unstructured memory to incrementally process these contexts, but they still suffer from information overload due to the volume of unstructured data handled. In our study, we introduce structured knowledge representations (… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  4. arXiv:2406.05079  [pdf, other

    cs.CL cs.LG

    SUMIE: A Synthetic Benchmark for Incremental Entity Summarization

    Authors: Eunjeong Hwang, Yichao Zhou, Beliz Gunel, James Bradley Wendt, Sandeep Tata

    Abstract: No existing dataset adequately tests how well language models can incrementally update entity summaries - a crucial ability as these models rapidly advance. The Incremental Entity Summarization (IES) task is vital for maintaining accurate, up-to-date knowledge. To address this, we introduce SUMIE, a fully synthetic dataset designed to expose real-world IES challenges. This dataset effectively high… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 24 figures, 4 tables

  5. arXiv:2404.15565  [pdf, other

    cs.CL

    CASPR: Automated Evaluation Metric for Contrastive Summarization

    Authors: Nirupan Ananthamurugan, Dat Duong, Philip George, Ankita Gupta, Sandeep Tata, Beliz Gunel

    Abstract: Summarizing comparative opinions about entities (e.g., hotels, phones) from a set of source reviews, often referred to as contrastive summarization, can considerably aid users in decision making. However, reliably measuring the contrastiveness of the output summaries without relying on human evaluations remains an open problem. Prior work has proposed token-overlap based metrics, Distinctiveness S… ▽ More

    Submitted 13 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  6. arXiv:2403.19710  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    STRUM-LLM: Attributed and Structured Contrastive Summarization

    Authors: Beliz Gunel, James B. Wendt, Jing Xie, Yichao Zhou, Nguyen Vo, Zachary Fisher, Sandeep Tata

    Abstract: Users often struggle with decision-making between two options (A vs B), as it usually requires time-consuming research across multiple web pages. We propose STRUM-LLM that addresses this challenge by generating attributed, structured, and helpful contrastive summaries that highlight key differences between the two options. STRUM-LLM identifies helpful contrast: the specific attributes along which… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  7. arXiv:2304.03932  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    3D GANs and Latent Space: A comprehensive survey

    Authors: Satya Pratheek Tata, Subhankar Mishra

    Abstract: Generative Adversarial Networks (GANs) have emerged as a significant player in generative modeling by mapping lower-dimensional random noise to higher-dimensional spaces. These networks have been used to generate high-resolution images and 3D objects. The efficient modeling of 3D objects and human faces is crucial in the development process of 3D graphical environments such as games or simulations… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  8. arXiv:2212.10047  [pdf, other

    cs.CL

    An Augmentation Strategy for Visually Rich Documents

    Authors: Jing Xie, James B. Wendt, Yichao Zhou, Seth Ebner, Sandeep Tata

    Abstract: Many business workflows require extracting important fields from form-like documents (e.g. bank statements, bills of lading, purchase orders, etc.). Recent techniques for automating this task work well only when trained with large datasets. In this work we propose a novel data augmentation technique to improve performance when training data is scarce, e.g. 10-250 documents. Our technique, which we… ▽ More

    Submitted 22 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 9 pages, 6 figures, 3 tables

  9. VRDU: A Benchmark for Visually-rich Document Understanding

    Authors: Zilong Wang, Yichao Zhou, Wei Wei, Chen-Yu Lee, Sandeep Tata

    Abstract: Understanding visually-rich business documents to extract structured data and automate business workflows has been receiving attention both in academia and industry. Although recent multi-modal language models have achieved impressive results, we find that existing benchmarks do not reflect the complexity of real documents seen in industry. In this work, we identify the desiderata for a more compr… ▽ More

    Submitted 16 September, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: KDD 2023

  10. arXiv:2210.16391  [pdf, other

    cs.CL

    Radically Lower Data-Labeling Costs for Visually Rich Document Extraction Models

    Authors: Yichao Zhou, James B. Wendt, Navneet Potti, Jing Xie, Sandeep Tata

    Abstract: A key bottleneck in building automatic extraction models for visually rich documents like invoices is the cost of acquiring the several thousand high-quality labeled documents that are needed to train a model with acceptable accuracy. We propose Selective Labeling to simplify the labeling task to provide "yes/no" labels for candidate extractions predicted by a model trained on partially labeled do… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 9 pages, 8 figures, 3 tables

  11. arXiv:2201.02647  [pdf, other

    cs.LG cs.IR

    Data-Efficient Information Extraction from Form-Like Documents

    Authors: Beliz Gunel, Navneet Potti, Sandeep Tata, James B. Wendt, Marc Najork, Jing Xie

    Abstract: Automating information extraction from form-like documents at scale is a pressing need due to its potential impact on automating business workflows across many industries like financial services, insurance, and healthcare. The key challenge is that form-like documents in these business workflows can be laid out in virtually infinitely many ways; hence, a good solution to this problem should genera… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: Published at the 2nd Document Intelligence Workshop @ KDD 2021 (https://document-intelligence.github.io/DI-2021/)

  12. arXiv:2101.02415  [pdf, other

    cs.LG cs.CL

    Simplified DOM Trees for Transferable Attribute Extraction from the Web

    Authors: Yichao Zhou, Ying Sheng, Nguyen Vo, Nick Edmonds, Sandeep Tata

    Abstract: There has been a steady need to precisely extract structured knowledge from the web (i.e. HTML documents). Given a web page, extracting a structured object along with various attributes of interest (e.g. price, publisher, author, and genre for a book) can facilitate a variety of downstream applications such as large-scale knowledge base construction, e-commerce product search, and personalized rec… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 10 pages, 9 figures

  13. FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents

    Authors: Bill Yuchen Lin, Ying Sheng, Nguyen Vo, Sandeep Tata

    Abstract: Extracting structured data from HTML documents is a long-studied problem with a broad range of applications like augmenting knowledge bases, supporting faceted search, and providing domain-specific experiences for key verticals like shopping and movies. Previous approaches have either required a small number of examples for each target site or relied on carefully handcrafted heuristics built over… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: in Proc. of KDD 2020 (Research Track). Figure 5 updated

  14. arXiv:2005.11442  [pdf, other

    cs.LG stat.ML

    Active Learning for Skewed Data Sets

    Authors: Abbas Kazerouni, Qi Zhao, Jing Xie, Sandeep Tata, Marc Najork

    Abstract: Consider a sequential active learning problem where, at each round, an agent selects a batch of unlabeled data points, queries their labels and updates a binary classifier. While there exists a rich body of work on active learning in this general form, in this paper, we focus on problems with two distinguishing characteristics: severe class imbalance (skew) and small amounts of initial training da… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

  15. arXiv:2002.02807  [pdf, other

    q-bio.NC cs.NE

    Adaptive control for hindlimb locomotion in a simulated mouse through temporal cerebellar learning

    Authors: T. P. Jensen, S. Tata, A. J. Ijspeert, S. Tolu

    Abstract: Human beings and other vertebrates show remarkable performance and efficiency in locomotion, but the functioning of their biological control systems for locomotion is still only partially understood. The basic patterns and timing for locomotion are provided by a central pattern generator (CPG) in the spinal cord. The cerebellum is known to play an important role in adaptive locomotion. Recent stud… ▽ More

    Submitted 17 February, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Comments: To be published in NICE '20: Proceedings of the 8th Annual Neuro-inspired Computational Elements Workshop. 8 pages, 13 figures

  16. arXiv:1811.00652  [pdf

    cs.DC

    Modeling IoT-aware Business Processes - A State of the Art Report

    Authors: Nadja Brouns, Samir Tata, Heiko Ludwig, E. Serral Asensio, Paul Grefen

    Abstract: This research report presents an analysis of the state of the art of modeling Internet of Things (IoT)-aware business processes. IOT links the physical world to the digital world. Traditionally, we would find information about events and processes in the physical world in the digital world entered by humans and humans using this information to control the physical world. In the IoT paradigm, the p… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: 42 pages

    Report number: RJ 10540

    Journal ref: IBM Research Report 2018

  17. arXiv:1105.4252  [pdf

    cs.DB cs.DC

    Column-Oriented Storage Techniques for MapReduce

    Authors: Avrilia Floratou, Jignesh Patel, Eugene Shekita, Sandeep Tata

    Abstract: Users of MapReduce often run into performance problems when they scale up their workloads. Many of the problems they encounter can be overcome by applying techniques learned from over three decades of research on parallel DBMSs. However, translating these techniques to a MapReduce implementation such as Hadoop presents unique challenges that can lead to new design choices. This paper describes how… ▽ More

    Submitted 21 May, 2011; originally announced May 2011.

    Comments: VLDB2011

    Report number: Proceedings of the VLDB Endowment (PVLDB), Vol. 4, No. 7, pp. 419-429 (2011)

  18. arXiv:1103.2408  [pdf

    cs.DB cs.DC

    Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore

    Authors: Jun Rao, Eugene J. Shekita, Sandeep Tata

    Abstract: Spinnaker is an experimental datastore that is designed to run on a large cluster of commodity servers in a single datacenter. It features key-based range partitioning, 3-way replication, and a transactional get-put API with the option to choose either strong or timeline consistency on reads. This paper describes Spinnaker's Paxos-based replication protocol. The use of Paxos ensures that a data pa… ▽ More

    Submitted 11 March, 2011; originally announced March 2011.

    Comments: VLDB2011

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 4, No. 4, pp. 243-254 (2011)