Skip to main content

Showing 1–8 of 8 results for author: D'Arcy, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.14199  [pdf, other

    cs.CL cs.AI cs.DL cs.IR cs.LG

    OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

    Authors: Akari Asai, Jacqueline He, Rulin Shao, Weijia Shi, Amanpreet Singh, Joseph Chee Chang, Kyle Lo, Luca Soldaini, Sergey Feldman, Mike D'arcy, David Wadden, Matt Latzke, Minyang Tian, Pan Ji, Shengyan Liu, Hao Tong, Bohao Wu, Yanyu Xiong, Luke Zettlemoyer, Graham Neubig, Dan Weld, Doug Downey, Wen-tau Yih, Pang Wei Koh, Hannaneh Hajishirzi

    Abstract: Scientific progress depends on researchers' ability to synthesize the growing body of literature. Can large language models (LMs) assist scientists in this task? We introduce OpenScholar, a specialized retrieval-augmented LM that answers scientific queries by identifying relevant passages from 45 million open-access papers and synthesizing citation-backed responses. To evaluate OpenScholar, we dev… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  2. arXiv:2401.04259  [pdf, other

    cs.CL

    MARG: Multi-Agent Review Generation for Scientific Papers

    Authors: Mike D'Arcy, Tom Hope, Larry Birnbaum, Doug Downey

    Abstract: We study the ability of LLMs to generate feedback for scientific papers and develop MARG, a feedback generation approach using multiple LLM instances that engage in internal discussion. By distributing paper text across agents, MARG can consume the full text of papers beyond the input length limitations of the base LLM, and by specializing agents and incorporating sub-tasks tailored to different c… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  3. arXiv:2306.12587  [pdf, other

    cs.CL

    ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews

    Authors: Mike D'Arcy, Alexis Ross, Erin Bransom, Bailey Kuehl, Jonathan Bragg, Tom Hope, Doug Downey

    Abstract: We introduce the task of automatically revising scientific papers based on peer feedback and release ARIES, a dataset of review comments and their corresponding paper edits. The data is drawn from real reviewer-author interactions from computer science, and we provide labels linking each reviewer comment to the specific paper edits made by the author in response. We automatically create a high-pre… ▽ More

    Submitted 5 August, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: ACL 2024, 10 pages, 2 figures

  4. arXiv:2211.13308  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    SciRepEval: A Multi-Format Benchmark for Scientific Document Representations

    Authors: Amanpreet Singh, Mike D'Arcy, Arman Cohan, Doug Downey, Sergey Feldman

    Abstract: Learned representations of scientific documents can serve as valuable input features for downstream tasks without further fine-tuning. However, existing benchmarks for evaluating these representations fail to capture the diversity of relevant tasks. In response, we introduce SciRepEval, the first comprehensive benchmark for training and evaluating scientific document representations. It includes 2… ▽ More

    Submitted 13 November, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 19 pages, 2 figures, 11 tables. Accepted in EMNLP 2023 Main Conference

  5. arXiv:2207.04993  [pdf, other

    cs.CL

    Embedding Recycling for Language Models

    Authors: Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey

    Abstract: Real-world applications of neural language models often involve running many different models over the same corpus. The high computational cost of these runs has led to interest in techniques that can reuse the contextualized embeddings produced in previous runs to speed training and inference of future ones. We refer to this approach as embedding recycling (ER). While multiple ER techniques have… ▽ More

    Submitted 30 January, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: EACL Findings 2023

  6. arXiv:1904.04365  [pdf, ps, other

    cs.CL cs.LG

    CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense

    Authors: Michael Chen, Mike D'Arcy, Alisa Liu, Jared Fernandez, Doug Downey

    Abstract: Commonsense reasoning is a critical AI capability, but it is difficult to construct challenging datasets that test common sense. Recent neural question answering systems, based on large pre-trained models of language, have already achieved near-human-level performance on commonsense knowledge benchmarks. These systems do not possess human-level common sense, but are able to exploit limitations of… ▽ More

    Submitted 26 July, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: 8 pages, Appeared in RepEval 2019

  7. arXiv:1803.03719  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    DeepMoTIon: Learning to Navigate Like Humans

    Authors: Mahmoud Hamandi, Mike D'Arcy, Pooyan Fazli

    Abstract: We present a novel human-aware navigation approach, where the robot learns to mimic humans to navigate safely in crowds. The presented model, referred to as DeepMoTIon, is trained with pedestrian surveillance data to predict human velocity in the environment. The robot processes LiDAR scans via the trained network to navigate to the target location. We conduct extensive experiments to assess the c… ▽ More

    Submitted 1 August, 2019; v1 submitted 9 March, 2018; originally announced March 2018.

    Comments: 7 pages, In Proceedings of the IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019

  8. arXiv:1710.06831  [pdf, other

    cs.RO

    Setting Up the Beam for Human-Centered Service Tasks

    Authors: Utkarsh Patel, Emre Hatay, Mike D'Arcy, Ghazal Zand, Pooyan Fazli

    Abstract: We introduce the Beam, a collaborative autonomous mobile service robot, based on SuitableTech's Beam telepresence system. We present a set of enhancements to the telepresence system, including autonomy, human awareness, increased computation and sensing capabilities, and integration with the popular Robot Operating System (ROS) framework. Together, our improvements transform the Beam into a low-co… ▽ More

    Submitted 18 October, 2017; originally announced October 2017.

    Comments: 10 pages