Skip to main content

Showing 1–50 of 159 results for author: Taylor, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05643  [pdf, ps, other

    cs.RO

    A Physics-Based Continuum Model for Versatile, Scalable, and Fast Terramechanics Simulation

    Authors: Huzaifa Unjhawala, Luning Bakke, Harry Zhang, Michael Taylor, Ganesh Arivoli, Radu Serban, Dan Negrut

    Abstract: This paper discusses Chrono's Continuous Representation Model (called herein Chrono::CRM), a general-purpose, scalable, and efficient simulation solution for terramechanics problems. Built on Chrono's Smoothed Particle Hydrodynamics (SPH) framework, Chrono::CRM moves beyond semi-empirical terramechanics approaches, e.g., Bekker-Wong/Janosi-Hanamoto, to provide a physics-based model able to address… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 32 pages, 21 figures, Submitted to Journal of Terramechanics

  2. arXiv:2506.20054  [pdf, ps, other

    cs.IT math.MG math.PR

    On sharp stable recovery from clipped and folded measurements

    Authors: Pedro Abdalla, Daniel Freeman, João P. G. Ramos, Mitchell A. Taylor

    Abstract: We investigate the stability of vector recovery from random linear measurements which have been either clipped or folded. This is motivated by applications where measurement devices detect inputs outside of their effective range. As examples of our main results, we prove sharp lower bounds on the recovery constant for both the declipping and unfolding problems whenever samples are taken accordin… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: 33 pages, 2 figures

  3. arXiv:2506.13206  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models

    Authors: James Chua, Jan Betley, Mia Taylor, Owain Evans

    Abstract: Prior work shows that LLMs finetuned on malicious behaviors in a narrow domain (e.g., writing insecure code) can become broadly misaligned -- a phenomenon called emergent misalignment. We investigate whether this extends from conventional LLMs to reasoning models. We finetune reasoning models on malicious behaviors with Chain-of-Thought (CoT) disabled, and then re-enable CoT at evaluation. Like co… ▽ More

    Submitted 10 July, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2506.11613  [pdf, ps, other

    cs.LG cs.AI

    Model Organisms for Emergent Misalignment

    Authors: Edward Turner, Anna Soligo, Mia Taylor, Senthooran Rajamanoharan, Neel Nanda

    Abstract: Recent work discovered Emergent Misalignment (EM): fine-tuning large language models on narrowly harmful datasets can lead them to become broadly misaligned. A survey of experts prior to publication revealed this was highly unexpected, demonstrating critical gaps in our understanding of model alignment. In this work, we both advance understanding and provide tools for future research. Using new na… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  5. arXiv:2505.21682  [pdf, ps, other

    cs.CY cs.HC

    Data and Technology for Equitable Public Administration: Understanding City Government Employees' Challenges and Needs

    Authors: Angie Zhang, Madison Liao, Elizaveta, Kravchenko, Marshanah Taylor, Angela Haddad, Chandra Bhat, S. Craig Watkins, Min Kyung Lee

    Abstract: City governments in the United States are increasingly pressured to adopt emerging technologies. Yet, these systems often risk biased and disparate outcomes. Scholars studying public sector technology design have converged on the need to ground these systems in the goals and organizational contexts of employees using them. We expand our understanding of employees' contexts by focusing on the equit… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Accepted to ACM CSCW 2025

  6. arXiv:2505.04961  [pdf, ps, other

    cs.GR cs.AI cs.CV cs.RO

    ADD: Physics-Based Motion Imitation with Adversarial Differential Discriminators

    Authors: Ziyu Zhang, Sergey Bashkirov, Dun Yang, Michael Taylor, Xue Bin Peng

    Abstract: Multi-objective optimization problems, which require the simultaneous optimization of multiple terms, are prevalent across numerous applications. Existing multi-objective optimization methods often rely on manually tuned aggregation functions to formulate a joint optimization target. The performance of such hand-tuned methods is heavily dependent on careful weight selection, a time-consuming and l… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 19 pages, 15 figures

  7. arXiv:2505.03770  [pdf, other

    cs.AI

    Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

    Authors: Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter, Raghav Awasthi, Soumya Banerjee, Joe M. Barnby, Rhea Basappa, Severin Bergsmann, Djallel Bouneffouf, Patrick Callaghan, Marc Cavazza, Thierry Chaminade, Sonia Chernova, Mohamed Chetouan, Moumita Choudhury, Axel Cleeremans, Jacek B. Cywinski, Fabio Cuzzolin , et al. (83 additional authors not shown)

    Abstract: This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

    Submitted 28 April, 2025; originally announced May 2025.

    Comments: workshop proceedings

  8. arXiv:2504.17006  [pdf, other

    cs.AI cs.LG cs.RO

    A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs

    Authors: Jalal Arabneydi, Saiful Islam, Srijita Das, Sai Krishna Gottipati, William Duguay, Cloderic Mars, Matthew E. Taylor, Matthew Guzdial, Antoine Fagette, Younes Zerouali

    Abstract: With the growing popularity of deep reinforcement learning (DRL), human-in-the-loop (HITL) approach has the potential to revolutionize the way we approach decision-making problems and create new opportunities for human-AI collaboration. In this article, we introduce a novel multi-layered hierarchical HITL DRL algorithm that comprises three types of learning: self learning, imitation learning and t… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: This is a result of the collaboration by JACOBB, AMII(Alberta Machine Intelligence Institute), Thales and AI Redefined (AIR) in 2021-2023

  9. arXiv:2504.00285  [pdf, other

    cs.CL

    Do Large Language Models Exhibit Spontaneous Rational Deception?

    Authors: Samuel M. Taylor, Benjamin K. Bergen

    Abstract: Large Language Models (LLMs) are effective at deceiving, when prompted to do so. But under what conditions do they deceive spontaneously? Models that demonstrate better performance on reasoning tasks are also better at prompted deception. Do they also increasingly deceive spontaneously in situations where it could be considered rational to do so? This study evaluates spontaneous deception produced… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  10. arXiv:2503.17298  [pdf, other

    cs.CR

    UAV Resilience Against Stealthy Attacks

    Authors: Arthur Amorim, Max Taylor, Trevor Kann, Gary T. Leavens, William L. Harrison, Lance Joneckis

    Abstract: Unmanned aerial vehicles (UAVs) depend on untrusted software components to automate dangerous or critical missions, making them a desirable target for attacks. Some work has been done to prevent an attacker who has either compromised a ground control station or parts of a UAV's software from sabotaging the vehicle, but not both. We present an architecture running a UAV software stack with runtime… ▽ More

    Submitted 14 April, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

    Comments: To be featured in ICUAS'25 proceedings

  11. arXiv:2503.16131  [pdf, other

    cs.CL

    MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering

    Authors: Feiyang Li, Yingjian Chen, Haoran Liu, Rui Yang, Han Yuan, Yuang Jiang, Tianxiao Li, Edison Marrese Taylor, Hossein Rouhizadeh, Yusuke Iwasawa, Douglas Teodoro, Yutaka Matsuo, Irene Li

    Abstract: Large Language Models (LLMs) have shown remarkable progress in medical question answering (QA), yet their effectiveness remains predominantly limited to English due to imbalanced multilingual training data and scarce medical resources for low-resource languages. To address this critical language gap in medical QA, we propose Multilingual Knowledge Graph-based Retrieval Ranking (MKG-Rank), a knowle… ▽ More

    Submitted 20 March, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

  12. arXiv:2503.05996  [pdf, other

    cs.LG cs.AI

    Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners

    Authors: Calarina Muslimani, Kerrick Johnstonbaugh, Suyog Chandramouli, Serena Booth, W. Bradley Knox, Matthew E. Taylor

    Abstract: Reinforcement learning agents are fundamentally limited by the quality of the reward functions they learn from, yet reward design is often overlooked under the assumption that a well-defined reward is readily available. However, in practice, designing rewards is difficult, and even when specified, evaluating their correctness is equally problematic: how do we know if a reward function is correctly… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  13. arXiv:2503.04754  [pdf

    cs.CY

    GIS as a Job Growth Area for IT Professionals

    Authors: Timur Mirzoev, Anthony Moore, Brianna Pryzbysz, Melissa Taylor, John Centeno

    Abstract: As more companies look to capitalize on the benefits of geospatial data, Geographic Information Systems provide an area for growth in the Information Technology job sector in the United States. Careers in GIS require geography, cartography, and IT skills. As the industry grows, candidates with these types of skills that are in demand and are needed to advance the geospatial industry forward. This… ▽ More

    Submitted 8 February, 2025; originally announced March 2025.

    Journal ref: 2015 World of Computer Science and Information Technology Journal (WCSIT) ISSN: 2221-0741 Vol. 5, No. 6, 98-111

  14. arXiv:2502.16772  [pdf, ps, other

    cs.LG

    Model-Based Exploration in Monitored Markov Decision Processes

    Authors: Alireza Kazemipour, Simone Parisi, Matthew E. Taylor, Michael Bowling

    Abstract: A tenet of reinforcement learning is that the agent always observes rewards. However, this is not true in many realistic settings, e.g., a human observer may not always be available to provide rewards, sensors may be limited or malfunctioning, or rewards may be inaccessible during deployment. Monitored Markov decision processes (Mon-MDPs) have recently been proposed to model such settings. However… ▽ More

    Submitted 24 June, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

  15. arXiv:2502.15214  [pdf, other

    cs.LG cs.AI cs.CL

    The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning

    Authors: Sheila Schoepp, Masoud Jafaripour, Yingyue Cao, Tianpei Yang, Fatemeh Abdollahi, Shadan Golestan, Zahin Sufiyan, Osmar R. Zaiane, Matthew E. Taylor

    Abstract: Reinforcement learning (RL) has shown impressive results in sequential decision-making tasks. Meanwhile, Large Language Models (LLMs) and Vision-Language Models (VLMs) have emerged, exhibiting impressive capabilities in multimodal understanding and reasoning. These advances have led to a surge of research integrating LLMs and VLMs into RL. In this survey, we review representative works in which LL… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 9 pages, 4 figures

  16. arXiv:2502.09799  [pdf, other

    cs.HC cs.AI cs.CY

    Co-designing Large Language Model Tools for Project-Based Learning with K12 Educators

    Authors: Prerna Ravi, John Masla, Gisella Kakoti, Grace Lin, Emma Anderson, Matt Taylor, Anastasia Ostrowski, Cynthia Breazeal, Eric Klopfer, Hal Abelson

    Abstract: The emergence of generative AI, particularly large language models (LLMs), has opened the door for student-centered and active learning methods like project-based learning (PBL). However, PBL poses practical implementation challenges for educators around project design and management, assessment, and balancing student guidance with student autonomy. The following research documents a co-design pro… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 25 pages

    Journal ref: CHI Conference on Human Factors in Computing Systems (CHI '25), April 26-May 01, 2025, Yokohama, Japan. ACM, New York, NY, USA

  17. arXiv:2501.18874  [pdf, other

    cs.CR cs.PL

    Enforcing MAVLink Safety & Security Properties Via Refined Multiparty Session Types

    Authors: Arthur Amorim, Max Taylor, Trevor Kann, William L. Harrison, Gary T. Leavens, Lance Joneckis

    Abstract: A compromised system component can issue message sequences that are legal while also leading the overall system into unsafe states. Such stealthy attacks are challenging to characterize, because message interfaces in standard languages specify each individual message separately but do not specify safe sequences of messages. We present initial results from ongoing work applying refined multiparty s… ▽ More

    Submitted 14 March, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: To be featured in proceedings of The 17th NASA Formal Methods Symposium

  18. arXiv:2501.09870  [pdf, other

    cs.LG

    An LLM-Guided Tutoring System for Social Skills Training

    Authors: Michael Guevarra, Indronil Bhattacharjee, Srijita Das, Christabel Wayllace, Carrie Demmans Epp, Matthew E. Taylor, Alan Tay

    Abstract: Social skills training targets behaviors necessary for success in social interactions. However, traditional classroom training for such skills is often insufficient to teach effective communication -- one-to-one interaction in real-world scenarios is preferred to lecture-style information delivery. This paper introduces a framework that allows instructors to collaborate with large language models… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  19. arXiv:2501.09666  [pdf

    cs.DL

    Evaluating the diversity of scientific discourse on twenty-one multilingual Wikipedias using citation analysis

    Authors: Michael Taylor, Roisi Proven, Carlos Areia

    Abstract: INTRODUCTION: Wikipedia is a major source of information, particularly for medical and health content, citing over 4 million scholarly publications. However, the representation of research-based knowledge across different languages on Wikipedia has been under explored. This study analyses the largest database of Wikipedia citations collected to date, examining the uniqueness of content and researc… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  20. arXiv:2412.18973  [pdf, other

    quant-ph cond-mat.str-el cs.LG

    Derandomized shallow shadows: Efficient Pauli learning with bounded-depth circuits

    Authors: Katherine Van Kirk, Christian Kokail, Jonathan Kunjummen, Hong-Ye Hu, Yanting Teng, Madelyn Cain, Jacob Taylor, Susanne F. Yelin, Hannes Pichler, Mikhail Lukin

    Abstract: Efficiently estimating large numbers of non-commuting observables is an important subroutine of many quantum science tasks. We present the derandomized shallow shadows (DSS) algorithm for efficiently learning a large set of non-commuting observables, using shallow circuits to rotate into measurement bases. Exploiting tensor network techniques to ensure polynomial scaling of classical resources, ou… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: 10+29 pages, 9 figures

  21. arXiv:2412.14467  [pdf, other

    cs.CR cs.FL cs.LO cs.PL

    Towards Provable Security in Industrial Control Systems Via Dynamic Protocol Attestation

    Authors: Arthur Amorim, Trevor Kann, Max Taylor, Lance Joneckis

    Abstract: Industrial control systems (ICSs) increasingly rely on digital technologies vulnerable to cyber attacks. Cyber attackers can infiltrate ICSs and execute malicious actions. Individually, each action seems innocuous. But taken together, they cause the system to enter an unsafe state. These attacks have resulted in dramatic consequences such as physical damage, economic loss, and environmental catast… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: This paper was accepted into the ICSS'24 workshop

  22. arXiv:2411.18593  [pdf, other

    cs.DC

    CkIO: Parallel File Input for Over-Decomposed Task-Based Systems

    Authors: Mathew Jacob, Maya Taylor, Laxmikant Kale

    Abstract: Parallel input performance issues are often neglected in large scale parallel applications in Computational Science and Engineering. Traditionally, there has been less focus on input performance because either input sizes are small (as in biomolecular simulations) or the time doing input is insignificant compared with the simulation with many timesteps. But newer applications, such as graph algori… ▽ More

    Submitted 27 November, 2024; v1 submitted 27 November, 2024; originally announced November 2024.

  23. arXiv:2411.14568  [pdf, other

    cs.RO

    Maximum Solar Energy Tracking Leverage High-DoF Robotics System with Deep Reinforcement Learning

    Authors: Anjie Jiang, Kangtong Mo, Satoshi Fujimoto, Michael Taylor, Sanjay Kumar, Chiotis Dimitrios, Emilia Ruiz

    Abstract: Solar trajectory monitoring is a pivotal challenge in solar energy systems, underpinning applications such as autonomous energy harvesting and environmental sensing. A prevalent failure mode in sustained solar tracking arises when the predictive algorithm erroneously diverges from the solar locus, erroneously anchoring to extraneous celestial or terrestrial features. This phenomenon is attributabl… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  24. arXiv:2410.21406  [pdf, other

    cs.RO

    Investigating the Benefits of Nonlinear Action Maps in Data-Driven Teleoperation

    Authors: Michael Przystupa, Gauthier Gidel, Matthew E. Taylor, Martin Jagersand, Justus Piater, Samuele Tosatto

    Abstract: As robots become more common for both able-bodied individuals and those living with a disability, it is increasingly important that lay people be able to drive multi-degree-of-freedom platforms with low-dimensional controllers. One approach is to use state-conditioned action mapping methods to learn mappings between low-dimensional controllers and high DOF manipulators -- prior research suggests t… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 13 Pages, 7 Figures, presented at Collaborative AI and Modeling of Humans AAAI Bridge Program Submission

  25. arXiv:2409.15521  [pdf, other

    cs.LG cs.AI

    CANDERE-COACH: Reinforcement Learning from Noisy Feedback

    Authors: Yuxuan Li, Srijita Das, Matthew E. Taylor

    Abstract: In recent times, Reinforcement learning (RL) has been widely applied to many challenging tasks. However, in order to perform well, it requires access to a good reward function which is often sparse or manually engineered with scope for error. Introducing human prior knowledge is often seen as a possible solution to the above-mentioned problem, such as imitation learning, learning from preference,… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  26. arXiv:2409.11948  [pdf

    cs.DL

    Research Citations Building Trust in Wikipedia

    Authors: Michael Taylor, Carlos Areia, Kath Burton, Charles Watkinson

    Abstract: The use of Wikipedia citations in scholarly research has been the topic of much inquiry over the past decade. A cross-publisher study (Taylor & Francis and University of Michigan Press) convened by Digital Science was established in late 2022 to explore author sentiment towards Wikipedia as a trusted source of information. A short survey was designed to poll published authors about views and uses… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  27. arXiv:2408.14488  [pdf

    cs.LG cond-mat.mtrl-sci

    Multi-Task Multi-Fidelity Learning of Properties for Energetic Materials

    Authors: Robert J. Appleton, Daniel Klinger, Brian H. Lee, Michael Taylor, Sohee Kim, Samuel Blankenship, Brian C. Barnes, Steven F. Son, Alejandro Strachan

    Abstract: Data science and artificial intelligence are playing an increasingly important role in the physical sciences. Unfortunately, in the field of energetic materials data scarcity limits the accuracy and even applicability of ML tools. To address data limitations, we compiled multi-modal data: both experimental and computational results for several properties. We find that multi-task neural networks ca… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 16 pages, 4 figures, 2 tables

  28. arXiv:2408.05609  [pdf, ps, other

    eess.SY cs.AI cs.LG cs.MA cs.RO

    Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale

    Authors: Vindula Jayawardana, Baptiste Freydt, Ao Qu, Cameron Hickert, Edgar Sanchez, Catherine Tang, Mark Taylor, Blaine Leonard, Cathy Wu

    Abstract: The sheer scale and diversity of transportation make it a formidable sector to decarbonize. Here, we consider an emerging opportunity to reduce carbon emissions: the growing adoption of semi-autonomous vehicles, which can be programmed to mitigate stop-and-go traffic through intelligent speed commands and, thus, reduce emissions. But would such dynamic eco-driving move the needle on climate change… ▽ More

    Submitted 27 June, 2025; v1 submitted 10 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at Transportation Research Part C: Emerging Technologies

  29. arXiv:2407.16220  [pdf, other

    cs.AI cs.LG

    ODGR: Online Dynamic Goal Recognition

    Authors: Matan Shamir, Osher Elhadad, Matthew E. Taylor, Reuth Mirsky

    Abstract: Traditionally, Reinforcement Learning (RL) problems are aimed at optimization of the behavior of an agent. This paper proposes a novel take on RL, which is used to learn the policy of another agent, to allow real-time recognition of that agent's goals. Goal Recognition (GR) has traditionally been framed as a planning problem where one must recognize an agent's objectives based on its observed acti… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 8 pages, 1 figure, RLC workshop, WAHT workshop

  30. arXiv:2407.09533  [pdf, other

    cs.CV cs.AI

    Video Occupancy Models

    Authors: Manan Tomar, Philippe Hansen-Estruch, Philip Bachman, Alex Lamb, John Langford, Matthew E. Taylor, Sergey Levine

    Abstract: We introduce a new family of video prediction models designed to support downstream control tasks. We call these models Video Occupancy models (VOCs). VOCs operate in a compact latent space, thus avoiding the need to make predictions about individual pixels. Unlike prior latent-space world models, VOCs directly predict the discounted distribution of future states in a single step, thus avoiding th… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  31. arXiv:2407.08633  [pdf, other

    cs.AI

    A Novel Framework for Automated Warehouse Layout Generation

    Authors: Atefeh Shahroudnejad, Payam Mousavi, Oleksii Perepelytsia, Sahir, David Staszak, Matthew E. Taylor, Brent Bawel

    Abstract: Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive optimal layouts within given spatial parameters, adhering to all functional requirements. The feasibility of the generated layouts is verified based on criteria suc… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  32. arXiv:2406.13049  [pdf, other

    cs.CY cs.AI

    Assessing AI vs Human-Authored Spear Phishing SMS Attacks: An Empirical Study

    Authors: Jerson Francia, Derek Hansen, Ben Schooley, Matthew Taylor, Shydra Murray, Greg Snow

    Abstract: This paper explores the use of Large Language Models (LLMs) in spear phishing message generation and evaluates their performance compared to human-authored counterparts. Our pilot study examines the effectiveness of smishing (SMS phishing) messages created by GPT-4 and human authors, which have been personalized for willing targets. The targets assessed these messages in a modified ranked-order ex… ▽ More

    Submitted 18 March, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: 18 pages, 5 figures, 1 table

  33. arXiv:2406.10535  [pdf

    cs.DL

    Evaluating Open Access Advantages for Citations and Altmetrics (2011-21): A Dynamic and Evolving Relationship

    Authors: Michael Taylor

    Abstract: Differences between the impacts of Open Access (OA) and non-OA research have been observed over a wide range of citation and altmetric indicators, usually finding an Open Access Advantage (OAA) within specific fields. However, science-wide analyses covering multiple years, indicators and disciplines are lacking. Using citation counts and six altmetrics for 38.7M articles published 2011-21, we comp… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  34. arXiv:2406.06495  [pdf, ps, other

    cs.LG

    Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity

    Authors: Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor

    Abstract: To integrate into human-centered environments, autonomous agents must learn from and adapt to humans in their native settings. Preference-based reinforcement learning (PbRL) can enable this by learning reward functions from human preferences. However, humans live in a world full of diverse information, most of which is irrelevant to completing any particular task. It then becomes essential that ag… ▽ More

    Submitted 3 July, 2025; v1 submitted 10 June, 2024; originally announced June 2024.

  35. arXiv:2405.19296  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Neural Isometries: Taming Transformations for Equivariant ML

    Authors: Thomas W. Mitchel, Michael Taylor, Vincent Sitzmann

    Abstract: Real-world geometry and 3D vision tasks are replete with challenging symmetries that defy tractable analytical expression. In this paper, we introduce Neural Isometries, an autoencoder framework which learns to map the observation space to a general-purpose latent space wherein encodings are related by isometries whenever their corresponding observations are geometrically related in world space. S… ▽ More

    Submitted 29 October, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024

  36. arXiv:2405.00746  [pdf, other

    cs.LG cs.AI cs.RO

    Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning

    Authors: Calarina Muslimani, Matthew E. Taylor

    Abstract: To create useful reinforcement learning (RL) agents, step zero is to design a suitable reward function that captures the nuances of the task. However, reward engineering can be a difficult and time-consuming process. Instead, human-in-the-loop RL methods hold the promise of learning reward functions from human feedback. Despite recent successes, many of the human-in-the-loop RL methods still requi… ▽ More

    Submitted 7 April, 2025; v1 submitted 30 April, 2024; originally announced May 2024.

  37. arXiv:2404.13142  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning

    Authors: Daniel May, Matthew Taylor, Petr Musilek

    Abstract: As distributed energy resources (DERs) grow, the electricity grid faces increased net load variability at the grid edge, impacting operability and reliability. Transactive energy, facilitated through local energy markets, offers a decentralized, indirect demand response solution, with model-free control techniques, such as deep reinforcement learning (DRL), enabling automated, decentralized partic… ▽ More

    Submitted 14 November, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: preprint, submitted to Energy and AI

  38. arXiv:2404.13061  [pdf, other

    cs.AR cs.AI cs.LG

    FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

    Authors: Shang Wang, Deepak Ranganatha Sastry Mamillapalli, Tianpei Yang, Matthew E. Taylor

    Abstract: This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to previous search-based placement algorithms, we instead employ Reinforcement Learning (RL) with the goal of minimizing wirelength. In addition to our preliminary learning results, we also evaluated a novel decomposition to address the nature of la… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: accepted by ISEDA2024

  39. arXiv:2402.06819  [pdf, other

    cs.LG

    Monitored Markov Decision Processes

    Authors: Simone Parisi, Montaser Mohammedalamen, Alireza Kazemipour, Matthew E. Taylor, Michael Bowling

    Abstract: In reinforcement learning (RL), an agent learns to perform a task by interacting with an environment and receiving feedback (a numerical reward) for its actions. However, the assumption that rewards are always observable is often not applicable in real-world problems. For example, the agent may need to ask a human to supervise its actions or activate a monitoring system to receive feedback. There… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: AAMAS 2024, Main Track

  40. arXiv:2401.02991  [pdf, other

    cs.CL cs.AI cs.LG

    GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

    Authors: Chaitanya Kharyal, Sai Krishna Gottipati, Tanmay Kumar Sinha, Srijita Das, Matthew E. Taylor

    Abstract: One of the final frontiers in the development of complex human - AI collaborative systems is the ability of AI agents to comprehend the natural language and perform tasks accordingly. However, training efficient Reinforcement Learning (RL) agents grounded in natural language has been a long-standing challenge due to the complexity and ambiguity of the language and sparsity of the rewards, among ot… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 12 pages, 6 figures, to be presented at AAMAS 2024

  41. arXiv:2401.00907  [pdf, other

    cs.LG cs.AI cs.CL

    LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models

    Authors: Qianxi Li, Yingyue Cao, Jikun Kang, Tianpei Yang, Xi Chen, Jun Jin, Matthew E. Taylor

    Abstract: Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised Fine-Tuning (SFT) is a common approach, where an LLM is trained to produce desired answers. However, LLMs trained with SFT sometimes make simple mistakes and result in hallucinations on reasoning tasks such as question-answering. Without extern… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

  42. arXiv:2312.15339  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning

    Authors: Bram Grooten, Tristan Tomilin, Gautham Vasan, Matthew E. Taylor, A. Rupam Mahmood, Meng Fang, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks wit… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: Accepted as full-paper (oral) at AAMAS 2024. Code is available at https://github.com/bramgrooten/mask-distractions and see our 40-second video at https://youtu.be/2oImF0h1k48

  43. arXiv:2312.14322  [pdf, other

    cond-mat.mes-hall cs.DB cs.LG quant-ph

    Data needs and challenges for quantum dot devices automation

    Authors: Justyna P. Zwolak, Jacob M. Taylor, Reed W. Andrews, Jared Benson, Garnett W. Bryant, Donovan Buterakos, Anasua Chatterjee, Sankar Das Sarma, Mark A. Eriksson, Eliška Greplová, Michael J. Gullans, Fabian Hader, Tyler J. Kovach, Pranav S. Mundada, Mick Ramsey, Torbjørn Rasmussen, Brandon Severin, Anthony Sigillito, Brennan Undseth, Brian Weber

    Abstract: Gate-defined quantum dots are a promising candidate system for realizing scalable, coupled qubit systems and serving as a fundamental building block for quantum computers. However, present-day quantum dot devices suffer from imperfections that must be accounted for, which hinders the characterization, tuning, and operation process. Moreover, with an increasing number of quantum dot qubits, the rel… ▽ More

    Submitted 5 November, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: A meeting report from a workshop held at the National Institute of Standards and Technology, Gaithersburg, MD

    Journal ref: npj Quantum Inf. 10, 105 (2024)

  44. arXiv:2312.11768  [pdf, other

    cs.AI cs.LG cs.MA

    Curriculum Learning for Cooperation in Multi-Agent Reinforcement Learning

    Authors: Rupali Bhati, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor

    Abstract: While there has been significant progress in curriculum learning and continuous learning for training agents to generalize across a wide variety of environments in the context of single-agent reinforcement learning, it is unclear if these algorithms would still be valid in a multi-agent setting. In a competitive setting, a learning agent can be trained by making it compete with a curriculum of inc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures. Presented at Agent Learning in Open-Endedness Workshop at Neural Information Processing Systems (NeurIPS 2023)

  45. arXiv:2312.11718  [pdf, other

    cs.AI cs.HC cs.LG cs.MA stat.AP

    Human-Machine Teaming for UAVs: An Experimentation Platform

    Authors: Laila El Moujtahid, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor

    Abstract: Full automation is often not achievable or desirable in critical systems with high-stakes decisions. Instead, human-AI teams can achieve better results. To research, develop, evaluate, and validate algorithms suited for such teaming, lightweight experimentation platforms that enable interactions between humans and multiple AI agents are necessary. However, there are limited examples of such platfo… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures Presented at Conference on Artificial Intelligence for Defense (CAID) 2023

  46. arXiv:2311.14891  [pdf, other

    cs.CY

    Simpson's Paradox and Lagging Progress in Completion Trends of Underrepresented Students in Computer Science

    Authors: John Mason Taylor, Rebecca Drucker, Chris Alvin, Syed Fahad Sultan

    Abstract: It is imperative for the Computer Science (CS) community to ensure active participation and success of students from diverse backgrounds. This work compares CS to other areas of study with respect to success of students from three underrepresented groups: Women, Black and Hispanic or Latino. Using a data-driven approach, we show that trends of success over the years for underrepresented groups in… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  47. arXiv:2311.00810  [pdf, other

    cs.CY cs.CV cs.HC

    A Call to Arms: AI Should be Critical for Social Media Analysis of Conflict Zones

    Authors: Afia Abedin, Abdul Bais, Cody Buntain, Laura Courchesne, Brian McQuinn, Matthew E. Taylor, Muhib Ullah

    Abstract: The massive proliferation of social media data represents a transformative opportunity for conflict studies and for tracking the proliferation and use of weaponry, as conflicts are increasingly documented in these online spaces. At the same time, the scale and types of data available are problematic for traditional open-source intelligence. This paper focuses on identifying specific weapon systems… ▽ More

    Submitted 14 May, 2025; v1 submitted 1 November, 2023; originally announced November 2023.

  48. Cocoon: Static Information Flow Control in Rust

    Authors: Ada Lamba, Max Taylor, Vincent Beardsley, Jacob Bambeck, Michael D. Bond, Zhiqiang Lin

    Abstract: Information flow control (IFC) provides confidentiality by enforcing noninterference, which ensures that high-secrecy values cannot affect low-secrecy values. Prior work introduces fine-grained IFC approaches that modify the programming language and use nonstandard compilation tools, impose run-time overhead, or report false secrecy leaks -- all of which hinder adoption. This paper presents Coco… ▽ More

    Submitted 18 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Will be published in PACMPL(OOPSLA) in October 2024

  49. arXiv:2307.05603  [pdf, other

    cs.SE cs.LG cs.PL

    Can You Improve My Code? Optimizing Programs with Local Search

    Authors: Fatemeh Abdollahi, Saqib Ameen, Matthew E. Taylor, Levi H. S. Lelis

    Abstract: This paper introduces a local search method for improving an existing program with respect to a measurable objective. Program Optimization with Locally Improving Search (POLIS) exploits the structure of a program, defined by its lines. POLIS improves a single line of the program while keeping the remaining lines fixed, using existing brute-force synthesis algorithms, and continues iterating until… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI) 2023

  50. arXiv:2307.02666  [pdf, other

    cs.AR

    Chiplet Cloud: Building AI Supercomputers for Serving Large Generative Language Models

    Authors: Huwan Peng, Scott Davidson, Richard Shi, Shuaiwen Leon Song, Michael Taylor

    Abstract: Large language models (LLMs) such as OpenAI's ChatGPT and Google's Gemini have demonstrated unprecedented capabilities of autoregressive AI models across multiple tasks triggering disruptive technology innovations around the world. However, as models continue to grow the cost to serve these models also continues to grow threatening the democratization of LLMs. To address this issue, we propose C… ▽ More

    Submitted 20 May, 2024; v1 submitted 5 July, 2023; originally announced July 2023.