Skip to main content

Showing 1–50 of 134 results for author: Brown, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.01342  [pdf, ps, other

    cs.CV

    Learning Camera-Agnostic White-Balance Preferences

    Authors: Luxi Zhao, Mahmoud Afifi, Michael S. Brown

    Abstract: The image signal processor (ISP) pipeline in modern cameras consists of several modules that transform raw sensor data into visually pleasing images in a display color space. Among these, the auto white balance (AWB) module is essential for compensating for scene illumination. However, commercial AWB systems often strive to compute aesthetic white-balance preferences rather than accurate neutral c… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  2. arXiv:2506.11140  [pdf, ps, other

    cs.CV cs.AI cs.MA

    Autonomous Computer Vision Development with Agentic AI

    Authors: Jin Kim, Muhammad Wahi-Anwa, Sangyun Park, Shawn Shin, John M. Hoffman, Matthew S. Brown

    Abstract: Agentic Artificial Intelligence (AI) systems leveraging Large Language Models (LLMs) exhibit significant potential for complex reasoning, planning, and tool utilization. We demonstrate that a specialized computer vision system can be built autonomously from a natural language prompt using Agentic AI methods. This involved extending SimpleMind (SM), an open-source Cognitive AI environment with conf… ▽ More

    Submitted 19 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: The paper is 13 pages long and contains 4 figures

  3. arXiv:2505.22783  [pdf, ps, other

    eess.SP cs.LG

    Temporal Convolutional Autoencoder for Interference Mitigation in FMCW Radar Altimeters

    Authors: Charles E. Thornton, Jamie Sloop, Samuel Brown, Aaron Orndorff, William C. Headley, Stephen Young

    Abstract: We investigate the end-to-end altitude estimation performance of a convolutional autoencoder-based interference mitigation approach for frequency-modulated continuous-wave (FMCW) radar altimeters. Specifically, we show that a Temporal Convolutional Network (TCN) autoencoder effectively exploits temporal correlations in the received signal, providing superior interference suppression compared to a… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 10 pages, 10 figures

  4. arXiv:2505.19167  [pdf

    cs.AI

    Amplifying Human Creativity and Problem Solving with AI Through Generative Collective Intelligence

    Authors: Thomas P. Kehler, Scott E. Page, Alex Pentland, Martin Reeves, John Seely Brown

    Abstract: We propose a general framework for human-AI collaboration that amplifies the distinct capabilities of both types of intelligence. We refer to this as Generative Collective Intelligence (GCI). GCI employs AI in dual roles: as interactive agents and as technology that accumulates, organizes, and leverages knowledge. In this second role, AI creates a cognitive bridge between human reasoning and AI mo… ▽ More

    Submitted 4 June, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  5. How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios

    Authors: Satvik Venkatesh, Philip Coleman, Arthur Benilov, Simon Brown, Selim Sheta, Frederic Roskam

    Abstract: Dereverberation is an important sub-task of Speech Enhancement (SE) to improve the signal's intelligibility and quality. However, it remains challenging because the reverberation is highly correlated with the signal. Furthermore, the single-channel SE literature has predominantly focused on rooms with short reverb times (typically under 1 second), smaller rooms (under volumes of 1000 cubic meters)… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: Published in ICASSP 2025

    ACM Class: I.5.1; I.5.4

  6. arXiv:2504.07959  [pdf, other

    cs.CV

    CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy

    Authors: Dongyoung Kim, Mahmoud Afifi, Dongyun Kim, Michael S. Brown, Seon Joo Kim

    Abstract: Computational color constancy, or white balancing, is a key module in a camera's image signal processor (ISP) that corrects color casts from scene lighting. Because this operation occurs in the camera-specific raw color space, white balance algorithms must adapt to different cameras. This paper introduces a learning-based method for cross-camera color constancy that generalizes to new cameras with… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  7. arXiv:2504.05623  [pdf, ps, other

    cs.CV

    Time-Aware Auto White Balance in Mobile Photography

    Authors: Mahmoud Afifi, Luxi Zhao, Abhijith Punnappurath, Mohammed A. Abdelsalam, Ran Zhang, Michael S. Brown

    Abstract: Cameras rely on auto white balance (AWB) to correct undesirable color casts caused by scene illumination and the camera's spectral sensitivity. This is typically achieved using an illuminant estimator that determines the global color cast solely from the color information in the camera's raw sensor image. Mobile devices provide valuable additional metadata-such as capture timestamp and geolocation… ▽ More

    Submitted 25 June, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  8. arXiv:2503.22026  [pdf, other

    cs.CV eess.IV

    Multispectral Demosaicing via Dual Cameras

    Authors: SaiKiran Tedla, Junyong Lee, Beixuan Yang, Mahmoud Afifi, Michael S. Brown

    Abstract: Multispectral (MS) images capture detailed scene information across a wide range of spectral bands, making them invaluable for applications requiring rich spectral data. Integrating MS imaging into multi camera devices, such as smartphones, has the potential to enhance both spectral applications and RGB image quality. A critical step in processing MS data is demosaicing, which reconstructs color i… ▽ More

    Submitted 8 April, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

  9. arXiv:2503.14774  [pdf, other

    cs.CV

    Revisiting Image Fusion for Multi-Illuminant White-Balance Correction

    Authors: David Serrano-Lozano, Aditya Arora, Luis Herranz, Konstantinos G. Derpanis, Michael S. Brown, Javier Vazquez-Corral

    Abstract: White balance (WB) correction in scenes with multiple illuminants remains a persistent challenge in computer vision. Recent methods explored fusion-based approaches, where a neural network linearly blends multiple sRGB versions of an input image, each processed with predefined WB presets. However, we demonstrate that these methods are suboptimal for common multi-illuminant scenarios. Additionally,… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 10 pages

  10. arXiv:2503.08735  [pdf

    eess.IV cs.CV cs.LG

    A Bi-channel Aided Stitching of Atomic Force Microscopy Images

    Authors: Huanhuan Zhao, Ruben Millan-Solsona, Marti Checa, Spenser R. Brown, Jennifer L. Morrell-Falvey, Liam Collins, Arpan Biswas

    Abstract: Microscopy is an essential tool in scientific research, enabling the visualization of structures at micro- and nanoscale resolutions. However, the field of microscopy often encounters limitations in field-of-view (FOV), restricting the amount of sample that can be imaged in a single capture. To overcome this limitation, image stitching techniques have been developed to seamlessly merge multiple ov… ▽ More

    Submitted 13 March, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

    Comments: The manuscript has 21 pages with 8 figures in main-text and 2 figures in Supplementary materials

  11. arXiv:2502.15937  [pdf, other

    cs.RO cs.AI cs.MA

    Discovery and Deployment of Emergent Robot Swarm Behaviors via Representation Learning and Real2Sim2Real Transfer

    Authors: Connor Mattson, Varun Raveendra, Ricardo Vega, Cameron Nowzari, Daniel S. Drew, Daniel S. Brown

    Abstract: Given a swarm of limited-capability robots, we seek to automatically discover the set of possible emergent behaviors. Prior approaches to behavior discovery rely on human feedback or hand-crafted behavior metrics to represent and evolve behaviors and only discover behaviors in simulation, without testing or considering the deployment of these new behaviors on real robot swarms. In this work, we pr… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 10 pages, 5 figures. To be included in Proc. of the 24th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2025)

  12. arXiv:2502.07158  [pdf, other

    cs.LG cs.AI

    Early Risk Prediction of Pediatric Cardiac Arrest from Electronic Health Records via Multimodal Fused Transformer

    Authors: Jiaying Lu, Stephanie R. Brown, Songyuan Liu, Shifan Zhao, Kejun Dong, Del Bold, Michael Fundora, Alaa Aljiffry, Alex Fedorov, Jocelyn Grunwell, Xiao Hu

    Abstract: Early prediction of pediatric cardiac arrest (CA) is critical for timely intervention in high-risk intensive care settings. We introduce PedCA-FT, a novel transformer-based framework that fuses tabular view of EHR with the derived textual view of EHR to fully unleash the interactions of high-dimensional risk factors and their dynamics. By employing dedicated transformer modules for each modality v… ▽ More

    Submitted 20 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Journal ref: in Proceedings of 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2025)

  13. Wearable AR in Everyday Contexts: Insights from a Digital Ethnography of YouTube Videos

    Authors: Tram Thi Minh Tran, Shane Brown, Oliver Weidlich, Soojeong Yoo, Callum Parker

    Abstract: With growing investment in consumer augmented reality (AR) headsets and glasses, wearable AR is moving from niche applications to everyday use. However, current research primarily examines AR in controlled settings, offering limited insights into its use in real-world daily life. To address this gap, we adopt a digital ethnographic approach, analysing 27 hours of 112 YouTube videos featuring early… ▽ More

    Submitted 11 February, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  14. arXiv:2502.03698  [pdf, other

    cs.LG cs.CR cs.RO

    How vulnerable is my policy? Adversarial attacks on modern behavior cloning policies

    Authors: Basavasagar Patil, Akansha Kalra, Guanhong Tao, Daniel S. Brown

    Abstract: Learning from Demonstration (LfD) algorithms have shown promising results in robotic manipulation tasks, but their vulnerability to adversarial attacks remains underexplored. This paper presents a comprehensive study of adversarial attacks on both classic and recently proposed algorithms, including Behavior Cloning (BC), LSTM-GMM, Implicit Behavior Cloning (IBC), Diffusion Policy (DP), and VQ-Beha… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  15. arXiv:2501.10561  [pdf, other

    cs.RO

    Early Failure Detection in Autonomous Surgical Soft-Tissue Manipulation via Uncertainty Quantification

    Authors: Jordan Thompson, Ronald Koe, Anthony Le, Gabriella Goodman, Daniel S. Brown, Alan Kuntz

    Abstract: Autonomous surgical robots are a promising solution to the increasing demand for surgery amid a shortage of surgeons. Recent work has proposed learning-based approaches for the autonomous manipulation of soft tissue. However, due to variability in tissue geometries and stiffnesses, these methods do not always perform optimally, especially in out-of-distribution settings. We propose, develop, and t… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: 8 pages, 6 figures

  16. arXiv:2501.08389  [pdf, other

    cs.RO cs.HC

    Toward Zero-Shot User Intent Recognition in Shared Autonomy

    Authors: Atharv Belsare, Zohre Karimi, Connor Mattson, Daniel S. Brown

    Abstract: A fundamental challenge of shared autonomy is to use high-DoF robots to assist, rather than hinder, humans by first inferring user intent and then empowering the user to achieve their intent. Although successful, prior methods either rely heavily on a priori knowledge of all possible human intents or require many demonstrations and interactions with the human to learn these intents before being ab… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 10 pages, 6 figures, Accepted to IEEE/ACM International Conference on Human-Robot Interaction (HRI), 2025. Equal Contribution from the first three authors

  17. arXiv:2412.15438  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Efficient Neural Network Encoding for 3D Color Lookup Tables

    Authors: Vahid Zehtab, David B. Lindell, Marcus A. Brubaker, Michael S. Brown

    Abstract: 3D color lookup tables (LUTs) enable precise color manipulation by mapping input RGB values to specific output RGB values. 3D LUTs are instrumental in various applications, including video editing, in-camera processing, photographic filters, computer graphics, and color processing for displays. While an individual LUT does not incur a high memory overhead, software and devices may need to store do… ▽ More

    Submitted 19 December, 2024; originally announced December 2024.

    Comments: 14 pages, 13 figures; extended version; to appear in AAAI 2025

    ACM Class: I.4.0; I.4.2; I.5.0; I.5.1; I.5.4; I.2.0; I.2.6; I.2.10

  18. arXiv:2410.16444  [pdf, other

    cs.RO eess.SY

    Agent-Based Emulation for Deploying Robot Swarm Behaviors

    Authors: Ricardo Vega, Kevin Zhu, Connor Mattson, Daniel S. Brown, Cameron Nowzari

    Abstract: Despite significant research, robotic swarms have yet to be useful in solving real-world problems, largely due to the difficulty of creating and controlling swarming behaviors in multi-agent systems. Traditional top-down approaches in which a desired emergent behavior is produced often require complex, resource-heavy robots, limiting their practicality. This paper introduces a bottom-up approach b… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 8 pages, 6 figures, submitted to ICRA 2025

  19. arXiv:2410.16175  [pdf, other

    cs.NE cs.MA eess.SY

    Spiking Neural Networks as a Controller for Emergent Swarm Agents

    Authors: Kevin Zhu, Connor Mattson, Shay Snyder, Ricardo Vega, Daniel S. Brown, Maryam Parsa, Cameron Nowzari

    Abstract: Drones which can swarm and loiter in a certain area cost hundreds of dollars, but mosquitos can do the same and are essentially worthless. To control swarms of low-cost robots, researchers may end up spending countless hours brainstorming robot configurations and policies to ``organically" create behaviors which do not need expensive sensors and perception. Existing research explores the possible… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 8 pages, 7 figures, presented at the 2024 International Conference on Neuromorphic Systems

  20. arXiv:2410.03423  [pdf, other

    eess.SP cs.LG

    Aircraft Radar Altimeter Interference Mitigation Through a CNN-Layer Only Denoising Autoencoder Architecture

    Authors: Samuel B. Brown, Stephen Young, Adam Wagenknecht, Daniel Jakubisin, Charles E. Thornton, Aaron Orndorff, William C. Headley

    Abstract: Denoising autoencoders for signal processing applications have been shown to experience significant difficulty in learning to reconstruct radio frequency communication signals, particularly in the large sample regime. In communication systems, this challenge is primarily due to the need to reconstruct the modulated data stream which is generally highly stochastic in nature. In this work, we take a… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: To be presented at MILCOM 2024, Washington DC

  21. arXiv:2408.12633  [pdf

    cs.SD eess.AS physics.soc-ph

    Melody predominates over harmony in the evolution of musical scales across 96 countries

    Authors: John M McBride, Elizabeth Phillips, Patrick E Savage, Steven Brown, Tsvi Tlusty

    Abstract: The standard theory of musical scales since antiquity has been based on harmony, rather than melody. While recent analyses provide mixed support for a role of melody as well as harmony, we lack a comparative analysis based on cross-cultural data. We address this longstanding problem through a rigorous computational comparison of the main theories using 1,314 scales from 96 countries. There is near… ▽ More

    Submitted 2 July, 2025; v1 submitted 22 August, 2024; originally announced August 2024.

  22. arXiv:2408.05610  [pdf, other

    cs.RO cs.AI

    Representation Alignment from Human Feedback for Cross-Embodiment Reward Learning from Mixed-Quality Demonstrations

    Authors: Connor Mattson, Anurag Aribandi, Daniel S. Brown

    Abstract: We study the problem of cross-embodiment inverse reinforcement learning, where we wish to learn a reward function from video demonstrations in one or more embodiments and then transfer the learned reward to a different embodiment (e.g., different action space, dynamics, size, shape, etc.). Learning reward functions that transfer across embodiments is important in settings such as teaching a robot… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: First Two Authors Share Equal Contribution. 19 Pages, 4 Figures

  23. arXiv:2407.09892  [pdf, other

    cs.CV

    NamedCurves: Learned Image Enhancement via Color Naming

    Authors: David Serrano-Lozano, Luis Herranz, Michael S. Brown, Javier Vazquez-Corral

    Abstract: A popular method for enhancing images involves learning the style of a professional photo editor using pairs of training images comprised of the original input with the editor-enhanced version. When manipulating images, many editing tools offer a feature that allows the user to manipulate a limited selection of familiar colors. Editing by color name allows easy adjustment of elements like the "blu… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: European Conference on Computer Vision ECCV 2024

  24. arXiv:2406.07358  [pdf, other

    cs.AI cs.CL cs.CY cs.LG

    AI Sandbagging: Language Models can Strategically Underperform on Evaluations

    Authors: Teun van der Weij, Felix Hofstätter, Ollie Jaffe, Samuel F. Brown, Francis Rhys Ward

    Abstract: Trustworthy capability evaluations are crucial for ensuring the safety of AI systems, and are becoming a key component of AI regulation. However, the developers of an AI system, or the AI system itself, may have incentives for evaluations to understate the AI's actual capability. These conflicting interests lead to the problem of sandbagging, which we define as strategic underperformance on an eva… ▽ More

    Submitted 6 February, 2025; v1 submitted 11 June, 2024; originally announced June 2024.

  25. arXiv:2405.09733  [pdf, other

    cs.CL

    SCI 3.0: A Web-based Schema Curation Interface for Graphical Event Representations

    Authors: Reece Suchocki, Mary Martin, Martha Palmer, Susan Brown

    Abstract: To understand the complexity of global events, one must navigate a web of interwoven sub-events, identifying those most impactful elements within the larger, abstract macro-event framework at play. This concept can be extended to the field of natural language processing (NLP) through the creation of structured event schemas which can serve as representations of these abstract events. Central to ou… ▽ More

    Submitted 16 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  26. arXiv:2404.16244  [pdf, other

    cs.CY

    The Ethics of Advanced AI Assistants

    Authors: Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomašev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz , et al. (32 additional authors not shown)

    Abstract: This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, pro… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  27. arXiv:2404.15058  [pdf, other

    cs.CY cs.AI

    A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI

    Authors: Seliem El-Sayed, Canfer Akbulut, Amanda McCroskery, Geoff Keeling, Zachary Kenton, Zaria Jalan, Nahema Marchal, Arianna Manzini, Toby Shevlane, Shannon Vallor, Daniel Susser, Matija Franklin, Sophie Bridgers, Harry Law, Matthew Rahtz, Murray Shanahan, Michael Henry Tessler, Arthur Douillard, Tom Everitt, Sasha Brown

    Abstract: Recent generative AI systems have demonstrated more advanced persuasive capabilities and are increasingly permeating areas of life where they can influence decision-making. Generative AI presents a new risk profile of persuasion due the opportunity for reciprocal exchange and prolonged interactions. This has led to growing concerns about harms from AI persuasion and how they can be mitigated, high… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  28. Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery

    Authors: Zohre Karimi, Shing-Hei Ho, Bao Thach, Alan Kuntz, Daniel S. Brown

    Abstract: Automating robotic surgery via learning from demonstration (LfD) techniques is extremely challenging. This is because surgical tasks often involve sequential decision-making processes with complex interactions of physical objects and have low tolerance for mistakes. Prior works assume that all demonstrations are fully observable and optimal, which might not be practical in the real world. This pap… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: In proceedings of the International Symposium on Medical Robotics (ISMR) 2024. Equal contribution from two first authors

    Journal ref: 2024 International Symposium on Medical Robotics (ISMR), pp. 1-7, 2024

  29. arXiv:2404.04241  [pdf, other

    cs.RO

    Modeling Kinematic Uncertainty of Tendon-Driven Continuum Robots via Mixture Density Networks

    Authors: Jordan Thompson, Brian Y. Cho, Daniel S. Brown, Alan Kuntz

    Abstract: Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  30. arXiv:2404.02164  [pdf, other

    physics.soc-ph cs.SI econ.TH

    Exploring Correlation Patterns in the Ethereum Validator Network

    Authors: Simon Brown, Leonardo Bautista-Gomez

    Abstract: There have been several studies into measuring the level of decentralization in Ethereum through applying various indices to indicate the relative dominance of entities in different domains in the ecosystem. However, these indices do not capture any correlation between those different entities, that could potentially make them the subject of external coercion, or covert collusion. We propose an in… ▽ More

    Submitted 22 March, 2024; originally announced April 2024.

    Comments: 11 pages, 7 figures, 3 tables

  31. arXiv:2403.13793  [pdf, other

    cs.LG

    Evaluating Frontier Models for Dangerous Capabilities

    Authors: Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah , et al. (2 additional authors not shown)

    Abstract: To understand the risks posed by a new AI system, we must understand what it can and cannot do. Building on prior work, we introduce a programme of new "dangerous capability" evaluations and pilot them on Gemini 1.0 models. Our evaluations cover four areas: (1) persuasion and deception; (2) cyber-security; (3) self-proliferation; and (4) self-reasoning. We do not find evidence of strong dangerous… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  32. arXiv:2403.02431  [pdf, other

    cs.RO

    Bayesian Constraint Inference from User Demonstrations Based on Margin-Respecting Preference Models

    Authors: Dimitris Papadimitriou, Daniel S. Brown

    Abstract: It is crucial for robots to be aware of the presence of constraints in order to acquire safe policies. However, explicitly specifying all constraints in an environment can be a challenging task. State-of-the-art constraint inference algorithms learn constraints from demonstrations, but tend to be computationally expensive and prone to instability issues. In this paper, we propose a novel Bayesian… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  33. A Framework for Assurance Audits of Algorithmic Systems

    Authors: Khoa Lam, Benjamin Lange, Borhane Blili-Hamelin, Jovana Davidovic, Shea Brown, Ali Hasan

    Abstract: An increasing number of regulations propose AI audits as a mechanism for achieving transparency and accountability for artificial intelligence (AI) systems. Despite some converging norms around various forms of AI auditing, auditing for the purpose of compliance and assurance currently lacks agreed-upon practices, procedures, taxonomies, and standards. We propose the criterion audit as an operatio… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Journal ref: The 2024 ACM Conference on Fairness, Accountability, and Transparency

  34. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  35. arXiv:2312.03093  [pdf, other

    cs.HC cs.AI cs.CL

    RESIN-EDITOR: A Schema-guided Hierarchical Event Graph Visualizer and Editor

    Authors: Khanh Duy Nguyen, Zixuan Zhang, Reece Suchocki, Sha Li, Martha Palmer, Susan Brown, Jiawei Han, Heng Ji

    Abstract: In this paper, we present RESIN-EDITOR, an interactive event graph visualizer and editor designed for analyzing complex events. Our RESIN-EDITOR system allows users to render and freely edit hierarchical event graphs extracted from multimedia and multi-document news clusters with guidance from human-curated event schemas. RESIN-EDITOR's unique features include hierarchical graph visualization, com… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: The first two authors contribute equally to this paper

  36. arXiv:2311.06989  [pdf

    cs.SE cs.AI

    Creating a Discipline-specific Commons for Infectious Disease Epidemiology

    Authors: Michael M. Wagner, William Hogan, John Levander, Adam Darr, Matt Diller, Max Sibilla, Alexander T. Loiacono. Terence Sperringer, Jr., Shawn T. Brown

    Abstract: Objective: To create a commons for infectious disease (ID) epidemiology in which epidemiologists, public health officers, data producers, and software developers can not only share data and software, but receive assistance in improving their interoperability. Materials and Methods: We represented 586 datasets, 54 software, and 24 data formats in OWL 2 and then used logical queries to infer potenti… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 12 pages, 6 figures

  37. arXiv:2310.16941  [pdf, other

    cs.RO cs.LG cs.MA

    Exploring Behavior Discovery Methods for Heterogeneous Swarms of Limited-Capability Robots

    Authors: Connor Mattson, Jeremy C. Clark, Daniel S. Brown

    Abstract: We study the problem of determining the emergent behaviors that are possible given a functionally heterogeneous swarm of robots with limited capabilities. Prior work has considered behavior search for homogeneous swarms and proposed the use of novelty search over either a hand-specified or learned behavior space followed by clustering to return a taxonomy of emergent behaviors to the user. In this… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 9 figures, To be published in Proceedings IEEE International Symposium on Multi-Robot & Multi-Agent Systems (MRS 2023)

  38. arXiv:2310.10610  [pdf, other

    cs.AI cs.LG cs.RO

    Quantifying Assistive Robustness Via the Natural-Adversarial Frontier

    Authors: Jerry Zhi-Yang He, Zackory Erickson, Daniel S. Brown, Anca D. Dragan

    Abstract: Our ultimate goal is to build robust policies for robots that assist people. What makes this hard is that people can behave unexpectedly at test time, potentially interacting with the robot outside its training distribution and leading to failures. Even just measuring robustness is a challenge. Adversarial perturbations are the default, but they can paint the wrong picture: they can correspond to… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  39. arXiv:2309.11408  [pdf, other

    cs.RO eess.SY

    Indirect Swarm Control: Characterization and Analysis of Emergent Swarm Behaviors

    Authors: Ricardo Vega, Connor Mattson, Daniel S. Brown, Cameron Nowzari

    Abstract: Emergence and emergent behaviors are often defined as cases where changes in local interactions between agents at a lower level effectively changes what occurs in the higher level of the system (i.e., the whole swarm) and its properties. However, the manner in which these collective emergent behaviors self-organize is less understood. The focus of this paper is in presenting a new framework for ch… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 8 pages, 13 figures, submitted to IROS 2024 conference

  40. arXiv:2309.04542  [pdf, other

    cs.CV

    Examining Autoexposure for Challenging Scenes

    Authors: SaiKiran Tedla, Beixuan Yang, Michael S. Brown

    Abstract: Autoexposure (AE) is a critical step applied by camera systems to ensure properly exposed images. While current AE algorithms are effective in well-lit environments with constant illumination, these algorithms still struggle in environments with bright light sources or scenes with abrupt changes in lighting. A significant hurdle in developing new AE algorithms for challenging environments, especia… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  41. arXiv:2307.10026  [pdf, other

    cs.LG

    Contextual Reliability: When Different Features Matter in Different Contexts

    Authors: Gaurav Ghosal, Amrith Setlur, Daniel S. Brown, Anca D. Dragan, Aditi Raghunathan

    Abstract: Deep neural networks often fail catastrophically by relying on spurious correlations. Most prior work assumes a clear dichotomy into spurious and reliable features; however, this is often unrealistic. For example, most of the time we do not want an autonomous car to simply copy the speed of surrounding cars -- we don't want our car to run a red light if a neighboring car does so. However, we canno… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: ICML 2023 Camera Ready Version

  42. arXiv:2306.13004  [pdf, other

    cs.LG cs.AI

    Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?

    Authors: Akansha Kalra, Daniel S. Brown

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has emerged as a popular paradigm for capturing human intent to alleviate the challenges of hand-crafting the reward values. Despite the increasing interest in RLHF, most works learn black box reward functions that while expressive are difficult to interpret and often require running the whole costly process of RL before we can even decipher if the… ▽ More

    Submitted 10 October, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Report number: Reinforcement Learning Journal, vol. 4, 2024, pp. 1887--1910

  43. arXiv:2306.11920  [pdf, other

    cs.CV

    NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement

    Authors: Marcos V. Conde, Javier Vazquez-Corral, Michael S. Brown, Radu Timofte

    Abstract: 3D lookup tables (3D LUTs) are a key component for image enhancement. Modern image signal processors (ISPs) have dedicated support for these as part of the camera rendering pipeline. Cameras typically provide multiple options for picture styles, where each style is usually obtained by applying a unique handcrafted 3D LUT. Current approaches for learning and applying 3D LUTs are notably fast, yet n… ▽ More

    Submitted 24 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: AAAI 2024 - The 38th Annual AAAI Conference on Artificial Intelligence

  44. arXiv:2306.02183  [pdf

    cs.DC q-bio.NC q-bio.QM

    brainlife.io: A decentralized and open source cloud platform to support neuroscience research

    Authors: Soichi Hayashi, Bradley A. Caron, Anibal Sólon Heinsfeld, Sophia Vinci-Booher, Brent McPherson, Daniel N. Bullock, Giulia Bertò, Guiomar Niso, Sandra Hanekamp, Daniel Levitas, Kimberly Ray, Anne MacKenzie, Lindsey Kitchell, Josiah K. Leong, Filipi Nascimento-Silva, Serge Koudoro, Hanna Willis, Jasleen K. Jolly, Derek Pisner, Taylor R. Zuidema, Jan W. Kurzawski, Kyriaki Mikellidou, Aurore Bussalb, Christopher Rorden, Conner Victory , et al. (39 additional authors not shown)

    Abstract: Neuroscience research has expanded dramatically over the past 30 years by advancing standardization and tool development to support rigor and transparency. Consequently, the complexity of the data pipeline has also increased, hindering access to FAIR (Findable, Accessible, Interoperabile, and Reusable) data analysis to portions of the worldwide research community. brainlife.io was developed to red… ▽ More

    Submitted 11 August, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  45. arXiv:2305.16148  [pdf, other

    cs.MA cs.LG cs.RO

    Leveraging Human Feedback to Evolve and Discover Novel Emergent Behaviors in Robot Swarms

    Authors: Connor Mattson, Daniel S. Brown

    Abstract: Robot swarms often exhibit emergent behaviors that are fascinating to observe; however, it is often difficult to predict what swarm behaviors can emerge under a given set of agent capabilities. We seek to efficiently leverage human input to automatically discover a taxonomy of collective behaviors that can emerge from a particular multi-agent system, without requiring the human to know beforehand… ▽ More

    Submitted 16 July, 2023; v1 submitted 25 April, 2023; originally announced May 2023.

    Comments: 13 pages, 10 figures, To be published in Proceedings Genetic and Evolutionary Computation Conference (GECCO 2023)

  46. arXiv:2305.14600  [pdf, other

    cs.CL cs.LG

    Learning Semantic Role Labeling from Compatible Label Sequences

    Authors: Tao Li, Ghazaleh Kazeminejad, Susan W. Brown, Martha Palmer, Vivek Srikumar

    Abstract: Semantic role labeling (SRL) has multiple disjoint label sets, e.g., VerbNet and PropBank. Creating these datasets is challenging, therefore a natural question is how to use each one to help the other. Prior work has shown that cross-task interaction helps, but only explored multitask learning so far. A common issue with multi-task setup is that argument sequences are still separately decoded, run… ▽ More

    Submitted 19 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at Findings of EMNLP 2023

  47. arXiv:2304.14095  [pdf, other

    cs.NI

    Securing Autonomous Air Traffic Management: Blockchain Networks Driven by Explainable AI

    Authors: Louise Axon, Dimitrios Panagiotakopoulos, Samuel Ayo, Carolina Sanchez-Hernandez, Yan Zong, Simon Brown, Lei Zhang, Michael Goldsmith, Sadie Creese, Weisi Guo

    Abstract: Air Traffic Management data systems today are inefficient and not scalable to enable future unmanned systems. Current data is fragmented, siloed, and not easily accessible. There is data conflict, misuse, and eroding levels of trust in provenance and accuracy. With increased autonomy in aviation, Artificially Intelligent (AI) enabled unmanned traffic management (UTM) will be more reliant on secure… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: under review in IEEE

  48. arXiv:2304.13907  [pdf, other

    cs.SI

    Network Analysis as a Tool for Shaping Conservation and Development Policy: A Case Study of Timber Market Optimization in India

    Authors: Xiou Ge, Sarah E. Brown, Pushpendra Rana, Lav R. Varshney, Daniel C. Miller

    Abstract: The incorporation of trees on farms can help to improve livelihoods and build resilience among small-holder farmers in developing countries. On-farm trees can help gen- erate additional income from commercial tree harvest as well as contribute significant environmental benefits and ecosystem services to increase resiliency. Long-term benefits from tree-based livelihoods, however, depend on sustain… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Paper accepted to proceedings of the 5th Data for Good Exchange (D4GX)

  49. arXiv:2304.11743  [pdf, other

    cs.CV

    GamutMLP: A Lightweight MLP for Color Loss Recovery

    Authors: Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown

    Abstract: Cameras and image-editing software often process images in the wide-gamut ProPhoto color space, encompassing 90% of all visible colors. However, when images are encoded for sharing, this color-rich representation is transformed and clipped to fit within the small-gamut standard RGB (sRGB) color space, representing only 30% of visible colors. Recovering the lost color information is challenging due… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  50. Human-in-the-Loop Schema Induction

    Authors: Tianyi Zhang, Isaac Tham, Zhaoyi Hou, Jiaxuan Ren, Liyang Zhou, Hainiu Xu, Li Zhang, Lara J. Martin, Rotem Dror, Sha Li, Heng Ji, Martha Palmer, Susan Brown, Reece Suchocki, Chris Callison-Burch

    Abstract: Schema induction builds a graph representation explaining how events unfold in a scenario. Existing approaches have been based on information retrieval (IR) and information extraction(IE), often with limited human curation. We demonstrate a human-in-the-loop schema induction system powered by GPT-3. We first describe the different modules of our system, including prompting to generate schematic el… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: 10 pages, ACL2023 demo track