Skip to main content

Showing 1–50 of 107 results for author: Williams, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00909  [pdf, ps, other

    cs.DC cs.AI cs.PF eess.SY

    Turning AI Data Centers into Grid-Interactive Assets: Results from a Field Demonstration in Phoenix, Arizona

    Authors: Philip Colangelo, Ayse K. Coskun, Jack Megrue, Ciaran Roberts, Shayan Sengupta, Varun Sivaram, Ethan Tiao, Aroon Vijaykar, Chris Williams, Daniel C. Wilson, Zack MacFarland, Daniel Dreiling, Nathan Morey, Anuja Ratnayake, Baskar Vairamohan

    Abstract: Artificial intelligence (AI) is fueling exponential electricity demand growth, threatening grid reliability, raising prices for communities paying for new energy infrastructure, and stunting AI innovation as data centers wait for interconnection to constrained grids. This paper presents the first field demonstration, in collaboration with major corporate partners, of a software-only approach--Emer… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 10 pages, 6 figures, 1 table

  2. arXiv:2505.14599  [pdf, ps, other

    cs.CL cs.AI

    Toward Reliable Scientific Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models

    Authors: Guangzhi Xiong, Eric Xie, Corey Williams, Myles Kim, Amir Hassan Shariatmadari, Sikun Guo, Stefan Bekiranov, Aidong Zhang

    Abstract: Large language models (LLMs) have shown significant potential in scientific disciplines such as biomedicine, particularly in hypothesis generation, where they can analyze vast literature, identify patterns, and suggest research directions. However, a key challenge lies in evaluating the truthfulness of generated hypotheses, as verifying their accuracy often requires substantial time and resources.… ▽ More

    Submitted 8 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted to IJCAI 2025

  3. arXiv:2505.01249  [pdf, other

    cs.CV cs.LG

    Fusing Foveal Fixations Using Linear Retinal Transformations and Bayesian Experimental Design

    Authors: Christopher K. I. Williams

    Abstract: Humans (and many vertebrates) face the problem of fusing together multiple fixations of a scene in order to obtain a representation of the whole, where each fixation uses a high-resolution fovea and decreasing resolution in the periphery. In this paper we explicitly represent the retinal transformation of a fixation as a linear downsampling of a high-resolution latent image of the scene, exploitin… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: 19 pages, 4 figures

  4. arXiv:2504.20910  [pdf, other

    cs.CY cs.AI cs.HC

    When Testing AI Tests Us: Safeguarding Mental Health on the Digital Frontlines

    Authors: Sachin R. Pendse, Darren Gergle, Rachel Kornfield, Jonah Meyerhoff, David Mohr, Jina Suh, Annie Wescott, Casey Williams, Jessica Schleider

    Abstract: Red-teaming is a core part of the infrastructure that ensures that AI models do not produce harmful content. Unlike past technologies, the black box nature of generative AI systems necessitates a uniquely interactional mode of testing, one in which individuals on red teams actively interact with the system, leveraging natural language to simulate malicious actors and solicit harmful outputs. This… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: Accepted to ACM Conference on Fairness, Accountability, and Transparency (FAccT 2025)

  5. arXiv:2502.09716  [pdf, ps, other

    cs.CY

    Principles and Policy Recommendations for Comprehensive Genetic Data Governance

    Authors: Vivek Ramanan, Ria Vinod, Cole Williams, Sohini Ramachandran, Suresh Venkatasubramanian

    Abstract: Genetic data collection has become ubiquitous, producing genetic information about health, ancestry, and social traits. However, unregulated use, especially amid evolving scientific understanding, poses serious privacy and discrimination risks. These risks are intensified by advancing AI, particularly multi-modal systems integrating genetic, clinical, behavioral, and environmental data. In this wo… ▽ More

    Submitted 30 May, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    ACM Class: K.4.1

  6. arXiv:2412.07877  [pdf, other

    stat.ML cs.LG

    Score-Optimal Diffusion Schedules

    Authors: Christopher Williams, Andrew Campbell, Arnaud Doucet, Saifuddin Syed

    Abstract: Denoising diffusion models (DDMs) offer a flexible framework for sampling from high dimensional data distributions. DDMs generate a path of probability distributions interpolating between a reference Gaussian distribution and a data distribution by incrementally injecting noise into the data. To numerically simulate the sampling process, a discretisation schedule from the reference back towards cl… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: NeurIPS 2024 accepted paper

  7. arXiv:2409.04676  [pdf, other

    cs.HC

    Exploring Crowdworkers' Perceptions, Current Practices, and Desired Practices Regarding Using Non-Workstation Devices for Crowdwork

    Authors: Senjuti Dutta, Scott Ruoti, Rhema Linder, Alex C. Williams, Anastasia Kuzminykh

    Abstract: Despite a plethora of research dedicated to designing HITs for non-workstations, there is a lack of research looking specifically into workers' perceptions of the suitability of these devices for managing and completing work. In this work, we fill this research gap by conducting an online survey of 148 workers on Amazon Mechanical Turk to explore 1. how crowdworkers currently use their non-worksta… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  8. arXiv:2409.04658  [pdf, other

    cs.HC

    Unveiling the Inter-Related Preferences of Crowdworkers: Implications for Personalized and Flexible Platform Design

    Authors: Senjuti Dutta, Rhema Linder, Alex C. Williams, Anastasia Kuzminykh, Scott Ruoti

    Abstract: Crowdsourcing platforms have traditionally been designed with a focus on workstation interfaces, restricting the flexibility that crowdworkers need. Recognizing this limitation and the need for more adaptable platforms, prior research has highlighted the diverse work processes of crowdworkers, influenced by factors such as device type and work stage. However, these variables have largely been stud… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

  9. arXiv:2408.07532  [pdf, other

    eess.IV cs.CV

    Improved 3D Whole Heart Geometry from Sparse CMR Slices

    Authors: Yiyang Xu, Hao Xu, Matthew Sinclair, Esther Puyol-Antón, Steven A Niederer, Amedeo Chiribiri, Steven E Williams, Michelle C Williams, Alistair A Young

    Abstract: Cardiac magnetic resonance (CMR) imaging and computed tomography (CT) are two common non-invasive imaging methods for assessing patients with cardiovascular disease. CMR typically acquires multiple sparse 2D slices, with unavoidable respiratory motion artefacts between slices, whereas CT acquires isotropic dense data but uses ionising radiation. In this study, we explore the combination of Slice S… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 13 pages, STACOM2024

  10. arXiv:2404.18190  [pdf, other

    cs.LG stat.ML

    Naive Bayes Classifiers and One-hot Encoding of Categorical Variables

    Authors: Christopher K. I. Williams

    Abstract: This paper investigates the consequences of encoding a $K$-valued categorical variable incorrectly as $K$ bits via one-hot encoding, when using a Naïve Bayes classifier. This gives rise to a product-of-Bernoullis (PoB) assumption, rather than the correct categorical Naïve Bayes classifier. The differences between the two classifiers are analysed mathematically and experimentally. In our experiment… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 7 pages, 3 figures

  11. arXiv:2404.07063  [pdf, other

    cs.RO cs.AI

    LaPlaSS: Latent Space Planning for Stochastic Systems

    Authors: Marlyse Reeves, Brian C. Williams

    Abstract: Autonomous mobile agents often operate in hazardous environments, necessitating an awareness of safety. These agents can have non-linear, stochastic dynamics that must be considered during planning to guarantee bounded risk. Most state of the art methods require closed-form dynamics to verify plan correctness and safety however modern robotic systems often have dynamics that are learned from data.… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  12. arXiv:2403.07925  [pdf, other

    q-bio.BM cs.LG physics.chem-ph

    Physics-informed generative model for drug-like molecule conformers

    Authors: David C. Williams, Neil Inala

    Abstract: We present a diffusion-based, generative model for conformer generation. Our model is focused on the reproduction of bonded structure and is constructed from the associated terms traditionally found in classical force fields to ensure a physically relevant representation. Techniques in deep learning are used to infer atom typing and geometric parameters from a training set. Conformer sampling is a… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: To appear in the Journal of Chemical Information and Modeling

  13. arXiv:2403.02558  [pdf

    cs.CL cs.CV

    The Minimum Information about CLinical Artificial Intelligence Checklist for Generative Modeling Research (MI-CLAIM-GEN)

    Authors: Brenda Y. Miao, Irene Y. Chen, Christopher YK Williams, Jaysón Davidson, Augusto Garcia-Agundez, Shenghuan Sun, Travis Zack, Suchi Saria, Rima Arnaout, Giorgio Quer, Hossein J. Sadaei, Ali Torkamani, Brett Beaulieu-Jones, Bin Yu, Milena Gianfrancesco, Atul J. Butte, Beau Norgeot, Madhumita Sushil

    Abstract: Recent advances in generative models, including large language models (LLMs), vision language models (VLMs), and diffusion models, have accelerated the field of natural language and image processing in medicine and marked a significant paradigm shift in how biomedical models can be developed and deployed. While these models are highly adaptable to new tasks, scaling and evaluating their usage pres… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  14. arXiv:2403.01485  [pdf, other

    stat.ML cs.CV cs.LG

    Approximations to the Fisher Information Metric of Deep Generative Models for Out-Of-Distribution Detection

    Authors: Sam Dauncey, Chris Holmes, Christopher Williams, Fabian Falck

    Abstract: Likelihood-based deep generative models such as score-based diffusion models and variational autoencoders are state-of-the-art machine learning models approximating high-dimensional distributions of data such as images, text, or audio. One of many downstream tasks they can be naturally applied to is out-of-distribution (OOD) detection. However, seminal work by Nalisnick et al. which we reproduce s… ▽ More

    Submitted 25 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  15. arXiv:2402.18480  [pdf, other

    cs.DC

    Libfork: portable continuation-stealing with stackless coroutines

    Authors: Conor John Williams, James Elliott

    Abstract: Fully-strict fork-join parallelism is a powerful model for shared-memory programming due to its optimal time scaling and strong bounds on memory scaling. The latter is rarely achieved due to the difficulty of implementing continuation stealing in traditional High Performance Computing (HPC) languages -- where it is often impossible without modifying the compiler or resorting to non-portable techni… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  16. arXiv:2402.15589  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    LLMs as Meta-Reviewers' Assistants: A Case Study

    Authors: Eftekhar Hossain, Sanjeev Kumar Sinha, Naman Bansal, Alex Knipper, Souvika Sarkar, John Salvador, Yash Mahajan, Sri Guttikonda, Mousumi Akter, Md. Mahadi Hassan, Matthew Freestone, Matthew C. Williams Jr., Dongji Feng, Santu Karmaker

    Abstract: One of the most important yet onerous tasks in the academic peer-reviewing process is composing meta-reviews, which involves assimilating diverse opinions from multiple expert peers, formulating one's self-judgment as a senior expert, and then summarizing all these perspectives into a concise holistic overview to make an overall recommendation. This process is time-consuming and can be compromised… ▽ More

    Submitted 8 February, 2025; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted to NAACL 2025, 41 pages

    ACM Class: I.2.7

  17. arXiv:2402.03597  [pdf

    cs.CL cs.IR cs.LG

    Identifying Reasons for Contraceptive Switching from Real-World Data Using Large Language Models

    Authors: Brenda Y. Miao, Christopher YK Williams, Ebenezer Chinedu-Eneh, Travis Zack, Emily Alsentzer, Atul J. Butte, Irene Y. Chen

    Abstract: Prescription contraceptives play a critical role in supporting women's reproductive health. With nearly 50 million women in the United States using contraceptives, understanding the factors that drive contraceptives selection and switching is of significant interest. However, many factors related to medication switching are often only captured in unstructured clinical notes and can be difficult to… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  18. arXiv:2312.13103  [pdf

    cs.CL cs.CV

    Exploring Multimodal Large Language Models for Radiology Report Error-checking

    Authors: Jinge Wu, Yunsoo Kim, Eva C. Keller, Jamie Chow, Adam P. Levine, Nikolas Pontikos, Zina Ibrahim, Paul Taylor, Michelle C. Williams, Honghan Wu

    Abstract: This paper proposes one of the first clinical applications of multimodal large language models (LLMs) as an assistant for radiologists to check errors in their reports. We created an evaluation dataset from real-world radiology datasets (including X-rays and CT scans). A subset of original reports was modified to contain synthetic errors by introducing three types of mistakes: "insert", "remove",… ▽ More

    Submitted 3 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  19. Adaptation and Communication in Human-Robot Teaming to Handle Discrepancies in Agents' Beliefs about Plans

    Authors: Yuening Zhang, Brian C. Williams

    Abstract: When agents collaborate on a task, it is important that they have some shared mental model of the task routines -- the set of feasible plans towards achieving the goals. However, in reality, situations often arise that such a shared mental model cannot be guaranteed, such as in ad-hoc teams where agents may follow different conventions or when contingent constraints arise that only some agents are… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 10 pages, Published at ICAPS 2023 (Main Track)

  20. arXiv:2306.09877  [pdf

    cs.CL

    Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes

    Authors: Shenghuan Sun, Travis Zack, Christopher Y. K. Williams, Atul J. Butte, Madhumita Sushil

    Abstract: We aimed to investigate the impact of social circumstances on cancer therapy selection using natural language processing to derive insights from social worker documentation. We developed and employed a Bidirectional Encoder Representations from Transformers (BERT) based approach, using a hierarchical multi-step BERT model (BERT-MS) to predict the prescription of targeted cancer therapy to patients… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 18 pages, 4 figures, 2 Tables

  21. arXiv:2306.03066  [pdf, other

    cs.CV cs.LG stat.ML

    Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across Cages

    Authors: Michael P. J. Camilleri, Rasneer S. Bains, Christopher K. I. Williams

    Abstract: Behavioural experiments often happen in specialised arenas, but this may confound the analysis. To address this issue, we provide tools to study mice in the home-cage environment, equipping biologists with the possibility to capture the temporal aspect of the individual's behaviour and model the interaction and interdependence between cage-mates with minimal human intervention. Our main contributi… ▽ More

    Submitted 24 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: International Journal of Computer Vision (2024)

  22. arXiv:2306.01268  [pdf, other

    cs.CV cs.DL cs.IR

    DeepScribe: Localization and Classification of Elamite Cuneiform Signs Via Deep Learning

    Authors: Edward C. Williams, Grace Su, Sandra R. Schloen, Miller C. Prosser, Susanne Paulus, Sanjay Krishnan

    Abstract: Twenty-five hundred years ago, the paperwork of the Achaemenid Empire was recorded on clay tablets. In 1933, archaeologists from the University of Chicago's Oriental Institute (OI) found tens of thousands of these tablets and fragments during the excavation of Persepolis. Many of these tablets have been painstakingly photographed and annotated by expert cuneiformists, and now provide a rich datase… ▽ More

    Submitted 1 February, 2025; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to ACM JOCCH

  23. arXiv:2305.19638  [pdf, other

    stat.ML cs.CV cs.LG eess.IV

    A Unified Framework for U-Net Design and Analysis

    Authors: Christopher Williams, Fabian Falck, George Deligiannidis, Chris Holmes, Arnaud Doucet, Saifuddin Syed

    Abstract: U-Nets are a go-to, state-of-the-art neural architecture across numerous tasks for continuous signals on a square such as images and Partial Differential Equations (PDE), however their design and architecture is understudied. In this paper, we provide a framework for designing and analysing general U-Net architectures. We present theoretical results which characterise the role of the encoder and d… ▽ More

    Submitted 10 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

  24. arXiv:2303.09648  [pdf, other

    cs.CV

    Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery

    Authors: Jinfan Zhou, William Muirhead, Simon C. Williams, Danail Stoyanov, Hani J. Marcus, Evangelos B. Mazomenos

    Abstract: Purpose: Microsurgical Aneurysm Clipping Surgery (MACS) carries a high risk for intraoperative aneurysm rupture. Automated recognition of instances when the aneurysm is exposed in the surgical video would be a valuable reference point for neuronavigation, indicating phase transitioning and more importantly designating moments of high risk for rupture. This article introduces the MACS dataset conta… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  25. Augmenting Pathologists with NaviPath: Design and Evaluation of a Human-AI Collaborative Navigation System

    Authors: Hongyan Gu, Chunxu Yang, Mohammad Haeri, Jing Wang, Shirley Tang, Wenzhong Yan, Shujin He, Christopher Kazu Williams, Shino Magaki, Xiang 'Anthony' Chen

    Abstract: Artificial Intelligence (AI) brings advancements to support pathologists in navigating high-resolution tumor images to search for pathology patterns of interest. However, existing AI-assisted tools have not realized this promised potential due to a lack of insight into pathology and HCI considerations for pathologists' navigation workflows in practice. We first conducted a formative study with six… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted ACM CHI Conference on Human Factors in Computing Systems (CHI '23)

  26. Structured Generative Models for Scene Understanding

    Authors: Christopher K. I. Williams

    Abstract: This position paper argues for the use of \emph{structured generative models} (SGMs) for the understanding of static scenes. This requires the reconstruction of a 3D scene from an input image (or a set of multi-view images), whereby the contents of the image(s) are causally explained in terms of models of instantiated objects, each with their own type, shape, appearance and pose, along with global… ▽ More

    Submitted 2 September, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 32 pages, 10 figures

    Journal ref: International Journal of Computer Vision, 2024

  27. arXiv:2301.08187  [pdf, other

    stat.ML cs.CV cs.LG eess.SP

    A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs

    Authors: Fabian Falck, Christopher Williams, Dominic Danks, George Deligiannidis, Christopher Yau, Chris Holmes, Arnaud Doucet, Matthew Willetts

    Abstract: U-Net architectures are ubiquitous in state-of-the-art deep learning, however their regularisation properties and relationship to wavelets are understudied. In this paper, we formulate a multi-resolution framework which identifies U-Nets as finite-dimensional truncations of models on an infinite-dimensional function space. We provide theoretical results which prove that average pooling corresponds… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2022 (selected as oral)

  28. arXiv:2211.01634  [pdf, other

    cs.RO cs.AI cs.CV

    P4P: Conflict-Aware Motion Prediction for Planning in Autonomous Driving

    Authors: Qiao Sun, Xin Huang, Brian C. Williams, Hang Zhao

    Abstract: Motion prediction is crucial in enabling safe motion planning for autonomous vehicles in interactive scenarios. It allows the planner to identify potential conflicts with other traffic agents and generate safe plans. Existing motion predictors often focus on reducing prediction errors, yet it remains an open question on how well they help identify the conflicts for the planner. In this paper, we e… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 7 pages, 4 figures, 3 tables

  29. arXiv:2211.00192  [pdf, other

    cs.DB

    AI Assistants: A Framework for Semi-Automated Data Wrangling

    Authors: Tomas Petricek, Gerrit J. J. van den Burg, Alfredo Nazábal, Taha Ceritli, Ernesto Jiménez-Ruiz, Christopher K. I. Williams

    Abstract: Data wrangling tasks such as obtaining and linking data from various sources, transforming data formats, and correcting erroneous records, can constitute up to 80% of typical data engineering work. Despite the rise of machine learning and artificial intelligence, data wrangling remains a tedious and manual task. We introduce AI assistants, a class of semi-automatic interactive tools to streamline… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted for publication in IEEE Transactions on Knowledge and Data Engineering

  30. Incorporating Crowdsourced Annotator Distributions into Ensemble Modeling to Improve Classification Trustworthiness for Ancient Greek Papyri

    Authors: Graham West, Matthew I. Swindall, Ben Keener, Timothy Player, Alex C. Williams, James H. Brusuelas, John F. Wallin

    Abstract: Performing classification on noisy, crowdsourced image datasets can prove challenging even for the best neural networks. Two issues which complicate the problem on such datasets are class imbalance and ground-truth uncertainty in labeling. The AL-ALL and AL-PUB datasets - consisting of tightly cropped, individual characters from images of ancient Greek papyri - are strongly affected by both issues… ▽ More

    Submitted 26 January, 2024; v1 submitted 28 October, 2022; originally announced October 2022.

    Journal ref: Journal of Data Mining & Digital Humanities, Historical Documents and automatic text recognition, Digital humanities in languages (February 7, 2024) jdmdh:10297

  31. arXiv:2210.14413  [pdf, other

    cs.RO

    InterSim: Interactive Traffic Simulation via Explicit Relation Modeling

    Authors: Qiao Sun, Xin Huang, Brian C. Williams, Hang Zhao

    Abstract: Interactive traffic simulation is crucial to autonomous driving systems by enabling testing for planners in a more scalable and safe way compared to real-world road testing. Existing approaches learn an agent model from large-scale driving data to simulate realistic traffic scenarios, yet it remains an open question to produce consistent and diverse multi-agent interactive behaviors in crowded sce… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted at IROS 2022. Author version with 8 pages, 4 figures, and 2 tables. Code and demo available at paper website: https://tsinghua-mars-lab.github.io/InterSim/

  32. arXiv:2210.04221  [pdf, other

    stat.ME cs.IT math.ST

    The Elliptical Quartic Exponential Distribution: An Annular Distribution Obtained via Maximum Entropy

    Authors: Christopher K I Williams

    Abstract: This paper describes the Elliptical Quartic Exponential distribution in $\mathbb{R}^D$, obtained via a maximum entropy construction by imposing second and fourth moment constraints. I discuss relationships to related work, analytical expressions for the normalization constant and the entropy, and the conditional and marginal distributions.

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: 6 pages, 1 figure

  33. arXiv:2210.04023  [pdf, other

    cs.LG

    Multi-Task Dynamical Systems

    Authors: Alex Bird, Christopher K. I. Williams, Christopher Hawthorne

    Abstract: Time series datasets are often composed of a variety of sequences from the same domain, but from different entities, such as individuals, products, or organizations. We are interested in how time series models can be specialized to individual sequences (capturing the specific characteristics) while still retaining statistical power by sharing commonalities across the sequences. This paper describe… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 52 pages, 17 figures

    Journal ref: Journal of Machine Learning Research 23 (2022)

  34. arXiv:2210.00058  [pdf, other

    cs.CR cs.AR

    Hardware Trojan Threats to Cache Coherence in Modern 2.5D Chiplet Systems

    Authors: Gino A. Chacon, Charles Williams, Johann Knechtel, Ozgur Sinanoglu, Paul V. Gratz

    Abstract: As industry moves toward chiplet-based designs, the insertion of hardware Trojans poses a significant threat to the security of these systems. These systems rely heavily on cache coherence for coherent data communication, making coherence an attractive target. Critically, unlike prior work, which focuses only on malicious packet modifications, a Trojan attack that exploits coherence can modify dat… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  35. Inference and Learning for Generative Capsule Models

    Authors: Alfredo Nazabal, Nikolaos Tsagkas, Christopher K. I. Williams

    Abstract: Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this paper we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the objec… ▽ More

    Submitted 21 October, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 31 pages, 6 figures. This paper extends our previous work (arxiv:2103.06676) by covering the learning of the models as well as inference. Paper accepted for publication in Neural Computation

    Journal ref: Neural Computation 35(4) (2023) 727-761

  36. arXiv:2208.12437  [pdf, other

    cs.CV

    Detecting Mitoses with a Convolutional Neural Network for MIDOG 2022 Challenge

    Authors: Hongyan Gu, Mohammad Haeri, Shuo Ni, Christopher Kazu Williams, Neda Zarrin-Khameh, Shino Magaki, Xiang 'Anthony' Chen

    Abstract: This work presents a mitosis detection method with only one vanilla Convolutional Neural Network (CNN). Our method consists of two steps: given an image, we first apply a CNN using a sliding window technique to extract patches that have mitoses; we then calculate each extracted patch's class activation map to obtain the mitosis's precise location. To increase the model performance on high-domain-v… ▽ More

    Submitted 30 October, 2022; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: 3 pages, 2 figures

  37. arXiv:2205.05228  [pdf, other

    cs.AI

    Hierarchical Constrained Stochastic Shortest Path Planning via Cost Budget Allocation

    Authors: Sungkweon Hong, Brian C. Williams

    Abstract: Stochastic sequential decision making often requires hierarchical structure in the problem where each high-level action should be further planned with primitive states and actions. In addition, many real-world applications require a plan that satisfies constraints on the secondary costs such as risk measure or fuel consumption. In this paper, we propose a hierarchical constrained stochastic shorte… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  38. arXiv:2204.03408  [pdf, other

    eess.IV cs.CV q-bio.NC

    Surface Vision Transformers: Flexible Attention-Based Modelling of Biomedical Surfaces

    Authors: Simon Dahan, Hao Xu, Logan Z. J. Williams, Abdulah Fawaz, Chunhui Yang, Timothy S. Coalson, Michelle C. Williams, David E. Newby, A. David Edwards, Matthew F. Glasser, Alistair A. Young, Daniel Rueckert, Emma C. Robinson

    Abstract: Recent state-of-the-art performances of Vision Transformers (ViT) in computer vision tasks demonstrate that a general-purpose architecture, which implements long-range self-attention, could replace the local feature learning operations of convolutional neural networks. In this paper, we extend ViTs to surfaces by reformulating the task of surface learning as a sequence-to-sequence learning problem… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 10 pages, 3 figures, Submitted to IEEE Transactions on Medical Imaging

  39. On Suspicious Coincidences and Pointwise Mutual Information

    Authors: Christopher K. I. Williams

    Abstract: Barlow (1985) hypothesized that the co-occurrence of two events $A$ and $B$ is "suspicious" if $P(A,B) \gg P(A) P(B)$. We first review classical measures of association for $2 \times 2$ contingency tables, including Yule's $Y$ (Yule, 1912), which depends only on the odds ratio $λ$, and is independent of the marginal probabilities of the table. We then discuss the mutual information (MI) and pointw… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: 9 pages, 1 figure. Addendum added March 2023

    Journal ref: Neural Computation 34(10) 2037-2046 (2022)

  40. arXiv:2203.04694  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Align-Deform-Subtract: An Interventional Framework for Explaining Object Differences

    Authors: Cian Eastwood, Li Nanbo, Christopher K. I. Williams

    Abstract: Given two object images, how can we explain their differences in terms of the underlying object properties? To address this question, we propose Align-Deform-Subtract (ADS) -- an interventional framework for explaining object differences. By leveraging semantic alignments in image-space as counterfactual interventions on the underlying object properties, ADS iteratively quantifies and removes diff… ▽ More

    Submitted 20 July, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: ICLR 2022 Workshop on Objects, Structure and Causality

  41. arXiv:2203.02475  [pdf, other

    cs.RO cs.AI

    Cooperative Task and Motion Planning for Multi-Arm Assembly Systems

    Authors: Jingkai Chen, Jiaoyang Li, Yijiang Huang, Caelan Garrett, Dawei Sun, Chuchu Fan, Andreas Hofmann, Caitlin Mueller, Sven Koenig, Brian C. Williams

    Abstract: Multi-robot assembly systems are becoming increasingly appealing in manufacturing due to their ability to automatically, flexibly, and quickly construct desired structural designs. However, effectively planning for these systems in a manner that ensures each robot is simultaneously productive, and not idle, is challenging due to (1) the close proximity that the robots must operate in to manipulate… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 8 pages, 6 figures, 1 table

  42. arXiv:2202.11884  [pdf, other

    cs.RO cs.AI cs.CV

    M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction

    Authors: Qiao Sun, Xin Huang, Junru Gu, Brian C. Williams, Hang Zhao

    Abstract: Predicting future motions of road participants is an important task for driving autonomously in urban scenes. Existing models excel at predicting marginal trajectories for single agents, yet it remains an open question to jointly predict scene compliant trajectories over multiple agents. The challenge is due to exponentially increasing prediction space as a function of the number of agents. In thi… ▽ More

    Submitted 27 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: Accepted at CVPR 2022. Author version with 15 pages, 8 figures, and 3 tables. Code and demo available at paper website: https://tsinghua-mars-lab.github.io/M2I/

  43. Persistent Animal Identification Leveraging Non-Visual Markers

    Authors: Michael P. J. Camilleri, Li Zhang, Rasneer S. Bains, Andrew Zisserman, Christopher K. I. Williams

    Abstract: Our objective is to locate and provide a unique identifier for each mouse in a cluttered home-cage environment through time, as a precursor to automated behaviour recognition for biological research. This is a very challenging problem due to (i) the lack of distinguishing visual features for each mouse, and (ii) the close confines of the scene with constant occlusion, making standard visual tracki… ▽ More

    Submitted 19 July, 2023; v1 submitted 13 December, 2021; originally announced December 2021.

    Journal ref: Machine Vision and Applications 34, 68 (2023)

  44. arXiv:2111.11959  [pdf, other

    cs.LG cs.DB

    Identifying the Units of Measurement in Tabular Data

    Authors: Taha Ceritli, Christopher K. I. Williams

    Abstract: We consider the problem of identifying the units of measurement in a data column that contains both numeric values and unit symbols in each row, e.g., "5.2 l", "7 pints". In this case we seek to identify the dimension of the column (e.g. volume) and relate the unit symbols to valid units (e.g. litre, pint) obtained from a knowledge graph. Below we present PUC, a Probabilistic Unit Canonicalizer th… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  45. arXiv:2111.11956  [pdf, other

    cs.LG cs.DB

    ptype-cat: Inferring the Type and Values of Categorical Variables

    Authors: Taha Ceritli, Christopher K. I. Williams

    Abstract: Type inference is the task of identifying the type of values in a data column and has been studied extensively in the literature. Most existing type inference methods support data types such as Boolean, date, float, integer and string. However, these methods do not consider non-Boolean categorical variables, where there are more than two possible values encoded by integers or strings. Therefore, s… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  46. arXiv:2110.08750  [pdf, other

    cs.RO cs.AI cs.LG

    TIP: Task-Informed Motion Prediction for Intelligent Vehicles

    Authors: Xin Huang, Guy Rosman, Ashkan Jasour, Stephen G. McGill, John J. Leonard, Brian C. Williams

    Abstract: When predicting trajectories of road agents, motion predictors usually approximate the future distribution by a limited number of samples. This constraint requires the predictors to generate samples that best support the task given task specifications. However, existing predictors are often optimized and evaluated via task-agnostic measures without accounting for the use of predictions in downstre… ▽ More

    Submitted 26 May, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: 9 pages, 5 figures, 5 tables

  47. arXiv:2110.02344  [pdf, other

    cs.RO cs.AI cs.LG

    HYPER: Learned Hybrid Trajectory Prediction via Factored Inference and Adaptive Sampling

    Authors: Xin Huang, Guy Rosman, Igor Gilitschenski, Ashkan Jasour, Stephen G. McGill, John J. Leonard, Brian C. Williams

    Abstract: Modeling multi-modal high-level intent is important for ensuring diversity in trajectory prediction. Existing approaches explore the discrete nature of human intent before predicting continuous trajectories, to improve accuracy and support explainability. However, these approaches often assume the intent to remain fixed over the prediction horizon, which is problematic in practice, especially over… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: 12 pages, 10 figures, 4 tables

  48. arXiv:2109.09975  [pdf, other

    cs.LG cs.AI cs.RO

    Fast nonlinear risk assessment for autonomous vehicles using learned conditional probabilistic models of agent futures

    Authors: Ashkan Jasour, Xin Huang, Allen Wang, Brian C. Williams

    Abstract: This paper presents fast non-sampling based methods to assess the risk for trajectories of autonomous vehicles when probabilistic predictions of other agents' futures are generated by deep neural networks (DNNs). The presented methods address a wide range of representations for uncertain predictions including both Gaussian and non-Gaussian mixture models to predict both agent positions and control… ▽ More

    Submitted 22 September, 2021; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: Accepted at Autonomous Robots. Author version, with 11 pages, 5 figures, 2 tables. Journal extension of "Fast Risk Assessment for Autonomous Vehicles Using Learned Models of Agent Futures" (Wang et al. RSS 2020, arXiv:2005.13458)

  49. arXiv:2107.05446  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration

    Authors: Cian Eastwood, Ian Mason, Christopher K. I. Williams, Bernhard Schölkopf

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model trained on labelled data in a source domain to unlabelled data in a target domain without access to the source-domain data during adaptation. Existing methods for SFDA leverage entropy-minimization techniques which: (i) apply only to classification; (ii) destroy model calibration; and (iii) rely on the source model achieving a good level o… ▽ More

    Submitted 17 March, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: ICLR 2022 (Spotlight)

  50. arXiv:2106.03216  [pdf, other

    cs.LG stat.ML

    On Memorization in Probabilistic Deep Generative Models

    Authors: Gerrit J. J. van den Burg, Christopher K. I. Williams

    Abstract: Recent advances in deep generative models have led to impressive results in a variety of application domains. Motivated by the possibility that deep learning models might memorize part of the input data, there have been increased efforts to understand how memorization arises. In this work, we extend a recently proposed measure of memorization for supervised learning (Feldman, 2019) to the unsuperv… ▽ More

    Submitted 29 December, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at NeurIPS 2021

    MSC Class: 68T07