Search | arXiv e-print repository

Large Language Models Miss the Multi-Agent Mark

Authors: Emanuele La Malfa, Gabriele La Malfa, Samuele Marro, Jie M. Zhang, Elizabeth Black, Michael Luck, Philip Torr, Michael Wooldridge

Abstract: Recent interest in Multi-Agent Systems of Large Language Models (MAS LLMs) has led to an increase in frameworks leveraging multiple LLMs to tackle complex tasks. However, much of this literature appropriates the terminology of MAS without engaging with its foundational principles. In this position paper, we highlight critical discrepancies between MAS theory and current MAS LLMs implementations, f… ▽ More Recent interest in Multi-Agent Systems of Large Language Models (MAS LLMs) has led to an increase in frameworks leveraging multiple LLMs to tackle complex tasks. However, much of this literature appropriates the terminology of MAS without engaging with its foundational principles. In this position paper, we highlight critical discrepancies between MAS theory and current MAS LLMs implementations, focusing on four key areas: the social aspect of agency, environment design, coordination and communication protocols, and measuring emergent behaviours. Our position is that many MAS LLMs lack multi-agent characteristics such as autonomy, social interaction, and structured environments, and often rely on oversimplified, LLM-centric architectures. The field may slow down and lose traction by revisiting problems the MAS literature has already addressed. Therefore, we systematically analyse this issue and outline associated research opportunities; we advocate for better integrating established MAS concepts and more precise terminology to avoid mischaracterisation and missed opportunities. △ Less

Submitted 11 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

arXiv:2505.15229 [pdf, other]

Multilingual Prompting for Improving LLM Generation Diversity

Authors: Qihan Wang, Shidong Pan, Tal Linzen, Emily Black

Abstract: Large Language Models (LLMs) are known to lack cultural representation and overall diversity in their generations, from expressing opinions to answering factual questions. To mitigate this problem, we propose multilingual prompting: a prompting method which generates several variations of a base prompt with added cultural and linguistic cues from several cultures, generates responses, and then com… ▽ More Large Language Models (LLMs) are known to lack cultural representation and overall diversity in their generations, from expressing opinions to answering factual questions. To mitigate this problem, we propose multilingual prompting: a prompting method which generates several variations of a base prompt with added cultural and linguistic cues from several cultures, generates responses, and then combines the results. Building on evidence that LLMs have language-specific knowledge, multilingual prompting seeks to increase diversity by activating a broader range of cultural knowledge embedded in model training data. Through experiments across multiple models (GPT-4o, GPT-4o-mini, LLaMA 70B, and LLaMA 8B), we show that multilingual prompting consistently outperforms existing diversity-enhancing techniques such as high-temperature sampling, step-by-step recall, and personas prompting. Further analyses show that the benefits of multilingual prompting vary with language resource level and model size, and that aligning the prompting language with the cultural cues reduces hallucination about culturally-specific information. △ Less

Submitted 21 May, 2025; originally announced May 2025.

arXiv:2502.03953 [pdf, other]

Fairness Aware Reinforcement Learning via Proximal Policy Optimization

Authors: Gabriele La Malfa, Jie M. Zhang, Michael Luck, Elizabeth Black

Abstract: Fairness in multi-agent systems (MAS) focuses on equitable reward distribution among agents in scenarios involving sensitive attributes such as race, gender, or socioeconomic status. This paper introduces fairness in Proximal Policy Optimization (PPO) with a penalty term derived from demographic parity, counterfactual fairness, and conditional statistical parity. The proposed method balances rewar… ▽ More Fairness in multi-agent systems (MAS) focuses on equitable reward distribution among agents in scenarios involving sensitive attributes such as race, gender, or socioeconomic status. This paper introduces fairness in Proximal Policy Optimization (PPO) with a penalty term derived from demographic parity, counterfactual fairness, and conditional statistical parity. The proposed method balances reward maximisation with fairness by integrating two penalty components: a retrospective component that minimises disparities in past outcomes and a prospective component that ensures fairness in future decision-making. We evaluate our approach in the Allelopathic Harvest game, a cooperative and competitive MAS focused on resource collection, where some agents possess a sensitive attribute. Experiments demonstrate that fair-PPO achieves fairer policies across all fairness metrics than classic PPO. Fairness comes at the cost of reduced rewards, namely the Price of Fairness, although agents with and without the sensitive attribute renounce comparable amounts of rewards. Additionally, the retrospective and prospective penalties effectively change the agents' behaviour and improve fairness. These findings underscore the potential of fair-PPO to address fairness challenges in MAS. △ Less

Submitted 6 February, 2025; originally announced February 2025.

arXiv:2501.15634 [pdf, other]

Be Intentional About Fairness!: Fairness, Size, and Multiplicity in the Rashomon Set

Authors: Gordon Dai, Pavan Ravishankar, Rachel Yuan, Daniel B. Neill, Emily Black

Abstract: When selecting a model from a set of equally performant models, how much unfairness can you really reduce? Is it important to be intentional about fairness when choosing among this set, or is arbitrarily choosing among the set of ''good'' models good enough? Recent work has highlighted that the phenomenon of model multiplicity-where multiple models with nearly identical predictive accuracy exist f… ▽ More When selecting a model from a set of equally performant models, how much unfairness can you really reduce? Is it important to be intentional about fairness when choosing among this set, or is arbitrarily choosing among the set of ''good'' models good enough? Recent work has highlighted that the phenomenon of model multiplicity-where multiple models with nearly identical predictive accuracy exist for the same task-has both positive and negative implications for fairness, from strengthening the enforcement of civil rights law in AI systems to showcasing arbitrariness in AI decision-making. Despite the enormous implications of model multiplicity, there is little work that explores the properties of sets of equally accurate models, or Rashomon sets, in general. In this paper, we present five main theoretical and methodological contributions which help us to understand the relatively unexplored properties of the Rashomon set, in particular with regards to fairness. Our contributions include methods for efficiently sampling models from this set and techniques for identifying the fairest models according to key fairness metrics such as statistical parity. We also derive the probability that an individual's prediction will be flipped within the Rashomon set, as well as expressions for the set's size and the distribution of error tolerance used across models. These results lead to policy-relevant takeaways, such as the importance of intentionally looking for fair models within the Rashomon set, and understanding which individuals or groups may be more susceptible to arbitrary decisions. △ Less

Submitted 26 January, 2025; originally announced January 2025.

Comments: 34 pages

arXiv:2412.21052 [pdf, other]

Towards Effective Discrimination Testing for Generative AI

Authors: Thomas P. Zollo, Nikita Rajaneesh, Richard Zemel, Talia B. Gillis, Emily Black

Abstract: Generative AI (GenAI) models present new challenges in regulating against discriminatory behavior. In this paper, we argue that GenAI fairness research still has not met these challenges; instead, a significant gap remains between existing bias assessment methods and regulatory goals. This leads to ineffective regulation that can allow deployment of reportedly fair, yet actually discriminatory, Ge… ▽ More Generative AI (GenAI) models present new challenges in regulating against discriminatory behavior. In this paper, we argue that GenAI fairness research still has not met these challenges; instead, a significant gap remains between existing bias assessment methods and regulatory goals. This leads to ineffective regulation that can allow deployment of reportedly fair, yet actually discriminatory, GenAI systems. Towards remedying this problem, we connect the legal and technical literature around GenAI bias evaluation and identify areas of misalignment. Through four case studies, we demonstrate how this misalignment between fairness testing techniques and regulatory goals can result in discriminatory outcomes in real-world deployments, especially in adaptive or complex environments. We offer practical recommendations for improving discrimination testing to better align with regulatory goals and enhance the reliability of fairness assessments in future deployments. △ Less

Submitted 30 December, 2024; originally announced December 2024.

Comments: 38 pages, 9 tables, 8 figures

arXiv:2410.12889 [pdf, other]

Using Protected Attributes to Consider Fairness in Multi-Agent Systems

Authors: Gabriele La Malfa, Jie M. Zhang, Michael Luck, Elizabeth Black

Abstract: Fairness in Multi-Agent Systems (MAS) has been extensively studied, particularly in reward distribution among agents in scenarios such as goods allocation, resource division, lotteries, and bargaining systems. Fairness in MAS depends on various factors, including the system's governing rules, the behaviour of the agents, and their characteristics. Yet, fairness in human society often involves eval… ▽ More Fairness in Multi-Agent Systems (MAS) has been extensively studied, particularly in reward distribution among agents in scenarios such as goods allocation, resource division, lotteries, and bargaining systems. Fairness in MAS depends on various factors, including the system's governing rules, the behaviour of the agents, and their characteristics. Yet, fairness in human society often involves evaluating disparities between disadvantaged and privileged groups, guided by principles of Equality, Diversity, and Inclusion (EDI). Taking inspiration from the work on algorithmic fairness, which addresses bias in machine learning-based decision-making, we define protected attributes for MAS as characteristics that should not disadvantage an agent in terms of its expected rewards. We adapt fairness metrics from the algorithmic fairness literature -- namely, demographic parity, counterfactual fairness, and conditional statistical parity -- to the multi-agent setting, where self-interested agents interact within an environment. These metrics allow us to evaluate the fairness of MAS, with the ultimate aim of designing MAS that do not disadvantage agents based on protected attributes. △ Less

Submitted 16 October, 2024; originally announced October 2024.

arXiv:2408.12029 [pdf]

Federated Diabetes Prediction in Canadian Adults Using Real-world Cross-Province Primary Care Data

Authors: Guojun Tang, Jason E. Black, Tyler S. Williamson, Steve H. Drew

Abstract: Integrating Electronic Health Records (EHR) and the application of machine learning present opportunities for enhancing the accuracy and accessibility of data-driven diabetes prediction. In particular, developing data-driven machine learning models can provide early identification of patients with high risk for diabetes, potentially leading to more effective therapeutic strategies and reduced heal… ▽ More Integrating Electronic Health Records (EHR) and the application of machine learning present opportunities for enhancing the accuracy and accessibility of data-driven diabetes prediction. In particular, developing data-driven machine learning models can provide early identification of patients with high risk for diabetes, potentially leading to more effective therapeutic strategies and reduced healthcare costs. However, regulation restrictions create barriers to developing centralized predictive models. This paper addresses the challenges by introducing a federated learning approach, which amalgamates predictive models without centralized data storage and processing, thus avoiding privacy issues. This marks the first application of federated learning to predict diabetes using real clinical datasets in Canada extracted from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) without crossprovince patient data sharing. We address class-imbalance issues through downsampling techniques and compare federated learning performance against province-based and centralized models. Experimental results show that the federated MLP model presents a similar or higher performance compared to the model trained with the centralized approach. However, the federated logistic regression model showed inferior performance compared to its centralized peer. △ Less

Submitted 21 August, 2024; originally announced August 2024.

Comments: 10 pages

arXiv:2407.02209 [pdf, other]

Generative Monoculture in Large Language Models

Authors: Fan Wu, Emily Black, Varun Chandrasekaran

Abstract: We introduce {\em generative monoculture}, a behavior observed in large language models (LLMs) characterized by a significant narrowing of model output diversity relative to available training data for a given task: for example, generating only positive book reviews for books with a mixed reception. While in some cases, generative monoculture enhances performance (e.g., LLMs more often produce eff… ▽ More We introduce {\em generative monoculture}, a behavior observed in large language models (LLMs) characterized by a significant narrowing of model output diversity relative to available training data for a given task: for example, generating only positive book reviews for books with a mixed reception. While in some cases, generative monoculture enhances performance (e.g., LLMs more often produce efficient code), the dangers are exacerbated in others (e.g., LLMs refuse to share diverse opinions). As LLMs are increasingly used in high-impact settings such as education and web search, careful maintenance of LLM output diversity is essential to ensure a variety of facts and perspectives are preserved over time. We experimentally demonstrate the prevalence of generative monoculture through analysis of book review and code generation tasks, and find that simple countermeasures such as altering sampling or prompting strategies are insufficient to mitigate the behavior. Moreover, our results suggest that the root causes of generative monoculture are likely embedded within the LLM's alignment processes, suggesting a need for developing fine-tuning paradigms that preserve or promote diversity. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2406.06936 [pdf, other]

Random Shadows of Fixed Polytopes

Authors: Alexander E. Black, Francisco Criado

Abstract: Estimating the number of vertices of a two dimensional projection, called a shadow, of a polytope is a fundamental tool for understanding the performance of the shadow simplex method for linear programming among other applications. We prove multiple upper bounds on the expected number of vertices of a random shadow of a fixed polytope. Our bounds are in terms of various parameters in the literatur… ▽ More Estimating the number of vertices of a two dimensional projection, called a shadow, of a polytope is a fundamental tool for understanding the performance of the shadow simplex method for linear programming among other applications. We prove multiple upper bounds on the expected number of vertices of a random shadow of a fixed polytope. Our bounds are in terms of various parameters in the literature including geometric diameter and edge lengths, minimal and maximal slack, maximal coordinates for lattice polytopes, and maximum absolute values of subdeterminants. For the case of geometric diameter and edge lengths, we prove lower bounds and argue that our upper and lower bounds are both tight for zonotopes. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 14 pages, 1 figure

MSC Class: 52B12; 52B55; 52A22

arXiv:2406.06817 [pdf, other]

The Legal Duty to Search for Less Discriminatory Algorithms

Authors: Emily Black, Logan Koepke, Pauline Kim, Solon Barocas, Mingwei Hsu

Abstract: Work in computer science has established that, contrary to conventional wisdom, for a given prediction problem there are almost always multiple possible models with equivalent performance--a phenomenon often termed model multiplicity. Critically, different models of equivalent performance can produce different predictions for the same individual, and, in aggregate, exhibit different levels of impa… ▽ More Work in computer science has established that, contrary to conventional wisdom, for a given prediction problem there are almost always multiple possible models with equivalent performance--a phenomenon often termed model multiplicity. Critically, different models of equivalent performance can produce different predictions for the same individual, and, in aggregate, exhibit different levels of impacts across demographic groups. Thus, when an algorithmic system displays a disparate impact, model multiplicity suggests that developers could discover an alternative model that performs equally well, but has less discriminatory impact. Indeed, the promise of model multiplicity is that an equally accurate, but less discriminatory algorithm (LDA) almost always exists. But without dedicated exploration, it is unlikely developers will discover potential LDAs. Model multiplicity and the availability of LDAs have significant ramifications for the legal response to discriminatory algorithms, in particular for disparate impact doctrine, which has long taken into account the availability of alternatives with less disparate effect when assessing liability. A close reading of legal authorities over the decades reveals that the law has on numerous occasions recognized that the existence of a less discriminatory alternative is sometimes relevant to a defendant's burden of justification at the second step of disparate impact analysis. Indeed, under disparate impact doctrine, it makes little sense to say that a given algorithmic system used by an employer, creditor, or housing provider is "necessary" if an equally accurate model that exhibits less disparate effect is available and possible to discover with reasonable effort. As a result, we argue that the law should place a duty of a reasonable search for LDAs on entities that develop and deploy predictive models in covered civil rights domains. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: Non-archival work presented at FAccT 2024. Short version of a law review paper with the same authors: "Less Discriminatory Algorithms'' forthcoming in Georgetown Law Journal, October 2024. (Available at: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4590481 )

ACM Class: K.4.1; C.4

arXiv:2405.08506 [pdf, other]

From linear programming to colliding particles

Authors: Alexander E. Black, Niklas Lütjeharms, Raman Sanyal

Abstract: Although simplices are trivial from a linear optimization standpoint, the simplex algorithm can exhibit quite complex behavior. In this paper we study the behavior of max-slope pivot rules on (products of) simplices and describe the associated pivot rule polytopes. For simplices, the pivot rule polytopes are combinatorially isomorphic to associahedra. To prove this correspondence, we interpret max… ▽ More Although simplices are trivial from a linear optimization standpoint, the simplex algorithm can exhibit quite complex behavior. In this paper we study the behavior of max-slope pivot rules on (products of) simplices and describe the associated pivot rule polytopes. For simplices, the pivot rule polytopes are combinatorially isomorphic to associahedra. To prove this correspondence, we interpret max-slope pivot rules in terms of the combinatorics of colliding particles on a line. For prisms over simplices, we recover Stasheff's multiplihedra. For products of two simplices we get new realizations of constrainahedra, that capture the combinatorics of certain particle systems in the plane. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 20 pages, 13 figures

MSC Class: 90C05; 90C57; 52B12; 52B11

arXiv:2403.04886 [pdf, other]

Exponential Lower Bounds for Many Pivot Rules for the Simplex Method

Authors: Alexander E. Black

Abstract: The existence of a pivot rule for the simplex method that guarantees a strongly polynomial run-time is a longstanding, fundamental open problem in the theory of linear programming. The leading pivot rule in theory is the shadow pivot rule, which solves a linear program by projecting the feasible region onto a polygon. It has been shown to perform in expected strongly polynomial time on uniformly r… ▽ More The existence of a pivot rule for the simplex method that guarantees a strongly polynomial run-time is a longstanding, fundamental open problem in the theory of linear programming. The leading pivot rule in theory is the shadow pivot rule, which solves a linear program by projecting the feasible region onto a polygon. It has been shown to perform in expected strongly polynomial time on uniformly random instances and in smoothed analysis. In practice, the pivot rule of choice is the steepest edge rule, which normalizes the set of improving neighbors and then chooses a maximally improving normalized neighbor. Exponential lower bounds are known for both rules in worst-case analysis. However, for the shadow simplex method, all exponential examples were only proven for one choice of projection, and for the steepest edge rule, the lower bounds were only proven for the Euclidean norm. In this work, we construct linear programs for which any choice of projection for shadow rule variants will lead to an exponential run-time and exponential examples for any choice of norm for a steepest edge variant. △ Less

Submitted 8 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

Comments: 14 pages, 4 figures, changed name from previous version

MSC Class: 90C05; 52B12; 52B11

arXiv:2312.13025 [pdf, other]

Effect of molecular rotation and concentration on the adsorption of pentacene molecules on two-dimensional monolayer transition metal dichalcogenides

Authors: Edward Black, Juliana Morbec

Abstract: Heterostructures composed of pentacene (PEN) molecules and transition metal dichalchogenides (TMDs) are promising materials for small, flexible and lightweight photovoltaic devices and various other optoelectronic applications. The effects of changing concentration and orientation of adsorbed pentacene molecules on two-dimensional monolayer substrates of TMDs, namely MoS$_2$, MoSe$_2$, WS$_2$ and… ▽ More Heterostructures composed of pentacene (PEN) molecules and transition metal dichalchogenides (TMDs) are promising materials for small, flexible and lightweight photovoltaic devices and various other optoelectronic applications. The effects of changing concentration and orientation of adsorbed pentacene molecules on two-dimensional monolayer substrates of TMDs, namely MoS$_2$, MoSe$_2$, WS$_2$ and WSe$_2$, were investigated using first-principles calculations based on density functional theory. We examined the structural and electronic properties of the corresponding PEN/TMD heterostructures and compared these between differing pentacene concentrations and the orientations of pentacene with respect to the underlying substrate crystal structure. We analyse the band alignment of the heterostructures and demonstrate a concentration-dependent staggered-to-straddling (typeII-I) band gap transition in PEN/MoSe$_2$ △ Less

Submitted 20 December, 2023; originally announced December 2023.

arXiv:2310.01679 [pdf, other]

Estimating and Implementing Conventional Fairness Metrics With Probabilistic Protected Features

Authors: Hadi Elzayn, Emily Black, Patrick Vossler, Nathanael Jo, Jacob Goldin, Daniel E. Ho

Abstract: The vast majority of techniques to train fair models require access to the protected attribute (e.g., race, gender), either at train time or in production. However, in many important applications this protected attribute is largely unavailable. In this paper, we develop methods for measuring and reducing fairness violations in a setting with limited access to protected attribute labels. Specifical… ▽ More The vast majority of techniques to train fair models require access to the protected attribute (e.g., race, gender), either at train time or in production. However, in many important applications this protected attribute is largely unavailable. In this paper, we develop methods for measuring and reducing fairness violations in a setting with limited access to protected attribute labels. Specifically, we assume access to protected attribute labels on a small subset of the dataset of interest, but only probabilistic estimates of protected attribute labels (e.g., via Bayesian Improved Surname Geocoding) for the rest of the dataset. With this setting in mind, we propose a method to estimate bounds on common fairness metrics for an existing model, as well as a method for training a model to limit fairness violations by solving a constrained non-convex optimization problem. Unlike similar existing approaches, our methods take advantage of contextual information -- specifically, the relationships between a model's predictions and the probabilistic prediction of protected attributes, given the true protected attribute, and vice versa -- to provide tighter bounds on the true disparity. We provide an empirical illustration of our methods using voting data. First, we show our measurement method can bound the true disparity up to 5.5x tighter than previous methods in these applications. Then, we demonstrate that our training technique effectively reduces disparity while incurring lesser fairness-accuracy trade-offs than other fair optimization methods with limited access to protected attributes. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.17337 [pdf, other]

doi 10.1145/3617694.3623259

Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools

Authors: Emily Black, Rakshit Naidu, Rayid Ghani, Kit T. Rodolfa, Daniel E. Ho, Hoda Heidari

Abstract: While algorithmic fairness is a thriving area of research, in practice, mitigating issues of bias often gets reduced to enforcing an arbitrarily chosen fairness metric, either by enforcing fairness constraints during the optimization step, post-processing model outputs, or by manipulating the training data. Recent work has called on the ML community to take a more holistic approach to tackle fairn… ▽ More While algorithmic fairness is a thriving area of research, in practice, mitigating issues of bias often gets reduced to enforcing an arbitrarily chosen fairness metric, either by enforcing fairness constraints during the optimization step, post-processing model outputs, or by manipulating the training data. Recent work has called on the ML community to take a more holistic approach to tackle fairness issues by systematically investigating the many design choices made through the ML pipeline, and identifying interventions that target the issue's root cause, as opposed to its symptoms. While we share the conviction that this pipeline-based approach is the most appropriate for combating algorithmic unfairness on the ground, we believe there are currently very few methods of \emph{operationalizing} this approach in practice. Drawing on our experience as educators and practitioners, we first demonstrate that without clear guidelines and toolkits, even individuals with specialized ML knowledge find it challenging to hypothesize how various design choices influence model behavior. We then consult the fair-ML literature to understand the progress to date toward operationalizing the pipeline-aware approach: we systematically collect and organize the prior work that attempts to detect, measure, and mitigate various sources of unfairness through the ML pipeline. We utilize this extensive categorization of previous contributions to sketch a research agenda for the community. We hope this work serves as the stepping stone toward a more comprehensive set of resources for ML researchers, practitioners, and students interested in exploring, designing, and testing pipeline-oriented approaches to algorithmic fairness. △ Less

Submitted 29 September, 2023; originally announced September 2023.

Comments: EAAMO'23 (Archival)

arXiv:2304.08619 [pdf, ps, other]

doi 10.1039/D3CP01895D

Interaction between pentacene molecules and monolayer transition metal dichalcogenides

Authors: E. Black, P. Kratzer, J. M. Morbec

Abstract: Using first-principles calculations based on density-functional theory, we investigated the adsorption of pentacene molecules on monolayer two-dimensional transition metal dichalcogenides (TMD). We considered the four most popular TMDs, namely, MoS$_2$, MoSe$_2$, WS$_2$ and WSe$_2$, and we examined the structural and electronic properties of pentacene/TMD systems. We discuss how monolayer pentacen… ▽ More Using first-principles calculations based on density-functional theory, we investigated the adsorption of pentacene molecules on monolayer two-dimensional transition metal dichalcogenides (TMD). We considered the four most popular TMDs, namely, MoS$_2$, MoSe$_2$, WS$_2$ and WSe$_2$, and we examined the structural and electronic properties of pentacene/TMD systems. We discuss how monolayer pentacene interacts with the TMDs, and how this interaction affects the charge transfer and work function of the heterostructure. We also analyse the type of band alignment formed in the heterostructure and how it is affected by molecule-molecule and molecule-substrate interactions. Such analysis is valuable since pentacene/TMD heterostructures are considered to be promising for application in flexible, thin and lightweight photovoltaics and photodetectors. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: 13 pages, 4 figures

arXiv:2304.05318 [pdf, ps, other]

Sampling planar tanglegrams and pairs of disjoint triangulations

Authors: Alexander E. Black, Kevin Liu, Alex Mcdonough, Garrett Nelson, Michael C. Wigal, Mei Yin, Youngho Yoo

Abstract: A tanglegram consists of two rooted binary trees and a perfect matching between their leaves, and a planar tanglegram is one that admits a layout with no crossings. We show that the problem of generating planar tanglegrams uniformly at random reduces to the corresponding problem for irreducible planar tanglegram layouts, which are known to be in bijection with pairs of disjoint triangulations of a… ▽ More A tanglegram consists of two rooted binary trees and a perfect matching between their leaves, and a planar tanglegram is one that admits a layout with no crossings. We show that the problem of generating planar tanglegrams uniformly at random reduces to the corresponding problem for irreducible planar tanglegram layouts, which are known to be in bijection with pairs of disjoint triangulations of a convex polygon. We extend the flip operation on a single triangulation to a flip operation on pairs of disjoint triangulations. Interestingly, the resulting flip graph is both connected and regular, and hence a random walk on this graph converges to the uniform distribution. We also show that the restriction of the flip graph to the pairs with a fixed triangulation in either coordinate is connected, and give diameter bounds that are near optimal. Our results furthermore yield new insight into the flip graph of triangulations of a convex $n$-gon with a geometric interpretation on the associahedron. △ Less

Submitted 11 April, 2023; originally announced April 2023.

Comments: 16 pages, 8 figures

MSC Class: 05C05; 05C30

arXiv:2302.09194 [pdf, other]

Realizable Standard Young Tableaux

Authors: Igor Araujo, Alexander E. Black, Amanda Burcroff, Yibo Gao, Robert A. Krueger, Alex McDonough

Abstract: Given two vectors $u$ and $v$, their outer sum is given by the matrix $A$ with entries $A_{ij} = u_{i} + v_{j}$. If the entries of $u$ and $v$ are increasing and sufficiently generic, the total ordering of the entries of the matrix is a standard Young tableau of rectangular shape. We call standard Young tableaux arising in this way realizable. The set of realizable tableaux was defined by Mallows… ▽ More Given two vectors $u$ and $v$, their outer sum is given by the matrix $A$ with entries $A_{ij} = u_{i} + v_{j}$. If the entries of $u$ and $v$ are increasing and sufficiently generic, the total ordering of the entries of the matrix is a standard Young tableau of rectangular shape. We call standard Young tableaux arising in this way realizable. The set of realizable tableaux was defined by Mallows and Vanderbei for studying a deconvolution algorithm, but we show they have appeared in many other contexts including sorting algorithms, quantum computing, random sorting networks, reflection arrangements, fiber polytopes, and Goodman and Pollack's theory of allowable sequences. In our work, we prove tight bounds on the asymptotic number of realizable rectangular tableaux. We also derive tight asymptotics for the number of realizable allowable sequences, which are in bijection with realizable staircase-shaped standard Young tableaux with the notion of realizability coming from the theory of sorting networks. As a consequence, we resolve an open question of Angel, Gorin, and Holroyd from 2012 and improve upon a 1986 result of Goodman and Pollack. △ Less

Submitted 17 February, 2023; originally announced February 2023.

Comments: 19 pages, 7 figures

MSC Class: 05A16; 20F55; 52C35

arXiv:2302.03977 [pdf, other]

On the Circuit Diameter Conjecture for Counterexamples to the Hirsch Conjecture

Authors: Alexander E. Black, Steffen Borgwardt, Matthias Brugger

Abstract: Circuit diameters of polyhedra are a fundamental tool for studying the complexity of circuit augmentation schemes for linear programming and for finding lower bounds on combinatorial diameters. The main open problem in this area is the circuit diameter conjecture, the analogue of the Hirsch conjecture in the circuit setting. A natural question is whether the well-known counterexamples to the Hirsc… ▽ More Circuit diameters of polyhedra are a fundamental tool for studying the complexity of circuit augmentation schemes for linear programming and for finding lower bounds on combinatorial diameters. The main open problem in this area is the circuit diameter conjecture, the analogue of the Hirsch conjecture in the circuit setting. A natural question is whether the well-known counterexamples to the Hirsch conjecture carry over. Previously, Stephen and Yusun showed that the Klee-Walkup counterexample to the unbounded Hirsch conjecture does not transfer to the circuit setting. Our main contribution is to show that the original counterexamples for the other variants, for bounded polytopes and using monotone walks, also do not transfer. Our results rely on new observations on structural properties of these counterexamples. To resolve the bounded case, we exploit the geometry of certain $2$-faces of the polytopes underlying all known bounded Hirsch counterexamples in Santos' work. For Todd's monotone Hirsch counterexample, we provide two alternative approaches. The first one uses sign-compatible circuit walks, and the second one uses the observation that Todd's polytope is anti-blocking. Along the way, we enumerate all linear programs over the polytope and find four new orientations that contradict the monotone Hirsch conjecture, while the remaining $7107$ satisfy the bound. △ Less

Submitted 8 April, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

MSC Class: 52B05; 52B12; 90C05

arXiv:2207.12221 [pdf, other]

Underlying Flag Polymatroids

Authors: Alexander E. Black, Raman Sanyal

Abstract: We describe a natural geometric relationship between matroids and underlying flag matroids by relating the geometry of the greedy algorithm to monotone path polytopes. This perspective allows us to generalize the construction of underlying flag matroids to polymatroids. We show that the polytopes associated to underlying flag polymatroid are simple by proving that they are normally equivalent to c… ▽ More We describe a natural geometric relationship between matroids and underlying flag matroids by relating the geometry of the greedy algorithm to monotone path polytopes. This perspective allows us to generalize the construction of underlying flag matroids to polymatroids. We show that the polytopes associated to underlying flag polymatroid are simple by proving that they are normally equivalent to certain nestohedra. We use this to show that polymatroids realized by subspace arrangements give rise to smooth toric varieties in flag varieties and we interpret our construction in terms of toric quotients. We give various examples that illustrate the rich combinatorial structure of flag polymatroids. Finally, we study general monotone paths on polymatroid polytopes, that relate to the enumeration of certain Young tableaux. △ Less

Submitted 24 June, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: 31 pages, 4 figures. New title, improved exposition, minor fixes; final version

arXiv:2206.09875 [pdf, other]

Algorithmic Fairness and Vertical Equity: Income Fairness with IRS Tax Audit Models

Authors: Emily Black, Hadi Elzayn, Alexandra Chouldechova, Jacob Goldin, Daniel E. Ho

Abstract: This study examines issues of algorithmic fairness in the context of systems that inform tax audit selection by the United States Internal Revenue Service (IRS). While the field of algorithmic fairness has developed primarily around notions of treating like individuals alike, we instead explore the concept of vertical equity -- appropriately accounting for relevant differences across individuals -… ▽ More This study examines issues of algorithmic fairness in the context of systems that inform tax audit selection by the United States Internal Revenue Service (IRS). While the field of algorithmic fairness has developed primarily around notions of treating like individuals alike, we instead explore the concept of vertical equity -- appropriately accounting for relevant differences across individuals -- which is a central component of fairness in many public policy settings. Applied to the design of the U.S. individual income tax system, vertical equity relates to the fair allocation of tax and enforcement burdens across taxpayers of different income levels. Through a unique collaboration with the Treasury Department and IRS, we use access to anonymized individual taxpayer microdata, risk-selected audits, and random audits from 2010-14 to study vertical equity in tax administration. In particular, we assess how the use of modern machine learning methods for selecting audits may affect vertical equity. First, we show how the use of more flexible machine learning (classification) methods -- as opposed to simpler models -- shifts audit burdens from high to middle-income taxpayers. Second, we show that while existing algorithmic fairness techniques can mitigate some disparities across income, they can incur a steep cost to performance. Third, we show that the choice of whether to treat risk of underreporting as a classification or regression problem is highly consequential. Moving from classification to regression models to predict underreporting shifts audit burden substantially toward high income individuals, while increasing revenue. Last, we explore the role of differential audit cost in shaping the audit distribution. We show that a narrow focus on return-on-investment can undermine vertical equity. Our results have implications for the design of algorithmic tools across the public sector. △ Less

Submitted 20 June, 2022; originally announced June 2022.

arXiv:2204.09129 [pdf, ps, other]

Small Shadows of Lattice Polytopes

Authors: Alexander E. Black

Abstract: The diameter of the graph of a $d$-dimensional lattice polytope $P \subseteq [0,k]^{n}$ is known to be at most $dk$ due to work by Kleinschmidt and Onn. However, it is an open question whether the monotone diameter, the shortest guaranteed length of a monotone path, of a $d$-dimensional lattice polytope $P = \{\mathbf{x}: A\mathbf{x} \leq \mathbf{b}\} \subseteq [0,k]^{n}$ is bounded by a polynomia… ▽ More The diameter of the graph of a $d$-dimensional lattice polytope $P \subseteq [0,k]^{n}$ is known to be at most $dk$ due to work by Kleinschmidt and Onn. However, it is an open question whether the monotone diameter, the shortest guaranteed length of a monotone path, of a $d$-dimensional lattice polytope $P = \{\mathbf{x}: A\mathbf{x} \leq \mathbf{b}\} \subseteq [0,k]^{n}$ is bounded by a polynomial in $d$ and $k$. This question is of particular interest in linear optimization, since paths traced by the Simplex method must be monotone. We introduce partial results in this direction including a monotone diameter bound of $3d$ for $k = 2$, a monotone diameter bound of $(d-1)m+1$ for $d$-dimensional $(m+1)$-level polytopes, a pivot rule such that the Simplex method is guaranteed to take at most $dnk||A||_{\infty}$ non-degenerate steps to solve a LP on $P$, and a bound of $dk$ for lengths of paths from certain fixed starting points. Finally, we present a constructive approach to a diameter bound of $(3/2)dk$ and describe how to translate this final bound into an algorithm that solves a linear program by tracing such a path. △ Less

Submitted 19 April, 2022; originally announced April 2022.

Comments: 11 pages

MSC Class: 52B05; 52B20; 90C05 ACM Class: G.2

arXiv:2203.06060 [pdf, other]

ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI

Authors: Lyndon Boone, Mahdi Biparva, Parisa Mojiri Forooshani, Joel Ramirez, Mario Masellis, Robert Bartha, Sean Symons, Stephen Strother, Sandra E. Black, Chris Heyn, Anne L. Martel, Richard H. Swartz, Maged Goubran

Abstract: Deep artificial neural networks (DNNs) have moved to the forefront of medical image analysis due to their success in classification, segmentation, and detection challenges. A principal challenge in large-scale deployment of DNNs in neuroimage analysis is the potential for shifts in signal-to-noise ratio, contrast, resolution, and presence of artifacts from site to site due to variances in scanners… ▽ More Deep artificial neural networks (DNNs) have moved to the forefront of medical image analysis due to their success in classification, segmentation, and detection challenges. A principal challenge in large-scale deployment of DNNs in neuroimage analysis is the potential for shifts in signal-to-noise ratio, contrast, resolution, and presence of artifacts from site to site due to variances in scanners and acquisition protocols. DNNs are famously susceptible to these distribution shifts in computer vision. Currently, there are no benchmarking platforms or frameworks to assess the robustness of new and existing models to specific distribution shifts in MRI, and accessible multi-site benchmarking datasets are still scarce or task-specific. To address these limitations, we propose ROOD-MRI: a platform for benchmarking the Robustness of DNNs to Out-Of-Distribution (OOD) data, corruptions, and artifacts in MRI. The platform provides modules for generating benchmarking datasets using transforms that model distribution shifts in MRI, implementations of newly derived benchmarking metrics for image segmentation, and examples for using the methodology with new models and tasks. We apply our methodology to hippocampus, ventricle, and white matter hyperintensity segmentation in several large studies, providing the hippocampus dataset as a publicly available benchmark. By evaluating modern DNNs on these datasets, we demonstrate that they are highly susceptible to distribution shifts and corruptions in MRI. We show that while data augmentation strategies can substantially improve robustness to OOD data for anatomical segmentation tasks, modern DNNs using augmentation still lack robustness in more challenging lesion-based segmentation tasks. We finally benchmark U-Nets and transformer-based models, finding consistent differences in robustness to particular classes of transforms across architectures. △ Less

Submitted 11 March, 2022; originally announced March 2022.

Comments: 30 pages, 13 figures. For associated GitHub repository, see https://github.com/AICONSlab/roodmri

arXiv:2201.05134 [pdf, other]

The Polyhedral Geometry of Pivot Rules and Monotone Paths

Authors: Alexander E. Black, Jesús A. De Loera, Niklas Lütjeharms, Raman Sanyal

Abstract: Motivated by the analysis of the performance of the simplex method we study the behavior of families of pivot rules of linear programs. We introduce normalized-weight pivot rules which are fundamental for the following reasons: First, they are memory-less, in the sense that the pivots are governed by local information encoded by an arborescence. Second, many of the most used pivot rules belong to… ▽ More Motivated by the analysis of the performance of the simplex method we study the behavior of families of pivot rules of linear programs. We introduce normalized-weight pivot rules which are fundamental for the following reasons: First, they are memory-less, in the sense that the pivots are governed by local information encoded by an arborescence. Second, many of the most used pivot rules belong to that class, and we show this subclass is critical for understanding the complexity of all pivot rules. Finally, normalized-weight pivot rules can be parametrized in a natural continuous manner. We show the existence of two polytopes, the pivot rule polytopes and the neighbotopes, that capture the behavior of normalized-weight pivot rules on polytopes and linear programs. We explain their face structure in terms of multi-arborescences. We compute upper bounds on the number of coherent arborescences, that is, vertices of our polytopes. Beyond optimization, our constructions provide new perspectives on classical geometric combinatorics. We introduce a normalized-weight pivot rule, we call the max-slope pivot rule which generalizes the shadow-vertex pivot rule. The corresponding pivot rule polytopes and neighbotopes refine monotone path polytopes of Billera--Sturmfels. Moreover special cases of our polytopes yield permutahedra, associahedra, and multiplihedra. For the greatest improvement pivot rules we draw connections to sweep polytopes and polymatroids. △ Less

Submitted 13 January, 2022; originally announced January 2022.

Comments: 27 pages, 6 figures

arXiv:2112.02899 [pdf, other]

Reduced-bias estimation of the residual dependence index with unnamed marginals

Authors: Jennifer Israelsson, Emily Black, Claudia Neves, David Walshaw

Abstract: This paper addresses important weaknesses in current methodology for the estimation of multivariate extreme event distributions. The estimation of the residual dependence index $η\in (0,1]$ is notoriously problematic. We introduce a flexible class of reduced-bias estimators for this parameter, designed to ameliorate the usual problems of threshold selection through a unified approach to familiar m… ▽ More This paper addresses important weaknesses in current methodology for the estimation of multivariate extreme event distributions. The estimation of the residual dependence index $η\in (0,1]$ is notoriously problematic. We introduce a flexible class of reduced-bias estimators for this parameter, designed to ameliorate the usual problems of threshold selection through a unified approach to familiar marginal standardisations. We derive the asymptotic properties of the proposed class of gradient estimators for $η$. Their efficiency stems from a hitherto neglected exponentially decaying term in the characterisation of the asymptotic independence based on the theory of regular variation. Simulation studies to demonstrate the finite-sample efficacy of the new gradient estimation across a wealth of bivariate distributions belonging to some max-domain of attraction that enjoy the asymptotic independence property. Our leading application illustrates how asymptotic independence can be discerned from monsoon-related rainfall occurrences at different locations in Ghana. The considerations involved in extending this framework to the estimation of the extreme value index attached to univariate domains of attraction associated with heavy-tailed distributions are briefly discussed. △ Less

Submitted 2 May, 2025; v1 submitted 6 December, 2021; originally announced December 2021.

MSC Class: 60F17; 62G32; 62A99; 62P12

arXiv:2111.12623 [pdf, other]

Rheological dynamics of active Myxococcus xanthus populations during development

Authors: Matthew E. Black, Joshua W. Shaevitz

Abstract: The bacterium Myxoccocus xanthus produces multicellular protective droplets called fruiting bodies when starved. These structures form initially through the active dewetting of cells into surface-bound droplets, where substantial flows of the material are needed as the fruiting bodies grow and become round. These dynamics are followed by a primitive developmental process in which the fluid-like dr… ▽ More The bacterium Myxoccocus xanthus produces multicellular protective droplets called fruiting bodies when starved. These structures form initially through the active dewetting of cells into surface-bound droplets, where substantial flows of the material are needed as the fruiting bodies grow and become round. These dynamics are followed by a primitive developmental process in which the fluid-like droplets of motile cells mature into mechanically-resilient mounds of non-motile spores that can resist significant mechanical perturbation from the external environment. To date, the mechanical properties of fruiting bodies and the changes in cellular behavior that lead to maturation have not been studied. We use atomic force microscopy to probe the rheology of droplets throughout their development and find that relaxation occurs on two time scales, $\sim$1~s and $\sim$100~s. We use a two-element Maxwell-Wiechert model to quantify the viscoelastic relaxation and find that at early developmental times, cellular motility is responsible for the flow of the material but that this flow ceases when cells stop moving and become nonmotile spores. Later in development there is a dramatic increase in the modulus of the droplet as cells sporulate and the fruiting body matures, resulting in a mostly elastic structure that can protect spores from harsh environmental insult. △ Less

Submitted 24 November, 2021; originally announced November 2021.

arXiv:2111.08230 [pdf, other]

Selective Ensembles for Consistent Predictions

Authors: Emily Black, Klas Leino, Matt Fredrikson

Abstract: Recent work has shown that models trained to the same objective, and which achieve similar measures of accuracy on consistent test data, may nonetheless behave very differently on individual predictions. This inconsistency is undesirable in high-stakes contexts, such as medical diagnosis and finance. We show that this inconsistent behavior extends beyond predictions to feature attributions, which… ▽ More Recent work has shown that models trained to the same objective, and which achieve similar measures of accuracy on consistent test data, may nonetheless behave very differently on individual predictions. This inconsistency is undesirable in high-stakes contexts, such as medical diagnosis and finance. We show that this inconsistent behavior extends beyond predictions to feature attributions, which may likewise have negative implications for the intelligibility of a model, and one's ability to find recourse for subjects. We then introduce selective ensembles to mitigate such inconsistencies by applying hypothesis testing to the predictions of a set of models trained using randomly-selected starting conditions; importantly, selective ensembles can abstain in cases where a consistent outcome cannot be achieved up to a specified confidence level. We prove that that prediction disagreement between selective ensembles is bounded, and empirically demonstrate that selective ensembles achieve consistent predictions and feature attributions while maintaining low abstention rates. On several benchmark datasets, selective ensembles reach zero inconsistently predicted points, with abstention rates as low 1.5%. △ Less

Submitted 16 November, 2021; originally announced November 2021.

Comments: Preprint

arXiv:2111.02523 [pdf, other]

Adding Safety Rules to Surgeon-Authored VR Training

Authors: Ruiliang Gao, Sergei Kurenov, Erik W. Black, Jorg Peters

Abstract: Introduction: Safety criteria in surgical VR training are typically hard-coded and informally summarized. The Virtual Reality (VR) content creation interface, TIPS-author, for the Toolkit for Illustration of Procedures in Surgery (TIPS) allows surgeon-educators (SEs) to create laparoscopic VR-training modules with force feedback. TIPS-author initializes anatomy shape and physical properties select… ▽ More Introduction: Safety criteria in surgical VR training are typically hard-coded and informally summarized. The Virtual Reality (VR) content creation interface, TIPS-author, for the Toolkit for Illustration of Procedures in Surgery (TIPS) allows surgeon-educators (SEs) to create laparoscopic VR-training modules with force feedback. TIPS-author initializes anatomy shape and physical properties selected by the SE accessing a cloud data base of physics-enabled pieces of anatomy. Methods: A new addition to TIPS-author are safety rules that are set by the SE and are automatically monitored during simulation. Errors are recorded as visual snapshots for feedback to the trainee. This paper reports on the implementation and opportunistic evaluation of the snap-shot mechanism as a trainee feedback mechanism. TIPS was field tested at two surgical conferences, one before and one after adding the snapshot feature. Results: While other ratings of TIPS remained unchanged for an overall Likert scale score of 5.24 out of 7 (7 equals very useful), the rating of the statement `The TIPS interface helps learners understand the force necessary to explore the anatomy' improved from 5.04 to 5.35 out of 7 after the snapshot mechanism was added. Conclusions: The ratings indicate the viability of the TIPS open-source2 E-authored surgical training units. Presenting SE-determined procedural missteps via the snapshot mechanism at the end of the training increases acceptance △ Less

Submitted 3 November, 2021; originally announced November 2021.

Comments: How do I migrate this to cs.HC ? I need the identifier for a deadline and

arXiv:2110.03109 [pdf, other]

Consistent Counterfactuals for Deep Models

Authors: Emily Black, Zifan Wang, Matt Fredrikson, Anupam Datta

Abstract: Counterfactual examples are one of the most commonly-cited methods for explaining the predictions of machine learning models in key areas such as finance and medical diagnosis. Counterfactuals are often discussed under the assumption that the model on which they will be used is static, but in deployment models may be periodically retrained or fine-tuned. This paper studies the consistency of model… ▽ More Counterfactual examples are one of the most commonly-cited methods for explaining the predictions of machine learning models in key areas such as finance and medical diagnosis. Counterfactuals are often discussed under the assumption that the model on which they will be used is static, but in deployment models may be periodically retrained or fine-tuned. This paper studies the consistency of model prediction on counterfactual examples in deep networks under small changes to initial training conditions, such as weight initialization and leave-one-out variations in data, as often occurs during model deployment. We demonstrate experimentally that counterfactual examples for deep models are often inconsistent across such small changes, and that increasing the cost of the counterfactual, a stability-enhancing mitigation suggested by prior work in the context of simpler models, is not a reliable heuristic in deep networks. Rather, our analysis shows that a model's local Lipschitz continuity around the counterfactual is key to its consistency across related models. To this end, we propose Stable Neighbor Search as a way to generate more consistent counterfactual explanations, and illustrate the effectiveness of this approach on several benchmark datasets. △ Less

Submitted 6 October, 2021; originally announced October 2021.

arXiv:2107.12850 [pdf, other]

doi 10.6028/NIST.IR.8397

Guidelines on Minimum Standards for Developer Verification of Software

Authors: Paul E. Black, Barbara Guttman, Vadim Okun

Abstract: Executive Order (EO) 14028, "Improving the Nation's Cybersecurity", 12 May 2021, directs the National Institute of Standards and Technology (NIST) to recommend minimum standards for software testing within 60 days. This document describes eleven recommendations for software verification techniques as well as providing supplemental information about the techniques and references for further informa… ▽ More Executive Order (EO) 14028, "Improving the Nation's Cybersecurity", 12 May 2021, directs the National Institute of Standards and Technology (NIST) to recommend minimum standards for software testing within 60 days. This document describes eleven recommendations for software verification techniques as well as providing supplemental information about the techniques and references for further information. It recommends the following techniques: Threat modeling to look for design-level security issues Automated testing for consistency and to minimize human effort Static code scanning to look for top bugs Heuristic tools to look for possible hardcoded secrets Use of built-in checks and protections "Black box" test cases Code-based structural test cases Historical test cases Fuzzing Web app scanners, if applicable Address included code (libraries, packages, services) The document does not address the totality of software verification, but instead, recommends techniques that are broadly applicable and form the minimum standards. The document was developed by NIST in consultation with the National Security Agency (NSA). Additionally, we received input from numerous outside organizations through papers submitted to a NIST workshop on the Executive Order held in early June 2021, discussion at the workshop, as well as follow up with several of the submitters. △ Less

Submitted 7 October, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

Comments: 33 pages, 2 figures

Report number: NIST IR 8397

arXiv:2107.10171 [pdf, other]

Leave-one-out Unfairness

Authors: Emily Black, Matt Fredrikson

Abstract: We introduce leave-one-out unfairness, which characterizes how likely a model's prediction for an individual will change due to the inclusion or removal of a single other person in the model's training data. Leave-one-out unfairness appeals to the idea that fair decisions are not arbitrary: they should not be based on the chance event of any one person's inclusion in the training data. Leave-one-o… ▽ More We introduce leave-one-out unfairness, which characterizes how likely a model's prediction for an individual will change due to the inclusion or removal of a single other person in the model's training data. Leave-one-out unfairness appeals to the idea that fair decisions are not arbitrary: they should not be based on the chance event of any one person's inclusion in the training data. Leave-one-out unfairness is closely related to algorithmic stability, but it focuses on the consistency of an individual point's prediction outcome over unit changes to the training data, rather than the error of the model in aggregate. Beyond formalizing leave-one-out unfairness, we characterize the extent to which deep models behave leave-one-out unfairly on real data, including in cases where the generalization error is small. Further, we demonstrate that adversarial training and randomized smoothing techniques have opposite effects on leave-one-out fairness, which sheds light on the relationships between robustness, memorization, individual fairness, and leave-one-out fairness in deep models. Finally, we discuss salient practical applications that may be negatively affected by leave-one-out unfairness. △ Less

Submitted 21 July, 2021; originally announced July 2021.

Comments: FAccT '21

ACM Class: I.2.0; K.4.0

Journal ref: FAccT '21: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency 2021, Pages 285-295

arXiv:1906.09218 [pdf, other]

doi 10.1145/3351095.3372845

FlipTest: Fairness Testing via Optimal Transport

Authors: Emily Black, Samuel Yeom, Matt Fredrikson

Abstract: We present FlipTest, a black-box technique for uncovering discrimination in classifiers. FlipTest is motivated by the intuitive question: had an individual been of a different protected status, would the model have treated them differently? Rather than relying on causal information to answer this question, FlipTest leverages optimal transport to match individuals in different protected groups, cre… ▽ More We present FlipTest, a black-box technique for uncovering discrimination in classifiers. FlipTest is motivated by the intuitive question: had an individual been of a different protected status, would the model have treated them differently? Rather than relying on causal information to answer this question, FlipTest leverages optimal transport to match individuals in different protected groups, creating similar pairs of in-distribution samples. We show how to use these instances to detect discrimination by constructing a "flipset": the set of individuals whose classifier output changes post-translation, which corresponds to the set of people who may be harmed because of their group membership. To shed light on why the model treats a given subgroup differently, FlipTest produces a "transparency report": a ranking of features that are most associated with the model's behavior on the flipset. Evaluating the approach on three case studies, we show that this provides a computationally inexpensive way to identify subgroups that may be harmed by model discrimination, including in cases where the model satisfies group fairness criteria. △ Less

Submitted 6 December, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

Comments: Accepted to ACM FAT* 2020; The first two authors contributed equally

arXiv:1812.08999 [pdf, other]

Feature-Wise Bias Amplification

Authors: Klas Leino, Emily Black, Matt Fredrikson, Shayak Sen, Anupam Datta

Abstract: We study the phenomenon of bias amplification in classifiers, wherein a machine learning model learns to predict classes with a greater disparity than the underlying ground truth. We demonstrate that bias amplification can arise via an inductive bias in gradient descent methods that results in the overestimation of the importance of moderately-predictive "weak" features if insufficient training da… ▽ More We study the phenomenon of bias amplification in classifiers, wherein a machine learning model learns to predict classes with a greater disparity than the underlying ground truth. We demonstrate that bias amplification can arise via an inductive bias in gradient descent methods that results in the overestimation of the importance of moderately-predictive "weak" features if insufficient training data is available. This overestimation gives rise to feature-wise bias amplification -- a previously unreported form of bias that can be traced back to the features of a trained model. Through analysis and experiments, we show that while some bias cannot be mitigated without sacrificing accuracy, feature-wise bias amplification can be mitigated through targeted feature selection. We present two new feature selection algorithms for mitigating bias amplification in linear models, and show how they can be adapted to convolutional neural networks efficiently. Our experiments on synthetic and real data demonstrate that these algorithms consistently lead to reduced bias without harming accuracy, in some cases eliminating predictive bias altogether while providing modest gains in accuracy. △ Less

Submitted 21 October, 2019; v1 submitted 21 December, 2018; originally announced December 2018.

Comments: Published in ICLR 2019

arXiv:1805.11212 [pdf, ps, other]

doi 10.1103/PhysRevD.99.124053

Effect of quantum and thermal jitter on the feasibility of Beckenstein's proposed experiment to search for Planck-scale signals

Authors: G. Jordan Maclay, S. A. Wadood, Eric D. Black, Peter W. Milonni

Abstract: A proposed experiment to test whether space is discretized [J. D. Bekenstein, Phys. Rev. D 86, 124040 (2012); Found. Phys. 44, 452 (2014)] is based on the supposed impossibility of an incident photon causing a displacement of a transparent block by less than the Planck length. An analysis of the quantum and thermal jitter of the block shows that it greatly diminishes the possibility that the exper… ▽ More A proposed experiment to test whether space is discretized [J. D. Bekenstein, Phys. Rev. D 86, 124040 (2012); Found. Phys. 44, 452 (2014)] is based on the supposed impossibility of an incident photon causing a displacement of a transparent block by less than the Planck length. An analysis of the quantum and thermal jitter of the block shows that it greatly diminishes the possibility that the experiment could reveal Planck-scale signals. △ Less

Submitted 9 June, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

Comments: 5 pages, accepted for publication in Physical Review D

Journal ref: Phys. Rev. D 99, 124053 (2019)

arXiv:1704.07699 [pdf, other]

doi 10.1038/s41598-018-19781-5

Perivascular Spaces Segmentation in Brain MRI Using Optimal 3D Filtering

Authors: Lucia Ballerini, Ruggiero Lovreglio, Maria del C. Valdes-Hernandez, Joel Ramirez, Bradley J. MacIntosh, Sandra E. Black, Joanna M. Wardlaw

Abstract: Perivascular Spaces (PVS) are a recently recognised feature of Small Vessel Disease (SVD), also indicating neuroinflammation, and are an important part of the brain's circulation and glymphatic drainage system. Quantitative analysis of PVS on Magnetic Resonance Images (MRI) is important for understanding their relationship with neurological diseases. In this work, we propose a segmentation techniq… ▽ More Perivascular Spaces (PVS) are a recently recognised feature of Small Vessel Disease (SVD), also indicating neuroinflammation, and are an important part of the brain's circulation and glymphatic drainage system. Quantitative analysis of PVS on Magnetic Resonance Images (MRI) is important for understanding their relationship with neurological diseases. In this work, we propose a segmentation technique based on the 3D Frangi filtering for extraction of PVS from MRI. Based on prior knowledge from neuroradiological ratings of PVS, we used ordered logit models to optimise Frangi filter parameters in response to the variability in the scanner's parameters and study protocols. We optimized and validated our proposed models on two independent cohorts, a dementia sample (N=20) and patients who previously had mild to moderate stroke (N=48). Results demonstrate the robustness and generalisability of our segmentation method. Segmentation-based PVS burden estimates correlated with neuroradiological assessments (Spearman's $ρ$ = 0.74, p $<$ 0.001), suggesting the great potential of our proposed method △ Less

Submitted 25 April, 2017; originally announced April 2017.

arXiv:1704.05373 [pdf, ps, other]

Strengthened Euler's Inequality in Spherical and Hyperbolic Geometries

Authors: Ren Guo, Estonia Black, Caleb Smith

Abstract: Euler's inequality is a well known inequality relating the inradius and circumradius of a triangle. In Euclidean geometry, this inequality takes the form $R \geq 2r$ where $R$ is the circumradius and $r$ is the inradius. In spherical geometry, the inequality takes the form $\tan(R) \geq 2\tan(r)$ as proved in \cite{MPV}; similary, we have $\tanh(R) \geq 2\tanh(r)$ for hyperbolic triangles (see \ci… ▽ More Euler's inequality is a well known inequality relating the inradius and circumradius of a triangle. In Euclidean geometry, this inequality takes the form $R \geq 2r$ where $R$ is the circumradius and $r$ is the inradius. In spherical geometry, the inequality takes the form $\tan(R) \geq 2\tan(r)$ as proved in \cite{MPV}; similary, we have $\tanh(R) \geq 2\tanh(r)$ for hyperbolic triangles (see \cite{SV} for proof). In Euclidean geometry, this inequality can be strengthened as discussed in \cite{SV}. We prove an analogous version of this strengthened inequality which holds in spherical geometry, as well as an additional strengthening of Euler's inequality which holds in Euclidean geometry and can be generalized into both spherical and hyperbolic geometry. △ Less

Submitted 16 April, 2017; originally announced April 2017.

arXiv:1604.00439 [pdf, other]

doi 10.1103/PhysRevD.93.112004

The Sensitivity of the Advanced LIGO Detectors at the Beginning of Gravitational Wave Astronomy

Authors: D. V. Martynov, E. D. Hall, B. P. Abbott, R. Abbott, T. D. Abbott, C. Adams, R. X. Adhikari, R. A. Anderson, S. B. Anderson, K. Arai, M. A. Arain, S. M. Aston, L. Austin, S. W. Ballmer, M. Barbet, D. Barker, B. Barr, L. Barsotti, J. Bartlett, M. A. Barton, I. Bartos, J. C. Batch, A. S. Bell, I. Belopolski, J. Bergman , et al. (239 additional authors not shown)

Abstract: The Laser Interferometer Gravitational Wave Observatory (LIGO) consists of two widely separated 4 km laser interferometers designed to detect gravitational waves from distant astrophysical sources in the frequency range from 10 Hz to 10 kHz. The first observation run of the Advanced LIGO detectors started in September 2015 and ended in January 2016. A strain sensitivity of better than… ▽ More The Laser Interferometer Gravitational Wave Observatory (LIGO) consists of two widely separated 4 km laser interferometers designed to detect gravitational waves from distant astrophysical sources in the frequency range from 10 Hz to 10 kHz. The first observation run of the Advanced LIGO detectors started in September 2015 and ended in January 2016. A strain sensitivity of better than $10^{-23}/\sqrt{\text{Hz}}$ was achieved around 100 Hz. Understanding both the fundamental and the technical noise sources was critical for increasing the observable volume in the universe. The average distance at which coalescing binary black hole systems with individual masses of 30 $M_\odot$ could be detected was 1.3 Gpc. Similarly, the range for binary neutron star inspirals was about 75 Mpc. With respect to the initial detectors, the observable volume of Universe increased respectively by a factor 69 and 43. These improvements allowed Advanced LIGO to detect the gravitational wave signal from the binary black hole coalescence, known as GW150914. △ Less

Submitted 10 February, 2018; v1 submitted 1 April, 2016; originally announced April 2016.

Journal ref: Phys. Rev. D 93, 112004 (2016)

arXiv:1501.06371 [pdf]

doi 10.1103/PhysRevD.91.022005

Material loss angles from direct measurements of broadband thermal noise

Authors: Maria Principe, Innocenzo M. Pinto, Vincenzo Pierro, Riccardo DeSalvo, Ilaria Taurasi, Akira E. Villar, Eric D. Black, Kenneth G. Libbrecht, Christophe Michel, Nazario Morgado, Laurent Pinard

Abstract: We estimate the loss angles of the materials currently used in the highly reflective test-mass coatings of interferometric detectors of gravitational waves, namely Silica, Tantala, and Ti-dop ed Tantala, from direct measurement of coating thermal noise in an optical interferometer testbench, the Caltech TNI. We also present a simple predictive theory for the material properties of amorphous glassy… ▽ More We estimate the loss angles of the materials currently used in the highly reflective test-mass coatings of interferometric detectors of gravitational waves, namely Silica, Tantala, and Ti-dop ed Tantala, from direct measurement of coating thermal noise in an optical interferometer testbench, the Caltech TNI. We also present a simple predictive theory for the material properties of amorphous glassy oxide mixtures, which gives results in good agreement with our measurements on Ti-doped Tantala. Alternative measure ment methods and results are reviewed, and some critical issues are discussed. △ Less

Submitted 26 January, 2015; originally announced January 2015.

Journal ref: Phys. Rev. D Vol. 91, 022005 (2015)

arXiv:1410.7764 [pdf, other]

doi 10.1088/0264-9381/32/11/115012

Characterization of the LIGO detectors during their sixth science run

Authors: The LIGO Scientific Collaboration, The Virgo Collaboration, J. Aasi, J. Abadie, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, C. Adams, T. Adams, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, P. Ajith, B. Allen, A. Allocca, E. Amador. Ceron, D. Amariutei, R. A. Anderson, S. B. Anderson, W. G. Anderson , et al. (846 additional authors not shown)

Abstract: In 2009-2010, the Laser Interferometer Gravitational-wave Observa- tory (LIGO) operated together with international partners Virgo and GEO600 as a network to search for gravitational waves of astrophysical origin. The sensitiv- ity of these detectors was limited by a combination of noise sources inherent to the instrumental design and its environment, often localized in time or frequency, that cou… ▽ More In 2009-2010, the Laser Interferometer Gravitational-wave Observa- tory (LIGO) operated together with international partners Virgo and GEO600 as a network to search for gravitational waves of astrophysical origin. The sensitiv- ity of these detectors was limited by a combination of noise sources inherent to the instrumental design and its environment, often localized in time or frequency, that couple into the gravitational-wave readout. Here we review the performance of the LIGO instruments during this epoch, the work done to characterize the de- tectors and their data, and the effect that transient and continuous noise artefacts have on the sensitivity of LIGO to a variety of astrophysical sources. △ Less

Submitted 18 November, 2014; v1 submitted 28 October, 2014; originally announced October 2014.

Comments: 31 pages, 13 figures

arXiv:1410.6211 [pdf, ps, other]

doi 10.1103/PhysRevD.91.022003

Searching for stochastic gravitational waves using data from the two co-located LIGO Hanford detectors

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, J. Abadie, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, P. Ajith, B. Allen, A. Allocca, E. Amado. Ceron, D. Amariutei, R. A. Anderson, S. B. Anderson , et al. (852 additional authors not shown)

Abstract: Searches for a stochastic gravitational-wave background (SGWB) using terrestrial detectors typically involve cross-correlating data from pairs of detectors. The sensitivity of such cross-correlation analyses depends, among other things, on the separation between the two detectors: the smaller the separation, the better the sensitivity. Hence, a co-located detector pair is more sensitive to a gravi… ▽ More Searches for a stochastic gravitational-wave background (SGWB) using terrestrial detectors typically involve cross-correlating data from pairs of detectors. The sensitivity of such cross-correlation analyses depends, among other things, on the separation between the two detectors: the smaller the separation, the better the sensitivity. Hence, a co-located detector pair is more sensitive to a gravitational-wave background than a non-co-located detector pair. However, co-located detectors are also expected to suffer from correlated noise from instrumental and environmental effects that could contaminate the measurement of the background. Hence, methods to identify and mitigate the effects of correlated noise are necessary to achieve the potential increase in sensitivity of co-located detectors. Here we report on the first SGWB analysis using the two LIGO Hanford detectors and address the complications arising from correlated environmental noise. We apply correlated noise identification and mitigation techniques to data taken by the two LIGO Hanford detectors, H1 and H2, during LIGO's fifth science run. At low frequencies, 40 - 460 Hz, we are unable to sufficiently mitigate the correlated noise to a level where we may confidently measure or bound the stochastic gravitational-wave signal. However, at high frequencies, 460-1000 Hz, these techniques are sufficient to set a $95%$ confidence level (C.L.) upper limit on the gravitational-wave energy density of Ω(f)<7.7 x 10^{-4} (f/ 900 Hz)^3, which improves on the previous upper limit by a factor of $\sim 180$. In doing so, we demonstrate techniques that will be useful for future searches using advanced detectors, where correlated noise (e.g., from global magnetic fields) may affect even widely separated detectors. △ Less

Submitted 2 December, 2014; v1 submitted 22 October, 2014; originally announced October 2014.

Comments: 21 pages, 10 figures, 5 tables

Journal ref: Phys. Rev. D 91, 022003 (2015)

arXiv:1409.6262 [pdf, ps, other]

doi 10.1119/1.4902185

Particle Dynamics in Damped Nonlinear Quadrupole Ion Traps

Authors: Eugene A. Vinitsky, Eric D. Black, Kenneth G. Libbrecht

Abstract: We examine the motions of particles in quadrupole ion traps as a function of damping and trapping forces, including cases where nonlinear damping or nonlinearities in the electric field geometry play significant roles. In the absence of nonlinearities, particles are either damped to the trap center or ejected, while their addition brings about a rich spectrum of stable closed particle trajectories… ▽ More We examine the motions of particles in quadrupole ion traps as a function of damping and trapping forces, including cases where nonlinear damping or nonlinearities in the electric field geometry play significant roles. In the absence of nonlinearities, particles are either damped to the trap center or ejected, while their addition brings about a rich spectrum of stable closed particle trajectories. In three-dimensional (3D) quadrupole traps, the extended orbits are typically confined to the trap axis, and for this case we present a 1D analysis of the relevant equation of motion. We follow this with an analysis of 2D quadrupole traps that frequently show diamond-shaped closed orbits. For both the 1D and 2D cases we present experimental observations of the calculated trajectories in microparticle ion traps. We also report the discovery of a new collective behavior in damped 2D microparticle ion traps, where particles spontaneously assemble into a remarkable knot of overlapping, corotating diamond orbits, self-stabilized by air currents arising from the particle motion. △ Less

Submitted 14 August, 2014; originally announced September 2014.

arXiv:1407.1042 [pdf, other]

doi 10.1103/PhysRevD.90.102002

Multimessenger Search for Sources of Gravitational Waves and High-Energy Neutrinos: Results for Initial LIGO-Virgo and IceCube

Authors: The IceCube Collaboration, The LIGO Scientific Collaboration, The Virgo Collaboration, M. G. Aartsen, M. Ackermann, J. Adams, J. A. Aguilar, M. Ahlers, M. Ahrens, D. Altmann, T. Anderson, C. Arguelles, T. C. Arlen, J. Auffenberg, X. Bai, S. W. Barwick, V. Baum, J. J. Beatty, J. Becker Tjus, K. -H. Becker, S. BenZvi, P. Berghaus, D. Berley, E. Bernardini, A. Bernhard , et al. (1166 additional authors not shown)

Abstract: We report the results of a multimessenger search for coincident signals from the LIGO and Virgo gravitational-wave observatories and the partially completed IceCube high-energy neutrino detector, including periods of joint operation between 2007-2010. These include parts of the 2005-2007 run and the 2009-2010 run for LIGO-Virgo, and IceCube's observation periods with 22, 59 and 79 strings. We find… ▽ More We report the results of a multimessenger search for coincident signals from the LIGO and Virgo gravitational-wave observatories and the partially completed IceCube high-energy neutrino detector, including periods of joint operation between 2007-2010. These include parts of the 2005-2007 run and the 2009-2010 run for LIGO-Virgo, and IceCube's observation periods with 22, 59 and 79 strings. We find no significant coincident events, and use the search results to derive upper limits on the rate of joint sources for a range of source emission parameters. For the optimistic assumption of gravitational-wave emission energy of $10^{-2}$\,M$_\odot$c$^2$ at $\sim 150$\,Hz with $\sim 60$\,ms duration, and high-energy neutrino emission of $10^{51}$\,erg comparable to the isotropic gamma-ray energy of gamma-ray bursts, we limit the source rate below $1.6 \times 10^{-2}$\,Mpc$^{-3}$yr$^{-1}$. We also examine how combining information from gravitational waves and neutrinos will aid discovery in the advanced gravitational-wave detector era. △ Less

Submitted 9 October, 2014; v1 submitted 3 July, 2014; originally announced July 2014.

Journal ref: Phys. Rev. D 90, 102002 (2014)

arXiv:1407.0075 [pdf, ps, other]

doi 10.1119/1.4901972

A Laser Interferometer for the Undergraduate Teaching Laboratory Demonstrating Picometer Sesitivity

Authors: Kenneth G. Libbrecht, Eric D. Black

Abstract: We describe a laser interferometer experiment for the undergraduate teaching laboratory that achieves picometer sensitivity in a hands-on table-top instrument. In addition to providing an introduction to interferometer physics and optical hardware, the experiment also focuses on precision measurement techniques including servo control, signal modulation, phase-sensitive detection, and different ty… ▽ More We describe a laser interferometer experiment for the undergraduate teaching laboratory that achieves picometer sensitivity in a hands-on table-top instrument. In addition to providing an introduction to interferometer physics and optical hardware, the experiment also focuses on precision measurement techniques including servo control, signal modulation, phase-sensitive detection, and different types of signal averaging. After students assemble, align, and characterize the interferometer, they then use it to measure nanoscale motions of a simple harmonic oscillator system, as a substantive example of how laser interferometry can be used as an effective tool in experimental science. △ Less

Submitted 30 June, 2014; originally announced July 2014.

arXiv:1406.4556 [pdf, ps, other]

doi 10.1103/PhysRevLett.113.231101

Improved Upper Limits on the Stochastic Gravitational-Wave Background from 2009-2010 LIGO and Virgo Data

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, A. Ain, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen , et al. (824 additional authors not shown)

Abstract: Gravitational waves from a variety of sources are predicted to superpose to create a stochastic background. This background is expected to contain unique information from throughout the history of the universe that is unavailable through standard electromagnetic observations, making its study of fundamental importance to understanding the evolution of the universe. We carry out a search for the st… ▽ More Gravitational waves from a variety of sources are predicted to superpose to create a stochastic background. This background is expected to contain unique information from throughout the history of the universe that is unavailable through standard electromagnetic observations, making its study of fundamental importance to understanding the evolution of the universe. We carry out a search for the stochastic background with the latest data from LIGO and Virgo. Consistent with predictions from most stochastic gravitational-wave background models, the data display no evidence of a stochastic gravitational-wave signal. Assuming a gravitational-wave spectrum of Omega_GW(f)=Omega_alpha*(f/f_ref)^alpha, we place 95% confidence level upper limits on the energy density of the background in each of four frequency bands spanning 41.5-1726 Hz. In the frequency band of 41.5-169.25 Hz for a spectral index of alpha=0, we constrain the energy density of the stochastic background to be Omega_GW(f)<5.6x10^-6. For the 600-1000 Hz band, Omega_GW(f)<0.14*(f/900 Hz)^3, a factor of 2.5 lower than the best previously reported upper limits. We find Omega_GW(f)<1.8x10^-4 using a spectral index of zero for 170-600 Hz and Omega_GW(f)<1.0*(f/1300 Hz)^3 for 1000-1726 Hz, bands in which no previous direct limits have been placed. The limits in these four bands are the lowest direct measurements to date on the stochastic background. We discuss the implications of these results in light of the recent claim by the BICEP2 experiment of the possible evidence for inflationary gravitational waves. △ Less

Submitted 12 August, 2014; v1 submitted 17 June, 2014; originally announced June 2014.

Comments: 11 pages, 2 figures, 1 table

Journal ref: Phys. Rev. Lett. 113, 231101 (2014)

arXiv:1405.7904 [pdf, ps, other]

doi 10.1103/PhysRevD.90.062010

First all-sky search for continuous gravitational waves from unknown sources in binary systems

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, A. Ain, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen , et al. (827 additional authors not shown)

Abstract: We present the first results of an all-sky search for continuous gravitational waves from unknown spinning neutron stars in binary systems using LIGO and Virgo data. Using a specially developed analysis program, the TwoSpect algorithm, the search was carried out on data from the sixth LIGO Science Run and the second and third Virgo Science Runs. The search covers a range of frequencies from 20 Hz… ▽ More We present the first results of an all-sky search for continuous gravitational waves from unknown spinning neutron stars in binary systems using LIGO and Virgo data. Using a specially developed analysis program, the TwoSpect algorithm, the search was carried out on data from the sixth LIGO Science Run and the second and third Virgo Science Runs. The search covers a range of frequencies from 20 Hz to 520 Hz, a range of orbital periods from 2 to ~2,254 h and a frequency- and period-dependent range of frequency modulation depths from 0.277 to 100 mHz. This corresponds to a range of projected semi-major axes of the orbit from ~0.6e-3 ls to ~6,500 ls assuming the orbit of the binary is circular. While no plausible candidate gravitational wave events survive the pipeline, upper limits are set on the analyzed data. The most sensitive 95% confidence upper limit obtained on gravitational wave strain is 2.3e-24 at 217 Hz, assuming the source waves are circularly polarized. Although this search has been optimized for circular binary orbits, the upper limits obtained remain valid for orbital eccentricities as large as 0.9. In addition, upper limits are placed on continuous gravitational wave emission from the low-mass x-ray binary Scorpius X-1 between 20 Hz and 57.25 Hz. △ Less

Submitted 17 September, 2014; v1 submitted 30 May, 2014; originally announced May 2014.

Comments: 16 pages, 6 figures

Journal ref: Phys. Rev. D 90, 062010 (2014)

arXiv:1405.1053 [pdf, other]

doi 10.1103/PhysRevD.89.122004

Methods and results of a search for gravitational waves associated with gamma-ray bursts using the GEO600, LIGO, and Virgo detectors

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen, R. A. Anderson, S. B. Anderson , et al. (868 additional authors not shown)

Abstract: In this paper we report on a search for short-duration gravitational wave bursts in the frequency range 64 Hz-1792 Hz associated with gamma-ray bursts (GRBs), using data from GEO600 and one of the LIGO or Virgo detectors. We introduce the method of a linear search grid to analyse GRB events with large sky localisation uncertainties such as the localisations provided by the Fermi Gamma-ray Burst Mo… ▽ More In this paper we report on a search for short-duration gravitational wave bursts in the frequency range 64 Hz-1792 Hz associated with gamma-ray bursts (GRBs), using data from GEO600 and one of the LIGO or Virgo detectors. We introduce the method of a linear search grid to analyse GRB events with large sky localisation uncertainties such as the localisations provided by the Fermi Gamma-ray Burst Monitor (GBM). Coherent searches for gravitational waves (GWs) can be computationally intensive when the GRB sky position is not well-localised, due to the corrections required for the difference in arrival time between detectors. Using a linear search grid we are able to reduce the computational cost of the analysis by a factor of O(10) for GBM events. Furthermore, we demonstrate that our analysis pipeline can improve upon the sky localisation of GRBs detected by the GBM, if a high-frequency GW signal is observed in coincidence. We use the linear search grid method in a search for GWs associated with 129 GRBs observed satellite-based gamma-ray experiments between 2006 and 2011. The GRBs in our sample had not been previously analysed for GW counterparts. A fraction of our GRB events are analysed using data from GEO600 while the detector was using squeezed-light states to improve its sensitivity; this is the first search for GWs using data from a squeezed-light interferometric observatory. We find no evidence for GW signals, either with any individual GRB in this sample or with the population as a whole. For each GRB we place lower bounds on the distance to the progenitor, assuming a fixed GW emission energy of $10^{-2} M_{\odot}c^{2}$, with a median exclusion distance of 0.8 Mpc for emission at 500 Hz and 0.3 Mpc at 1 kHz. The reduced computational cost associated with a linear search grid will enable rapid searches for GWs associated with Fermi GBM events in the Advanced detector era. △ Less

Submitted 1 July, 2014; v1 submitted 5 May, 2014; originally announced May 2014.

Comments: 18 pages, 6 figures, 1 table. For a science summary, see http://www.ligo.org/science/Publication-GEOGRB/ . For the public data release of GW search results for each GRB in the paper, see https://dcc.ligo.org/LIGO-P1300086/public

Journal ref: Phys. Rev. D 89, 122004 (2014)

arXiv:1404.2199 [pdf, other]

doi 10.1103/PhysRevD.89.122003

Search for gravitational radiation from intermediate mass black hole binaries in data from the second LIGO-Virgo joint science run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, A. Ain, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen , et al. (825 additional authors not shown)

Abstract: This paper reports on an unmodeled, all-sky search for gravitational waves from merging intermediate mass black hole binaries (IMBHB). The search was performed on data from the second joint science run of the LIGO and Virgo detectors (July 2009 - October 2010) and was sensitive to IMBHBs with a range up to $\sim 200$ Mpc, averaged over the possible sky positions and inclinations of the binaries wi… ▽ More This paper reports on an unmodeled, all-sky search for gravitational waves from merging intermediate mass black hole binaries (IMBHB). The search was performed on data from the second joint science run of the LIGO and Virgo detectors (July 2009 - October 2010) and was sensitive to IMBHBs with a range up to $\sim 200$ Mpc, averaged over the possible sky positions and inclinations of the binaries with respect to the line of sight. No significant candidate was found. Upper limits on the coalescence-rate density of nonspinning IMBHBs with total masses between 100 and $450 \ \mbox{M}_{\odot}$ and mass ratios between $0.25$ and $1\,$ were placed by combining this analysis with an analogous search performed on data from the first LIGO-Virgo joint science run (November 2005 - October 2007). The most stringent limit was set for systems consisting of two $88 \ \mbox{M}_{\odot}$ black holes and is equal to $0.12 \ \mbox{Mpc}^{-3} \ \mbox{Myr}^{-1}$ at the $90\%$ confidence level. This paper also presents the first estimate, for the case of an unmodeled analysis, of the impact on the search range of IMBHB spin configurations: the visible volume for IMBHBs with nonspinning components is roughly doubled for a population of IMBHBs with spins aligned with the binary's orbital angular momentum and uniformly distributed in the dimensionless spin parameter up to 0.8, whereas an analogous population with antialigned spins decreases the visible volume by $\sim 20\%\,$. △ Less

Submitted 19 June, 2014; v1 submitted 8 April, 2014; originally announced April 2014.

Comments: 15 pages, 10 figures, data for plots and archived public version at https://dcc.ligo.org/LIGO-P1300158/public, see also the public announcement at http://www.ligo.org/science/Publication-S6IMBH/index.php, updated bibliography

Report number: LIGO-P1300158

Journal ref: Phys. Rev. D 89, 122003 (2014)

arXiv:1403.6639 [pdf, other]

doi 10.1103/PhysRevLett.113.011102

Search for gravitational waves associated with gamma-ray bursts detected by the InterPlanetary Network

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen, R. A. Anderson, S. B. Anderson , et al. (879 additional authors not shown)

Abstract: We present the results of a search for gravitational waves associated with 223 gamma-ray bursts (GRBs) detected by the InterPlanetary Network (IPN) in 2005-2010 during LIGO's fifth and sixth science runs and Virgo's first, second and third science runs. The IPN satellites provide accurate times of the bursts and sky localizations that vary significantly from degree scale to hundreds of square degr… ▽ More We present the results of a search for gravitational waves associated with 223 gamma-ray bursts (GRBs) detected by the InterPlanetary Network (IPN) in 2005-2010 during LIGO's fifth and sixth science runs and Virgo's first, second and third science runs. The IPN satellites provide accurate times of the bursts and sky localizations that vary significantly from degree scale to hundreds of square degrees. We search for both a well-modeled binary coalescence signal, the favored progenitor model for short GRBs, and for generic, unmodeled gravitational wave bursts. Both searches use the event time and sky localization to improve the gravitational-wave search sensitivity as compared to corresponding all-time, all-sky searches. We find no evidence of a gravitational-wave signal associated with any of the IPN GRBs in the sample, nor do we find evidence for a population of weak gravitational-wave signals associated with the GRBs. For all IPN-detected GRBs, for which a sufficient duration of quality gravitational-wave data is available, we place lower bounds on the distance to the source in accordance with an optimistic assumption of gravitational-wave emission energy of $10^{-2}M_{\odot}c^2$ at 150 Hz, and find a median of 13 Mpc. For the 27 short-hard GRBs we place 90% confidence exclusion distances to two source models: a binary neutron star coalescence, with a median distance of 12Mpc, or the coalescence of a neutron star and black hole, with a median distance of 22 Mpc. Finally, we combine this search with previously published results to provide a population statement for GRB searches in first-generation LIGO and Virgo gravitational-wave detectors, and a resulting examination of prospects for the advanced gravitational-wave detectors. △ Less

Submitted 17 April, 2014; v1 submitted 26 March, 2014; originally announced March 2014.

Comments: 15 pages, 8 figures, 2 tables; for the science summary, see http://www.ligo.org/science/Publication-IPNS56VSR123grb/index.php

Report number: LIGO-P1300226

Journal ref: Phys. Rev. Lett. 113, 011102 (2014)

arXiv:1403.5306 [pdf, other]

doi 10.1103/PhysRevD.89.102006

Search for gravitational wave ringdowns from perturbed intermediate mass black holes in LIGO-Virgo data from 2005-2010

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, A. Ain, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen, R. Anderson , et al. (827 additional authors not shown)

Abstract: We report results from a search for gravitational waves produced by perturbed intermediate mass black holes (IMBH) in data collected by LIGO and Virgo between 2005 and 2010. The search was sensitive to astrophysical sources that produced damped sinusoid gravitational wave signals, also known as ringdowns, with frequency $50\le f_{0}/\mathrm{Hz} \le 2000$ and decay timescale… ▽ More We report results from a search for gravitational waves produced by perturbed intermediate mass black holes (IMBH) in data collected by LIGO and Virgo between 2005 and 2010. The search was sensitive to astrophysical sources that produced damped sinusoid gravitational wave signals, also known as ringdowns, with frequency $50\le f_{0}/\mathrm{Hz} \le 2000$ and decay timescale $0.0001\lesssim τ/\mathrm{s} \lesssim 0.1$ characteristic of those produced in mergers of IMBH pairs. No significant gravitational wave candidate was detected. We report upper limits on the astrophysical coalescence rates of IMBHs with total binary mass $50 \le M/\mathrm{M}_\odot \le 450$ and component mass ratios of either 1:1 or 4:1. For systems with total mass $100 \le M/\mathrm{M}_\odot \le 150$, we report a 90%-confidence upper limit on the rate of binary IMBH mergers with non-spinning and equal mass components of $6.9\times10^{-8}\,$Mpc$^{-3}$yr$^{-1}$. We also report a rate upper limit for ringdown waveforms from perturbed IMBHs, radiating 1% of their mass as gravitational waves in the fundamental, $\ell=m=2$, oscillation mode, that is nearly three orders of magnitude more stringent than previous results. △ Less

Submitted 21 May, 2014; v1 submitted 20 March, 2014; originally announced March 2014.

Comments: for the science summary, see http://www.ligo.org/science/Publication-S5S6RD/index.php

Report number: LIGO-P1300156

Journal ref: Phys. Rev. D 89, 102006 (2014)

arXiv:1402.4974 [pdf, ps, other]

doi 10.1088/0264-9381/31/16/165014

Implementation of an F-statistic all-sky search for continuous gravitational waves in Virgo VSR1 data

Authors: J. Aasi, B. P. Abbott, R. Abbott, T. Abbott, M. R. Abernathy, T. Accadia, F. Acernese, K. Ackley, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, M. Agathos, N. Aggarwal, O. D. Aguiar, A. Ain, P. Ajith, A. Alemic, B. Allen, A. Allocca, D. Amariutei, M. Andersen, R. Anderson, S. B. Anderson , et al. (826 additional authors not shown)

Abstract: We present an implementation of the $\mathcal{F}$-statistic to carry out the first search in data from the Virgo laser interferometric gravitational wave detector for periodic gravitational waves from a priori unknown, isolated rotating neutron stars. We searched a frequency $f_0$ range from 100 Hz to 1 kHz and the frequency dependent spindown $f_1$ range from… ▽ More We present an implementation of the $\mathcal{F}$-statistic to carry out the first search in data from the Virgo laser interferometric gravitational wave detector for periodic gravitational waves from a priori unknown, isolated rotating neutron stars. We searched a frequency $f_0$ range from 100 Hz to 1 kHz and the frequency dependent spindown $f_1$ range from $-1.6\,(f_0/100\,{\rm Hz}) \times 10^{-9}\,$ Hz/s to zero. A large part of this frequency - spindown space was unexplored by any of the all-sky searches published so far. Our method consisted of a coherent search over two-day periods using the $\mathcal{F}$-statistic, followed by a search for coincidences among the candidates from the two-day segments. We have introduced a number of novel techniques and algorithms that allow the use of the Fast Fourier Transform (FFT) algorithm in the coherent part of the search resulting in a fifty-fold speed-up in computation of the $\mathcal{F}$-statistic with respect to the algorithm used in the other pipelines. No significant gravitational wave signal was found. The sensitivity of the search was estimated by injecting signals into the data. In the most sensitive parts of the detector band more than 90% of signals would have been detected with dimensionless gravitational-wave amplitude greater than $5 \times 10^{-24}$. △ Less

Submitted 10 April, 2014; v1 submitted 20 February, 2014; originally announced February 2014.

Comments: 27 pages, 10 figures, submitted to CQG, corrected affiliations and autors list; Science summary of results available at http://www.ligo.org/science/Publication-VSR1CWAllSkyFstat/index.php

Report number: LIGO Document No. LIGO-P1300133

Showing 1–50 of 97 results for author: Black, E