-
The KnowWhereGraph: A Large-Scale Geo-Knowledge Graph for Interdisciplinary Knowledge Discovery and Geo-Enrichment
Authors:
Rui Zhu,
Cogan Shimizu,
Shirly Stephen,
Colby K. Fisher,
Thomas Thelen,
Kitty Currier,
Krzysztof Janowicz,
Pascal Hitzler,
Mark Schildhauer,
Wenwen Li,
Dean Rehberger,
Adrita Barua,
Antrea Christou,
Ling Cai,
Abhilekha Dalal,
Anthony D'Onofrio,
Andrew Eells,
Mitchell Faulk,
Zilong Liu,
Gengchen Mai,
Mohammad Saeid Mahdavinejad,
Bryce Mecum,
Sanaz Saki Norouzi,
Meilin Shi,
Yuanyuan Tian
, et al. (3 additional authors not shown)
Abstract:
Global challenges such as food supply chain disruptions, public health crises, and natural hazard responses require access to and integration of diverse datasets, many of which are geospatial. Over the past few years, a growing number of (geo)portals have been developed to address this need. However, most existing (geo)portals are stacked by separated or sparsely connected data "silos" impeding ef…
▽ More
Global challenges such as food supply chain disruptions, public health crises, and natural hazard responses require access to and integration of diverse datasets, many of which are geospatial. Over the past few years, a growing number of (geo)portals have been developed to address this need. However, most existing (geo)portals are stacked by separated or sparsely connected data "silos" impeding effective data consolidation. A new way of sharing and reusing geospatial data is therefore urgently needed. In this work, we introduce KnowWhereGraph, a knowledge graph-based data integration, enrichment, and synthesis framework that not only includes schemas and data related to human and environmental systems but also provides a suite of supporting tools for accessing this information. The KnowWhereGraph aims to address the challenge of data integration by building a large-scale, cross-domain, pre-integrated, FAIR-principles-based, and AI-ready data warehouse rooted in knowledge graphs. We highlight the design principles of KnowWhereGraph, emphasizing the roles of space, place, and time in bridging various data "silos". Additionally, we demonstrate multiple use cases where the proposed geospatial knowledge graph and its associated tools empower decision-makers to uncover insights that are often hidden within complex and poorly interoperable datasets.
△ Less
Submitted 20 February, 2025; v1 submitted 19 February, 2025;
originally announced February 2025.
-
Can Bayesian Neural Networks Make Confident Predictions?
Authors:
Katharine Fisher,
Youssef Marzouk
Abstract:
Bayesian inference promises a framework for principled uncertainty quantification of neural network predictions. Barriers to adoption include the difficulty of fully characterizing posterior distributions on network parameters and the interpretability of posterior predictive distributions. We demonstrate that under a discretized prior for the inner layer weights, we can exactly characterize the po…
▽ More
Bayesian inference promises a framework for principled uncertainty quantification of neural network predictions. Barriers to adoption include the difficulty of fully characterizing posterior distributions on network parameters and the interpretability of posterior predictive distributions. We demonstrate that under a discretized prior for the inner layer weights, we can exactly characterize the posterior predictive distribution as a Gaussian mixture. This setting allows us to define equivalence classes of network parameter values which produce the same likelihood (training error) and to relate the elements of these classes to the network's scaling regime -- defined via ratios of the training sample size, the size of each layer, and the number of final layer parameters. Of particular interest are distinct parameter realizations that map to low training error and yet correspond to distinct modes in the posterior predictive distribution. We identify settings that exhibit such predictive multimodality, and thus provide insight into the accuracy of unimodal posterior approximations. We also characterize the capacity of a model to "learn from data" by evaluating contraction of the posterior predictive in different scaling regimes.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
Rango: Adaptive Retrieval-Augmented Proving for Automated Software Verification
Authors:
Kyle Thompson,
Nuno Saavedra,
Pedro Carrott,
Kevin Fisher,
Alex Sanchez-Stern,
Yuriy Brun,
João F. Ferreira,
Sorin Lerner,
Emily First
Abstract:
Formal verification using proof assistants, such as Coq, enables the creation of high-quality software. However, the verification process requires significant expertise and manual effort to write proofs. Recent work has explored automating proof synthesis using machine learning and large language models (LLMs). This work has shown that identifying relevant premises, such as lemmas and definitions,…
▽ More
Formal verification using proof assistants, such as Coq, enables the creation of high-quality software. However, the verification process requires significant expertise and manual effort to write proofs. Recent work has explored automating proof synthesis using machine learning and large language models (LLMs). This work has shown that identifying relevant premises, such as lemmas and definitions, can aid synthesis. We present Rango, a fully automated proof synthesis tool for Coq that automatically identifies relevant premises and also similar proofs from the current project and uses them during synthesis. Rango uses retrieval augmentation at every step of the proof to automatically determine which proofs and premises to include in the context of its fine-tuned LLM. In this way, Rango adapts to the project and to the evolving state of the proof. We create a new dataset, CoqStoq, of 2,226 open-source Coq projects and 196,929 theorems from GitHub, which includes both training data and a curated evaluation benchmark of well-maintained projects. On this benchmark, Rango synthesizes proofs for 32.0% of the theorems, which is 29% more theorems than the prior state-of-the-art tool Tactician. Our evaluation also shows that Rango adding relevant proofs to its context leads to a 47% increase in the number of theorems proven.
△ Less
Submitted 28 January, 2025; v1 submitted 18 December, 2024;
originally announced December 2024.
-
The KnowWhereGraph Ontology
Authors:
Cogan Shimizu,
Shirly Stephe,
Adrita Barua,
Ling Cai,
Antrea Christou,
Kitty Currier,
Abhilekha Dalal,
Colby K. Fisher,
Pascal Hitzler,
Krzysztof Janowicz,
Wenwen Li,
Zilong Liu,
Mohammad Saeid Mahdavinejad,
Gengchen Mai,
Dean Rehberger,
Mark Schildhauer,
Meilin Shi,
Sanaz Saki Norouzi,
Yuanyuan Tian,
Sizhe Wang,
Zhangyu Wang,
Joseph Zalewski,
Lu Zhou,
Rui Zhu
Abstract:
KnowWhereGraph is one of the largest fully publicly available geospatial knowledge graphs. It includes data from 30 layers on natural hazards (e.g., hurricanes, wildfires), climate variables (e.g., air temperature, precipitation), soil properties, crop and land-cover types, demographics, and human health, various place and region identifiers, among other themes. These have been leveraged through t…
▽ More
KnowWhereGraph is one of the largest fully publicly available geospatial knowledge graphs. It includes data from 30 layers on natural hazards (e.g., hurricanes, wildfires), climate variables (e.g., air temperature, precipitation), soil properties, crop and land-cover types, demographics, and human health, various place and region identifiers, among other themes. These have been leveraged through the graph by a variety of applications to address challenges in food security and agricultural supply chains; sustainability related to soil conservation practices and farm labor; and delivery of emergency humanitarian aid following a disaster. In this paper, we introduce the ontology that acts as the schema for KnowWhereGraph. This broad overview provides insight into the requirements and design specifications for the graph and its schema, including the development methodology (modular ontology modeling) and the resources utilized to implement, materialize, and deploy KnowWhereGraph with its end-user interfaces and public query SPARQL endpoint.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
Digital Twin Generators for Disease Modeling
Authors:
Nameyeh Alam,
Jake Basilico,
Daniele Bertolini,
Satish Casie Chetty,
Heather D'Angelo,
Ryan Douglas,
Charles K. Fisher,
Franklin Fuller,
Melissa Gomes,
Rishabh Gupta,
Alex Lang,
Anton Loukianov,
Rachel Mak-McCully,
Cary Murray,
Hanalei Pham,
Susanna Qiao,
Elena Ryapolova-Webb,
Aaron Smith,
Dimitri Theoharatos,
Anil Tolwani,
Eric W. Tramel,
Anna Vidovszky,
Judy Viduya,
Jonathan R. Walsh
Abstract:
A patient's digital twin is a computational model that describes the evolution of their health over time. Digital twins have the potential to revolutionize medicine by enabling individual-level computer simulations of human health, which can be used to conduct more efficient clinical trials or to recommend personalized treatment options. Due to the overwhelming complexity of human biology, machine…
▽ More
A patient's digital twin is a computational model that describes the evolution of their health over time. Digital twins have the potential to revolutionize medicine by enabling individual-level computer simulations of human health, which can be used to conduct more efficient clinical trials or to recommend personalized treatment options. Due to the overwhelming complexity of human biology, machine learning approaches that leverage large datasets of historical patients' longitudinal health records to generate patients' digital twins are more tractable than potential mechanistic models. In this manuscript, we describe a neural network architecture that can learn conditional generative models of clinical trajectories, which we call Digital Twin Generators (DTGs), that can create digital twins of individual patients. We show that the same neural network architecture can be trained to generate accurate digital twins for patients across 13 different indications simply by changing the training set and tuning hyperparameters. By introducing a general purpose architecture, we aim to unlock the ability to scale machine learning approaches to larger datasets and across more indications so that a digital twin could be created for any patient in the world.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Prognostic Covariate Adjustment for Logistic Regression in Randomized Controlled Trials
Authors:
Yunfan Li,
Arman Sabbaghi,
Jonathan R. Walsh,
Charles K. Fisher
Abstract:
Randomized controlled trials (RCTs) with binary primary endpoints introduce novel challenges for inferring the causal effects of treatments. The most significant challenge is non-collapsibility, in which the conditional odds ratio estimand under covariate adjustment differs from the unconditional estimand in the logistic regression analysis of RCT data. This issue gives rise to apparent paradoxes,…
▽ More
Randomized controlled trials (RCTs) with binary primary endpoints introduce novel challenges for inferring the causal effects of treatments. The most significant challenge is non-collapsibility, in which the conditional odds ratio estimand under covariate adjustment differs from the unconditional estimand in the logistic regression analysis of RCT data. This issue gives rise to apparent paradoxes, such as the variance of the estimator for the conditional odds ratio from a covariate-adjusted model being greater than the variance of the estimator from the unadjusted model. We address this challenge in the context of adjustment based on predictions of control outcomes from generative artificial intelligence (AI) algorithms, which are referred to as prognostic scores. We demonstrate that prognostic score adjustment in logistic regression increases the power of the Wald test for the conditional odds ratio under a fixed sample size, or alternatively reduces the necessary sample size to achieve a desired power, compared to the unadjusted analysis. We derive formulae for prospective calculations of the power gain and sample size reduction that can result from adjustment for the prognostic score. Furthermore, we utilize g-computation to expand the scope of prognostic score adjustment to inferences on the marginal risk difference, relative risk, and odds ratio estimands. We demonstrate the validity of our formulae via extensive simulation studies that encompass different types of logistic regression model specifications. Our simulation studies also indicate how prognostic score adjustment can reduce the variance of g-computation estimators for the marginal estimands while maintaining frequentist properties such as asymptotic unbiasedness and Type I error rate control. Our methodology can ultimately enable more definitive and conclusive analyses for RCTs with binary primary endpoints.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Multitask methods for predicting molecular properties from heterogeneous data
Authors:
Katharine Fisher,
Michael Herbst,
Youssef Marzouk
Abstract:
Data generation remains a bottleneck in training surrogate models to predict molecular properties. We demonstrate that multitask Gaussian process regression overcomes this limitation by leveraging both expensive and cheap data sources. In particular, we consider training sets constructed from coupled-cluster (CC) and density functional theory (DFT) data. We report that multitask surrogates can pre…
▽ More
Data generation remains a bottleneck in training surrogate models to predict molecular properties. We demonstrate that multitask Gaussian process regression overcomes this limitation by leveraging both expensive and cheap data sources. In particular, we consider training sets constructed from coupled-cluster (CC) and density functional theory (DFT) data. We report that multitask surrogates can predict at CC-level accuracy with a reduction to data generation cost by over an order of magnitude. Of note, our approach allows the training set to include DFT data generated by a heterogeneous mix of exchange-correlation functionals without imposing any artificial hierarchy on functional accuracy. More generally, the multitask framework can accommodate a wider range of training set structures -- including full disparity between the different levels of fidelity -- than existing kernel approaches based on $Δ$-learning, though we show that the accuracy of the two approaches can be similar. Consequently, multitask regression can be a tool for reducing data generation costs even further by opportunistically exploiting existing data sources.
△ Less
Submitted 24 May, 2024; v1 submitted 31 January, 2024;
originally announced January 2024.
-
Bayesian Prognostic Covariate Adjustment With Additive Mixture Priors
Authors:
Alyssa M. Vanderbeek,
Arman Sabbaghi,
Jon R. Walsh,
Charles K. Fisher
Abstract:
Effective and rapid decision-making from randomized controlled trials (RCTs) requires unbiased and precise treatment effect inferences. Two strategies to address this requirement are to adjust for covariates that are highly correlated with the outcome, and to leverage historical control information via Bayes' theorem. We propose a new Bayesian prognostic covariate adjustment methodology, referred…
▽ More
Effective and rapid decision-making from randomized controlled trials (RCTs) requires unbiased and precise treatment effect inferences. Two strategies to address this requirement are to adjust for covariates that are highly correlated with the outcome, and to leverage historical control information via Bayes' theorem. We propose a new Bayesian prognostic covariate adjustment methodology, referred to as Bayesian PROCOVA, that combines these two strategies. Covariate adjustment in Bayesian PROCOVA is based on generative artificial intelligence (AI) algorithms that construct a digital twin generator (DTG) for RCT participants. The DTG is trained on historical control data and yields a digital twin (DT) probability distribution for each RCT participant's outcome under the control treatment. The expectation of the DT distribution, referred to as the prognostic score, defines the covariate for adjustment. Historical control information is leveraged via an additive mixture prior with two components: an informative prior probability distribution specified based on historical control data, and a weakly informative prior distribution. The mixture weight determines the extent to which posterior inferences are drawn from the informative component, versus the weakly informative component. This weight has a prior distribution as well, and so the entire additive mixture prior is completely pre-specifiable without involving any RCT information. We establish an efficient Gibbs algorithm for sampling from the posterior distribution, and derive closed-form expressions for the posterior mean and variance of the treatment effect parameter conditional on the weight, in Bayesian PROCOVA. We evaluate efficiency gains of Bayesian PROCOVA via its bias control and variance reduction compared to frequentist PROCOVA in simulation studies that encompass different discrepancies. These gains translate to smaller RCTs.
△ Less
Submitted 28 February, 2024; v1 submitted 27 October, 2023;
originally announced October 2023.
-
A Weighted Prognostic Covariate Adjustment Method for Efficient and Powerful Treatment Effect Inferences in Randomized Controlled Trials
Authors:
Alyssa M. Vanderbeek,
Anna A. Vidovszky,
Jessica L. Ross,
Arman Sabbaghi,
Jonathan R. Walsh,
Charles K. Fisher,
the Critical Path for Alzheimer's Disease,
the Alzheimer's Disease Neuroimaging Initiative,
the European Prevention of Alzheimer's Disease,
Consortium,
the Alzheimer's Disease Cooperative Study
Abstract:
A crucial task for a randomized controlled trial (RCT) is to specify a statistical method that can yield an efficient estimator and powerful test for the treatment effect. A novel and effective strategy to obtain efficient and powerful treatment effect inferences is to incorporate predictions from generative artificial intelligence (AI) algorithms into covariate adjustment for the regression analy…
▽ More
A crucial task for a randomized controlled trial (RCT) is to specify a statistical method that can yield an efficient estimator and powerful test for the treatment effect. A novel and effective strategy to obtain efficient and powerful treatment effect inferences is to incorporate predictions from generative artificial intelligence (AI) algorithms into covariate adjustment for the regression analysis of a RCT. Training a generative AI algorithm on historical control data enables one to construct a digital twin generator (DTG) for RCT participants, which utilizes a participant's baseline covariates to generate a probability distribution for their potential control outcome. Summaries of the probability distribution from the DTG are highly predictive of the trial outcome, and adjusting for these features via regression can thus improve the quality of treatment effect inferences, while satisfying regulatory guidelines on statistical analyses, for a RCT. However, a critical assumption in this strategy is homoskedasticity, or constant variance of the outcome conditional on the covariates. In the case of heteroskedasticity, existing covariate adjustment methods yield inefficient estimators and underpowered tests. We propose to address heteroskedasticity via a weighted prognostic covariate adjustment methodology (Weighted PROCOVA) that adjusts for both the mean and variance of the regression model using information obtained from the DTG. We prove that our method yields unbiased treatment effect estimators, and demonstrate via comprehensive simulation studies and case studies from Alzheimer's disease that it can reduce the variance of the treatment effect estimator, maintain the Type I error rate, and increase the power of the test for the treatment effect from 80% to 85%~90% when the variances from the DTG can explain 5%~10% of the variation in the RCT participants' outcomes.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Identifying and Mitigating the Security Risks of Generative AI
Authors:
Clark Barrett,
Brad Boyd,
Elie Burzstein,
Nicholas Carlini,
Brad Chen,
Jihye Choi,
Amrita Roy Chowdhury,
Mihai Christodorescu,
Anupam Datta,
Soheil Feizi,
Kathleen Fisher,
Tatsunori Hashimoto,
Dan Hendrycks,
Somesh Jha,
Daniel Kang,
Florian Kerschbaum,
Eric Mitchell,
John Mitchell,
Zulfikar Ramzan,
Khawaja Shams,
Dawn Song,
Ankur Taly,
Diyi Yang
Abstract:
Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well…
▽ More
Every major technical invention resurfaces the dual-use dilemma -- the new technology has the potential to be used for good as well as for harm. Generative AI (GenAI) techniques, such as large language models (LLMs) and diffusion models, have shown remarkable capabilities (e.g., in-context learning, code-completion, and text-to-image generation and editing). However, GenAI can be used just as well by attackers to generate new attacks and increase the velocity and efficacy of existing attacks.
This paper reports the findings of a workshop held at Google (co-organized by Stanford University and the University of Wisconsin-Madison) on the dual-use dilemma posed by GenAI. This paper is not meant to be comprehensive, but is rather an attempt to synthesize some of the interesting findings from the workshop. We discuss short-term and long-term goals for the community on this topic. We hope this paper provides both a launching point for a discussion on this important topic as well as interesting problems that the research community can work to address.
△ Less
Submitted 28 December, 2023; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Saggitarius: A DSL for Specifying Grammatical Domains
Authors:
Anders Miltner,
Devon Loehr,
Arnold Mong,
Kathleen Fisher,
David Walker
Abstract:
Common data types like dates, addresses, phone numbers and tables can have multiple textual representations, and many heavily-used languages, such as SQL, come in several dialects. These variations can cause data to be misinterpreted, leading to silent data corruption, failure of data processing systems, or even security vulnerabilities. Saggitarius is a new language and system designed to help pr…
▽ More
Common data types like dates, addresses, phone numbers and tables can have multiple textual representations, and many heavily-used languages, such as SQL, come in several dialects. These variations can cause data to be misinterpreted, leading to silent data corruption, failure of data processing systems, or even security vulnerabilities. Saggitarius is a new language and system designed to help programmers reason about the format of data, by describing grammatical domains -- that is, sets of context-free grammars that describe the many possible representations of a datatype. We describe the design of Saggitarius via example and provide a relational semantics. We show how Saggitarius may be used to analyze a data set: given example data, it uses an algorithm based on semi-ring parsing and MaxSAT to infer which grammar in a given domain best matches that data. We evaluate the effectiveness of the algorithm on a benchmark suite of 110 example problems, and we demonstrate that our system typically returns a satisfying grammar within a few seconds with only a small number of examples. We also delve deeper into a more extensive case study on using Saggitarius for CSV dialect detection. Despite being general-purpose, we find that Saggitarius offers comparable results to hand-tuned, specialized tools; in the case of CSV, it infers grammars for 84% of benchmarks within 60 seconds, and has comparable accuracy to custom-built dialect detection tools.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
A Rule of Thumb for the Power Gain due to Covariate Adjustment in Randomized Controlled Trials with Continuous Outcomes
Authors:
Charles K. Fisher
Abstract:
Randomized Controlled Trials (RCTs) often adjust for baseline covariates in order to increase power. This technical note provides a short derivation of a simple rule of thumb for approximating the ratio of the power of an adjusted analysis to that of an unadjusted analysis. Specifically, if the unadjusted analysis is powered to approximately 80\%, then the ratio of the power of the adjusted analys…
▽ More
Randomized Controlled Trials (RCTs) often adjust for baseline covariates in order to increase power. This technical note provides a short derivation of a simple rule of thumb for approximating the ratio of the power of an adjusted analysis to that of an unadjusted analysis. Specifically, if the unadjusted analysis is powered to approximately 80\%, then the ratio of the power of the adjusted analysis to the power of the unadjusted analysis is approximately $1 + \frac{1}{2} R^2$, where $R$ is the correlation between the baseline covariate and the outcome.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Neural Boltzmann Machines
Authors:
Alex H. Lang,
Anton D. Loukianov,
Charles K. Fisher
Abstract:
Conditional generative models are capable of using contextual information as input to create new imaginative outputs. Conditional Restricted Boltzmann Machines (CRBMs) are one class of conditional generative models that have proven to be especially adept at modeling noisy discrete or continuous data, but the lack of expressivity in CRBMs have limited their widespread adoption. Here we introduce Ne…
▽ More
Conditional generative models are capable of using contextual information as input to create new imaginative outputs. Conditional Restricted Boltzmann Machines (CRBMs) are one class of conditional generative models that have proven to be especially adept at modeling noisy discrete or continuous data, but the lack of expressivity in CRBMs have limited their widespread adoption. Here we introduce Neural Boltzmann Machines (NBMs) which generalize CRBMs by converting each of the CRBM parameters to their own neural networks that are allowed to be functions of the conditional inputs. NBMs are highly flexible conditional generative models that can be trained via stochastic gradient descent to approximately maximize the log-likelihood of the data. We demonstrate the utility of NBMs especially with normally distributed data which has historically caused problems for Gaussian-Bernoulli CRBMs. Code to reproduce our results can be found at https://github.com/unlearnai/neural-boltzmann-machines.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Technical Report: Match-reference regular expressions and lenses
Authors:
Jeanne-Marie Musca,
Anders Miltner,
Kathleen Fisher,
David Walker
Abstract:
A lens is a single program that specifies two data transformations at once: one transformation converts data from source format to target format and a second transformation inverts the process. Over the past decade, researchers have developed many different kinds of lenses with different properties. One class of such languages operate over regular languages. In other words, these lenses convert st…
▽ More
A lens is a single program that specifies two data transformations at once: one transformation converts data from source format to target format and a second transformation inverts the process. Over the past decade, researchers have developed many different kinds of lenses with different properties. One class of such languages operate over regular languages. In other words, these lenses convert strings drawn from one regular language to strings drawn from another regular language (and back again). In this paper, we define a more powerful language of lenses, which we call match-reference lenses, that is capable of translating between non-regular formats that contain repeated substrings, which is a primitive form of dependency. To define the non-regular formats themselves, we develop a new language, match-reference regular expressions, which are regular expressions that can bind variables to substrings and use those substrings repeatedly. These match-reference regular expressions are closely related to the familiar ``back-references" that can be found in traditional regular expression packages, but are redesigned to adhere to conventional programming language lexical scoping conventions and to interact smoothly with lens language infrastructure. We define the semantics of match-reference regular expressions and match-reference lenses. We also define a new kind of automaton, the match-reference regex automaton system (MRRAS), for deciding string membership in the language match-reference regular expressions. We illustrate our definitions with a variety of examples.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Can RBMs be trained with zero step contrastive divergence?
Authors:
Charles K. Fisher
Abstract:
Restricted Boltzmann Machines (RBMs) are probabilistic generative models that can be trained by maximum likelihood in principle, but are usually trained by an approximate algorithm called Contrastive Divergence (CD) in practice. In general, a CD-k algorithm estimates an average with respect to the model distribution using a sample obtained from a k-step Markov Chain Monte Carlo Algorithm (e.g., bl…
▽ More
Restricted Boltzmann Machines (RBMs) are probabilistic generative models that can be trained by maximum likelihood in principle, but are usually trained by an approximate algorithm called Contrastive Divergence (CD) in practice. In general, a CD-k algorithm estimates an average with respect to the model distribution using a sample obtained from a k-step Markov Chain Monte Carlo Algorithm (e.g., block Gibbs sampling) starting from some initial configuration. Choices of k typically vary from 1 to 100. This technical report explores if it's possible to leverage a simple approximate sampling algorithm with a modified version of CD in order to train an RBM with k=0. As usual, the method is illustrated on MNIST.
△ Less
Submitted 3 November, 2022;
originally announced November 2022.
-
Modeling Disease Progression in Mild Cognitive Impairment and Alzheimer's Disease with Digital Twins
Authors:
Daniele Bertolini,
Anton D. Loukianov,
Aaron M. Smith,
David Li-Bland,
Yannick Pouliot,
Jonathan R. Walsh,
Charles K. Fisher
Abstract:
Alzheimer's Disease (AD) is a neurodegenerative disease that affects subjects in a broad range of severity and is assessed in clinical trials with multiple cognitive and functional instruments. As clinical trials in AD increasingly focus on earlier stages of the disease, especially Mild Cognitive Impairment (MCI), the ability to model subject outcomes across the disease spectrum is extremely impor…
▽ More
Alzheimer's Disease (AD) is a neurodegenerative disease that affects subjects in a broad range of severity and is assessed in clinical trials with multiple cognitive and functional instruments. As clinical trials in AD increasingly focus on earlier stages of the disease, especially Mild Cognitive Impairment (MCI), the ability to model subject outcomes across the disease spectrum is extremely important. We use unsupervised machine learning models called Conditional Restricted Boltzmann Machines (CRBMs) to create Digital Twins of AD subjects. Digital Twins are simulated clinical records that share baseline data with actual subjects and comprehensively model their outcomes under standard-of-care. The CRBMs are trained on a large set of records from subjects in observational studies and the placebo arms of clinical trials across the AD spectrum. These data exhibit a challenging, but common, patchwork of measured and missing observations across subjects in the dataset, and we present a novel model architecture designed to learn effectively from it. We evaluate performance against a held-out test dataset and show how Digital Twins simultaneously capture the progression of a number of key endpoints in clinical trials across a broad spectrum of disease severity, including MCI and mild-to-moderate AD.
△ Less
Submitted 24 December, 2020;
originally announced December 2020.
-
The Case for Non-Cryogenic Comet Nucleus Sample Return
Authors:
Keiko Nakamura-Messenger,
Alexander G. Hayes,
Scott Sandford,
Carol Raymond,
Steven W. Squyres,
Larry R. Nittler,
Samuel Birch,
Denis Bodewits,
Nancy Chabot,
Meenakshi Wadhwa,
Mathieu Choukroun,
Simon J. Clemett,
Maitrayee Bose,
Neil Dello Russo,
Jason P. Dworkin,
Jamie E. Elsila,
Kenton Fisher,
Perry Gerakines,
Daniel P. Glavin,
Julie Mitchell,
Michael Mumma,
Ann. N. Nguyen,
Lisa Pace,
Jason Soderblom,
Jessica M. Sunshine
Abstract:
Comets hold answers to mysteries of the Solar System by recording presolar history, the initial states of planet formation and prebiotic organics and volatiles to the early Earth. Analysis of returned samples from a comet nucleus will provide unparalleled knowledge about the Solar System starting materials and how they came together to form planets and give rise to life:
1. How did comets form?…
▽ More
Comets hold answers to mysteries of the Solar System by recording presolar history, the initial states of planet formation and prebiotic organics and volatiles to the early Earth. Analysis of returned samples from a comet nucleus will provide unparalleled knowledge about the Solar System starting materials and how they came together to form planets and give rise to life:
1. How did comets form?
2. Is comet material primordial, or has it undergone a complex alteration history?
3. Does aqueous alteration occur in comets?
4. What is the composition of cometary organics?
5. Did comets supply a substantial fraction of Earth's volatiles?
6. Did cometary organics contribute to the homochirality in life on Earth?
7. How do complex organic molecules form and evolve in interstellar, nebular, and planetary environments?
8. What can comets tell us about the mixing of materials in the protosolar nebula?
△ Less
Submitted 29 September, 2020;
originally announced September 2020.
-
Improved isotope-shift-based bounds on bosons beyond the Standard Model through measurements of the $^2$D$_{3/2} - ^2$D$_{5/2}$ interval in Ca$^+$
Authors:
Cyrille Solaro,
Steffen Meyer,
Karin Fisher,
Julian C. Berengut,
Elina Fuchs,
Michael Drewsen
Abstract:
We perform high-resolution spectroscopy of the $3$d$~^2$D$_{3/2} - 3$d$~^2$D$_{5/2}$ interval in all stable even isotopes of $^A$Ca$^+$ (A = 40, 42, 44, 46 and 48) with an accuracy of $\sim$ 20 Hz using direct frequency-comb Raman spectroscopy. Combining these data with isotope shift measurements of the 4s$~^2$S$_{1/2} \leftrightarrow 3$d$~^2$D$_{5/2}$ transition, we carry out a King plot analysis…
▽ More
We perform high-resolution spectroscopy of the $3$d$~^2$D$_{3/2} - 3$d$~^2$D$_{5/2}$ interval in all stable even isotopes of $^A$Ca$^+$ (A = 40, 42, 44, 46 and 48) with an accuracy of $\sim$ 20 Hz using direct frequency-comb Raman spectroscopy. Combining these data with isotope shift measurements of the 4s$~^2$S$_{1/2} \leftrightarrow 3$d$~^2$D$_{5/2}$ transition, we carry out a King plot analysis with unprecedented sensitivity to coupling between electrons and neutrons by bosons beyond the Standard Model. Furthermore, we estimate the sensitivity to such bosons from equivalent spectroscopy in Ba$^+$ and Yb$^+$. Finally, the data yield isotope shifts of the 4s$~^2$S$_{1/2} \leftrightarrow 3$d$~^2$D$_{3/2}$ transition at 10 part-per-billion through combination with recent data of Knollmann et al (2019).
△ Less
Submitted 1 May, 2020;
originally announced May 2020.
-
Unresolved sideband photon recoil spectroscopy of molecular ions
Authors:
Emilie H. Clausen,
Vincent Jarlaud,
Karin Fisher,
Steffen Meyer,
Cyrille Solaro,
Michael Drewsen
Abstract:
We reflect on the prospect of exploiting the recoil associated with absorption and emission of photons to perform spectroscopy of a single molecular ion. For this recoil to be detectable, the molecular ion is sympathetically cooled by a laser-cooled atomic ion to near their common quantum mechanical ground state within a trapping potential. More specifically, we present a general framework for sim…
▽ More
We reflect on the prospect of exploiting the recoil associated with absorption and emission of photons to perform spectroscopy of a single molecular ion. For this recoil to be detectable, the molecular ion is sympathetically cooled by a laser-cooled atomic ion to near their common quantum mechanical ground state within a trapping potential. More specifically, we present a general framework for simulating the expected photon recoil spectra in regimes where either the natural transition linewidth $Γ_t$ of the molecular ion or the spectral width $Γ_L$ of the exciting light source exceeds the motional frequencies of the two-ion system. To exemplify the framework, we present two complementary cases: spectroscopy of the broad 3s $^2$S$_{1/2}$ - 3p $^2$P$_{3/2}$ electronic transition ($Γ_t/2π= 41.8$ MHz) of a single $^{24}$Mg$^+$ ion at $λ=279.6$ nm by a narrow laser source ($Γ_L/2π\lesssim 1$ MHz) and mid-infrared vibrational spectroscopy of the very narrow $|v=0,J=1\rangle$ - $|v'=1,J'=0\rangle$ transition ($Γ_t/2π= 2.50 $ Hz) at $λ=6.17$ $μ$m in the $^1Σ^+$ electronic ground state of $^{24}$MgH$^+$ by a broadband laser source ($Γ_L/2π\gtrsim$ 50 MHz). The atomic ion $^{24}$Mg$^+$ has been picked to introduce a simple system to make comparisons with experimental results while still capturing most of the physics involved in electronic excitations of molecular ions.
△ Less
Submitted 10 February, 2022; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Generating Digital Twins with Multiple Sclerosis Using Probabilistic Neural Networks
Authors:
Jonathan R. Walsh,
Aaron M. Smith,
Yannick Pouliot,
David Li-Bland,
Anton Loukianov,
Charles K. Fisher
Abstract:
Multiple Sclerosis (MS) is a neurodegenerative disorder characterized by a complex set of clinical assessments. We use an unsupervised machine learning model called a Conditional Restricted Boltzmann Machine (CRBM) to learn the relationships between covariates commonly used to characterize subjects and their disease progression in MS clinical trials. A CRBM is capable of generating digital twins,…
▽ More
Multiple Sclerosis (MS) is a neurodegenerative disorder characterized by a complex set of clinical assessments. We use an unsupervised machine learning model called a Conditional Restricted Boltzmann Machine (CRBM) to learn the relationships between covariates commonly used to characterize subjects and their disease progression in MS clinical trials. A CRBM is capable of generating digital twins, which are simulated subjects having the same baseline data as actual subjects. Digital twins allow for subject-level statistical analyses of disease progression. The CRBM is trained using data from 2395 subjects enrolled in the placebo arms of clinical trials across the three primary subtypes of MS. We discuss how CRBMs are trained and show that digital twins generated by the model are statistically indistinguishable from their actual subject counterparts along a number of measures.
△ Less
Submitted 19 April, 2020; v1 submitted 3 February, 2020;
originally announced February 2020.
-
TxForest: A DSL for Concurrent Filestores
Authors:
Jonathan DiLorenzo,
Katie Mancini,
Kathleen Fisher,
Nate Foster
Abstract:
Many systems use ad hoc collections of files and directories to store persistent data. For consumers of this data, the process of properly parsing, using, and updating these filestores using conventional APIs is cumbersome and error-prone. Making matters worse, most filestores are too big to fit in memory, so applications must process the data incrementally while managing concurrent accesses by mu…
▽ More
Many systems use ad hoc collections of files and directories to store persistent data. For consumers of this data, the process of properly parsing, using, and updating these filestores using conventional APIs is cumbersome and error-prone. Making matters worse, most filestores are too big to fit in memory, so applications must process the data incrementally while managing concurrent accesses by multiple users. This paper presents Transactional Forest (TxForest), which builds on earlier work on Forest to provide a simpler, more powerful API for managing filestores, including a mechanism for managing concurrent accesses using serializable transactions. Under the hood, TxForest implements an optimistic concurrency control scheme using Huet's zippers to track the data associated with filestores. We formalize TxForest in a core calculus, develop a proof of serializability, and describe our OCaml prototype, which we have used to build several practical applications.
△ Less
Submitted 27 August, 2019;
originally announced August 2019.
-
Synthesizing Symmetric Lenses
Authors:
Anders Miltner,
Solomon Maina,
Kathleen Fisher,
Benjamin C. Pierce,
David Walker,
Steve Zdancewic
Abstract:
Lenses are programs that can be run both "front to back" and "back to front," allowing updates to either their source or their target data to be transferred in both directions. Lenses have been extensively studied, extended, and applied. Recent work has demonstrated how techniques from type-directed program synthesis can be used to efficiently synthesize a simple class of lenses---bijective lenses…
▽ More
Lenses are programs that can be run both "front to back" and "back to front," allowing updates to either their source or their target data to be transferred in both directions. Lenses have been extensively studied, extended, and applied. Recent work has demonstrated how techniques from type-directed program synthesis can be used to efficiently synthesize a simple class of lenses---bijective lenses over string data---given a pair of types (regular expressions) and examples.
We extend this synthesis algorithm to a broader class of lenses, called simple symmetric lenses, including all bijective lenses, all of the popular category of "asymmetric" lenses, and a subset of the "symmetric lenses" proposed by Hofmann et al. Intuitively, simple symmetric lenses allow some information to be present on one side but not the other and vice versa. They are of independent theoretical interest, being the largest class of symmetric lenses that do not use persistent internal state.
Synthesizing simple symmetric lenses is more challenging than synthesizing bijective lenses: Since some of the information on each side can be "disconnected" from the other side, there will typically be many lenses that agree with a given example. To guide the search process, we use stochastic regular expressions and information theory to estimate the amount of information propagated by a candidate lens, preferring lenses that propagate more information, as well as user annotations marking parts of the source and target formats as either irrelevant or essential.
We describe an implementation of simple symmetric lenses and our synthesis procedure as extensions to the Boomerang language. We evaluate its performance on 48 benchmark examples drawn from Flash Fill, Augeas, and the bidirectional programming literature. Our implementation can synthesize each of these lenses in under 30 seconds.
△ Less
Submitted 25 June, 2019; v1 submitted 26 October, 2018;
originally announced October 2018.
-
Deep learning for comprehensive forecasting of Alzheimer's Disease progression
Authors:
Charles K. Fisher,
Aaron M. Smith,
Jonathan R. Walsh,
the Coalition Against Major Diseases
Abstract:
Most approaches to machine learning from electronic health data can only predict a single endpoint. Here, we present an alternative that uses unsupervised deep learning to simulate detailed patient trajectories. We use data comprising 18-month trajectories of 44 clinical variables from 1908 patients with Mild Cognitive Impairment or Alzheimer's Disease to train a model for personalized forecasting…
▽ More
Most approaches to machine learning from electronic health data can only predict a single endpoint. Here, we present an alternative that uses unsupervised deep learning to simulate detailed patient trajectories. We use data comprising 18-month trajectories of 44 clinical variables from 1908 patients with Mild Cognitive Impairment or Alzheimer's Disease to train a model for personalized forecasting of disease progression. We simulate synthetic patient data including the evolution of each sub-component of cognitive exams, laboratory tests, and their associations with baseline clinical characteristics, generating both predictions and their confidence intervals. Our unsupervised model predicts changes in total ADAS-Cog scores with the same accuracy as specifically trained supervised models and identifies sub-components associated with word recall as predictive of progression. The ability to simultaneously simulate dozens of patient characteristics is a crucial step towards personalized medicine for Alzheimer's Disease.
△ Less
Submitted 7 November, 2018; v1 submitted 10 July, 2018;
originally announced July 2018.
-
Boltzmann Encoded Adversarial Machines
Authors:
Charles K. Fisher,
Aaron M. Smith,
Jonathan R. Walsh
Abstract:
Restricted Boltzmann Machines (RBMs) are a class of generative neural network that are typically trained to maximize a log-likelihood objective function. We argue that likelihood-based training strategies may fail because the objective does not sufficiently penalize models that place a high probability in regions where the training data distribution has low probability. To overcome this problem, w…
▽ More
Restricted Boltzmann Machines (RBMs) are a class of generative neural network that are typically trained to maximize a log-likelihood objective function. We argue that likelihood-based training strategies may fail because the objective does not sufficiently penalize models that place a high probability in regions where the training data distribution has low probability. To overcome this problem, we introduce Boltzmann Encoded Adversarial Machines (BEAMs). A BEAM is an RBM trained against an adversary that uses the hidden layer activations of the RBM to discriminate between the training data and the probability distribution generated by the model. We present experiments demonstrating that BEAMs outperform RBMs and GANs on multiple benchmarks.
△ Less
Submitted 23 April, 2018;
originally announced April 2018.
-
A high-bias, low-variance introduction to Machine Learning for physicists
Authors:
Pankaj Mehta,
Marin Bukov,
Ching-Hao Wang,
Alexandre G. R. Day,
Clint Richardson,
Charles K. Fisher,
David J. Schwab
Abstract:
Machine Learning (ML) is one of the most exciting and dynamic areas of modern research and application. The purpose of this review is to provide an introduction to the core concepts and tools of machine learning in a manner easily understood and intuitive to physicists. The review begins by covering fundamental concepts in ML and modern statistics such as the bias-variance tradeoff, overfitting, r…
▽ More
Machine Learning (ML) is one of the most exciting and dynamic areas of modern research and application. The purpose of this review is to provide an introduction to the core concepts and tools of machine learning in a manner easily understood and intuitive to physicists. The review begins by covering fundamental concepts in ML and modern statistics such as the bias-variance tradeoff, overfitting, regularization, generalization, and gradient descent before moving on to more advanced topics in both supervised and unsupervised learning. Topics covered in the review include ensemble models, deep learning and neural networks, clustering and data visualization, energy-based models (including MaxEnt models and Restricted Boltzmann Machines), and variational methods. Throughout, we emphasize the many natural connections between ML and statistical physics. A notable aspect of the review is the use of Python Jupyter notebooks to introduce modern ML/statistical packages to readers using physics-inspired datasets (the Ising Model and Monte-Carlo simulations of supersymmetric decays of proton-proton collisions). We conclude with an extended outlook discussing possible uses of machine learning for furthering our understanding of the physical world as well as open problems in ML where physicists may be able to contribute. (Notebooks are available at https://physics.bu.edu/~pankajm/MLnotebooks.html )
△ Less
Submitted 27 May, 2019; v1 submitted 23 March, 2018;
originally announced March 2018.
-
Direct frequency-comb-driven Raman transitions in the terahertz range
Authors:
Cyrille Solaro,
Steffen Meyer,
Karin Fisher,
Michael V. DePalatis,
Michael Drewsen
Abstract:
We demonstrate the use of a femtosecond frequency comb to coherently drive stimulated Raman transitions between terahertz-spaced atomic energy levels. More specifically, we address the $3d~^2D_{3/2}$ and $3d~^2D_{5/2}$ fine structure levels of a single trapped $^{40}$Ca$^+$ ion and spectroscopically resolve the transition frequency to be $ν_D = 1{,}819{,}599{,}021{,}534 \pm 8$ Hz. The achieved acc…
▽ More
We demonstrate the use of a femtosecond frequency comb to coherently drive stimulated Raman transitions between terahertz-spaced atomic energy levels. More specifically, we address the $3d~^2D_{3/2}$ and $3d~^2D_{5/2}$ fine structure levels of a single trapped $^{40}$Ca$^+$ ion and spectroscopically resolve the transition frequency to be $ν_D = 1{,}819{,}599{,}021{,}534 \pm 8$ Hz. The achieved accuracy is nearly a factor of five better than the previous best Raman spectroscopy, and is currently limited by the stability of our atomic clock reference. Furthermore, the population dynamics of frequency-comb-driven Raman transitions can be fully predicted from the spectral properties of the frequency comb, and Rabi oscillations with a contrast of 99.3(6)\% and millisecond coherence time has been achieved. Importantly, the technique can be easily generalized to transitions in the sub-kHz to tens of THz range and should be applicable for driving, e.g., spin-resolved rovibrational transitions in molecules and hyperfine transitions in highly charged ions.
△ Less
Submitted 19 June, 2018; v1 submitted 20 December, 2017;
originally announced December 2017.
-
Abdominal aortic aneurysms and endovascular sealing: deformation and dynamic response
Authors:
L. P. Argani,
F. Torella,
R. K. Fisher,
R. G. McWilliams,
M. L. Wall,
A. B. Movchan
Abstract:
Endovascular sealing is a new technique for the repair of abdominal aortic aneurysms. Commercially available in Europe since~2013, it takes a revolutionary approach to aneurysm repair through minimally invasive techniques. Although aneurysm sealing may be thought as more stable than conventional endovascular stent graft repairs, post-implantation movement of the endoprosthesis has been described,…
▽ More
Endovascular sealing is a new technique for the repair of abdominal aortic aneurysms. Commercially available in Europe since~2013, it takes a revolutionary approach to aneurysm repair through minimally invasive techniques. Although aneurysm sealing may be thought as more stable than conventional endovascular stent graft repairs, post-implantation movement of the endoprosthesis has been described, potentially leading to late complications. The paper presents for the first time a model, which explains the nature of forces, in static and dynamic regimes, acting on sealed abdominal aortic aneurysms, with references to real case studies. It is shown that elastic deformation of the aorta and of the endoprosthesis induced by static forces and vibrations during daily activities can potentially promote undesired movements of the endovascular sealing structure.
△ Less
Submitted 23 October, 2017; v1 submitted 19 October, 2017;
originally announced October 2017.
-
Synthesizing Bijective Lenses
Authors:
Anders Miltner,
Kathleen Fisher,
Benjamin C. Pierce,
David Walker,
Steve Zdancewic
Abstract:
Bidirectional transformations between different data representations occur frequently in modern software systems. They appear as serializers and deserializers, as database views and view updaters, and more. Manually building bidirectional transformations---by writing two separate functions that are intended to be inverses---is tedious and error prone. A better approach is to use a domain-specific…
▽ More
Bidirectional transformations between different data representations occur frequently in modern software systems. They appear as serializers and deserializers, as database views and view updaters, and more. Manually building bidirectional transformations---by writing two separate functions that are intended to be inverses---is tedious and error prone. A better approach is to use a domain-specific language in which both directions can be written as a single expression. However, these domain-specific languages can be difficult to program in, requiring programmers to manage fiddly details while working in a complex type system.
To solve this, we present Optician, a tool for type-directed synthesis of bijective string transformers. The inputs to Optician are two ordinary regular expressions representing two data formats and a few concrete examples for disambiguation. The output is a well-typed program in Boomerang (a bidirectional language based on the theory of lenses). The main technical challenge involves navigating the vast program search space efficiently enough. Unlike most prior work on type-directed synthesis, our system operates in the context of a language with a rich equivalence relation on types (the theory of regular expressions). We synthesize terms of a equivalent language and convert those generated terms into our lens language. We prove the correctness of our synthesis algorithm. We also demonstrate empirically that our new language changes the synthesis problem from one that admits intractable solutions to one that admits highly efficient solutions. We evaluate Optician on a benchmark suite of 39 examples including both microbenchmarks and realistic examples derived from other data management systems including Flash Fill, a tool for synthesizing string transformations in spreadsheets, and Augeas, a tool for bidirectional processing of Linux system configuration files.
△ Less
Submitted 9 October, 2017;
originally announced October 2017.
-
Storage of polarization-entangled THz-bandwidth photons in a diamond quantum memory
Authors:
Kent A. G. Fisher,
Duncan G. England,
Jean-Philippe W. MacLean,
Philip J. Bustard,
Khabat Heshami,
Kevin J. Resch,
Benjamin J. Sussman
Abstract:
Bulk diamond phonons have been shown to be a versatile platform for the generation, storage, and manipulation of high-bandwidth quantum states of light. Here we demonstrate a diamond quantum memory that stores, and releases on demand, an arbitrarily polarized $\sim$250 fs duration photonic qubit. The single-mode nature of the memory is overcome by mapping the two degrees of polarization of the qub…
▽ More
Bulk diamond phonons have been shown to be a versatile platform for the generation, storage, and manipulation of high-bandwidth quantum states of light. Here we demonstrate a diamond quantum memory that stores, and releases on demand, an arbitrarily polarized $\sim$250 fs duration photonic qubit. The single-mode nature of the memory is overcome by mapping the two degrees of polarization of the qubit, via Raman transitions, onto two spatially distinct optical phonon modes located in the same diamond crystal. The two modes are coherently recombined upon retrieval and quantum process tomography confirms that the memory faithfully reproduces the input state with average fidelity $0.784\pm0.004$ with a total memory efficiency of $(0.76\pm0.03)\%$. In an additional demonstration, one photon of a polarization-entangled pair is stored in the memory. We report that entanglement persists in the retrieved state for up to 1.3 ps of storage time. These results demonstrate that the diamond phonon platform can be used in concert with polarization qubits, a key requirement for polarization-encoded photonic processing.
△ Less
Submitted 19 June, 2017;
originally announced June 2017.
-
Pure State Tomography with Pauli Measurements
Authors:
Xian Ma,
Tyler Jackson,
Hui Zhou,
Jianxin Chen,
Dawei Lu,
Michael D. Mazurek,
Kent A. G. Fisher,
Xinhua Peng,
David Kribs,
Kevin J. Resch,
Zhengfeng Ji,
Bei Zeng,
Raymond Laflamme
Abstract:
We examine the problem of finding the minimum number of Pauli measurements needed to uniquely determine an arbitrary $n$-qubit pure state among all quantum states. We show that only $11$ Pauli measurements are needed to determine an arbitrary two-qubit pure state compared to the full quantum state tomography with $16$ measurements, and only $31$ Pauli measurements are needed to determine an arbitr…
▽ More
We examine the problem of finding the minimum number of Pauli measurements needed to uniquely determine an arbitrary $n$-qubit pure state among all quantum states. We show that only $11$ Pauli measurements are needed to determine an arbitrary two-qubit pure state compared to the full quantum state tomography with $16$ measurements, and only $31$ Pauli measurements are needed to determine an arbitrary three-qubit pure state compared to the full quantum state tomography with $64$ measurements. We demonstrate that our protocol is robust under depolarizing error with simulated random pure states. We experimentally test the protocol on two- and three-qubit systems with nuclear magnetic resonance techniques. We show that the pure state tomography protocol saves us a number of measurements without considerable loss of fidelity. We compare our protocol with same-size sets of randomly selected Pauli operators and find that our selected set of Pauli measurements significantly outperforms those random sampling sets. As a direct application, our scheme can also be used to reduce the number of settings needed for pure-state tomography in quantum optical systems.
△ Less
Submitted 20 January, 2016;
originally announced January 2016.
-
An analytically tractable model for community ecology with many species
Authors:
Benjamin Dickens,
Charles K. Fisher,
Pankaj Mehta
Abstract:
A fundamental problem in community ecology is to understand how ecological processes such as selection, drift, and immigration give rise to observed patterns in species composition and diversity. Here, we present a simple, analytically tractable, presence-absence (PA) model for community assembly and use it to ask how ecological traits such as the strength of competition, the amount of diversity,…
▽ More
A fundamental problem in community ecology is to understand how ecological processes such as selection, drift, and immigration give rise to observed patterns in species composition and diversity. Here, we present a simple, analytically tractable, presence-absence (PA) model for community assembly and use it to ask how ecological traits such as the strength of competition, the amount of diversity, and demographic and environmental stochasticity affect species composition in a community. In the PA model, species are treated as stochastic binary variables that can either be present or absent in a community: species can immigrate into the community from a regional species pool and can go extinct due to competition and stochasticity. Despite its simplicity, the PA model reproduces the qualitative features of more complicated models of community assembly. In agreement with recent work on large, competitive Lotka-Volterra systems, the PA model exhibits distinct ecological behaviors organized around a special ("critical") point corresponding to Hubbell's neutral theory of biodiversity. These results suggest that the concepts of ecological "phases" and phase diagrams can provide a powerful framework for thinking about community ecology and that the PA model captures the essential ecological dynamics of community assembly.
△ Less
Submitted 30 November, 2015;
originally announced November 2015.
-
Certifying the presence of a photonic qubit by splitting it in two
Authors:
Evan Meyer-Scott,
Daniel McCloskey,
Klaudia Gołos,
Jeff Z. Salvail,
Kent A. G. Fisher,
Deny Hamel,
Adán Cabello,
Kevin J. Resch,
Thomas Jennewein
Abstract:
We present an implementation of photonic qubit precertification that performs the delicate task of detecting the presence of a flying photon without destroying its qubit state, allowing loss-sensitive quantum cryptography and tests of nonlocality even over long distance. By splitting an incoming single photon in two via parametric down-conversion, we herald the photon's arrival from an independent…
▽ More
We present an implementation of photonic qubit precertification that performs the delicate task of detecting the presence of a flying photon without destroying its qubit state, allowing loss-sensitive quantum cryptography and tests of nonlocality even over long distance. By splitting an incoming single photon in two via parametric down-conversion, we herald the photon's arrival from an independent photon source while preserving its quantum information with up to $92.3\pm0.6$ % fidelity. With reduced detector dark counts, precertification will be immediately useful in quantum communication.
△ Less
Submitted 1 March, 2016; v1 submitted 15 October, 2015;
originally announced October 2015.
-
Habitat Fluctuations Drive Species Covariation in the Human Microbiota
Authors:
Charles K. Fisher,
Thierry Mora,
Aleksandra M. Walczak
Abstract:
Two species with similar resource requirements respond in a characteristic way to variations in their habitat -- their abundances rise and fall in concert. We use this idea to learn how bacterial populations in the microbiota respond to habitat conditions that vary from person-to-person across the human population. Our mathematical framework shows that habitat fluctuations are sufficient for expla…
▽ More
Two species with similar resource requirements respond in a characteristic way to variations in their habitat -- their abundances rise and fall in concert. We use this idea to learn how bacterial populations in the microbiota respond to habitat conditions that vary from person-to-person across the human population. Our mathematical framework shows that habitat fluctuations are sufficient for explaining intra-bodysite correlations in relative species abundances from the Human Microbiome Project. We explicitly show that the relative abundances of phylogenetically related species are positively correlated and can be predicted from taxonomic relationships. We identify a small set of functional pathways related to metabolism and maintenance of the cell wall that form the basis of a common resource sharing niche space of the human microbiota.
△ Less
Submitted 1 October, 2015;
originally announced October 2015.
-
Quantum optical signal processing in diamond
Authors:
Kent A. G. Fisher,
Duncan. G. England,
Jean-Philippe W. MacLean,
Philip J. Bustard,
Kevin J. Resch,
Benjamin J. Sussman
Abstract:
Controlling the properties of single photons is essential for a wide array of emerging optical quantum technologies spanning quantum sensing, quantum computing, and quantum communications. Essential components for these technologies include single photon sources, quantum memories, waveguides, and detectors. The ideal spectral operating parameters (wavelength and bandwidth) of these components are…
▽ More
Controlling the properties of single photons is essential for a wide array of emerging optical quantum technologies spanning quantum sensing, quantum computing, and quantum communications. Essential components for these technologies include single photon sources, quantum memories, waveguides, and detectors. The ideal spectral operating parameters (wavelength and bandwidth) of these components are rarely similar; thus, frequency conversion and spectral control are key enabling steps for component hybridization. Here we perform signal processing of single photons by coherently manipulating their spectra via a modified quantum memory. We store 723.5 nm photons, with 4.1 nm bandwidth, in a room-temperature diamond crystal; upon retrieval we demonstrate centre frequency tunability over 4.2 times the input bandwidth, and bandwidth modulation between 0.5 to 1.9 times the input bandwidth. Our results demonstrate the potential for diamond, and Raman memories in general, to be an integrated platform for photon storage and spectral conversion.
△ Less
Submitted 18 September, 2015; v1 submitted 16 September, 2015;
originally announced September 2015.
-
Bayesian feature selection with strongly-regularizing priors maps to the Ising Model
Authors:
Charles K. Fisher,
Pankaj Mehta
Abstract:
Identifying small subsets of features that are relevant for prediction and/or classification tasks is a central problem in machine learning and statistics. The feature selection task is especially important, and computationally difficult, for modern datasets where the number of features can be comparable to, or even exceed, the number of samples. Here, we show that feature selection with Bayesian…
▽ More
Identifying small subsets of features that are relevant for prediction and/or classification tasks is a central problem in machine learning and statistics. The feature selection task is especially important, and computationally difficult, for modern datasets where the number of features can be comparable to, or even exceed, the number of samples. Here, we show that feature selection with Bayesian inference takes a universal form and reduces to calculating the magnetizations of an Ising model, under some mild conditions. Our results exploit the observation that the evidence takes a universal form for strongly-regularizing priors --- priors that have a large effect on the posterior probability even in the infinite data limit. We derive explicit expressions for feature selection for generalized linear models, a large class of statistical techniques that include linear and logistic regression. We illustrate the power of our approach by analyzing feature selection in a logistic regression-based classifier trained to distinguish between the letters B and D in the notMNIST dataset.
△ Less
Submitted 3 November, 2014;
originally announced November 2014.
-
Variational Pseudolikelihood for Regularized Ising Inference
Authors:
Charles K. Fisher
Abstract:
I propose a variational approach to maximum pseudolikelihood inference of the Ising model. The variational algorithm is more computationally efficient, and does a better job predicting out-of-sample correlations than $L_2$ regularized maximum pseudolikelihood inference as well as mean field and isolated spin pair approximations with pseudocount regularization. The key to the approach is a variatio…
▽ More
I propose a variational approach to maximum pseudolikelihood inference of the Ising model. The variational algorithm is more computationally efficient, and does a better job predicting out-of-sample correlations than $L_2$ regularized maximum pseudolikelihood inference as well as mean field and isolated spin pair approximations with pseudocount regularization. The key to the approach is a variational energy that regularizes the inference problem by shrinking the couplings towards zero, while still allowing some large couplings to explain strong correlations. The utility of the variational pseudolikelihood approach is illustrated by training an Ising model to represent the letters A-J using samples of letters from different computer fonts.
△ Less
Submitted 24 September, 2014;
originally announced September 2014.
-
Storage and retrieval of ultrafast single photons using a room-temperature diamond quantum memory
Authors:
Duncan G. England,
Kent A. G. Fisher,
Jean-Philippe W. MacLean,
Philip J. Bustard,
Rune Lausten,
Kevin J. Resch,
Benjamin J. Sussman
Abstract:
We report the storage and retrieval of single photons, via a quantum memory, in the optical phonons of room-temperature bulk diamond. The THz-bandwidth heralded photons are generated by spontaneous parametric downconversion and mapped to phonons via a Raman transition, stored for a variable delay, and released on demand. The second-order correlation of the memory output is…
▽ More
We report the storage and retrieval of single photons, via a quantum memory, in the optical phonons of room-temperature bulk diamond. The THz-bandwidth heralded photons are generated by spontaneous parametric downconversion and mapped to phonons via a Raman transition, stored for a variable delay, and released on demand. The second-order correlation of the memory output is $g^{(2)}(0) = 0.65 \pm 0.07$, demonstrating preservation of non-classical photon statistics throughout storage and retrieval. The memory is low-noise, high-speed and broadly tunable; it therefore promises to be a versatile light-matter interface for local quantum processing applications.
△ Less
Submitted 11 September, 2014; v1 submitted 9 September, 2014;
originally announced September 2014.
-
Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation
Authors:
Charles K. Fisher,
Pankaj Mehta
Abstract:
Feature selection, identifying a subset of variables that are relevant for predicting a response, is an important and challenging component of many methods in statistics and machine learning. Feature selection is especially difficult and computationally intensive when the number of variables approaches or exceeds the number of samples, as is often the case for many genomic datasets. Here, we intro…
▽ More
Feature selection, identifying a subset of variables that are relevant for predicting a response, is an important and challenging component of many methods in statistics and machine learning. Feature selection is especially difficult and computationally intensive when the number of variables approaches or exceeds the number of samples, as is often the case for many genomic datasets. Here, we introduce a new approach -- the Bayesian Ising Approximation (BIA) -- to rapidly calculate posterior probabilities for feature relevance in L2 penalized linear regression. In the regime where the regression problem is strongly regularized by the prior, we show that computing the marginal posterior probabilities for features is equivalent to computing the magnetizations of an Ising model. Using a mean field approximation, we show it is possible to rapidly compute the feature selection path described by the posterior probabilities as a function of the L2 penalty. We present simulations and analytical results illustrating the accuracy of the BIA on some simple regression problems. Finally, we demonstrate the applicability of the BIA to high dimensional regression by analyzing a gene expression dataset with nearly 30,000 features.
△ Less
Submitted 30 July, 2014;
originally announced July 2014.
-
Thermodynamics of statistical inference by cells
Authors:
Alex H. Lang,
Charles K. Fisher,
Thierry Mora,
Pankaj Mehta
Abstract:
The deep connection between thermodynamics, computation, and information is now well established both theoretically and experimentally. Here, we extend these ideas to show that thermodynamics also places fundamental constraints on statistical estimation and learning. To do so, we investigate the constraints placed by (nonequilibrium) thermodynamics on the ability of biochemical signaling networks…
▽ More
The deep connection between thermodynamics, computation, and information is now well established both theoretically and experimentally. Here, we extend these ideas to show that thermodynamics also places fundamental constraints on statistical estimation and learning. To do so, we investigate the constraints placed by (nonequilibrium) thermodynamics on the ability of biochemical signaling networks within cells to estimate the concentration of an external signal. We show that accuracy is limited by energy consumption, suggesting that there are fundamental thermodynamic constraints on statistical inference.
△ Less
Submitted 6 October, 2014; v1 submitted 15 May, 2014;
originally announced May 2014.
-
Identifying Keystone Species in the Human Gut Microbiome from Metagenomic Timeseries using Sparse Linear Regression
Authors:
Charles K. Fisher,
Pankaj Mehta
Abstract:
Human associated microbial communities exert tremendous influence over human health and disease. With modern metagenomic sequencing methods it is possible to follow the relative abundance of microbes in a community over time. These microbial communities exhibit rich ecological dynamics and an important goal of microbial ecology is to infer the interactions between species from sequence data. Any a…
▽ More
Human associated microbial communities exert tremendous influence over human health and disease. With modern metagenomic sequencing methods it is possible to follow the relative abundance of microbes in a community over time. These microbial communities exhibit rich ecological dynamics and an important goal of microbial ecology is to infer the interactions between species from sequence data. Any algorithm for inferring species interactions must overcome three obstacles: 1) a correlation between the abundances of two species does not imply that those species are interacting, 2) the sum constraint on the relative abundances obtained from metagenomic studies makes it difficult to infer the parameters in timeseries models, and 3) errors due to experimental uncertainty, or mis-assignment of sequencing reads into operational taxonomic units, bias inferences of species interactions. Here we introduce an approach, Learning Interactions from MIcrobial Time Series (LIMITS), that overcomes these obstacles. LIMITS uses sparse linear regression with boostrap aggregation to infer a discrete-time Lotka-Volterra model for microbial dynamics. We tested LIMITS on synthetic data and showed that it could reliably infer the topology of the inter-species ecological interactions. We then used LIMITS to characterize the species interactions in the gut microbiomes of two individuals and found that the interaction networks varied significantly between individuals. Furthermore, we found that the interaction networks of the two individuals are dominated by distinct "keystone species", Bacteroides fragilis and Bacteroided stercosis, that have a disproportionate influence on the structure of the gut microbiome even though they are only found in moderate abundance. Based on our results, we hypothesize that the abundances of certain keystone species may be responsible for individuality in the human gut microbiome.
△ Less
Submitted 3 February, 2014;
originally announced February 2014.
-
Quantum computing on encrypted data
Authors:
K. Fisher,
A. Broadbent,
L. K. Shalm,
Z. Yan,
J. Lavoie,
R. Prevedel,
T. Jennewein,
K. J. Resch
Abstract:
The ability to perform computations on encrypted data is a powerful tool for protecting privacy. Recently, protocols to achieve this on classical computing systems have been found. Here we present an efficient solution to the quantum analogue of this problem that enables arbitrary quantum computations to be carried out on encrypted quantum data. We prove that an untrusted server can implement a un…
▽ More
The ability to perform computations on encrypted data is a powerful tool for protecting privacy. Recently, protocols to achieve this on classical computing systems have been found. Here we present an efficient solution to the quantum analogue of this problem that enables arbitrary quantum computations to be carried out on encrypted quantum data. We prove that an untrusted server can implement a universal set of quantum gates on encrypted quantum bits (qubits) without learning any information about the inputs, while the client, knowing the decryption key, can easily decrypt the results of the computation. We experimentally demonstrate, using single photons and linear optics, the encryption and decryption scheme on a set of gates sufficient for arbitrary quantum computations. Because our protocol requires few extra resources compared to other schemes it can be easily incorporated into the design of future quantum servers. These results will play a key role in enabling the development of secure distributed quantum systems.
△ Less
Submitted 10 September, 2013;
originally announced September 2013.
-
Experimental Three-Particle Quantum Nonlocality under Strict Locality Conditions
Authors:
C. Erven,
E. Meyer-Scott,
K. Fisher,
J. Lavoie,
B. L. Higgins,
Z. Yan,
C. J. Pugh,
J. -P. Bourgoin,
R. Prevedel,
L. K. Shalm,
L. Richards,
N. Gigov,
R. Laflamme,
G. Weihs,
T. Jennewein,
K. J. Resch
Abstract:
Quantum correlations are critical to our understanding of nature, with far-reaching technological and fundamental impact. These often manifest as violations of Bell's inequalities, bounds derived from the assumptions of locality and realism, concepts integral to classical physics. Many tests of Bell's inequalities have studied pairs of correlated particles; however, the immense interest in multi-p…
▽ More
Quantum correlations are critical to our understanding of nature, with far-reaching technological and fundamental impact. These often manifest as violations of Bell's inequalities, bounds derived from the assumptions of locality and realism, concepts integral to classical physics. Many tests of Bell's inequalities have studied pairs of correlated particles; however, the immense interest in multi-particle quantum correlations is driving the experimental frontier to test systems beyond just pairs. All experimental violations of Bell's inequalities to date require supplementary assumptions, opening the results to one or more loopholes, the closing of which is one of the most important challenges in quantum science. Individual loopholes have been closed in experiments with pairs of particles and a very recent result closed the detection loophole in a six ion experiment. No experiment thus far has closed the locality loopholes with three or more particles. Here, we distribute three-photon Greenberger-Horne-Zeilinger entangled states using optical fibre and free-space links to independent measurement stations. The measured correlations constitute a test of Mermin's inequality while closing both the locality and related freedom-of-choice loopholes due to our experimental configuration and timing. We measured a Mermin parameter of 2.77 +/- 0.08, violating the inequality bound of 2 by over 9 standard deviations, with minimum tolerances for the locality and freedom-of-choice loopholes of 264 +/- 28 ns and 304 +/- 25 ns, respectively. These results represent a significant advance towards definitive tests of the foundations of quantum mechanics and practical multi-party quantum communications protocols.
△ Less
Submitted 5 September, 2013;
originally announced September 2013.
-
A phase transition between the niche and neutral regimes in ecology
Authors:
Charles K. Fisher,
Pankaj Mehta
Abstract:
An ongoing debate in ecology concerns the impacts of ecological drift and selection on community assembly. Here, we show that there is a sharp phase transition in diverse ecological communities between a selection dominated regime (the niche phase) and a drift dominated regime (the neutral phase). Simulations and analytic arguments show that the niche phase is favored in communities with large pop…
▽ More
An ongoing debate in ecology concerns the impacts of ecological drift and selection on community assembly. Here, we show that there is a sharp phase transition in diverse ecological communities between a selection dominated regime (the niche phase) and a drift dominated regime (the neutral phase). Simulations and analytic arguments show that the niche phase is favored in communities with large population sizes and relatively constant environments, whereas the neutral phase is favored in communities with small population sizes and fluctuating environments. Our results demonstrate how apparently neutral populations may arise even in communities inhabited by species with varying traits.
△ Less
Submitted 28 March, 2014; v1 submitted 13 August, 2013;
originally announced August 2013.
-
Optimal linear optical implementation of a single-qubit damping channel
Authors:
Kent Fisher,
Robert Prevedel,
Rainer Kaltenbaek,
Kevin J. Resch
Abstract:
We experimentally demonstrate a single-qubit decohering quantum channel using linear optics. We implement the channel, whose special cases include both the amplitude-damping channel and the bit-flip channel, using a single, static optical setup. Following a recent theoretical result [M. Piani et al., Phys. Rev. A, 84, 032304 (2011)], we realize the channel in an optimal way, maximizing the probabi…
▽ More
We experimentally demonstrate a single-qubit decohering quantum channel using linear optics. We implement the channel, whose special cases include both the amplitude-damping channel and the bit-flip channel, using a single, static optical setup. Following a recent theoretical result [M. Piani et al., Phys. Rev. A, 84, 032304 (2011)], we realize the channel in an optimal way, maximizing the probability of success, i.e., the probability for the photonic qubit to remain in its encoding. Using a two-photon entangled resource, we characterize the channel using ancilla-assisted process tomography and find average process fidelities of 0.9808 \pm 0.0002 and 0.9762 \pm 0.0002 for amplitude-damping and the bit-flip case, respectively.
△ Less
Submitted 9 September, 2011;
originally announced September 2011.
-
Electron pockets and pseudogap Dirac point in underdoped cuprate superconductors
Authors:
K. A. G. Fisher,
E. J. Nicol,
J. P. Carbotte
Abstract:
We consider a model of the pseudogap specifically designed to describe the underdoped cuprates and which exhibits particle-hole asymmetry. The presence of electron pockets, besides the usual hole pockets, leads to the appearance of new vectors beyond the usual so-called octet model in the joint density of states (JDOS), which underlies the analysis of Fourier-transform scanning tunneling spectrosc…
▽ More
We consider a model of the pseudogap specifically designed to describe the underdoped cuprates and which exhibits particle-hole asymmetry. The presence of electron pockets, besides the usual hole pockets, leads to the appearance of new vectors beyond the usual so-called octet model in the joint density of states (JDOS), which underlies the analysis of Fourier-transform scanning tunneling spectroscopy (FT-STS) data. These new vectors are associated with distinct patterns of large amplitude in the JDOS and are expected to occur primarily at positive bias. Likewise a pseudogap Dirac point occurs at positive bias and this point can be determined either through FT-STS or through extrapolation of data from the autocorrelation function of angle-resolved photoemission spectroscopy.
△ Less
Submitted 27 July, 2011;
originally announced July 2011.
-
Experimental investigation of the uncertainty principle in the presence of quantum memory
Authors:
Robert Prevedel,
Deny R. Hamel,
Roger Colbeck,
Kent Fisher,
Kevin J. Resch
Abstract:
Heisenberg's uncertainty principle provides a fundamental limitation on an observer's ability to simultaneously predict the outcome when one of two measurements is performed on a quantum system. However, if the observer has access to a particle (stored in a quantum memory) which is entangled with the system, his uncertainty is generally reduced. This effect has recently been quantified by Berta et…
▽ More
Heisenberg's uncertainty principle provides a fundamental limitation on an observer's ability to simultaneously predict the outcome when one of two measurements is performed on a quantum system. However, if the observer has access to a particle (stored in a quantum memory) which is entangled with the system, his uncertainty is generally reduced. This effect has recently been quantified by Berta et al. [Nature Physics 6, 659 (2010)] in a new, more general uncertainty relation, formulated in terms of entropies. Using entangled photon pairs, an optical delay line serving as a quantum memory and fast, active feed-forward we experimentally probe the validity of this new relation. The behaviour we find agrees with the predictions of quantum theory and satisfies the new uncertainty relation. In particular, we find lower uncertainties about the measurement outcomes than would be possible without the entangled particle. This shows not only that the reduction in uncertainty enabled by entanglement can be significant in practice, but also demonstrates the use of the inequality to witness entanglement.
△ Less
Submitted 1 December, 2010;
originally announced December 2010.
-
Effect of pseudogap formation on the penetration depth of underdoped high $T_c$ cuprates
Authors:
J. P. Carbotte,
K. A. G. Fisher,
J. P. F. LeBlanc,
E. J. Nicol
Abstract:
The penetration depth is calculated over the entire doping range of the cuprate phase diagram with emphasis on the underdoped regime. Pseudogap formation on approaching the Mott transition, for doping below a quantum critical point, is described within a model based on the resonating valence bond spin liquid which provides an ansatz for the coherent piece of the Green's function. Fermi surface r…
▽ More
The penetration depth is calculated over the entire doping range of the cuprate phase diagram with emphasis on the underdoped regime. Pseudogap formation on approaching the Mott transition, for doping below a quantum critical point, is described within a model based on the resonating valence bond spin liquid which provides an ansatz for the coherent piece of the Green's function. Fermi surface reconstruction, which is an essential element of the model, has a strong effect on the superfluid density at T=0 producing a sharp drop in magnitude, but does not change the slope of the linear low temperature variation. Comparison with recent data on Bi-based cuprates provides validation of the theory and shows that the effects of correlations, captured by Gutzwiller factors, are essential for a qualitative understanding of the data. We find that the Ferrell-Glover-Tinkham sum rule still holds and we compare our results with those for the Fermi arc and the nodal liquid models.
△ Less
Submitted 21 September, 2009;
originally announced September 2009.
-
Dynamics of polydisperse irreversible adsorption: a pharmacological example
Authors:
Radek Erban,
Jonathan Chapman,
Kerry D. Fisher,
Ioannis G. Kevrekidis,
Leonard W. Seymour
Abstract:
Many drug delivery systems suffer from undesirable interactions with the host immune system. It has been experimentally established that covalent attachment (irreversible adsorption) of suitable macromolecules to the surface of the drug carrier can reduce such undesirable interactions. A fundamental understanding of the adsorption process is still lacking. In this paper, the classical random irr…
▽ More
Many drug delivery systems suffer from undesirable interactions with the host immune system. It has been experimentally established that covalent attachment (irreversible adsorption) of suitable macromolecules to the surface of the drug carrier can reduce such undesirable interactions. A fundamental understanding of the adsorption process is still lacking. In this paper, the classical random irreversible adsorption model is generalized to capture certain essential processes involved in pharmacological applications, allowing for macromolecules of different sizes, partial overlapping of the tails of macromolecules, and the influence of reactions with the solvent on the adsorption process. Working in one dimension, an integro-differential evolution equation for the adsorption process is derived and the asymptotic behaviour of the surface area covered and the number of molecules attached to the surface is studied. Finally, equation-free dynamic renormalization tools are applied to study the asymptotically self-similar behaviour of the adsorption statistics.
△ Less
Submitted 31 January, 2006;
originally announced February 2006.
-
A Framework for Interoperability
Authors:
Kathleen Fisher,
Riccardo Pucella,
John Reppy
Abstract:
Practical implementations of high-level languages must provide access to libraries and system services that have APIs specified in a low-level language (usually C). An important characteristic of such mechanisms is the foreign-interface policy that defines how to bridge the semantic gap between the high-level language and C. For example, IDL-based tools generate code to marshal data into and out…
▽ More
Practical implementations of high-level languages must provide access to libraries and system services that have APIs specified in a low-level language (usually C). An important characteristic of such mechanisms is the foreign-interface policy that defines how to bridge the semantic gap between the high-level language and C. For example, IDL-based tools generate code to marshal data into and out of the high-level representation according to user annotations. The design space of foreign-interface policies is large and there are pros and cons to each approach. Rather than commit to a particular policy, we choose to focus on the problem of supporting a gamut of interoperability policies. In this paper, we describe a framework for language interoperability that is expressive enough to support very efficient implementations of a wide range of different foreign-interface policies. We describe two tools that implement substantially different policies on top of our framework and present benchmarks that demonstrate their efficiency.
△ Less
Submitted 23 May, 2004;
originally announced May 2004.
-
Fractional vortices in the XY model with $π$ bonds
Authors:
R. V. Kulkarni,
E. Almaas,
K. D. Fisher,
D. Stroud
Abstract:
We define a new set of excitations in the XY model which we call ``fractional vortices''. In the frustrated XY model containing $π$ bonds, we make the ansatz that the ground state configurations can be characterized by pairs of oppositely charged fractional vortices. For a chain of $π$ bonds, the ground state energy and the phase configurations calculated on the basis of this ansatz agree well w…
▽ More
We define a new set of excitations in the XY model which we call ``fractional vortices''. In the frustrated XY model containing $π$ bonds, we make the ansatz that the ground state configurations can be characterized by pairs of oppositely charged fractional vortices. For a chain of $π$ bonds, the ground state energy and the phase configurations calculated on the basis of this ansatz agree well with the results from direct numerical simulations. Finally, we discuss the possible connection of these results to some recent experiments by Kirtley {\it et al} [Phys. Rev. B {\bf 51}, R12057 (1995)] on high-T$_c$ superconductors where fractional flux trapping was observed along certain grain boundaries.
△ Less
Submitted 7 November, 2000; v1 submitted 31 August, 2000;
originally announced August 2000.