Search | arXiv e-print repository

Penalized Subgrouping of Heterogeneous Time Series

Authors: Christopher M. Crawford, Jonathan J. Park, Sy-Miin Chow, Anja F. Ernst, Vladas Pipiras, Zachary F. Fisher

Abstract: Interest in the study and analysis of dynamic processes in the social, behavioral, and health sciences has burgeoned in recent years due to the increased availability of intensive longitudinal data. However, how best to model and account for the persistent heterogeneity characterizing such processes remains an open question. The multi-VAR framework, a recent methodological development built on the… ▽ More Interest in the study and analysis of dynamic processes in the social, behavioral, and health sciences has burgeoned in recent years due to the increased availability of intensive longitudinal data. However, how best to model and account for the persistent heterogeneity characterizing such processes remains an open question. The multi-VAR framework, a recent methodological development built on the vector autoregressive model, accommodates heterogeneous dynamics in multiple-subject time series through structured penalization. In the original multi-VAR proposal, individual-level transition matrices are decomposed into common and unique dynamics, allowing for generalizable and person-specific features. The current project extends this framework to allow additionally for the identification and penalized estimation of subgroup-specific dynamics; that is, patterns of dynamics that are shared across subsets of individuals. The performance of the proposed subgrouping extension is evaluated in the context of both a simulation study and empirical application, and results are compared to alternative methods for subgrouping multiple-subject, multivariate time series. △ Less

Submitted 4 September, 2024; originally announced September 2024.

arXiv:2403.19710 [pdf, other]

STRUM-LLM: Attributed and Structured Contrastive Summarization

Authors: Beliz Gunel, James B. Wendt, Jing Xie, Yichao Zhou, Nguyen Vo, Zachary Fisher, Sandeep Tata

Abstract: Users often struggle with decision-making between two options (A vs B), as it usually requires time-consuming research across multiple web pages. We propose STRUM-LLM that addresses this challenge by generating attributed, structured, and helpful contrastive summaries that highlight key differences between the two options. STRUM-LLM identifies helpful contrast: the specific attributes along which… ▽ More Users often struggle with decision-making between two options (A vs B), as it usually requires time-consuming research across multiple web pages. We propose STRUM-LLM that addresses this challenge by generating attributed, structured, and helpful contrastive summaries that highlight key differences between the two options. STRUM-LLM identifies helpful contrast: the specific attributes along which the two options differ significantly and which are most likely to influence the user's decision. Our technique is domain-agnostic, and does not require any human-labeled data or fixed attribute list as supervision. STRUM-LLM attributes all extractions back to the input sources along with textual evidence, and it does not have a limit on the length of input sources that it can process. STRUM-LLM Distilled has 100x more throughput than the models with comparable performance while being 10x smaller. In this paper, we provide extensive evaluations for our method and lay out future directions for our currently deployed system. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.05530 [pdf, other]

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content. △ Less

Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2312.10003 [pdf, other]

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Authors: Renat Aksitov, Sobhan Miryoosefi, Zonglin Li, Daliang Li, Sheila Babayan, Kavya Kopparapu, Zachary Fisher, Ruiqi Guo, Sushant Prakash, Pranesh Srinivasan, Manzil Zaheer, Felix Yu, Sanjiv Kumar

Abstract: Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is… ▽ More Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is non-differentiable. To address these deficiencies, we define a ReAct-style LLM agent with the ability to reason and act upon external knowledge. We further refine the agent through a ReST-like method that iteratively trains on previous trajectories, employing growing-batch reinforcement learning with AI feedback for continuous self-improvement and self-distillation. Starting from a prompted large model and after just two iterations of the algorithm, we can produce a fine-tuned small model that achieves comparable performance on challenging compositional question-answering benchmarks with two orders of magnitude fewer parameters. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: 19 pages, 4 figures, 4 tables, 8 listings

arXiv:2311.08658 [pdf, other]

Structured Estimation of Heterogeneous Time Series

Authors: Zachary F. Fisher, Younghoon Kim, Vladas Pipiras, Christopher Crawford, Daniel J. Petrie, Michael D. Hunter, Charles F. Geier

Abstract: How best to model structurally heterogeneous processes is a foundational question in the social, health and behavioral sciences. Recently, Fisher et al., (2022) introduced the multi-VAR approach for simultaneously estimating multiple-subject multivariate time series characterized by common and individualizing features using penalized estimation. This approach differs from many popular modeling app… ▽ More How best to model structurally heterogeneous processes is a foundational question in the social, health and behavioral sciences. Recently, Fisher et al., (2022) introduced the multi-VAR approach for simultaneously estimating multiple-subject multivariate time series characterized by common and individualizing features using penalized estimation. This approach differs from many popular modeling approaches for multiple-subject time series in that qualitative and quantitative differences in a large number of individual dynamics are well-accommodated. The current work extends the multi-VAR framework to include new adaptive weighting schemes that greatly improve estimation performance. In a small set of simulation studies we compare adaptive multi-VAR with these new penalty weights to common alternative estimators in terms of path recovery and bias. Furthermore, we provide toy examples and code demonstrating the utility of multi-VAR under different heterogeneity regimes using the multivar package for R (Fisher, 2022). △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2307.15330 [pdf, other]

Group integrative dynamic factor models with application to multiple subject brain connectivity

Authors: Younghoon Kim, Zachary F. Fisher, Vladas Pipiras

Abstract: This work introduces a novel framework for dynamic factor model-based group-level analysis of multiple subjects time series data, called GRoup Integrative DYnamic factor (GRIDY) models. The framework identifies and characterizes inter-subject similarities and differences between two pre-determined groups by considering a combination of group spatial information and individual temporal dynamics. Fu… ▽ More This work introduces a novel framework for dynamic factor model-based group-level analysis of multiple subjects time series data, called GRoup Integrative DYnamic factor (GRIDY) models. The framework identifies and characterizes inter-subject similarities and differences between two pre-determined groups by considering a combination of group spatial information and individual temporal dynamics. Furthermore, it enables the identification of intra-subject similarities and differences over time by employing different model configurations for each subject. Methodologically, the framework combines a novel principal angle-based rank selection algorithm and a non-iterative integrative analysis framework. Inspired by simultaneous component analysis, this approach also reconstructs identifiable latent factor series with flexible covariance structures. The performance of the GRIDY models is evaluated through simulations conducted under various scenarios. An application is also presented to compare resting-state functional MRI data collected from multiple subjects in autism spectrum disorder and control groups. △ Less

Submitted 25 July, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

arXiv:2307.10454 [pdf, other]

Latent Gaussian dynamic factor modeling and forecasting for multivariate count time series

Authors: Younghoon Kim, Marie-Christine Düker, Zachary F. Fisher, Vladas Pipiras

Abstract: This work considers estimation and forecasting in a multivariate, possibly high-dimensional count time series model constructed from a transformation of a latent Gaussian dynamic factor series. The estimation of the latent model parameters is based on second-order properties of the count and underlying Gaussian time series, yielding estimators of the underlying covariance matrices for which standa… ▽ More This work considers estimation and forecasting in a multivariate, possibly high-dimensional count time series model constructed from a transformation of a latent Gaussian dynamic factor series. The estimation of the latent model parameters is based on second-order properties of the count and underlying Gaussian time series, yielding estimators of the underlying covariance matrices for which standard principal component analysis applies. Theoretical consistency results are established for the proposed estimation, building on certain concentration results for the models of the type considered. They also involve the memory of the latent Gaussian process, quantified through a spectral gap, shown to be suitably bounded as the model dimension increases, which is of independent interest. In addition, novel cross-validation schemes are suggested for model selection. The forecasting is carried out through a particle-based sequential Monte Carlo, leveraging Kalman filtering techniques. A simulation study and an application are also considered. △ Less

Submitted 3 April, 2025; v1 submitted 19 July, 2023; originally announced July 2023.

arXiv:2212.10770 [pdf, other]

ImPaKT: A Dataset for Open-Schema Knowledge Base Construction

Authors: Luke Vilnis, Zach Fisher, Bhargav Kanagal, Patrick Murray, Sumit Sanghai

Abstract: Large language models have ushered in a golden age of semantic parsing. The seq2seq paradigm allows for open-schema and abstractive attribute and relation extraction given only small amounts of finetuning data. Language model pretraining has simultaneously enabled great strides in natural language inference, reasoning about entailment and implication in free text. These advances motivate us to con… ▽ More Large language models have ushered in a golden age of semantic parsing. The seq2seq paradigm allows for open-schema and abstractive attribute and relation extraction given only small amounts of finetuning data. Language model pretraining has simultaneously enabled great strides in natural language inference, reasoning about entailment and implication in free text. These advances motivate us to construct ImPaKT, a dataset for open-schema information extraction, consisting of around 2500 text snippets from the C4 corpus, in the shopping domain (product buying guides), professionally annotated with extracted attributes, types, attribute summaries (attribute schema discovery from idiosyncratic text), many-to-one relations between compound and atomic attributes, and implication relations. We release this data in hope that it will be useful in fine tuning semantic parsers for information extraction and knowledge base construction across a variety of domains. We evaluate the power of this approach by fine-tuning the open source UL2 language model on a subset of the dataset, extracting a set of implication relations from a corpus of product buying guides, and conducting human evaluations of the resulting predictions. △ Less

Submitted 21 December, 2022; originally announced December 2022.

Comments: 14 pages. Preprint

arXiv:2203.15099 [pdf, other]

LogicInference: A New Dataset for Teaching Logical Inference to seq2seq Models

Authors: Santiago Ontanon, Joshua Ainslie, Vaclav Cvicek, Zachary Fisher

Abstract: Machine learning models such as Transformers or LSTMs struggle with tasks that are compositional in nature such as those involving reasoning/inference. Although many datasets exist to evaluate compositional generalization, when it comes to evaluating inference abilities, options are more limited. This paper presents LogicInference, a new dataset to evaluate the ability of models to perform logical… ▽ More Machine learning models such as Transformers or LSTMs struggle with tasks that are compositional in nature such as those involving reasoning/inference. Although many datasets exist to evaluate compositional generalization, when it comes to evaluating inference abilities, options are more limited. This paper presents LogicInference, a new dataset to evaluate the ability of models to perform logical inference. The dataset focuses on inference using propositional logic and a small subset of first-order logic, represented both in semi-formal logical notation, as well as in natural language. We also report initial results using a collection of machine learning models to establish an initial baseline in this dataset. △ Less

Submitted 11 April, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

Comments: Accepted at ICLR 2022 OSC workshop (v3 contains updated results after fixing a problem in dataset generation)

arXiv:2108.04378 [pdf, other]

Making Transformers Solve Compositional Tasks

Authors: Santiago Ontañón, Joshua Ainslie, Vaclav Cvicek, Zachary Fisher

Abstract: Several studies have reported the inability of Transformer models to generalize compositionally, a key type of generalization in many NLP tasks such as semantic parsing. In this paper we explore the design space of Transformer models showing that the inductive biases given to the model by several design decisions significantly impact compositional generalization. Through this exploration, we ident… ▽ More Several studies have reported the inability of Transformer models to generalize compositionally, a key type of generalization in many NLP tasks such as semantic parsing. In this paper we explore the design space of Transformer models showing that the inductive biases given to the model by several design decisions significantly impact compositional generalization. Through this exploration, we identified Transformer configurations that generalize compositionally significantly better than previously reported in the literature in a diverse set of compositional tasks, and that achieve state-of-the-art results in a semantic parsing compositional generalization benchmark (COGS), and a string edit operation composition benchmark (PCFG). △ Less

Submitted 3 March, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

Comments: Source code: https://github.com/google-research/google-research/tree/master/compositional_transformers

Journal ref: ACL 2022

arXiv:2007.11658 [pdf, other]

Evaporating Black Holes Coupled to a Thermal Bath

Authors: Hong Zhe Chen, Zachary Fisher, Juan Hernandez, Robert C. Myers, Shan-Ming Ruan

Abstract: We study the doubly holographic model of [arXiv:1908.10996] in the situation where a black hole in two-dimensional JT gravity theory is coupled to an auxiliary bath system at arbitrary finite temperature. Depending on the initial temperature of the black hole relative to the bath temperature, the black hole can lose mass by emitting Hawking radiation, stay in equilibrium with the bath or gain mass… ▽ More We study the doubly holographic model of [arXiv:1908.10996] in the situation where a black hole in two-dimensional JT gravity theory is coupled to an auxiliary bath system at arbitrary finite temperature. Depending on the initial temperature of the black hole relative to the bath temperature, the black hole can lose mass by emitting Hawking radiation, stay in equilibrium with the bath or gain mass by absorbing thermal radiation from the bath. In all of these scenarios, a unitary Page curve is obtained by applying the usual prescription for holographic entanglement entropy and identifying the quantum extremal surface for the generalized entropy, using both analytical and numeric calculations. As the application of the entanglement wedge reconstruction, we further investigate the reconstruction of the black hole interior from a subsystem containing the Hawking radiation. We examine the roles of the Hawking radiation and also the purification of the thermal bath in this reconstruction. △ Less

Submitted 9 August, 2020; v1 submitted 22 July, 2020; originally announced July 2020.

Comments: 77 pages, 14 figures; v2: corrected typos; added a simpler derivation to QES; expanded discussion about critical temperature

arXiv:2007.09672 [pdf, other]

A Square-Root Second-Order Extended Kalman Filtering Approach for Estimating Smoothly Time-Varying Parameters

Authors: Zachary F. Fisher, Sy-Miin Chow, Peter C. M. Molenaar, Barbara L. Fredrickson, Vladas Pipiras, Kathleen M. Gates

Abstract: Researchers collecting intensive longitudinal data (ILD) are increasingly looking to model psychological processes, such as emotional dynamics, that organize and adapt across time in complex and meaningful ways. This is also the case for researchers looking to characterize the impact of an intervention on individual behavior. To be useful, statistical models must be capable of characterizing these… ▽ More Researchers collecting intensive longitudinal data (ILD) are increasingly looking to model psychological processes, such as emotional dynamics, that organize and adapt across time in complex and meaningful ways. This is also the case for researchers looking to characterize the impact of an intervention on individual behavior. To be useful, statistical models must be capable of characterizing these processes as complex, time-dependent phenomenon, otherwise only a fraction of the system dynamics will be recovered. In this paper we introduce a Square-Root Second-Order Extended Kalman Filtering approach for estimating smoothly time-varying parameters. This approach is capable of handling dynamic factor models where the relations between variables underlying the processes of interest change in a manner that may be difficult to specify in advance. We examine the performance of our approach in a Monte Carlo simulation and show the proposed algorithm accurately recovers the unobserved states in the case of a bivariate dynamic factor model with time-varying dynamics and treatment effects. Furthermore, we illustrate the utility of our approach in characterizing the time-varying effect of a meditation intervention on day-to-day emotional experiences. △ Less

Submitted 19 July, 2020; originally announced July 2020.

arXiv:2007.05052 [pdf, other]

doi 10.1007/s11336-021-09825-7

Penalized Estimation and Forecasting of Multiple Subject Intensive Longitudinal Data

Authors: Zachary F. Fisher, Younghoon Kim, Barbara Fredrickson, Vladas Pipiras

Abstract: Intensive Longitudinal Data (ILD) is increasingly available to social and behavioral scientists. With this increased availability come new opportunities for modeling and predicting complex biological, behavioral, and physiological phenomena. Despite these new opportunities psychological researchers have not taken full advantage of promising opportunities inherent to this data, the potential to for… ▽ More Intensive Longitudinal Data (ILD) is increasingly available to social and behavioral scientists. With this increased availability come new opportunities for modeling and predicting complex biological, behavioral, and physiological phenomena. Despite these new opportunities psychological researchers have not taken full advantage of promising opportunities inherent to this data, the potential to forecast psychological processes at the individual level. To address this gap in the literature we present a novel modeling framework that addresses a number of topical challenges and open questions in the psychological literature on modeling dynamic processes. First, how can we model and forecast ILD when the length of individual time series and the number of variables collected are roughly equivalent, or when time series lengths are shorter than what is typically required for time series analyses? Second, how can we best take advantage of the cross-sectional (between-person) information inherent to most ILD scenarios while acknowledging individuals differ both quantitatively (e.g. in parameter magnitude) and qualitatively (e.g. in structural dynamics)? Despite the acknowledged between-person heterogeneity in many psychological processes is it possible to leverage group-level information to support improved forecasting at the individual level? In the remainder of the manuscript, we attempt to address these and other pressing questions relevant to the forecasting of multiple-subject ILD. △ Less

Submitted 4 February, 2022; v1 submitted 9 July, 2020; originally announced July 2020.

Journal ref: Psychometrika 87 (2022) 403-431

arXiv:2004.08483 [pdf, other]

ETC: Encoding Long and Structured Inputs in Transformers

Authors: Joshua Ainslie, Santiago Ontanon, Chris Alberti, Vaclav Cvicek, Zachary Fisher, Philip Pham, Anirudh Ravula, Sumit Sanghai, Qifan Wang, Li Yang

Abstract: Transformer models have advanced the state of the art in many Natural Language Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended Transformer Construction (ETC), that addresses two key challenges of standard Transformer architectures, namely scaling input length and encoding structured inputs. To scale attention to longer inputs, we introduce a novel global-… ▽ More Transformer models have advanced the state of the art in many Natural Language Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended Transformer Construction (ETC), that addresses two key challenges of standard Transformer architectures, namely scaling input length and encoding structured inputs. To scale attention to longer inputs, we introduce a novel global-local attention mechanism between global tokens and regular input tokens. We also show that combining global-local attention with relative position encodings and a Contrastive Predictive Coding (CPC) pre-training objective allows ETC to encode structured inputs. We achieve state-of-the-art results on four natural language datasets requiring long and/or structured inputs. △ Less

Submitted 27 October, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

Comments: Accepted at EMNLP 2020

arXiv:1911.03402 [pdf, other]

doi 10.1007/JHEP03(2020)152

Information Flow in Black Hole Evaporation

Authors: Hong Zhe Chen, Zachary Fisher, Juan Hernandez, Robert C. Myers, Shan-Ming Ruan

Abstract: Recently, new holographic models of black hole evaporation have given fresh insights into the information paradox [arXiv:1905.08255, arXiv:1905.08762, arXiv:1908.10996]. In these models, the black hole evaporates into an auxiliary bath space after a quantum quench, wherein the holographic theory and the bath are joined. One particularly exciting development is the appearance of "ER=EPR"-like wormh… ▽ More Recently, new holographic models of black hole evaporation have given fresh insights into the information paradox [arXiv:1905.08255, arXiv:1905.08762, arXiv:1908.10996]. In these models, the black hole evaporates into an auxiliary bath space after a quantum quench, wherein the holographic theory and the bath are joined. One particularly exciting development is the appearance of "ER=EPR"-like wormholes in the (doubly) holographic model of [arXiv:1908.10996]. At late times, the entanglement wedge of the bath includes the interior of the black hole. In this paper, we employ both numerical and analytic methods to study how information about the black hole interior is encoded in the Hawking radiation. In particular, we systematically excise intervals from the bath from the system and study the corresponding Page transition. Repeating this process ad infinitum, we end up with a fractal structure on which the black hole interior is encoded, implementing the uberholography protocol of [arXiv:1612.00017]. △ Less

Submitted 22 July, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

Comments: 56 pages, 22 figures; v2: published version, minor typos corrected

Journal ref: JHEP03(2020)152

arXiv:1910.07393 [pdf, other]

doi 10.1007/s11336-020-09721-6

An Instrumental Variable Estimator for Mixed Indicators: Analytic Derivatives and Alternative Parameterizations

Authors: Zachary F. Fisher, Kenneth A. Bollen

Abstract: Methodological development of the Model-implied Instrumental Variable (MIIV) estimation framework has proved fruitful over the last three decades. Major milestones include Bollen's (1996) original development of the MIIV estimator and its robustness properties for continuous endogenous variable SEMs, the extension of the MIIV estimator to ordered categorical endogenous variables (Bollen \& Maydeu-… ▽ More Methodological development of the Model-implied Instrumental Variable (MIIV) estimation framework has proved fruitful over the last three decades. Major milestones include Bollen's (1996) original development of the MIIV estimator and its robustness properties for continuous endogenous variable SEMs, the extension of the MIIV estimator to ordered categorical endogenous variables (Bollen \& Maydeu-Olivares, 2007), and the introduction of a Generalized Method of Moments (GMM) estimator (Bollen, Kolenikov \& Bauldry, 2014). This paper furthers these developments by making several unique contributions not present in the prior literature: (1) we use matrix calculus to derive the analytic derivatives of the PIV estimator, (2) we extend the PIV estimator to apply to any mixture of binary, ordinal, and continuous variables, (3) we generalize the PIV model to include intercepts and means, (4) we devise a method to input known threshold values for ordinal observed variables, and (5) we enable a general parameterization that permits the estimation of means, variances, and covariances of the underlying variables to use as input into a SEM analysis with PIV. An empirical example illustrates a mixture of continuous variables and ordinal variables with fixed thresholds. We also include a simulation study to compare the performance of this novel estimator to WLSMV. △ Less

Submitted 27 April, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

Journal ref: Psychometrika 85 (2020) 660-683

arXiv:1808.10522 [pdf, other]

Optimal Instrument Selection using Bayesian Model Averaging for Model Implied Instrumental Variable Two Stage Least Squares Estimators

Authors: Teague R. Henry, Zachary F. Fisher, Kenneth A. Bollen

Abstract: Model-Implied Instrumental Variable Two-Stage Least Squares (MIIV-2SLS) is a limited information, equation-by-equation, non-iterative estimator for latent variable models. Associated with this estimator are equation specific tests of model misspecification. One issue with equation specific tests is that they lack specificity, in that they indicate that some instruments are problematic without reve… ▽ More Model-Implied Instrumental Variable Two-Stage Least Squares (MIIV-2SLS) is a limited information, equation-by-equation, non-iterative estimator for latent variable models. Associated with this estimator are equation specific tests of model misspecification. One issue with equation specific tests is that they lack specificity, in that they indicate that some instruments are problematic without revealing which specific ones. Instruments that are poor predictors of their target variables (weak instruments) is a second potential problem. We propose a novel extension to detect instrument specific tests of misspecification and weak instruments. We term this the Model-Implied Instrumental Variable Two-Stage Bayesian Model Averaging (MIIV-2SBMA) estimator. We evaluate the performance of MIIV-2SBMA against MIIV-2SLS in a simulation study and show that it has comparable performance in terms of parameter estimation. Additionally, our instrument specific overidentification tests developed within the MIIV-2SBMA framework show increased power to detect specific problematic and weak instruments. Finally, we demonstrate MIIV-2SBMA using an empirical example. △ Less

Submitted 16 April, 2024; v1 submitted 30 August, 2018; originally announced August 2018.

Comments: 31 pages, 8 figures

arXiv:1706.08257 [pdf, ps, other]

A Semiclassical, Entropic Proof of a Weak Gravity Conjecture

Authors: Zachary Fisher, Christopher J. Mogni

Abstract: We present a semiclassical proof of the weak gravity conjecture in $D = 4$ spacetime dimensions for scalar matter gauged under a $U(1)^N$ gauge group. We compute the non-perturbative macroscopic entropy of a scalar field in an extremal black hole background at the level of linearized backreaction on the metric. The scalar field is assumed to violate or saturate the weak gravity conjecture. The sca… ▽ More We present a semiclassical proof of the weak gravity conjecture in $D = 4$ spacetime dimensions for scalar matter gauged under a $U(1)^N$ gauge group. We compute the non-perturbative macroscopic entropy of a scalar field in an extremal black hole background at the level of linearized backreaction on the metric. The scalar field is assumed to violate or saturate the weak gravity conjecture. The scalar contributes a logarithmic correction to the entropy in the black hole geometry that outgrows the classical contribution. We demonstrate that the entropy of the gauged scalar violates the generalized second law in the limit of large black hole charge. Our result suggests that entropy inequalities may directly discriminate between effective field theories that live in the landscape versus the swampland. △ Less

Submitted 17 October, 2017; v1 submitted 26 June, 2017; originally announced June 2017.

Comments: v3: 25 pages, clarified main result of the paper and created section on consistency checks on main argument;v2: 25 pages, significantly expanded discussion on renormalization and backreaction to demonstrate validity of our results

arXiv:1509.02542 [pdf, other]

doi 10.1103/PhysRevD.93.024017

Proof of the Quantum Null Energy Condition

Authors: Raphael Bousso, Zachary Fisher, Jason Koeller, Stefan Leichenauer, Aron C. Wall

Abstract: We prove the Quantum Null Energy Condition (QNEC), a lower bound on the stress tensor in terms of the second variation in a null direction of the entropy of a region. The QNEC arose previously as a consequence of the Quantum Focussing Conjecture, a proposal about quantum gravity. The QNEC itself does not involve gravity, so a proof within quantum field theory is possible. Our proof is somewhat non… ▽ More We prove the Quantum Null Energy Condition (QNEC), a lower bound on the stress tensor in terms of the second variation in a null direction of the entropy of a region. The QNEC arose previously as a consequence of the Quantum Focussing Conjecture, a proposal about quantum gravity. The QNEC itself does not involve gravity, so a proof within quantum field theory is possible. Our proof is somewhat nontrivial, suggesting that there may be alternative formulations of quantum field theory that make the QNEC more manifest. Our proof applies to free and superrenormalizable bosonic field theories, and to any points that lie on stationary null surfaces. An example is Minkowski space, where any point $p$ and null vector $k^a$ define a null plane $N$ (a Rindler horizon). Given any codimension-2 surface $Σ$ that contains $p$ and lies on $N$, one can consider the von Neumann entropy $S_\text{out}$ of the quantum state restricted to one side of $Σ$. A second variation $S_\text{out}^{\prime\prime}$ can be defined by deforming $Σ$ along $N$, in a small neighborhood of $p$ with area $\cal A$. The QNEC states that $\langle T_{kk}(p) \rangle \ge \frac{\hbar}{2π} \lim_{{\cal A}\to 0}S_\text{out}^{ \prime\prime}/{\cal A}$. △ Less

Submitted 15 September, 2015; v1 submitted 8 September, 2015; originally announced September 2015.

Comments: 32 pages, 3 figures. v2: references and minor typos

Journal ref: Phys. Rev. D 93, 024017 (2016)

arXiv:1506.02669 [pdf, other]

doi 10.1103/PhysRevD.93.064044

A Quantum Focussing Conjecture

Authors: Raphael Bousso, Zachary Fisher, Stefan Leichenauer, and Aron C. Wall

Abstract: We propose a universal inequality that unifies the Bousso bound with the classical focussing theorem. Given a surface $σ$ that need not lie on a horizon, we define a finite generalized entropy $S_\text{gen}$ as the area of $σ$ in Planck units, plus the von Neumann entropy of its exterior. Given a null congruence $N$ orthogonal to $σ$, the rate of change of $S_\text{gen}$ per unit area defines a qu… ▽ More We propose a universal inequality that unifies the Bousso bound with the classical focussing theorem. Given a surface $σ$ that need not lie on a horizon, we define a finite generalized entropy $S_\text{gen}$ as the area of $σ$ in Planck units, plus the von Neumann entropy of its exterior. Given a null congruence $N$ orthogonal to $σ$, the rate of change of $S_\text{gen}$ per unit area defines a quantum expansion. We conjecture that the quantum expansion cannot increase along $N$. This extends the notion of universal focussing to cases where quantum matter may violate the null energy condition. Integrating the conjecture yields a precise version of the Strominger-Thompson Quantum Bousso Bound. Applied to locally parallel light-rays, the conjecture implies a Quantum Null Energy Condition: a lower bound on the stress tensor in terms of the second derivative of the von Neumann entropy. We sketch a proof of this novel relation in quantum field theory. △ Less

Submitted 8 June, 2015; originally announced June 2015.

Comments: 45 pages, 6 figures

Journal ref: Phys. Rev. D 93, 064044 (2016)

arXiv:1503.02220 [pdf, other]

robumeta: An R-package for robust variance estimation in meta-analysis

Authors: Zachary Fisher, Elizabeth Tipton

Abstract: Meta-regression models are commonly used to synthesize and compare effect sizes. Unfortunately, traditional meta-regression methods are ill-equipped to handle the complex and often unknown correlations among non-independent effect sizes. Robust variance estimation (RVE) is a recently proposed meta-analytic method for dealing with dependent effect sizes. The robumeta package provides functions for… ▽ More Meta-regression models are commonly used to synthesize and compare effect sizes. Unfortunately, traditional meta-regression methods are ill-equipped to handle the complex and often unknown correlations among non-independent effect sizes. Robust variance estimation (RVE) is a recently proposed meta-analytic method for dealing with dependent effect sizes. The robumeta package provides functions for performing robust variance meta-regression using both large and small sample RVE estimators under various weighting schemes. These methods are distribution free and provide valid point estimates, standard errors and hypothesis tests even when the degree and structure of dependence between effect sizes is unknown. △ Less

Submitted 7 March, 2015; originally announced March 2015.

arXiv:1406.4545 [pdf, ps, other]

doi 10.1103/PhysRevD.91.084030

Entropy on a null surface for interacting quantum field theories and the Bousso bound

Authors: Raphael Bousso, Horacio Casini, Zachary Fisher, Juan Maldacena

Abstract: We study the vacuum-subtracted von Neumann entropy of a segment on a null plane. We argue that for interacting quantum field theories in more than two dimensions, this entropy has a simple expression in terms of the expectation value of the null components of the stress tensor on the null interval. More explicitly $ΔS = 2π\int d^{d-2}y \int_0^1 dx^+\, g(x^+)\, \langle T_{++}\rangle$, where… ▽ More We study the vacuum-subtracted von Neumann entropy of a segment on a null plane. We argue that for interacting quantum field theories in more than two dimensions, this entropy has a simple expression in terms of the expectation value of the null components of the stress tensor on the null interval. More explicitly $ΔS = 2π\int d^{d-2}y \int_0^1 dx^+\, g(x^+)\, \langle T_{++}\rangle$, where $g(x^+)$ is a theory-dependent function. This function is constrained by general properties of quantum relative entropy. These constraints are enough to extend our recent free field proof of the quantum Bousso bound to the interacting case. This unusual expression for the entropy as the expectation value of an operator implies that the entropy is equal to the modular Hamiltonian, $ΔS = \langle ΔK \rangle $, where $K$ is the operator in the right hand side. We explain how this equality is compatible with a non-zero value for $ΔS$. Finally, we also compute explicitly the function $g(x^+)$ for theories that have a gravity dual. △ Less

Submitted 17 June, 2014; originally announced June 2014.

Comments: 35 pages, 6 figures

Journal ref: Phys. Rev. D 91, 084030 (2015)

arXiv:1404.5635 [pdf, other]

doi 10.1103/PhysRevD.90.044002

Proof of a Quantum Bousso Bound

Authors: Raphael Bousso, Horacio Casini, Zachary Fisher, Juan Maldacena

Abstract: We prove the generalized Covariant Entropy Bound, $ΔS\leq (A-A')/4G\hbar$, for light-sheets with initial area $A$ and final area $A'$. The entropy $ΔS$ is defined as a difference of von Neumann entropies of an arbitrary state and the vacuum, with both states restricted to the light-sheet under consideration. The proof applies to free fields, in the limit where gravitational backreaction is small.… ▽ More We prove the generalized Covariant Entropy Bound, $ΔS\leq (A-A')/4G\hbar$, for light-sheets with initial area $A$ and final area $A'$. The entropy $ΔS$ is defined as a difference of von Neumann entropies of an arbitrary state and the vacuum, with both states restricted to the light-sheet under consideration. The proof applies to free fields, in the limit where gravitational backreaction is small. We do not assume the null energy condition. In regions where it is violated, we find that the bound is protected by the defining property of light-sheets: that their null generators are nowhere expanding. △ Less

Submitted 8 May, 2014; v1 submitted 22 April, 2014; originally announced April 2014.

Comments: 19 pages, 3 figures; v2: references added

Journal ref: Phys. Rev. D 90, 044002 (2014)

arXiv:gr-qc/9911008 [pdf, ps, other]

Scalar mesostatic field with regard for gravitational effects

Authors: I. Z. Fisher

Abstract: (Foreword by translator.) The aim of present translation is to clarify the historically important question who was the pioneer in obtaining of exact static solutions of Einstein equations minimally coupled with scalar field. Usually, people cite the works by Janis, Newman, Winicour (Phys. Rev. Lett. 20 (1968) 878) and others authors whereas it is clear that JNW rediscovered (in other coordinates… ▽ More (Foreword by translator.) The aim of present translation is to clarify the historically important question who was the pioneer in obtaining of exact static solutions of Einstein equations minimally coupled with scalar field. Usually, people cite the works by Janis, Newman, Winicour (Phys. Rev. Lett. 20 (1968) 878) and others authors whereas it is clear that JNW rediscovered (in other coordinates) the Fisher's solution which was obtained 20 years before, in 1947. Regrettably, up to now I continue to meet many papers (even very fresh ones) whose authors evidently do not know about the Fisher's work, so I try to remove this gap by virtue of present translation and putting it into the LANL e-print archive. (Original Abstract.) It is considered the scalar mesostatic field of a point source with the regard for spacetime curvature caused by this field. For the field with $\mass = 0$ the exact solution of Einstein equations was obtained. It was demonstrated that at small distance from a source the gravitational effects are so large that they cause the significant changes in behavior of meson field. In particular, the total energy of static field diverges logarithmically. △ Less

Submitted 3 November, 1999; originally announced November 1999.

Comments: who was the pioneer in obtaining of exact static solutions of Einstein equations coupled with scalar field?

Journal ref: Zh.Eksp.Teor.Fiz. 18 (1948) 636-640

Showing 1–24 of 24 results for author: Fisher, Z