Search | arXiv e-print repository

Capturing the Complexity of Human Strategic Decision-Making with Machine Learning

Authors: Jian-Qiao Zhu, Joshua C. Peterson, Benjamin Enke, Thomas L. Griffiths

Abstract: Understanding how people behave in strategic settings--where they make decisions based on their expectations about the behavior of others--is a long-standing problem in the behavioral sciences. We conduct the largest study to date of strategic decision-making in the context of initial play in two-player matrix games, analyzing over 90,000 human decisions across more than 2,400 procedurally generat… ▽ More Understanding how people behave in strategic settings--where they make decisions based on their expectations about the behavior of others--is a long-standing problem in the behavioral sciences. We conduct the largest study to date of strategic decision-making in the context of initial play in two-player matrix games, analyzing over 90,000 human decisions across more than 2,400 procedurally generated games that span a much wider space than previous datasets. We show that a deep neural network trained on these data predicts people's choices better than leading theories of strategic behavior, indicating that there is systematic variation that is not explained by those theories. We then modify the network to produce a new, interpretable behavioral model, revealing what the original network learned about people: their ability to optimally respond and their capacity to reason about others are dependent on the complexity of individual games. This context-dependence is critical in explaining deviations from the rational Nash equilibrium, response times, and uncertainty in strategic decisions. More broadly, our results demonstrate how machine learning can be applied beyond prediction to further help generate novel explanations of complex human behavior. △ Less

Submitted 14 August, 2024; originally announced August 2024.

arXiv:2405.19313 [pdf, other]

Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice

Authors: Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths

Abstract: The observed similarities in the behavior of humans and Large Language Models (LLMs) have prompted researchers to consider the potential of using LLMs as models of human cognition. However, several significant challenges must be addressed before LLMs can be legitimately regarded as cognitive models. For instance, LLMs are trained on far more data than humans typically encounter, and may have been… ▽ More The observed similarities in the behavior of humans and Large Language Models (LLMs) have prompted researchers to consider the potential of using LLMs as models of human cognition. However, several significant challenges must be addressed before LLMs can be legitimately regarded as cognitive models. For instance, LLMs are trained on far more data than humans typically encounter, and may have been directly trained on human data in specific cognitive tasks or aligned with human preferences. Consequently, the origins of these behavioral similarities are not well understood. In this paper, we propose a novel way to enhance the utility of LLMs as cognitive models. This approach involves (i) leveraging computationally equivalent tasks that both an LLM and a rational agent need to master for solving a cognitive problem and (ii) examining the specific task distributions required for an LLM to exhibit human-like behaviors. We apply this approach to decision-making -- specifically risky and intertemporal choice -- where the key computationally equivalent task is the arithmetic of expected value calculations. We show that an LLM pretrained on an ecologically valid arithmetic dataset, which we call Arithmetic-GPT, predicts human behavior better than many traditional cognitive models. Pretraining LLMs on ecologically valid arithmetic datasets is sufficient to produce a strong correspondence between these models and human decision-making. Our results also suggest that LLMs used as cognitive models should be carefully investigated via ablation studies of the pretraining data. △ Less

Submitted 5 May, 2025; v1 submitted 29 May, 2024; originally announced May 2024.

Journal ref: ICLR 2025

arXiv:2403.05803 [pdf, ps, other]

Semiparametric Inference for Regression-Discontinuity Designs

Authors: Weiwei Jiang, Rong J. B. Zhu

Abstract: Treatment effects in regression discontinuity designs (RDDs) are often estimated using local regression methods. \cite{Hahn:01} demonstrated that the identification of the average treatment effect at the cutoff in RDDs relies on the unconfoundedness assumption and that, without this assumption, only the local average treatment effect at the cutoff can be identified. In this paper, we propose a sem… ▽ More Treatment effects in regression discontinuity designs (RDDs) are often estimated using local regression methods. \cite{Hahn:01} demonstrated that the identification of the average treatment effect at the cutoff in RDDs relies on the unconfoundedness assumption and that, without this assumption, only the local average treatment effect at the cutoff can be identified. In this paper, we propose a semiparametric framework tailored for identifying the average treatment effect in RDDs, eliminating the need for the unconfoundedness assumption. Our approach globally conceptualizes the identification as a partially linear modeling problem, with the coefficient of a specified polynomial function of propensity score in the linear component capturing the average treatment effect. This identification result underpins our semiparametric inference for RDDs, employing the $P$-spline method to approximate the nonparametric function and establishing a procedure for conducting inference within this framework. Through theoretical analysis, we demonstrate that our global approach achieves a faster convergence rate compared to the local method. Monte Carlo simulations further confirm that the proposed method consistently outperforms alternatives across various scenarios. Furthermore, applications to real-world datasets illustrate that our global approach can provide more reliable inference for practical problems. △ Less

Submitted 24 December, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

arXiv:2402.07960 [pdf]

doi 10.4236/ojbm.2023.113061

An Analysis of the Recovery Path of the Consumer Sector in the Post-Pandemic Era

Authors: Wenbo Lyu, Jiayi Zhu, Yunan Ding, Keming Zhang

Abstract: This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index… ▽ More This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index. Finally we explain the internal logic of digital consumption as a consumption upgrade tool and a higher valuation target in the context of China's economic performance in 2022 and the Chinese government's policy in 2023, leading to the investment strategy of roller conduction effect. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2310.04585 [pdf, ps, other]

Interventions Against Machine-Assisted Statistical Discrimination

Authors: John Y. Zhu

Abstract: I study statistical discrimination driven by verifiable beliefs, such as those generated by machine learning, rather than by humans. When beliefs are verifiable, interventions against statistical discrimination can move beyond simple, belief-free designs like affirmative action, to more sophisticated ones, that constrain decision makers based on what they are thinking. I design a belief-contingent… ▽ More I study statistical discrimination driven by verifiable beliefs, such as those generated by machine learning, rather than by humans. When beliefs are verifiable, interventions against statistical discrimination can move beyond simple, belief-free designs like affirmative action, to more sophisticated ones, that constrain decision makers based on what they are thinking. I design a belief-contingent intervention I call common identity. I show that it is effective at eliminating equilibrium statistical discrimination, even when training data exhibit the various statistical biases that often plague algorithmic decision problems. △ Less

Submitted 19 June, 2025; v1 submitted 6 October, 2023; originally announced October 2023.

arXiv:2306.02584 [pdf, other]

Synthetic Regressing Control Method

Authors: Rong J. B. Zhu

Abstract: Estimating weights in the synthetic control method, typically resulting in sparse weights where only a few control units have non-zero weights, involves an optimization procedure that simultaneously selects and aligns control units to closely match the treated unit. However, this simultaneous selection and alignment of control units may lead to a loss of efficiency. Another concern arising from th… ▽ More Estimating weights in the synthetic control method, typically resulting in sparse weights where only a few control units have non-zero weights, involves an optimization procedure that simultaneously selects and aligns control units to closely match the treated unit. However, this simultaneous selection and alignment of control units may lead to a loss of efficiency. Another concern arising from the aforementioned procedure is its susceptibility to under-fitting due to imperfect pre-treatment fit. It is not uncommon for the linear combination, using nonnegative weights, of pre-treatment period outcomes for the control units to inadequately approximate the pre-treatment outcomes for the treated unit. To address both of these issues, this paper proposes a simple and effective method called Synthetic Regressing Control (SRC). The SRC method begins by performing the univariate linear regression to appropriately align the pre-treatment periods of the control units with the treated unit. Subsequently, a SRC estimator is obtained by synthesizing (taking a weighted average) the fitted controls. To determine the weights in the synthesis procedure, we propose an approach that utilizes a criterion of unbiased risk estimator. Theoretically, we show that the synthesis way is asymptotically optimal in the sense of achieving the lowest possible squared error. Extensive numerical experiments highlight the advantages of the SRC method. △ Less

Submitted 23 October, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

arXiv:2301.10541 [pdf, other]

Educational Game on Cryptocurrency Investment: Using Microeconomic Decision Making to Understand Macroeconomics Principles

Authors: Jiasheng Zhu, Luyao Zhang

Abstract: Gamification is an effective strategy for motivating and engaging users, which is grounded in business, marketing, and management by designing games in nongame contexts. Gamifying education, which consists of the design and study of educational games, is an emerging trend. However, the existing classroom games for understanding macroeconomics have weak connections to the microfoundations of indivi… ▽ More Gamification is an effective strategy for motivating and engaging users, which is grounded in business, marketing, and management by designing games in nongame contexts. Gamifying education, which consists of the design and study of educational games, is an emerging trend. However, the existing classroom games for understanding macroeconomics have weak connections to the microfoundations of individual decision-making. We design an educational game on cryptocurrency investment for understanding macroeconomic concepts in microeconomic decisions. We contribute to the literature by designing game-based learning that engages students in understanding macroeconomics in incentivized individual investment decisions. Our game can be widely implemented in online, in-person, and hybrid classrooms. We also reflect on strategies for improving the user experience for future educational game implementations. △ Less

Submitted 9 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

ACM Class: J.4; K.3.1; K.3.2; H.5.2

arXiv:2210.16525 [pdf, other]

Spectral Representation Learning for Conditional Moment Models

Authors: Ziyu Wang, Yucen Luo, Yueru Li, Jun Zhu, Bernhard Schölkopf

Abstract: Many problems in causal inference and economics can be formulated in the framework of conditional moment models, which characterize the target function through a collection of conditional moment restrictions. For nonparametric conditional moment models, efficient estimation often relies on preimposed conditions on various measures of ill-posedness of the hypothesis space, which are hard to validat… ▽ More Many problems in causal inference and economics can be formulated in the framework of conditional moment models, which characterize the target function through a collection of conditional moment restrictions. For nonparametric conditional moment models, efficient estimation often relies on preimposed conditions on various measures of ill-posedness of the hypothesis space, which are hard to validate when flexible models are used. In this work, we address this issue by proposing a procedure that automatically learns representations with controlled measures of ill-posedness. Our method approximates a linear representation defined by the spectral decomposition of a conditional expectation operator, which can be used for kernelized estimators and is known to facilitate minimax optimal estimation in certain settings. We show this representation can be efficiently estimated from data, and establish L2 consistency for the resulting estimator. We evaluate the proposed method on proximal causal inference tasks, exhibiting promising performance on high-dimensional, semi-synthetic data. △ Less

Submitted 28 December, 2022; v1 submitted 29 October, 2022; originally announced October 2022.

arXiv:2205.10772 [pdf, other]

Fast Instrument Learning with Faster Rates

Authors: Ziyu Wang, Yuhao Zhou, Jun Zhu

Abstract: We investigate nonlinear instrumental variable (IV) regression given high-dimensional instruments. We propose a simple algorithm which combines kernelized IV methods and an arbitrary, adaptive regression algorithm, accessed as a black box. Our algorithm enjoys faster-rate convergence and adapts to the dimensionality of informative latent features, while avoiding an expensive minimax optimization p… ▽ More We investigate nonlinear instrumental variable (IV) regression given high-dimensional instruments. We propose a simple algorithm which combines kernelized IV methods and an arbitrary, adaptive regression algorithm, accessed as a black box. Our algorithm enjoys faster-rate convergence and adapts to the dimensionality of informative latent features, while avoiding an expensive minimax optimization procedure, which has been necessary to establish similar guarantees. It further brings the benefit of flexible machine learning models to quasi-Bayesian uncertainty quantification, likelihood-based model selection, and model averaging. Simulation studies demonstrate the competitive performance of our method. △ Less

Submitted 22 October, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

Comments: NeurIPS camera ready. Code available at https://github.com/meta-inf/fil

arXiv:2205.03393 [pdf, other]

The Right Tool for the Job: Matching Active Learning Techniques to Learning Objectives

Authors: Sarah A. Jacobson, Luyao Zhang, Jiasheng Zhu

Abstract: Active learning comprises many varied techniques that engage students actively in the construction of their understanding. Because of this variation, different active learning techniques may be best suited to achieving different learning objectives. We study students' perceptions of a set of active learning techniques (including a Python simulation and an interactive game) and some traditional tec… ▽ More Active learning comprises many varied techniques that engage students actively in the construction of their understanding. Because of this variation, different active learning techniques may be best suited to achieving different learning objectives. We study students' perceptions of a set of active learning techniques (including a Python simulation and an interactive game) and some traditional techniques (like lecture). We find that students felt they engaged fairly actively with all of the techniques, though more with those with a heavy grade weight and some of the active learning techniques, and they reported enjoying the active learning techniques the most except for an assignment that required soliciting peer advice on a research idea. All of the techniques were rated as relatively effective for achieving each of six learning objectives, but to varying extents. The most traditional techniques like exams were rated highest for achieving an objective associated with lower order cognitive skills, remembering concepts. In contrast, some active learning techniques like class presentations and the Python simulation were rated highest for achieving objectives related to higher order cognitive skills, including learning to conduct research, though lectures also performed surprisingly well for these objectives. Other technique-objective matches are intuitive; for example, the debate is rated highly for understanding pros and cons of an issue, and small group discussion is rated highly for collaborative learning. Our results support the idea that different teaching techniques are best suited for different outcomes, which implies that a mix of techniques may be optimal in course design. △ Less

Submitted 12 July, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

arXiv:2108.10453

Continuous Treatment Recommendation with Deep Survival Dose Response Function

Authors: Jie Zhu, Blanca Gallego

Abstract: We propose a general formulation for continuous treatment recommendation problems in settings with clinical survival data, which we call the Deep Survival Dose Response Function (DeepSDRF). That is, we consider the problem of learning the conditional average dose response (CADR) function solely from historical data in which observed factors (confounders) affect both observed treatment and time-to-… ▽ More We propose a general formulation for continuous treatment recommendation problems in settings with clinical survival data, which we call the Deep Survival Dose Response Function (DeepSDRF). That is, we consider the problem of learning the conditional average dose response (CADR) function solely from historical data in which observed factors (confounders) affect both observed treatment and time-to-event outcomes. The estimated treatment effect from DeepSDRF enables us to develop recommender algorithms with the correction for selection bias. We compared two recommender approaches based on random search and reinforcement learning and found similar performance in terms of patient outcome. We tested the DeepSDRF and the corresponding recommender on extensive simulation studies and the eICU Research Institute (eRI) database. To the best of our knowledge, this is the first time that causal models are used to address the continuous treatment effect with observational data in a medical context. △ Less

Submitted 26 September, 2023; v1 submitted 23 August, 2021; originally announced August 2021.

Comments: results is outdated

arXiv:2104.08213 [pdf, other]

doi 10.1098/rsif.2021.0662

The spatial dissemination of COVID-19 and associated socio-economic consequences

Authors: Yafei Zhang, Lin Wang, Jonathan J. H. Zhu, Xiaofan Wang

Abstract: The ongoing coronavirus disease 2019 (COVID-19) pandemic has wreaked havoc worldwide with millions of lives claimed, human travel restricted and economic development halted. Leveraging city-level mobility and case data, our analysis shows that the spatial dissemination of COVID-19 can be well explained by a local diffusion process in the mobility network rather than a global diffusion process, ind… ▽ More The ongoing coronavirus disease 2019 (COVID-19) pandemic has wreaked havoc worldwide with millions of lives claimed, human travel restricted and economic development halted. Leveraging city-level mobility and case data, our analysis shows that the spatial dissemination of COVID-19 can be well explained by a local diffusion process in the mobility network rather than a global diffusion process, indicating the effectiveness of the implemented disease prevention and control measures. Based on the constructed case prediction model, it is estimated that there could be distinct social consequences if the COVID-19 outbreak happened in different areas. During the epidemic control period, human mobility experienced substantial reductions and the mobility network underwent remarkable local and global structural changes toward containing the spread of COVID-19. Our work has important implications for the mitigation of disease and the evaluation of the socio-economic consequences of COVID-19 on society. △ Less

Submitted 28 June, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

Comments: 9 pages, 4 figures

Journal ref: J. R. Soc. Interface. 19 (2022) 20210662

Showing 1–12 of 12 results for author: Zhu, J