Search | arXiv e-print repository

Machine-learning Growth at Risk

Authors: Tobias Adrian, Hongqi Chen, Max-Sebastian Dovì, Ji Hyung Lee

Abstract: We analyse growth vulnerabilities in the US using quantile partial correlation regression, a selection-based machine-learning method that achieves model selection consistency under time series. We find that downside risk is primarily driven by financial, labour-market, and housing variables, with their importance changing over time. Decomposing downside risk into its individual components, we cons… ▽ More We analyse growth vulnerabilities in the US using quantile partial correlation regression, a selection-based machine-learning method that achieves model selection consistency under time series. We find that downside risk is primarily driven by financial, labour-market, and housing variables, with their importance changing over time. Decomposing downside risk into its individual components, we construct sector-specific indices that predict it, while controlling for information from other sectors, thereby isolating the downside risks emanating from each sector. △ Less

Submitted 31 May, 2025; originally announced June 2025.

arXiv:2505.22873 [pdf, ps, other]

Forecasting Residential Heating and Electricity Demand with Scalable, High-Resolution, Open-Source Models

Authors: Stephen J. Lee, Cailinn Drouin

Abstract: We present a novel framework for high-resolution forecasting of residential heating and electricity demand using probabilistic deep learning models. We focus specifically on providing hourly building-level electricity and heating demand forecasts for the residential sector. Leveraging multimodal building-level information -- including data on building footprint areas, heights, nearby building dens… ▽ More We present a novel framework for high-resolution forecasting of residential heating and electricity demand using probabilistic deep learning models. We focus specifically on providing hourly building-level electricity and heating demand forecasts for the residential sector. Leveraging multimodal building-level information -- including data on building footprint areas, heights, nearby building density, nearby building size, land use patterns, and high-resolution weather data -- and probabilistic modeling, our methods provide granular insights into demand heterogeneity. Validation at the building level underscores a step change improvement in performance relative to NREL's ResStock model, which has emerged as a research community standard for residential heating and electricity demand characterization. In building-level heating and electricity estimation backtests, our probabilistic models respectively achieve RMSE scores 18.3\% and 35.1\% lower than those based on ResStock. By offering an open-source, scalable, high-resolution platform for demand estimation and forecasting, this research advances the tools available for policymakers and grid planners, contributing to the broader effort to decarbonize the U.S. building stock and meeting climate objectives. △ Less

Submitted 28 May, 2025; originally announced May 2025.

Comments: 16 pages, 4 figures, 1 table, preprint

arXiv:2502.05353 [pdf, other]

Point-Identifying Semiparametric Sample Selection Models with No Excluded Variable

Authors: Dongwoo Kim, Young Jun Lee

Abstract: Sample selection is pervasive in applied economic studies. This paper develops semiparametric selection models that achieve point identification without relying on exclusion restrictions, an assumption long believed necessary for identification in semiparametric selection models. Our identification conditions require at least one continuously distributed covariate and certain nonlinearity in the s… ▽ More Sample selection is pervasive in applied economic studies. This paper develops semiparametric selection models that achieve point identification without relying on exclusion restrictions, an assumption long believed necessary for identification in semiparametric selection models. Our identification conditions require at least one continuously distributed covariate and certain nonlinearity in the selection process. We propose a two-step plug-in estimator that is root-n-consistent, asymptotically normal, and computationally straightforward (readily available in statistical software), allowing for heteroskedasticity. Our approach provides a middle ground between Lee (2009)'s nonparametric bounds and Honoré and Hu (2020)'s linear selection bounds, while ensuring point identification. Simulation evidence confirms its excellent finite-sample performance. We apply our method to estimate the racial and gender wage disparity using data from the US Current Population Survey. Our estimates tend to lie outside the Honoré and Hu bounds. △ Less

Submitted 7 February, 2025; originally announced February 2025.

arXiv:2412.14778 [pdf, ps, other]

Testing linearity of spatial interaction functions à la Ramsey

Authors: Abhimanyu Gupta, Jungyoon Lee, Francesca Rossi

Abstract: We propose a computationally straightforward test for the linearity of a spatial interaction function. Such functions arise commonly, either as practitioner imposed specifications or due to optimizing behaviour by agents. Our conditional heteroskedasticity robust test is nonparametric, but based on the Lagrange Multiplier principle and reminiscent of the Ramsey RESET approach. This entails estimat… ▽ More We propose a computationally straightforward test for the linearity of a spatial interaction function. Such functions arise commonly, either as practitioner imposed specifications or due to optimizing behaviour by agents. Our conditional heteroskedasticity robust test is nonparametric, but based on the Lagrange Multiplier principle and reminiscent of the Ramsey RESET approach. This entails estimation only under the null hypothesis, which yields an easy to estimate linear spatial autoregressive model. Monte Carlo simulations show excellent size control and power. An empirical study with Finnish data illustrates the test's practical usefulness, shedding light on debates on the presence of tax competition among neighbouring municipalities. △ Less

Submitted 29 April, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

arXiv:2410.15097 [pdf, ps, other]

Predictive Quantile Regression with High-Dimensional Predictors: The Variable Screening Approach

Authors: Hongqi Chen, Ji Hyung Lee

Abstract: This paper advances a variable screening approach to enhance conditional quantile forecasts using high-dimensional predictors. We have refined and augmented the quantile partial correlation (QPC)-based variable screening proposed by Ma et al. (2017) to accommodate $β$-mixing time-series data. Our approach is inclusive of i.i.d scenarios but introduces new convergence bounds for time-series context… ▽ More This paper advances a variable screening approach to enhance conditional quantile forecasts using high-dimensional predictors. We have refined and augmented the quantile partial correlation (QPC)-based variable screening proposed by Ma et al. (2017) to accommodate $β$-mixing time-series data. Our approach is inclusive of i.i.d scenarios but introduces new convergence bounds for time-series contexts, suggesting the performance of QPC-based screening is influenced by the degree of time-series dependence. Through Monte Carlo simulations, we validate the effectiveness of QPC under weak dependence. Our empirical assessment of variable selection for growth-at-risk (GaR) forecasting underscores the method's advantages, revealing that specific labor market determinants play a pivotal role in forecasting GaR. While prior empirical research has predominantly considered a limited set of predictors, we employ the comprehensive Fred-QD dataset, retaining a richer breadth of information for GaR forecasts. △ Less

Submitted 19 October, 2024; originally announced October 2024.

arXiv:2409.10030 [pdf, other]

Econometric Inference for High Dimensional Predictive Regressions

Authors: Zhan Gao, Ji Hyung Lee, Ziwei Mei, Zhentao Shi

Abstract: LASSO introduces shrinkage bias into estimated coefficients, which can adversely affect the desirable asymptotic normality and invalidate the standard inferential procedure based on the $t$-statistic. The desparsified LASSO has emerged as a well-known remedy for this issue. In the context of high dimensional predictive regression, the desparsified LASSO faces an additional challenge: the Stambaugh… ▽ More LASSO introduces shrinkage bias into estimated coefficients, which can adversely affect the desirable asymptotic normality and invalidate the standard inferential procedure based on the $t$-statistic. The desparsified LASSO has emerged as a well-known remedy for this issue. In the context of high dimensional predictive regression, the desparsified LASSO faces an additional challenge: the Stambaugh bias arising from nonstationary regressors. To restore the standard inferential procedure, we propose a novel estimator called IVX-desparsified LASSO (XDlasso). XDlasso eliminates the shrinkage bias and the Stambaugh bias simultaneously and does not require prior knowledge about the identities of nonstationary and stationary regressors. We establish the asymptotic properties of XDlasso for hypothesis testing, and our theoretical findings are supported by Monte Carlo simulations. Applying our method to real-world applications from the FRED-MD database -- which includes a rich set of control variables -- we investigate two important empirical questions: (i) the predictability of the U.S. stock returns based on the earnings-price ratio, and (ii) the predictability of the U.S. inflation using the unemployment rate. △ Less

Submitted 9 November, 2024; v1 submitted 16 September, 2024; originally announced September 2024.

arXiv:2406.00669 [pdf]

Multi-technology co-optimization approach for sustainable hydrogen and electricity supply chains considering variability and demand scale

Authors: Sunwoo Kim, Joungho Park, Jay H. Lee

Abstract: In the pursuit of a carbon-neutral future, hydrogen emerges as a pivotal element, serving as a carbon-free energy carrier and feedstock. As efforts to decarbonize sectors such as heating and transportation intensify, understanding and navigating through the dynamics of hydrogen demand expansion becomes critical. Transitioning to hydrogen economy is complicated by varying regional scales and types… ▽ More In the pursuit of a carbon-neutral future, hydrogen emerges as a pivotal element, serving as a carbon-free energy carrier and feedstock. As efforts to decarbonize sectors such as heating and transportation intensify, understanding and navigating through the dynamics of hydrogen demand expansion becomes critical. Transitioning to hydrogen economy is complicated by varying regional scales and types of hydrogen demand, with forecasts indicating a rise in variable demand that calls for diverse production technologies. Currently, steam methane reforming is prevalent, but its significant carbon emissions make a shift to cleaner alternatives like blue and green hydrogen imperative. Each production method possesses distinct characteristics, necessitating a thorough exploration and co-optimization with electricity supply chains as well as carbon capture, utilization, and storage systems. Our study fills existing research gaps by introducing a superstructure optimization framework that accommodates various demand scenarios and technologies. Through case studies in California, we underscore the critical role of demand profiles in shaping the optimal configurations and economics of supply chains and emphasize the need for diversified portfolios and co-optimization to facilitate sustainable energy transitions. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2406.00665 [pdf]

Integrating solid direct air capture systems with green hydrogen production: Economic synergy of sector coupling

Authors: Sunwoo Kim, Joungho Park, Jay H. Lee

Abstract: In the global pursuit of sustainable energy solutions, mitigating carbon dioxide (CO2) emissions stands as a pivotal challenge. With escalating atmospheric CO2 levels, the imperative of direct air capture (DAC) systems becomes evident. Simultaneously, green hydrogen (GH) emerges as a pivotal medium for renewable energy. Nevertheless, the substantial expenses associated with these technologies impe… ▽ More In the global pursuit of sustainable energy solutions, mitigating carbon dioxide (CO2) emissions stands as a pivotal challenge. With escalating atmospheric CO2 levels, the imperative of direct air capture (DAC) systems becomes evident. Simultaneously, green hydrogen (GH) emerges as a pivotal medium for renewable energy. Nevertheless, the substantial expenses associated with these technologies impede widespread adoption, primarily due to significant installation costs and underutilized operational advantages when deployed independently. Integration through sector coupling enhances system efficiency and sustainability, while shared power sources and energy storage devices offer additional economic benefits. In this study, we assess the economic viability of polymer electrolyte membrane electrolyzers versus alkaline electrolyzers within the context of sector coupling. Our findings indicate that combining GH production with solid DAC systems yields significant economic advantages, with approximately a 10% improvement for PEM electrolyzers and a 20% enhancement for alkaline electrolyzers. These results highlight a substantial opportunity to improve the efficiency and economic viability of renewable energy and green hydrogen initiatives, thereby facilitating the broader adoption of cleaner technologies. △ Less

Submitted 19 October, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

Comments: We have corrected the errors from the previous version of the manuscript and uploaded the updated version

arXiv:2405.18089 [pdf, other]

Semi-nonparametric models of multidimensional matching: an optimal transport approach

Authors: Dongwoo Kim, Young Jun Lee

Abstract: This paper proposes empirically tractable multidimensional matching models, focusing on worker-job matching. We generalize the parametric model proposed by Lindenlaub (2017), which relies on the assumption of joint normality of observed characteristics of workers and jobs. In our paper, we allow unrestricted distributions of characteristics and show identification of the production technology, and… ▽ More This paper proposes empirically tractable multidimensional matching models, focusing on worker-job matching. We generalize the parametric model proposed by Lindenlaub (2017), which relies on the assumption of joint normality of observed characteristics of workers and jobs. In our paper, we allow unrestricted distributions of characteristics and show identification of the production technology, and equilibrium wage and matching functions using tools from optimal transport theory. Given identification, we propose efficient, consistent, asymptotically normal sieve estimators. We revisit Lindenlaub's empirical application and show that, between 1990 and 2010, the U.S. economy experienced much larger technological progress favoring cognitive abilities than the original findings suggest. Furthermore, our flexible model specifications provide a significantly better fit for patterns in the evolution of wage inequality. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.13241 [pdf, other]

Selecting Experimental Sites for External Validity

Authors: Michael Gechter, Keisuke Hirano, Jean Lee, Mahreen Mahmud, Orville Mondal, Jonathan Morduch, Saravana Ravindran, Abu S. Shonchoy

Abstract: Policy decisions often depend on evidence generated elsewhere. We take a Bayesian decision-theoretic approach to choosing where to experiment to optimize external validity. We frame external validity through a policy lens, developing a prior specification for the joint distribution of site-level treatment effects using a microeconometric structural model and allowing for other sources of heterogen… ▽ More Policy decisions often depend on evidence generated elsewhere. We take a Bayesian decision-theoretic approach to choosing where to experiment to optimize external validity. We frame external validity through a policy lens, developing a prior specification for the joint distribution of site-level treatment effects using a microeconometric structural model and allowing for other sources of heterogeneity. With data from South Asia, we show that, relative to basing policies on experiments in optimal sites, large efficiency losses result from instead using evidence from randomly-selected sites or, conversely, from sites with the largest expected treatment effects. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.02547 [pdf, other]

Crypto Market Analysis & Real-Estate Business Protocol Proposal | Application of Ethereum Blockchain

Authors: Sid Bhatia, Samuel Gedal, Himaya Jeyakumar Grace Lee, Ravinder Chopra, Daniel Roman, Shrijani Chakroborty

Abstract: This paper examines the dynamics of the cryptocurrency market and proposes a novel blockchain-based protocol for real estate transactions. Our analysis includes a detailed review of price trends, volatility, and correlations within the cryptocurrency market, focusing on major assets like Bitcoin, Ethereum, and Tether. We provide a critical assessment of the impact of significant market events, suc… ▽ More This paper examines the dynamics of the cryptocurrency market and proposes a novel blockchain-based protocol for real estate transactions. Our analysis includes a detailed review of price trends, volatility, and correlations within the cryptocurrency market, focusing on major assets like Bitcoin, Ethereum, and Tether. We provide a critical assessment of the impact of significant market events, such as the FTX bankruptcy, highlighting the vulnerabilities and resilience of the crypto market. The study also explores the potential of blockchain technology to innovate real estate transactions by enabling the secure and transparent handling of property deeds without traditional intermediaries. We introduce a blockchain protocol that reduces transaction costs, enhances security, and increases transparency, making real estate transactions more accessible and efficient. Our proposal aims to leverage the inherent benefits of blockchain to address real-world challenges in real estate transactions, providing a scalable and secure platform for property sales in a global market. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2311.11231 [pdf, other]

Workforce pDEI: Productivity Coupled with DEI

Authors: Lanqing Du, Jinwook Lee

Abstract: Ranking pertaining to the human-centered tasks -- underscoring their paramount significance in these domains such as evaluation and hiring process -- exhibits widespread prevalence across various industries. Consequently, decision-makers are taking proactive measurements to promote diversity, underscore equity, and advance inclusion. Their unwavering commitment to these ideals emanates from the fo… ▽ More Ranking pertaining to the human-centered tasks -- underscoring their paramount significance in these domains such as evaluation and hiring process -- exhibits widespread prevalence across various industries. Consequently, decision-makers are taking proactive measurements to promote diversity, underscore equity, and advance inclusion. Their unwavering commitment to these ideals emanates from the following convictions: (i) Diversity encompasses a broad spectrum of differences; (ii) Equity involves the assurance of equitable opportunities; and (iii) Inclusion revolves around the cultivation of a sense of value and impartiality, concurrently empowering individuals. Data-driven AI tools have been used for screening and ranking processes. However, there is a growing concern that the presence of pre-existing biases in databases may be exacerbated, particularly in the context of imbalanced datasets or the black-box-schema. In this research, we propose a model-driven recruitment decision support tool that addresses fairness together with equity in the screening phase. We introduce the term ``pDEI" to represent the output-input oriented production efficiency adjusted by socioeconomic disparity. Taking into account various aspects of interpreting socioeconomic disparity, our goals are (i) maximizing the relative efficiency of underrepresented groups and (ii) understanding how socioeconomic disparity affects the cultivation of a DEI-positive workplace. △ Less

Submitted 1 December, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

arXiv:2308.09009 [pdf, other]

Closed-form approximations of moments and densities of continuous-time Markov models

Authors: Dennis Kristensen, Young Jun Lee, Antonio Mele

Abstract: This paper develops power series expansions of a general class of moment functions, including transition densities and option prices, of continuous-time Markov processes, including jump--diffusions. The proposed expansions extend the ones in Kristensen and Mele (2011) to cover general Markov processes. We demonstrate that the class of expansions nests the transition density and option price expans… ▽ More This paper develops power series expansions of a general class of moment functions, including transition densities and option prices, of continuous-time Markov processes, including jump--diffusions. The proposed expansions extend the ones in Kristensen and Mele (2011) to cover general Markov processes. We demonstrate that the class of expansions nests the transition density and option price expansions developed in Yang, Chen, and Wan (2019) and Wan and Yang (2021) as special cases, thereby connecting seemingly different ideas in a unified framework. We show how the general expansion can be implemented for fully general jump--diffusion models. We provide a new theory for the validity of the expansions which shows that series expansions are not guaranteed to converge as more terms are added in general. Thus, these methods should be used with caution. At the same time, the numerical studies in this paper demonstrate good performance of the proposed implementation in practice when a small number of terms are included. △ Less

Submitted 17 August, 2023; originally announced August 2023.

arXiv:2305.01209 [pdf, other]

Cooperation and Cognition in Social Networks

Authors: Edoardo Gallo, Joseph Lee, Yohanes Eko Riyanto, Erwin Wong

Abstract: Social networks can sustain cooperation by amplifying the consequences of a single defection through a cascade of relationship losses. Building on Jackson et al. (2012), we introduce a novel robustness notion to characterize low cognitive complexity (LCC) networks - a subset of equilibrium networks that imposes a minimal cognitive burden to calculate and comprehend the consequences of defection. W… ▽ More Social networks can sustain cooperation by amplifying the consequences of a single defection through a cascade of relationship losses. Building on Jackson et al. (2012), we introduce a novel robustness notion to characterize low cognitive complexity (LCC) networks - a subset of equilibrium networks that imposes a minimal cognitive burden to calculate and comprehend the consequences of defection. We test our theory in a laboratory experiment and find that cooperation is higher in equilibrium than in non-equilibrium networks. Within equilibrium networks, LCC networks exhibit higher levels of cooperation than non-LCC networks. Learning is essential for the emergence of equilibrium play. △ Less

Submitted 2 May, 2023; originally announced May 2023.

arXiv:2304.00651 [pdf, other]

Implicit Bias against a Capitalistic Society Predicts Market Earnings

Authors: Syngjoo Choi, Kyu Sup Hahn, Byung-Yeon Kim, Eungik Lee, Jungmin Lee, Sokbae Lee

Abstract: This paper investigates whether ideological indoctrination by living in a communist regime relates to low economic performance in a market economy. We recruit North Korean refugees and measure their implicit bias against South Korea by using the Implicit Association Test. Conducting double auction and bilateral bargaining market experiments, we find that North Korean refugees with a larger bias ag… ▽ More This paper investigates whether ideological indoctrination by living in a communist regime relates to low economic performance in a market economy. We recruit North Korean refugees and measure their implicit bias against South Korea by using the Implicit Association Test. Conducting double auction and bilateral bargaining market experiments, we find that North Korean refugees with a larger bias against the capitalistic society have lower expectations about their earning potential, exhibit trading behavior with lower target profits, and earn less profits. These associations are robust to conditioning on correlates of preferences, human capital, and assimilation experiences. △ Less

Submitted 2 April, 2023; originally announced April 2023.

arXiv:2302.02221 [pdf]

A quantification of how much crypto-miners are driving up the wholesale cost of energy in Texas

Authors: Jangho Lee, Lily Wu, Andrew E. Dessler

Abstract: The use of energy by cryptocurrency mining comes not just with an environmental cost but also an economic one through increases in electricity prices for other consumers. Here we investigate the increase in wholesale price on Texas ERCOT grid due to energy consumption from cryptocurrency mining. For every GW of cryptocurrency mining load on the grid, we find that the wholesale price of electricity… ▽ More The use of energy by cryptocurrency mining comes not just with an environmental cost but also an economic one through increases in electricity prices for other consumers. Here we investigate the increase in wholesale price on Texas ERCOT grid due to energy consumption from cryptocurrency mining. For every GW of cryptocurrency mining load on the grid, we find that the wholesale price of electricity on the ERCOT grid increases by 2 per Cent. Given that todays cryptocurrency mining load on the ERCOT grid is around 1 GW, it suggests that wholesale prices have already risen this amount. There are 27 GW of mining load waiting to be hooked up to the ERCOT grid. If cryptocurrency mining increases rapidly, the price of energy in Texas could skyrocket. △ Less

Submitted 4 February, 2023; originally announced February 2023.

arXiv:2301.08847 [pdf]

Learning Production Process Heterogeneity Across Industries: Implications of Deep Learning for Corporate M&A Decisions

Authors: Jongsub Lee, Hayong Yun

Abstract: Using deep learning techniques, we introduce a novel measure for production process heterogeneity across industries. For each pair of industries during 1990-2021, we estimate the functional distance between two industries' production processes via deep neural network. Our estimates uncover the underlying factors and weights reflected in the multi-stage production decision tree in each industry. We… ▽ More Using deep learning techniques, we introduce a novel measure for production process heterogeneity across industries. For each pair of industries during 1990-2021, we estimate the functional distance between two industries' production processes via deep neural network. Our estimates uncover the underlying factors and weights reflected in the multi-stage production decision tree in each industry. We find that the greater the functional distance between two industries' production processes, the lower are the number of M&As, deal completion rates, announcement returns, and post-M&A survival likelihood. Our results highlight the importance of structural heterogeneity in production technology to firms' business integration decisions. △ Less

Submitted 20 January, 2023; originally announced January 2023.

arXiv:2209.05684 [pdf]

doi 10.13140/RG.2.2.13099.52007

Moral Hazard on Productivity Among Work-From-Home Workers Amid the COVID-19 Pandemic

Authors: Jieun Lee

Abstract: After the outbreak of COVID 19, firms appear to monitor Work From Home (WFH) workers more than ever out of anxiety that workers may shirk at home or implement moral hazard at home. Using the Survey of Working Arrangements and Attitudes (SWAA, Barrero et al., 2021), the evidence of WFH workers' ex post moral hazard as well as its specific aspects are examined. The results show that the ex post mora… ▽ More After the outbreak of COVID 19, firms appear to monitor Work From Home (WFH) workers more than ever out of anxiety that workers may shirk at home or implement moral hazard at home. Using the Survey of Working Arrangements and Attitudes (SWAA, Barrero et al., 2021), the evidence of WFH workers' ex post moral hazard as well as its specific aspects are examined. The results show that the ex post moral hazard among the WFH workers is generally found. Interestingly, however, the moral hazard on specific type of productivity, efficiency, is not detected for the workers at firms with WFH friendly policy for long term. Moreover, the advantages & challenges for the WFH culture report that workers with health or disability issues improve their productivity, whereas certain conditions specific to the WFH environment must be met. △ Less

Submitted 12 September, 2022; originally announced September 2022.

arXiv:2209.05563 [pdf, other]

doi 10.13140/RG.2.2.33923.58409

Testing Endogeneity of Spatial Weights Matrices in Spatial Dynamic Panel Data Models

Authors: Jieun Lee

Abstract: I propose Robust Rao's Score (RS) test statistic to determine endogeneity of spatial weights matrices in a spatial dynamic panel data (SDPD) model (Qu, Lee, and Yu, 2017). I firstly introduce the bias-corrected score function since the score function is not centered around zero due to the two-way fixed effects. I further adjust score functions to rectify the over-rejection of the null hypothesis u… ▽ More I propose Robust Rao's Score (RS) test statistic to determine endogeneity of spatial weights matrices in a spatial dynamic panel data (SDPD) model (Qu, Lee, and Yu, 2017). I firstly introduce the bias-corrected score function since the score function is not centered around zero due to the two-way fixed effects. I further adjust score functions to rectify the over-rejection of the null hypothesis under a presence of local misspecification in contemporaneous dependence over space, dependence over time, or spatial time dependence. I then derive the explicit forms of our test statistic. A Monte Carlo simulation supports the analytics and shows nice finite sample properties. Finally, an empirical illustration is provided using data from Penn World Table version 6.1. △ Less

Submitted 12 September, 2022; originally announced September 2022.

arXiv:2209.05562 [pdf, other]

doi 10.13140/RG.2.2.20252.77443

Evidence and Strategy on Economic Distance in Spatially Augmented Solow-Swan Growth Model

Authors: Jieun Lee

Abstract: Economists' interests in growth theory have a very long history (Harrod, 1939; Domar, 1946; Solow, 1956; Swan 1956; Mankiw, Romer, and Weil, 1992). Recently, starting from the neoclassical growth model, Ertur and Koch (2007) developed the spatially augmented Solow-Swan growth model with the exogenous spatial weights matrices ($W$). While the exogenous $W$ assumption could be true only with the geo… ▽ More Economists' interests in growth theory have a very long history (Harrod, 1939; Domar, 1946; Solow, 1956; Swan 1956; Mankiw, Romer, and Weil, 1992). Recently, starting from the neoclassical growth model, Ertur and Koch (2007) developed the spatially augmented Solow-Swan growth model with the exogenous spatial weights matrices ($W$). While the exogenous $W$ assumption could be true only with the geographical/physical distance, it may not be true when economic/social distances play a role. Using Penn World Table version 7.1, which covers year 1960-2010, I conducted the robust Rao's score test (Bera, Dogan, and Taspinar, 2018) to determine if $W$ is endogeonus and used the maximum likelihood estimation (Qu and Lee, 2015). The key finding is that the significance and positive effects of physical capital externalities and spatial externalities (technological interdependence) in Ertur and Koch (2007) were no longer found with the exogenous $W$, but still they were with the endogenous $W$ models. I also found an empirical strategy on which economic distance to use when the data recently has been under heavy shocks of the worldwide financial crises during year 1996-2010. △ Less

Submitted 12 September, 2022; originally announced September 2022.

arXiv:2207.14727 [pdf, other]

Tangential Wasserstein Projections

Authors: Florian Gunsilius, Meng Hsuan Hsieh, Myung Jin Lee

Abstract: We develop a notion of projections between sets of probability measures using the geometric properties of the 2-Wasserstein space. It is designed for general multivariate probability measures, is computationally efficient to implement, and provides a unique solution in regular settings. The idea is to work on regular tangent cones of the Wasserstein space using generalized geodesics. Its structure… ▽ More We develop a notion of projections between sets of probability measures using the geometric properties of the 2-Wasserstein space. It is designed for general multivariate probability measures, is computationally efficient to implement, and provides a unique solution in regular settings. The idea is to work on regular tangent cones of the Wasserstein space using generalized geodesics. Its structure and computational properties make the method applicable in a variety of settings, from causal inference to the analysis of object data. An application to estimating causal effects yields a generalization of the notion of synthetic controls to multivariate data with individual-level heterogeneity, as well as a way to estimate optimal weights jointly over all time periods. △ Less

Submitted 2 August, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

arXiv:2206.04257 [pdf, other]

Capital and Labor Income Pareto Exponents in the United States, 1916-2019

Authors: Ji Hyung Lee, Yuya Sasaki, Alexis Akira Toda, Yulong Wang

Abstract: Accurately estimating income Pareto exponents is challenging due to limitations in data availability and the applicability of statistical methods. Using tabulated summaries of incomes from tax authorities and a recent estimation method, we estimate income Pareto exponents in U.S. for 1916-2019. We find that during the past three decades, the capital and labor income Pareto exponents have been stab… ▽ More Accurately estimating income Pareto exponents is challenging due to limitations in data availability and the applicability of statistical methods. Using tabulated summaries of incomes from tax authorities and a recent estimation method, we estimate income Pareto exponents in U.S. for 1916-2019. We find that during the past three decades, the capital and labor income Pareto exponents have been stable at around 1.2 and 2. Our findings suggest that the top tail income and wealth inequality is higher and wealthy agents have twice as large an impact on the aggregate economy than previously thought but there is no clear trend post-1985. △ Less

Submitted 9 June, 2022; originally announced June 2022.

arXiv:2204.05480 [pdf, other]

doi 10.1016/j.jeconom.2023.105568

Tuning Parameter-Free Nonparametric Density Estimation from Tabulated Summary Data

Authors: Ji Hyung Lee, Yuya Sasaki, Alexis Akira Toda, Yulong Wang

Abstract: Administrative data are often easier to access as tabulated summaries than in the original format due to confidentiality concerns. Motivated by this practical feature, we propose a novel nonparametric density estimation method from tabulated summary data based on maximum entropy and prove its strong uniform consistency. Unlike existing kernel-based estimators, our estimator is free from tuning par… ▽ More Administrative data are often easier to access as tabulated summaries than in the original format due to confidentiality concerns. Motivated by this practical feature, we propose a novel nonparametric density estimation method from tabulated summary data based on maximum entropy and prove its strong uniform consistency. Unlike existing kernel-based estimators, our estimator is free from tuning parameters and admits a closed-form density that is convenient for post-estimation analysis. We apply the proposed method to the tabulated summary data of the U.S. tax returns to estimate the income distribution. △ Less

Submitted 17 May, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

arXiv:2203.00349 [pdf, other]

Minimax Risk in Estimating Kink Threshold and Testing Continuity

Authors: Javier Hidalgo, Heejun Lee, Jungyoon Lee, Myung Hwan Seo

Abstract: We derive a risk lower bound in estimating the threshold parameter without knowing whether the threshold regression model is continuous or not. The bound goes to zero as the sample size $ n $ grows only at the cube root rate. Motivated by this finding, we develop a continuity test for the threshold regression model and a bootstrap to compute its \textit{p}-values. The validity of the bootstrap is… ▽ More We derive a risk lower bound in estimating the threshold parameter without knowing whether the threshold regression model is continuous or not. The bound goes to zero as the sample size $ n $ grows only at the cube root rate. Motivated by this finding, we develop a continuity test for the threshold regression model and a bootstrap to compute its \textit{p}-values. The validity of the bootstrap is established, and its finite sample property is explored through Monte Carlo simulations. △ Less

Submitted 1 March, 2022; originally announced March 2022.

Comments: arXiv admin note: text overlap with arXiv:1702.00836

arXiv:2109.06453 [pdf, other]

Vaccination strategies and transmission of COVID-19: evidence across advanced countries

Authors: Dongwoo Kim, Young Jun Lee

Abstract: Given limited supply of approved vaccines and constrained medical resources, design of a vaccination strategy to control a pandemic is an economic problem. We use time-series and panel methods with real-world country-level data to estimate effects on COVID-19 cases and deaths of two key elements of mass vaccination - time between doses and vaccine type. We find that new infections and deaths are b… ▽ More Given limited supply of approved vaccines and constrained medical resources, design of a vaccination strategy to control a pandemic is an economic problem. We use time-series and panel methods with real-world country-level data to estimate effects on COVID-19 cases and deaths of two key elements of mass vaccination - time between doses and vaccine type. We find that new infections and deaths are both significantly negatively associated with the fraction of the population vaccinated with at least one dose. Conditional on first-dose coverage, an increased fraction with two doses appears to offer no further reductions in new cases and deaths. For vaccines from China, however, we find significant effects on both health outcomes only after two doses. Our results support a policy of extending the interval between first and second doses of vaccines developed in Europe and the US. As vaccination progresses, population mobility increases, which partially offsets the direct effects of vaccination. This suggests that non-pharmaceutical interventions remain important to contain transmission as vaccination is rolled out. △ Less

Submitted 15 January, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

arXiv:2108.08097 [pdf, other]

Why North Korean Refugees are Reluctant to Compete: The Roles of Cognitive Ability

Authors: Syngjoo Choi, Byung-Yeon Kim, Jungmin Lee, Sokbae Lee

Abstract: The study compares the competitiveness of three Korean groups raised in different institutional environments: South Korea, North Korea, and China. Laboratory experiments reveal that North Korean refugees are less likely to participate in competitive tournaments than South Koreans and Korean-Chinese immigrants. Analysis using a choice model with probability weighting suggests that lower cognitive a… ▽ More The study compares the competitiveness of three Korean groups raised in different institutional environments: South Korea, North Korea, and China. Laboratory experiments reveal that North Korean refugees are less likely to participate in competitive tournaments than South Koreans and Korean-Chinese immigrants. Analysis using a choice model with probability weighting suggests that lower cognitive ability may lead to lower expected performance, more pessimistic beliefs, and greater aversion to competition. △ Less

Submitted 24 August, 2023; v1 submitted 18 August, 2021; originally announced August 2021.

arXiv:2107.06174 [pdf]

doi 10.1016/j.energy.2021.122366

National-scale electricity peak load forecasting: Traditional, machine learning, or hybrid model?

Authors: Juyong Lee, Youngsang Cho

Abstract: As the volatility of electricity demand increases owing to climate change and electrification, the importance of accurate peak load forecasting is increasing. Traditional peak load forecasting has been conducted through time series-based models; however, recently, new models based on machine or deep learning are being introduced. This study performs a comparative analysis to determine the most acc… ▽ More As the volatility of electricity demand increases owing to climate change and electrification, the importance of accurate peak load forecasting is increasing. Traditional peak load forecasting has been conducted through time series-based models; however, recently, new models based on machine or deep learning are being introduced. This study performs a comparative analysis to determine the most accurate peak load-forecasting model for Korea, by comparing the performance of time series, machine learning, and hybrid models. Seasonal autoregressive integrated moving average with exogenous variables (SARIMAX) is used for the time series model. Artificial neural network (ANN), support vector regression (SVR), and long short-term memory (LSTM) are used for the machine learning models. SARIMAX-ANN, SARIMAX-SVR, and SARIMAX-LSTM are used for the hybrid models. The results indicate that the hybrid models exhibit significant improvement over the SARIMAX model. The LSTM-based models outperformed the others; the single and hybrid LSTM models did not exhibit a significant performance difference. In the case of Korea's highest peak load in 2019, the predictive power of the LSTM model proved to be greater than that of the SARIMAX-LSTM model. The LSTM, SARIMAX-SVR, and SARIMAX-LSTM models outperformed the current time series-based forecasting model used in Korea. Thus, Korea's peak load-forecasting performance can be improved by including machine learning or hybrid models. △ Less

Submitted 30 June, 2021; originally announced July 2021.

arXiv:2105.10007 [pdf, other]

Fixed-k Tail Regression: New Evidence on Tax and Wealth Inequality from Forbes 400

Authors: Ji Hyung Lee, Yuya Sasaki, Alexis Akira Toda, Yulong Wang

Abstract: We develop a novel fixed-k tail regression method that accommodates the unique feature in the Forbes 400 data that observations are truncated from below at the 400th largest order statistic. Applying this method, we find that higher maximum marginal income tax rates induce higher wealth Pareto exponents. Setting the maximum tax rate to 30-40% (as in U.S. currently) leads to a Pareto exponent of 1.… ▽ More We develop a novel fixed-k tail regression method that accommodates the unique feature in the Forbes 400 data that observations are truncated from below at the 400th largest order statistic. Applying this method, we find that higher maximum marginal income tax rates induce higher wealth Pareto exponents. Setting the maximum tax rate to 30-40% (as in U.S. currently) leads to a Pareto exponent of 1.5-1.8, while counterfactually setting it to 80% (as suggested by Piketty, 2014) would lead to a Pareto exponent of 2.6. We present a simple economic model that explains these findings and discuss the welfare implications of taxation. △ Less

Submitted 14 September, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

arXiv:2102.05876 [pdf]

On the Fragility of Third-party Punishment: The Context Effect of a Dominated Risky Investment Option

Authors: Changkuk Im, Jinkwon Lee

Abstract: Experimental studies regularly show that third-party punishment (TPP) substantially exists in various settings. This study further investigates the robustness of TPP under an environment where context effects are involved. In our experiment, we offer a third party an additional but unattractive risky investment option. We find that, when the dominated investment option irrelevant to prosocial beha… ▽ More Experimental studies regularly show that third-party punishment (TPP) substantially exists in various settings. This study further investigates the robustness of TPP under an environment where context effects are involved. In our experiment, we offer a third party an additional but unattractive risky investment option. We find that, when the dominated investment option irrelevant to prosocial behavior is available, the demand for punishment decreases, whereas the demand for investment increases. These findings support our hypothesis that the seemingly unrelated and dominated investment option may work as a compromise and suggest the fragility of TPP in this setting. △ Less

Submitted 19 October, 2021; v1 submitted 11 February, 2021; originally announced February 2021.

arXiv:2101.11568 [pdf, other]

doi 10.1016/j.jeconom.2022.11.006

Predictive Quantile Regression with Mixed Roots and Increasing Dimensions: The ALQR Approach

Authors: Rui Fan, Ji Hyung Lee, Youngki Shin

Abstract: In this paper we propose the adaptive lasso for predictive quantile regression (ALQR). Reflecting empirical findings, we allow predictors to have various degrees of persistence and exhibit different signal strengths. The number of predictors is allowed to grow with the sample size. We study regularity conditions under which stationary, local unit root, and cointegrated predictors are present simul… ▽ More In this paper we propose the adaptive lasso for predictive quantile regression (ALQR). Reflecting empirical findings, we allow predictors to have various degrees of persistence and exhibit different signal strengths. The number of predictors is allowed to grow with the sample size. We study regularity conditions under which stationary, local unit root, and cointegrated predictors are present simultaneously. We next show the convergence rates, model selection consistency, and asymptotic distributions of ALQR. We apply the proposed method to the out-of-sample quantile prediction problem of stock returns and find that it outperforms the existing alternatives. We also provide numerical evidence from additional Monte Carlo experiments, supporting the theoretical results. △ Less

Submitted 3 December, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

Comments: 71 pages, 5 figures, 18 tables

Journal ref: Journal of Econometrics, Vol 237, No 2, Part C, Article 105372, 2023

arXiv:2008.06660 [pdf, other]

doi 10.1038/s41467-021-24959-z

No COVID-19 Climate Silver Lining in the US Power Sector

Authors: Max Luke, Priyanshi Somani, Turner Cotterman, Dhruv Suri, Stephen J. Lee

Abstract: Recent studies conclude that the global coronavirus (COVID-19) pandemic decreased power sector CO$_2$ emissions globally and in the United States. In this paper, we analyze the statistical significance of CO2 emissions reductions in the U.S. power sector from March through December 2020. We use Gaussian process (GP) regression to assess whether CO2 emissions reductions would have occurred with rea… ▽ More Recent studies conclude that the global coronavirus (COVID-19) pandemic decreased power sector CO$_2$ emissions globally and in the United States. In this paper, we analyze the statistical significance of CO2 emissions reductions in the U.S. power sector from March through December 2020. We use Gaussian process (GP) regression to assess whether CO2 emissions reductions would have occurred with reasonable probability in the absence of COVID-19 considering uncertainty due to factors unrelated to the pandemic and adjusting for weather, seasonality, and recent emissions trends. We find that monthly CO2 emissions reductions are only statistically significant in April and May 2020 considering hypothesis tests at 5% significance levels. Separately, we consider the potential impact of COVID-19 on coal-fired power plant retirements through 2022. We find that only a small percentage of U.S. coal power plants are at risk of retirement due to a possible COVID-19-related sustained reduction in electricity demand and prices. We observe and anticipate a return to pre-COVID-19 CO2 emissions in the U.S. power sector. △ Less

Submitted 28 May, 2021; v1 submitted 15 August, 2020; originally announced August 2020.

Comments: 13 pages, 6 figures, preprint

arXiv:2003.03299 [pdf, other]

doi 10.1017/S0266466621000402

Complete Subset Averaging for Quantile Regressions

Authors: Ji Hyung Lee, Youngki Shin

Abstract: We propose a novel conditional quantile prediction method based on complete subset averaging (CSA) for quantile regressions. All models under consideration are potentially misspecified and the dimension of regressors goes to infinity as the sample size increases. Since we average over the complete subsets, the number of models is much larger than the usual model averaging method which adopts sophi… ▽ More We propose a novel conditional quantile prediction method based on complete subset averaging (CSA) for quantile regressions. All models under consideration are potentially misspecified and the dimension of regressors goes to infinity as the sample size increases. Since we average over the complete subsets, the number of models is much larger than the usual model averaging method which adopts sophisticated weighting schemes. We propose to use an equal weight but select the proper size of the complete subset based on the leave-one-out cross-validation method. Building upon the theory of Lu and Su (2015), we investigate the large sample properties of CSA and show the asymptotic optimality in the sense of Li (1987). We check the finite sample performance via Monte Carlo simulations and empirical applications. △ Less

Submitted 12 July, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

Comments: 46 pages, 3 figures, 9 tables

arXiv:1904.13329 [pdf]

Supervised Machine Learning for Eliciting Individual Demand

Authors: John A. Clithero, Jae Joon Lee, Joshua Tasoff

Abstract: Direct elicitation, guided by theory, is the standard method for eliciting latent preferences. The canonical direct-elicitation approach for measuring individuals' valuations for goods is the Becker-DeGroot-Marschak procedure, which generates willingness-to-pay (WTP) values that are imprecise and systematically biased by understating valuations. We show that enhancing elicited WTP values with supe… ▽ More Direct elicitation, guided by theory, is the standard method for eliciting latent preferences. The canonical direct-elicitation approach for measuring individuals' valuations for goods is the Becker-DeGroot-Marschak procedure, which generates willingness-to-pay (WTP) values that are imprecise and systematically biased by understating valuations. We show that enhancing elicited WTP values with supervised machine learning (SML) can substantially improve estimates of peoples' out-of-sample purchase behavior. Furthermore, swapping WTP data with choice data generated from a simple task, two-alternative forced choice, leads to comparable performance. Combining all the data with the best-performing SML methods yields large improvements in predicting out-of-sample purchases. We quantify the benefit of using various SML methods in conjunction with using different types of data. Our results suggest that prices set by SML would increase revenue by 28% over using the stated WTP, with the same data. △ Less

Submitted 4 February, 2021; v1 submitted 30 April, 2019; originally announced April 2019.

arXiv:1904.05209 [pdf, other]

Local Polynomial Estimation of Time-Varying Parameters in Nonlinear Models

Authors: Dennis Kristensen, Young Jun Lee

Abstract: We develop a novel asymptotic theory for local polynomial (quasi-) maximum-likelihood estimators of time-varying parameters in a broad class of nonlinear time series models. Under weak regularity conditions, we show the proposed estimators are consistent and follow normal distributions in large samples. Our conditions impose weaker smoothness and moment conditions on the data-generating process an… ▽ More We develop a novel asymptotic theory for local polynomial (quasi-) maximum-likelihood estimators of time-varying parameters in a broad class of nonlinear time series models. Under weak regularity conditions, we show the proposed estimators are consistent and follow normal distributions in large samples. Our conditions impose weaker smoothness and moment conditions on the data-generating process and its likelihood compared to existing theories. Furthermore, the bias terms of the estimators take a simpler form. We demonstrate the usefulness of our general results by applying our theory to local (quasi-)maximum-likelihood estimators of a time-varying VAR's, ARCH and GARCH, and Poisson autogressions. For the first three models, we are able to substantially weaken the conditions found in the existing literature. For the Poisson autogression, existing theories cannot be be applied while our novel approach allows us to analyze it. △ Less

Submitted 24 August, 2023; v1 submitted 10 April, 2019; originally announced April 2019.

arXiv:1810.03140 [pdf, other]

On LASSO for Predictive Regression

Authors: Ji Hyung Lee, Zhentao Shi, Zhan Gao

Abstract: Explanatory variables in a predictive regression typically exhibit low signal strength and various degrees of persistence. Variable selection in such a context is of great importance. In this paper, we explore the pitfalls and possibilities of the LASSO methods in this predictive regression framework. In the presence of stationary, local unit root, and cointegrated predictors, we show that the ada… ▽ More Explanatory variables in a predictive regression typically exhibit low signal strength and various degrees of persistence. Variable selection in such a context is of great importance. In this paper, we explore the pitfalls and possibilities of the LASSO methods in this predictive regression framework. In the presence of stationary, local unit root, and cointegrated predictors, we show that the adaptive LASSO cannot asymptotically eliminate all cointegrating variables with zero regression coefficients. This new finding motivates a novel post-selection adaptive LASSO, which we call the twin adaptive LASSO (TAlasso), to restore variable selection consistency. Accommodating the system of heterogeneous regressors, TAlasso achieves the well-known oracle property. In contrast, conventional LASSO fails to attain coefficient estimation consistency and variable screening in all components simultaneously. We apply these LASSO methods to evaluate the short- and long-horizon predictability of S\&P 500 excess returns. △ Less

Submitted 14 February, 2021; v1 submitted 7 October, 2018; originally announced October 2018.

arXiv:1808.03482 [pdf, other]

Exeum: A Decentralized Financial Platform for Price-Stable Cryptocurrencies

Authors: Jaehyung Lee, Minhyung Cho

Abstract: Price stability has often been cited as a key reason that cryptocurrencies have not gained widespread adoption as a medium of exchange and continue to prove incapable of powering the economy of decentralized applications (DApps) efficiently. Exeum proposes a novel method to provide price stable digital tokens whose values are pegged to real world assets, serving as a bridge between the real world… ▽ More Price stability has often been cited as a key reason that cryptocurrencies have not gained widespread adoption as a medium of exchange and continue to prove incapable of powering the economy of decentralized applications (DApps) efficiently. Exeum proposes a novel method to provide price stable digital tokens whose values are pegged to real world assets, serving as a bridge between the real world and the decentralized economy. Pegged tokens issued by Exeum - for example, USDE refers to a stable token issued by the system whose value is pegged to USD - are backed by virtual assets in a virtual asset exchange where users can deposit the base token of the system and take long or short positions. Guaranteeing the stability of the pegged tokens boils down to the problem of maintaining the peg of the virtual assets to real world assets, and the main mechanism used by Exeum is controlling the swap rate of assets. If the swap rate is fully controlled by the system, arbitrageurs can be incentivized enough to restore a broken peg; Exeum distributes statistical arbitrage trading software to decentralize this type of market making activity. The last major component of the system is a central bank equivalent that determines the long term interest rate of the base token, pays interest on the deposit by inflating the supply if necessary, and removes the need for stability fees on pegged tokens, improving their usability. To the best of our knowledge, Exeum is the first to propose a truly decentralized method for developing a stablecoin that enables 1:1 value conversion between the base token and pegged assets, completely removing the mismatch between supply and demand. In this paper, we will also discuss its applications, such as improving staking based DApp token models, price stable gas fees, pegging to an index of DApp tokens, and performing cross-chain asset transfer of legacy crypto assets. △ Less

Submitted 10 August, 2018; originally announced August 2018.

arXiv:1206.2966 [pdf, ps, other]

Panel Data Models with Nonadditive Unobserved Heterogeneity: Estimation and Inference

Authors: Ivan Fernandez-Val, Joonhwah Lee

Abstract: This paper considers fixed effects estimation and inference in linear and nonlinear panel data models with random coefficients and endogenous regressors. The quantities of interest -- means, variances, and other moments of the random coefficients -- are estimated by cross sectional sample moments of GMM estimators applied separately to the time series of each individual. To deal with the incidenta… ▽ More This paper considers fixed effects estimation and inference in linear and nonlinear panel data models with random coefficients and endogenous regressors. The quantities of interest -- means, variances, and other moments of the random coefficients -- are estimated by cross sectional sample moments of GMM estimators applied separately to the time series of each individual. To deal with the incidental parameter problem introduced by the noise of the within-individual estimators in short panels, we develop bias corrections. These corrections are based on higher-order asymptotic expansions of the GMM estimators and produce improved point and interval estimates in moderately long panels. Under asymptotic sequences where the cross sectional and time series dimensions of the panel pass to infinity at the same rate, the uncorrected estimator has an asymptotic bias of the same order as the asymptotic variance. The bias corrections remove the bias without increasing variance. An empirical example on cigarette demand based on Becker, Grossman and Murphy (1994) shows significant heterogeneity in the price effect across U.S. states. △ Less

Submitted 11 October, 2013; v1 submitted 13 June, 2012; originally announced June 2012.

Comments: 51 pages, 4 tables, 1 figure, it includes supplementary appendix

Showing 1–37 of 37 results for author: Lee, J