Skip to main content

Showing 1–35 of 35 results for author: Shin, Y

Searching in archive stat. Search in all archives.
.
  1. Building nonstationary extreme value model using L-moments

    Authors: Yire Shin, Yonggwan Shin, Jeong-Soo Park

    Abstract: The maximum likelihood estimation for a time-dependent nonstationary (NS) extreme value model is often too sensitive to influential observations, such as large values toward the end of a sample. Thus, alternative methods using L-moments have been developed in NS models to address this problem while retaining the advantages of the stationary L-moment method. However, one method using L-moments disp… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Journal ref: Journal of the Korean Statistical Society, 2025

  2. arXiv:2505.21417  [pdf, ps, other

    stat.ME stat.CO

    Model averaging with mixed criteria for estimating high quantiles of extreme values: Application to heavy rainfall

    Authors: Yonggwan Shin, Yire Shin, Jeong-Soo Park

    Abstract: Accurately estimating high quantiles beyond the largest observed value is crucial in risk assessment and devising effective adaptation strategies to prevent a greater disaster. The generalized extreme value distribution is widely used for this purpose, with L-moment estimation (LME) and maximum likelihood estimation (MLE) being the primary methods. However, estimating high quantiles with a small s… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2502.07033  [pdf, ps, other

    stat.ME

    Compatible Imputation for Hierarchical Linear Models with Incomplete Data: Interaction Effects of Continuous and Categorical Covariates MAR

    Authors: Dongho Shin, Yongyun Shin

    Abstract: This article focuses on Bayesian estimation of a hierarchical linear model (HLM) from incomplete data assumed missing at random where continuous covariates C and discrete categorical covariates $D$ have interaction effects on a continuous response $R$. Given small sample sizes, maximum likelihood estimation is suboptimal, and existing Gibbs samplers are based on a Bayesian joint distribution compa… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: arXiv admin note: text overlap with arXiv:2405.21020

  4. Generalized logistic model for $r$ largest order statistics, with hydrological application

    Authors: Yire Shin, Jeong-Soo Park

    Abstract: The effective use of available information in extreme value analysis is critical because extreme values are scarce. Thus, using the $r$ largest order statistics (rLOS) instead of the block maxima is encouraged. Based on the four-parameter kappa model for the rLOS (rK4D), we introduce a new distribution for the rLOS as a special case of the rK4D. That is the generalized logistic model for rLOS (rGL… ▽ More

    Submitted 25 October, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: In this revision, some modification and correction from the published one are made on sentences and formula with blue color

    Journal ref: Stoch Environ Res Risk Assess 38 (2024) 1567-1581

  5. arXiv:2405.21020  [pdf, other

    stat.ME

    Bayesian Estimation of Hierarchical Linear Models from Incomplete Data: Cluster-Level Interaction Effects and Small Sample Sizes

    Authors: Dongho Shin, Yongyun Shin, Nao Hagiwara

    Abstract: We consider Bayesian estimation of a hierarchical linear model (HLM) from partially observed data, assumed to be missing at random, and small sample sizes. A vector of continuous covariates $C$ includes cluster-level partially observed covariates with interaction effects. Due to small sample sizes from 37 patient-physician encounters repeatedly measured at four time points, maximum likelihood esti… ▽ More

    Submitted 30 January, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

  6. arXiv:2312.10072  [pdf, other

    cs.HC cs.AI cs.LG stat.AP

    Assessing the Usability of GutGPT: A Simulation Study of an AI Clinical Decision Support System for Gastrointestinal Bleeding Risk

    Authors: Colleen Chan, Kisung You, Sunny Chung, Mauro Giuffrè, Theo Saarinen, Niroop Rajashekar, Yuan Pu, Yeo Eun Shin, Loren Laine, Ambrose Wong, René Kizilcec, Jasjeet Sekhon, Dennis Shung

    Abstract: Applications of large language models (LLMs) like ChatGPT have potential to enhance clinical decision support through conversational interfaces. However, challenges of human-algorithmic interaction and clinician trust are poorly understood. GutGPT, a LLM for gastrointestinal (GI) bleeding risk prediction and management guidance, was deployed in clinical simulation scenarios alongside the electroni… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10, 2023, New Orleans, United States, 11 pages

  7. arXiv:2309.01020  [pdf, other

    math.NA cs.LG stat.ML

    On the training and generalization of deep operator networks

    Authors: Sanghyun Lee, Yeonjong Shin

    Abstract: We present a novel training method for deep operator networks (DeepONets), one of the most popular neural network models for operators. DeepONets are constructed by two sub-networks, namely the branch and trunk networks. Typically, the two sub-networks are trained simultaneously, which amounts to solving a complex optimization problem in a high dimensional space. In addition, the nonconvex and non… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  8. arXiv:2308.13564  [pdf, other

    econ.EM cs.LG math.ST stat.CO stat.ML

    SGMM: Stochastic Approximation to Generalized Method of Moments

    Authors: Xiaohong Chen, Sokbae Lee, Yuan Liao, Myung Hwan Seo, Youngki Shin, Myunghyun Song

    Abstract: We introduce a new class of algorithms, Stochastic Generalized Method of Moments (SGMM), for estimation and inference on (overidentified) moment restriction models. Our SGMM is a novel stochastic approximation alternative to the popular Hansen (1982) (offline) GMM, and offers fast and scalable implementation with the ability to handle streaming datasets in real time. We establish the almost sure c… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 46 pages, 4 tables, 2 figures

  9. arXiv:2209.14502  [pdf, other

    econ.EM stat.CO

    Fast Inference for Quantile Regression with Tens of Millions of Observations

    Authors: Sokbae Lee, Yuan Liao, Myung Hwan Seo, Youngki Shin

    Abstract: Big data analytics has opened new avenues in economic research, but the challenge of analyzing datasets with tens of millions of observations is substantial. Conventional econometric methods based on extreme estimators require large amounts of computing resources and memory, which are often not readily available. In this paper, we focus on linear quantile regression applied to "ultra-large" datase… ▽ More

    Submitted 31 October, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 62 pages, 8 figures

  10. arXiv:2202.06383  [pdf, other

    cs.LG stat.AP

    Surgical Scheduling via Optimization and Machine Learning with Long-Tailed Data

    Authors: Yuan Shi, Saied Mahdian, Jose Blanchet, Peter Glynn, Andrew Y. Shin, David Scheinker

    Abstract: Using data from cardiovascular surgery patients with long and highly variable post-surgical lengths of stay (LOS), we develop a modeling framework to reduce recovery unit congestion. We estimate the LOS and its probability distribution using machine learning models, schedule procedures on a rolling basis using a variety of optimization models, and estimate performance with simulation. The machine… ▽ More

    Submitted 28 November, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  11. arXiv:2111.07513  [pdf

    cs.LG stat.ML

    A Comparative Study on Basic Elements of Deep Learning Models for Spatial-Temporal Traffic Forecasting

    Authors: Yuyol Shin, Yoonjin Yoon

    Abstract: Traffic forecasting plays a crucial role in intelligent transportation systems. The spatial-temporal complexities in transportation networks make the problem especially challenging. The recently suggested deep learning models share basic elements such as graph convolution, graph attention, recurrent units, and/or attention mechanism. In this study, we designed an in-depth comparative study for fou… ▽ More

    Submitted 22 March, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: 14 pages, 4 figures, 3 Tables, This paper is accepted for AAAI-22 Workshop: AI for Transportation

  12. arXiv:2106.03156  [pdf, other

    stat.ML cs.LG econ.EM math.ST

    Fast and Robust Online Inference with Stochastic Gradient Descent via Random Scaling

    Authors: Sokbae Lee, Yuan Liao, Myung Hwan Seo, Youngki Shin

    Abstract: We develop a new method of online inference for a vector of parameters estimated by the Polyak-Ruppert averaging procedure of stochastic gradient descent (SGD) algorithms. We leverage insights from time series regression in econometrics and construct asymptotically pivotal statistics via random scaling. Our approach is fully operational with online data and is rigorously underpinned by a functiona… ▽ More

    Submitted 6 October, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: 29 pages, 8 figures, 8 tables

    MSC Class: Primary 62J10; 62M02; secondary 60K35 ACM Class: G.3

    Journal ref: Proceedings of the 36th AAAI Conference on Artificial Intelligence, 36(7), 2022, pp. 7381-7389

  13. arXiv:2105.11025  [pdf, other

    cs.LG stat.ML

    Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds

    Authors: John Y. Shin

    Abstract: Heavy-tailed distributions have been studied in statistics, random matrix theory, physics, and econometrics as models of correlated systems, among other domains. Further, heavy-tail distributed eigenvalues of the covariance matrix of the weight matrices in neural networks have been shown to empirically correlate with test set accuracy in several works (e.g. arXiv:1901.08276), but a formal relation… ▽ More

    Submitted 23 May, 2021; originally announced May 2021.

  14. Predictive Quantile Regression with Mixed Roots and Increasing Dimensions: The ALQR Approach

    Authors: Rui Fan, Ji Hyung Lee, Youngki Shin

    Abstract: In this paper we propose the adaptive lasso for predictive quantile regression (ALQR). Reflecting empirical findings, we allow predictors to have various degrees of persistence and exhibit different signal strengths. The number of predictors is allowed to grow with the sample size. We study regularity conditions under which stationary, local unit root, and cointegrated predictors are present simul… ▽ More

    Submitted 3 December, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: 71 pages, 5 figures, 18 tables

    Journal ref: Journal of Econometrics, Vol 237, No 2, Part C, Article 105372, 2023

  15. arXiv:2010.07604  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Sequential Likelihood-Free Inference with Neural Proposal

    Authors: Dongjun Kim, Kyungwoo Song, YoonYeong Kim, Yongjin Shin, Wanmo Kang, Il-Chul Moon, Weonyoung Joo

    Abstract: Bayesian inference without the likelihood evaluation, or likelihood-free inference, has been a key research topic in simulation studies for gaining quantitatively validated simulation models on real-world datasets. As the likelihood evaluation is inaccessible, previous papers train the amortized neural network to estimate the ground-truth posterior for the simulation of interest. Training the netw… ▽ More

    Submitted 4 November, 2022; v1 submitted 15 October, 2020; originally announced October 2020.

  16. Modeling climate extremes using the four-parameter kappa distribution for $r$-largest order statistics

    Authors: Yire Shin, Jeong-Soo Park

    Abstract: Accurate estimation of the T-year return levels of climate extremes using statistical distribution is a critical step in the projection of future climate and in engineering design for disaster response. We show how the estimation of such quantities can be improved by fitting {the four-parameter kappa distribution for $r$-largest order statistics} (rK4D), which was developed in this study. The rK4D… ▽ More

    Submitted 5 December, 2024; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: In this revision, some modification and correction from the published one are made on sentences and formula with blue color

    Journal ref: Weather and Climate Extremes, 39, 100533 (2023)

  17. Integration of max-stable processes and Bayesian model averaging to predict extreme climatic events in multi-model ensembles

    Authors: Yonggwan Shin, Youngsaeng Lee, Juntae Choi, Jeong-Soo Park

    Abstract: Projections of changes in extreme climate are sometimes predicted by using multi-model ensemble methods such as Bayesian model averaging (BMA) embedded with the generalized extreme value (GEV) distribution. BMA is a popular method for combining the forecasts of individual simulation models by weighted averaging and characterizing the uncertainty induced by simulating the model structure. This meth… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Journal ref: Stochastic Environmental Research and Risk Assessment 33, 2019, 47-57

  18. arXiv:2007.08199  [pdf, other

    cs.LG cs.CV stat.ML

    Learning from Noisy Labels with Deep Neural Networks: A Survey

    Authors: Hwanjun Song, Minseok Kim, Dongmin Park, Yooju Shin, Jae-Gil Lee

    Abstract: Deep learning has achieved remarkable success in numerous domains with help from large amounts of big data. However, the quality of data labels is a concern because of the lack of high-quality labels in many real-world scenarios. As noisy labels severely degrade the generalization performance of deep neural networks, learning from noisy labels (robust training) is becoming an important task in mod… ▽ More

    Submitted 9 March, 2022; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Final version published in TNNLS Journal (2022 March)

  19. arXiv:2007.07213  [pdf, other

    cs.LG math.NA stat.ML

    Plateau Phenomenon in Gradient Descent Training of ReLU networks: Explanation, Quantification and Avoidance

    Authors: Mark Ainsworth, Yeonjong Shin

    Abstract: The ability of neural networks to provide `best in class' approximation across a wide range of applications is well-documented. Nevertheless, the powerful expressivity of neural networks comes to naught if one is unable to effectively train (choose) the parameters defining the network. In general, neural networks are trained by gradient descent type optimization methods, or a stochastic variant th… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  20. arXiv:2007.01458  [pdf, other

    cs.LG stat.ML

    Confidence-Aware Learning for Deep Neural Networks

    Authors: Jooyoung Moon, Jihyo Kim, Younghak Shin, Sangheum Hwang

    Abstract: Despite the power of deep neural networks for a wide range of tasks, an overconfident prediction issue has limited their practical use in many safety-critical applications. Many recent works have been proposed to mitigate this issue, but most of them require either additional computational costs in training and/or inference phases or customized architectures to output confidence estimates separate… ▽ More

    Submitted 12 August, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: ICML 2020. The first two authors contributed equally

  21. Sparse HP Filter: Finding Kinks in the COVID-19 Contact Rate

    Authors: Sokbae Lee, Yuan Liao, Myung Hwan Seo, Youngki Shin

    Abstract: In this paper, we estimate the time-varying COVID-19 contact rate of a Susceptible-Infected-Recovered (SIR) model. Our measurement of the contact rate is constructed using data on actively infected, recovered and deceased cases. We propose a new trend filtering method that is a variant of the Hodrick-Prescott (HP) filter, constrained by the number of possible kinks. We term it the… ▽ More

    Submitted 29 July, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 42 pages, 15 figures, 1 table

    Journal ref: Journal of Econometrics, 220(1), 2021, pp. 158-180

  22. Complete Subset Averaging for Quantile Regressions

    Authors: Ji Hyung Lee, Youngki Shin

    Abstract: We propose a novel conditional quantile prediction method based on complete subset averaging (CSA) for quantile regressions. All models under consideration are potentially misspecified and the dimension of regressors goes to infinity as the sample size increases. Since we average over the complete subsets, the number of models is much larger than the usual model averaging method which adopts sophi… ▽ More

    Submitted 12 July, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: 46 pages, 3 figures, 9 tables

  23. arXiv:1911.10979  [pdf, other

    cs.LG cs.CV stat.ML

    Simple yet Effective Way for Improving the Performance of GAN

    Authors: Yong-Goo Shin, Yoon-Jae Yeo, Sung-Jea Ko

    Abstract: In adversarial learning, discriminator often fails to guide the generator successfully since it distinguishes between real and generated images using silly or non-robust features. To alleviate this problem, this brief presents a simple but effective way that improves the performance of generative adversarial network (GAN) without imposing the training overhead or modifying the network architecture… ▽ More

    Submitted 19 January, 2021; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted to IEEE transactions on neural networks and learning systems

  24. arXiv:1910.05874  [pdf, other

    cs.LG math.NA stat.ML

    Effects of Depth, Width, and Initialization: A Convergence Analysis of Layer-wise Training for Deep Linear Neural Networks

    Authors: Yeonjong Shin

    Abstract: Deep neural networks have been used in various machine learning applications and achieved tremendous empirical successes. However, training deep neural networks is a challenging task. Many alternatives have been proposed in place of end-to-end back-propagation. Layer-wise training is one of them, which trains a single layer at a time, rather than trains the whole layers simultaneously. In this pap… ▽ More

    Submitted 7 September, 2020; v1 submitted 13 October, 2019; originally announced October 2019.

  25. Incorporating dynamicity of transportation network with multi-weight traffic graph convolutional network for traffic forecasting

    Authors: Yuyol Shin, Yoonjin Yoon

    Abstract: Traffic forecasting problem remains a challenging task in the intelligent transportation system due to its spatio-temporal complexity. Although temporal dependency has been well studied and discussed, spatial dependency is relatively less explored due to its large variations, especially in the urban environment. In this study, a novel graph convolutional network model, Multi-Weight Traffic Graph C… ▽ More

    Submitted 26 May, 2021; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: 11 pages, 7 figures, Accepted to IEEE Transactions on Intelligent Transportation Systems (2020)

    MSC Class: 68T99

    Journal ref: IEEE Trans. Intell. Transp. Syst., 0 (2020) 1-11

  26. Trainability of ReLU networks and Data-dependent Initialization

    Authors: Yeonjong Shin, George Em Karniadakis

    Abstract: In this paper, we study the trainability of rectified linear unit (ReLU) networks. A ReLU neuron is said to be dead if it only outputs a constant for any input. Two death states of neurons are introduced; tentative and permanent death. A network is then said to be trainable if the number of permanently dead neurons is sufficiently small for a learning task. We refer to the probability of a network… ▽ More

    Submitted 31 March, 2020; v1 submitted 23 July, 2019; originally announced July 2019.

  27. arXiv:1903.06733  [pdf, other

    stat.ML cs.LG math.PR

    Dying ReLU and Initialization: Theory and Numerical Examples

    Authors: Lu Lu, Yeonjong Shin, Yanhui Su, George Em Karniadakis

    Abstract: The dying ReLU refers to the problem when ReLU neurons become inactive and only output 0 for any input. There are many empirical and heuristic explanations of why ReLU neurons die. However, little is known about its theoretical analysis. In this paper, we rigorously prove that a deep ReLU network will eventually die in probability as the depth goes to infinite. Several methods have been proposed t… ▽ More

    Submitted 21 October, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

  28. arXiv:1901.07375  [pdf

    cs.CV cs.LG eess.IV stat.ML

    Extension of Convolutional Neural Network with General Image Processing Kernels

    Authors: Jay Hoon Jung, Yousun Shin, YoungMin Kwon

    Abstract: We applied pre-defined kernels also known as filters or masks developed for image processing to convolution neural network. Instead of letting neural networks find its own kernels, we used 41 different general-purpose kernels of blurring, edge detecting, sharpening, discrete cosine transformation, etc. for the first layer of the convolution neural networks. This architecture, thus named as general… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: 4 pages, 6 figures

    Journal ref: TENCON 2018

  29. arXiv:1811.08083  [pdf, other

    econ.EM stat.ME

    Complete Subset Averaging with Many Instruments

    Authors: Seojeong Lee, Youngki Shin

    Abstract: We propose a two-stage least squares (2SLS) estimator whose first stage is the equal-weighted average over a complete subset with $k$ instruments among $K$ available, which we call the complete subset averaging (CSA) 2SLS. The approximate mean squared error (MSE) is derived as a function of the subset size $k$ by the Nagar (1959) expansion. The subset size is chosen by minimizing the sample counte… ▽ More

    Submitted 26 August, 2020; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: 56 pages, 3 figures, 10 tables

    Journal ref: Econometrics Journal, 24(2), 2021, pp. 290-314

  30. arXiv:1809.00758  [pdf

    cs.LG cs.CV cs.SD eess.AS stat.ML

    End-to-end Multimodal Emotion and Gender Recognition with Dynamic Joint Loss Weights

    Authors: Myungsu Chae, Tae-Ho Kim, Young Hoon Shin, June-Woo Kim, Soo-Young Lee

    Abstract: Multi-task learning is a method for improving the generalizability of multiple tasks. In order to perform multiple classification tasks with one neural network model, the losses of each task should be combined. Previous studies have mostly focused on multiple prediction tasks using joint loss with static weights for training models, choosing the weights between tasks without making sufficient cons… ▽ More

    Submitted 2 October, 2018; v1 submitted 3 September, 2018; originally announced September 2018.

    Comments: IROS 2018 Workshop on Crossmodal Learning for Intelligent Robotics

    MSC Class: 68T05

  31. arXiv:1802.00912  [pdf, other

    cs.LG cs.CV stat.ML

    Active, Continual Fine Tuning of Convolutional Neural Networks for Reducing Annotation Efforts

    Authors: Zongwei Zhou, Jae Y. Shin, Suryakanth R. Gurudu, Michael B. Gotway, Jianming Liang

    Abstract: The splendid success of convolutional neural networks (CNNs) in computer vision is largely attributable to the availability of massive annotated datasets, such as ImageNet and Places. However, in medical imaging, it is challenging to create such large annotated datasets, as annotating medical images is not only tedious, laborious, and time consuming, but it also demands costly, specialty-oriented… ▽ More

    Submitted 10 April, 2021; v1 submitted 3 February, 2018; originally announced February 2018.

  32. arXiv:1603.03141  [pdf

    q-bio.QM math.OC stat.CO

    Calibrar: an R package for fitting complex ecological models

    Authors: Ricardo Oliveros-Ramos, Yunne-Jai Shin

    Abstract: The fitting or parameter estimation of complex ecological models is a challenging optimisation task, with a notable lack of tools for fitting complex, long runtime or stochastic models. calibrar is an R package that is dedicated to the fitting of complex models to data. It is a generic tool that can be used for any type of model, especially those with non-differentiable objective functions and lon… ▽ More

    Submitted 27 April, 2024; v1 submitted 9 March, 2016; originally announced March 2016.

    Comments: 15 pages

  33. Oracle Estimation of a Change Point in High Dimensional Quantile Regression

    Authors: Sokbae Lee, Yuan Liao, Myung Hwan Seo, Youngki Shin

    Abstract: In this paper, we consider a high-dimensional quantile regression model where the sparsity structure may differ between two sub-populations. We develop $\ell_1$-penalized estimators of both regression coefficients and the threshold parameter. Our penalized estimators not only select covariates but also discriminate between a model with homogeneous sparsity and a model with a change point. As a res… ▽ More

    Submitted 16 December, 2016; v1 submitted 1 March, 2016; originally announced March 2016.

    Comments: 128 pages, 12 figures. A part of this paper was circulated under the title "Structural Change in Sparsity" arXiv:1411.3062

    Journal ref: JASA 113 (2018) 1184-1194

  34. arXiv:1509.06123  [pdf

    q-bio.QM q-bio.PE stat.ME

    A sequential approach to calibrate ecosystem models with multiple time series data

    Authors: Ricardo Oliveros-Ramos, Philippe Verley, Yunne-Jai Shin

    Abstract: Ecosystem approach to fisheries requires a thorough understanding of fishing impacts on ecosystem status and processes as well as predictive tools such as ecosystem models to provide useful information for management. The credibility of such models is essential when used as decision making tools, and model fitting to observed data is one major criterion to assess such credibility. However, more at… ▽ More

    Submitted 21 September, 2015; originally announced September 2015.

    Comments: 33 pages, 4 tables, 13 figures, 2 appendices

  35. arXiv:1411.3062  [pdf, ps, other

    stat.ME

    Structural Change in Sparsity

    Authors: Sokbae Lee, Yuan Liao, Myung Hwan Seo, Youngki Shin

    Abstract: In the high-dimensional sparse modeling literature, it has been crucially assumed that the sparsity structure of the model is homogeneous over the entire population. That is, the identities of important regressors are invariant across the population and across the individuals in the collected sample. In practice, however, the sparsity structure may not always be invariant in the population, due to… ▽ More

    Submitted 19 November, 2014; v1 submitted 11 November, 2014; originally announced November 2014.

    Comments: 65 pages