Skip to main content

Showing 1–18 of 18 results for author: Tsai, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.01557  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Contextures: Representations from Contexts

    Authors: Runtian Zhai, Kai Yang, Che-Ping Tsai, Burak Varici, Zico Kolter, Pradeep Ravikumar

    Abstract: Despite the empirical success of foundation models, we do not have a systematic characterization of the representations that these models learn. In this paper, we establish the contexture theory. It shows that a large class of representation learning methods can be characterized as learning from the association between the input and a context variable. Specifically, we show that many popular metho… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: ICML 2025, longer version. arXiv admin note: substantial text overlap with arXiv:2504.19792

  2. arXiv:2503.11990  [pdf, ps, other

    stat.ME

    Testing Stochastic Block Models Based on Maximum Sampling Entry-Wise Deviations

    Authors: Yujia Wu, Wei Lan, Long Feng, Chih-Ling Tsai

    Abstract: The stochastic block model (SBM) has been widely used to analyze network data. Various goodness-of-fit tests have been proposed to assess the adequacy of model structures. To the best of our knowledge, however, none of the existing approaches are applicable for sparse networks in which the connection probability of any two communities is of order log n/n, and the number of communities is divergent… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  3. arXiv:2409.05276  [pdf, ps, other

    stat.ME

    An Eigengap Ratio Test for Determining the Number of Communities in Network Data

    Authors: Yujia Wu, Jingfei Zhang, Wei Lan, Chih-Ling Tsai

    Abstract: To characterize the community structure in network data, researchers have introduced various block-type models, including the stochastic block model, degree-corrected stochastic block model, mixed membership block model, degree-corrected mixed membership block model, and others. A critical step in applying these models effectively is determining the number of communities in the network. However, t… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  4. arXiv:2305.13946  [pdf, ps, other

    cs.LG math.OC stat.ML

    Data-Dependent Bounds for Online Portfolio Selection Without Lipschitzness and Smoothness

    Authors: Chung-En Tsai, Ying-Ting Lin, Yen-Huan Li

    Abstract: This work introduces the first small-loss and gradual-variation regret bounds for online portfolio selection, marking the first instances of data-dependent bounds for online convex optimization with non-Lipschitz, non-smooth losses. The algorithms we propose exhibit sublinear regret rates in the worst cases and achieve logarithmic regrets when the data is "easy," with per-iteration time almost lin… ▽ More

    Submitted 4 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 37 pages, typos fixed, NeurIPS 2023

  5. arXiv:2211.12880  [pdf, other

    quant-ph cs.LG math.OC stat.ML

    Faster Stochastic First-Order Method for Maximum-Likelihood Quantum State Tomography

    Authors: Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

    Abstract: In maximum-likelihood quantum state tomography, both the sample size and dimension grow exponentially with the number of qubits. It is therefore desirable to develop a stochastic first-order method, just like stochastic gradient descent for modern machine learning, to compute the maximum-likelihood estimate. To this end, we propose an algorithm called stochastic mirror descent with the Burg entrop… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 11 pages, 1 figure

  6. arXiv:2210.00997  [pdf, ps, other

    stat.ML cs.LG math.OC q-fin.PM quant-ph

    Online Self-Concordant and Relatively Smooth Minimization, With Applications to Online Portfolio Selection and Learning Quantum States

    Authors: Chung-En Tsai, Hao-Chung Cheng, Yen-Huan Li

    Abstract: Consider an online convex optimization problem where the loss functions are self-concordant barriers, smooth relative to a convex function $h$, and possibly non-Lipschitz. We analyze the regret of online mirror descent with $h$. Then, based on the result, we prove the following in a unified manner. Denote by $T$ the time horizon and $d$ the parameter dimension. 1. For online portfolio selection, t… ▽ More

    Submitted 21 September, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 34th Int. Conf. Algorithmic Learning Theory (ALT 2023). A typo in the last equation in the proof of Lemma 10 is corrected

  7. Imputations for High Missing Rate Data in Covariates via Semi-supervised Learning Approach

    Authors: Wei Lan, Xuerong Chen, Tao Zou, Chih-Ling Tsai

    Abstract: Advancements in data collection techniques and the heterogeneity of data resources can yield high percentages of missing observations on variables, such as block-wise missing data. Under missing-data scenarios, traditional methods such as the simple average, $k$-nearest neighbor, multiple, and regression imputations may lead to results that are unstable or unable be computed. Motivated by the conc… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: 1 figure

    Journal ref: Journal of Business & Economic Statistics, 2021

  8. Inward and Outward Network Influence Analysis

    Authors: Yujia Wu, Wei Lan, Tao Zou, Chih-Ling Tsai

    Abstract: Measuring heterogeneous influence across nodes in a network is critical in network analysis. This paper proposes an Inward and Outward Network Influence (IONI) model to assess nodal heterogeneity. Specifically, we allow for two types of influence parameters; one measures the magnitude of influence that each node exerts on others (outward influence), while we introduce a new parameter to quantify t… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: 6 figures

    Journal ref: Journal of Business & Economic Statistics, 2021

  9. arXiv:2205.07294  [pdf, ps, other

    stat.ME

    Mutual Influence Regression Model

    Authors: Xinyan Fan, Wei Lan, Tao Zou, Chih-Ling Tsai

    Abstract: In this article, we propose the mutual influence regression model (MIR) to establish the relationship between the mutual influence matrix of actors and a set of similarity matrices induced by their associated attributes. This model is able to explain the heterogeneous structure of the mutual influence matrix by extending the commonly used spatial autoregressive model while allowing it to change wi… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  10. arXiv:2205.07174  [pdf, ps, other

    stat.ME

    Covariance Model with General Linear Structure and Divergent Parameters

    Authors: Xinyan Fan, Wei Lan, Tao Zou, Chih-Ling Tsai

    Abstract: For estimating the large covariance matrix with a limited sample size, we propose the covariance model with general linear structure (CMGL) by employing the general link function to connect the covariance of the continuous response vector to a linear combination of weight matrices. Without assuming the distribution of responses, and allowing the number of parameters associated with weight matrices… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

  11. arXiv:2111.03223  [pdf, other

    stat.ME

    Quantile index regression

    Authors: Yingying Zhang, Yuefeng Si, Guodong Li, Chil-Ling Tsai

    Abstract: Estimating the structures at high or low quantiles has become an important subject and attracted increasing attention across numerous fields. However, due to data sparsity at tails, it usually is a challenging task to obtain reliable estimation, especially for high-dimensional data. This paper suggests a flexible parametric structure to tails, and this enables us to conduct the estimation at quant… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  12. arXiv:2108.11483  [pdf, other

    cs.LG math.OC stat.ML

    Heavy-tailed Streaming Statistical Estimation

    Authors: Che-Ping Tsai, Adarsh Prasad, Sivaraman Balakrishnan, Pradeep Ravikumar

    Abstract: We consider the task of heavy-tailed statistical estimation given streaming $p$-dimensional samples. This could also be viewed as stochastic optimization under heavy-tailed distributions, with an additional $O(p)$ space complexity constraint. We design a clipped stochastic gradient descent algorithm and provide an improved analysis, under a more nuanced condition on the noise of the stochastic gra… ▽ More

    Submitted 25 February, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

  13. arXiv:1909.03434  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Order-free Learning Alleviating Exposure Bias in Multi-label Classification

    Authors: Che-Ping Tsai, Hung-Yi Lee

    Abstract: Multi-label classification (MLC) assigns multiple labels to each sample. Prior studies show that MLC can be transformed to a sequence prediction problem with a recurrent neural network (RNN) decoder to model the label dependency. However, training a RNN decoder requires a predefined order of labels, which is not directly available in the MLC specification. Besides, RNN thus trained tends to overfi… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

  14. arXiv:1908.00966  [pdf, other

    cs.LG math.CO stat.ML

    Mixed-Integer Optimization Approach to Learning Association Rules for Unplanned ICU Transfer

    Authors: Chun-An Chou, Qingtao Cao, Shao-Jen Weng, Che-Hung Tsai

    Abstract: After admission to emergency department (ED), patients with critical illnesses are transferred to intensive care unit (ICU) due to unexpected clinical deterioration occurrence. Identifying such unplanned ICU transfers is urgently needed for medical physicians to achieve two-fold goals: improving critical care quality and preventing mortality. A priority task is to understand the crucial rationale… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Journal ref: Artificial Intelligence in Medicine, 2020

  15. arXiv:1811.04689  [pdf, other

    cs.LG stat.ML

    Adversarial Learning of Label Dependency: A Novel Framework for Multi-class Classification

    Authors: Che-Ping Tsai, Hung-Yi Lee

    Abstract: Recent work has shown that exploiting relations between labels improves the performance of multi-label classification. We propose a novel framework based on generative adversarial networks (GANs) to model label dependency. The discriminator learns to model label dependency by discriminating real and generated label sets. To fool the discriminator, the classifier, or generator, learns to generate l… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

  16. arXiv:1610.10087  [pdf, other

    cs.NE cs.LG stat.ML

    Tensor Switching Networks

    Authors: Chuan-Yung Tsai, Andrew Saxe, David Cox

    Abstract: We present a novel neural network algorithm, the Tensor Switching (TS) network, which generalizes the Rectified Linear Unit (ReLU) nonlinearity to tensor-valued hidden units. The TS network copies its entire input vector to different locations in an expanded representation, with the location determined by its hidden unit activity. In this way, even a simple linear readout from the TS representatio… ▽ More

    Submitted 31 October, 2016; originally announced October 2016.

  17. arXiv:1607.05169  [pdf, ps, other

    stat.ME

    Sparse Estimation of Generalized Linear Models (GLM) via Approximated Information Criteria

    Authors: Xiaogang Su, Juanjuan Fan, Richard A. Levine, Martha E. Nunn, Chih-Ling Tsai

    Abstract: We propose a new sparse estimation method, termed MIC (Minimum approximated Information Criterion), for generalized linear models (GLM) in fixed dimensions. What is essentially involved in MIC is the approximation of the $\ell_0$-norm with a continuous unit dent function. Besides, a reparameterization step is devised to enforce sparsity in parameter estimates while maintaining the smoothness of th… ▽ More

    Submitted 18 July, 2016; originally announced July 2016.

    Comments: 23 pages, 3 figures

    MSC Class: 62J02

    Journal ref: Statistica Sinica, 28: 1561-1581, 2018

  18. arXiv:1209.6487  [pdf, ps, other

    stat.ME

    Quantile correlations and quantile autoregressive modeling

    Authors: Guodong Li, Yang Li, Chih-Ling Tsai

    Abstract: In this paper, we propose two important measures, quantile correlation (QCOR) and quantile partial correlation (QPCOR). We then apply them to quantile autoregressive (QAR) models, and introduce two valuable quantities, the quantile autocorrelation function (QACF) and the quantile partial autocorrelation function (QPACF). This allows us to extend the classical Box-Jenkins approach to quantile autor… ▽ More

    Submitted 28 September, 2012; originally announced September 2012.