Skip to main content

Showing 1–43 of 43 results for author: Chang, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.16534  [pdf, other

    cs.LG stat.ML

    DOFEN: Deep Oblivious Forest ENsemble

    Authors: Kuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Chih-Sheng Chen, Tien-Hao Chang

    Abstract: Deep Neural Networks (DNNs) have revolutionized artificial intelligence, achieving impressive results on diverse data types, including images, videos, and texts. However, DNNs still lag behind Gradient Boosting Decision Trees (GBDT) on tabular data, a format extensively utilized across various domains. In this paper, we propose DOFEN, short for \textbf{D}eep \textbf{O}blivious \textbf{F}orest \tex… ▽ More

    Submitted 24 December, 2024; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: NeurIPS 2024 (poster); (v2: modify and rearrange sections, propose multihead extension of DOFEN, include new results on tabular benchmark and other benchmarks)

  2. arXiv:2406.18865  [pdf, other

    cs.LG stat.ML

    From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions

    Authors: Trenton Chang, Jenna Wiens

    Abstract: Selective labels occur when label observations are subject to a decision-making process; e.g., diagnoses that depend on the administration of laboratory tests. We study a clinically-inspired selective label problem called disparate censorship, where labeling biases vary across subgroups and unlabeled individuals are imputed as "negative" (i.e., no diagnostic test = no illness). Machine learning mo… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 39 pages, 33 figures. ICML 2024 conference paper

  3. arXiv:2405.06763  [pdf, other

    stat.ME

    Post-selection inference for causal effects after causal discovery

    Authors: Ting-Hsuan Chang, Zijian Guo, Daniel Malinsky

    Abstract: Algorithms for constraint-based causal discovery select graphical causal models among a space of possible candidates (e.g., all directed acyclic graphs) by executing a sequence of conditional independence tests. These may be used to inform the estimation of causal effects (e.g., average treatment effects) when there is uncertainty about which covariates ought to be adjusted for, or which variables… ▽ More

    Submitted 26 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  4. Leveraging Interpolation Models and Error Bounds for Verifiable Scientific Machine Learning

    Authors: Tyler Chang, Andrew Gillette, Romit Maulik

    Abstract: Effective verification and validation techniques for modern scientific machine learning workflows are challenging to devise. Statistical methods are abundant and easily deployed, but often rely on speculative assumptions about the data and methods involved. Error bounds for classical interpolation techniques can provide mathematically rigorous estimates of accuracy, but often are difficult or impr… ▽ More

    Submitted 7 February, 2025; v1 submitted 4 April, 2024; originally announced April 2024.

    Journal ref: Journal of Computational Physics, Vol. 524, March 2025, 113726

  5. arXiv:2402.09970  [pdf, other

    cs.LG stat.ML

    Accelerating Parallel Sampling of Diffusion Models

    Authors: Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui Chang

    Abstract: Diffusion models have emerged as state-of-the-art generative models for image generation. However, sampling from diffusion models is usually time-consuming due to the inherent autoregressive nature of their sampling process. In this work, we propose a novel approach that accelerates the sampling of diffusion models by parallelizing the autoregressive process. Specifically, we reformulate the sampl… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  6. arXiv:2402.09941  [pdf, other

    cs.LG cs.AI stat.ML

    FedLion: Faster Adaptive Federated Optimization with Fewer Communication

    Authors: Zhiwei Tang, Tsung-Hui Chang

    Abstract: In Federated Learning (FL), a framework to train machine learning models across distributed data, well-known algorithms like FedAvg tend to have slow convergence rates, resulting in high communication costs during training. To address this challenge, we introduce FedLion, an adaptive federated optimization algorithm that seamlessly incorporates key elements from the recently proposed centralized a… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: ICASSP 2024

  7. arXiv:2312.02213  [pdf, other

    cs.LG cs.AI cs.DB stat.AP

    JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization

    Authors: Shang-Ching Liu, ShengKun Wang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-Ping Cheng, Sian-Hong Luo, Tsungyao Chang, Jianwei Zhang

    Abstract: In this study, we introduce JarviX, a sophisticated data analytics framework. JarviX is designed to employ Large Language Models (LLMs) to facilitate an automated guide and execute high-precision data analyzes on tabular datasets. This framework emphasizes the significance of varying column types, capitalizing on state-of-the-art LLMs to generate concise data insight summaries, propose relevant an… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  8. arXiv:2305.12043  [pdf, other

    stat.ME math.OC stat.AP

    SF-SFD: Stochastic Optimization of Fourier Coefficients to Generate Space-Filling Designs

    Authors: Manisha Garg, Tyler Chang, Krishnan Raghavan

    Abstract: Due to the curse of dimensionality, it is often prohibitively expensive to generate deterministic space-filling designs. On the other hand, when using na{ï}ve uniform random sampling to generate designs cheaply, design points tend to concentrate in a small region of the design space. Although, it is preferable in these cases to utilize quasi-random techniques such as Sobol sequences and Latin hype… ▽ More

    Submitted 19 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  9. arXiv:2304.09981  [pdf, other

    stat.ME cs.LG q-bio.QM

    Interpretable (not just posthoc-explainable) heterogeneous survivor bias-corrected treatment effects for assignment of postdischarge interventions to prevent readmissions

    Authors: Hongjing Xia, Joshua C. Chang, Sarah Nowak, Sonya Mahajan, Rohit Mahajan, Ted L. Chang, Carson C. Chow

    Abstract: We used survival analysis to quantify the impact of postdischarge evaluation and management (E/M) services in preventing hospital readmission or death. Our approach avoids a specific pitfall of applying machine learning to this problem, which is an inflated estimate of the effect of interventions, due to survivors bias -- where the magnitude of inflation may be conditional on heterogeneous confoun… ▽ More

    Submitted 3 August, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Submitted

    Journal ref: PMLR 219:884-905, 2023

  10. arXiv:2302.13428  [pdf, ps, other

    stat.ME

    Methods for Integrating Trials and Non-Experimental Data to Examine Treatment Effect Heterogeneity

    Authors: Carly Lupton Brantner, Ting-Hsuan Chang, Trang Quynh Nguyen, Hwanhee Hong, Leon Di Stefano, Elizabeth A. Stuart

    Abstract: Estimating treatment effects conditional on observed covariates can improve the ability to tailor treatments to particular individuals. Doing so effectively requires dealing with potential confounding, and also enough data to adequately estimate effect moderation. A recent influx of work has looked into estimating treatment effect heterogeneity using data from multiple randomized controlled trials… ▽ More

    Submitted 28 March, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

  11. arXiv:2302.02589  [pdf, other

    cs.LG stat.ML

    $z$-SignFedAvg: A Unified Stochastic Sign-based Compression for Federated Learning

    Authors: Zhiwei Tang, Yanmeng Wang, Tsung-Hui Chang

    Abstract: Federated Learning (FL) is a promising privacy-preserving distributed learning paradigm but suffers from high communication cost when training large-scale machine learning models. Sign-based methods, such as SignSGD \cite{bernstein2018signsgd}, have been proposed as a biased gradient compression technique for reducing the communication cost. However, sign-based algorithms could diverge under heter… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  12. arXiv:2209.00076  [pdf, other

    cs.CY cs.SI stat.AP

    Connecticut Redistricting Analysis

    Authors: Kyle Evans, Katherine T. Chang

    Abstract: Connecticut passed their new state House of Representatives district plan on November 18, 2021 and passed their new state Senate district plan on November 23, 2021. Each passed unanimously in their 9-person bipartisan Reapportionment Commission; however, the process has been criticized for legislators controlling the process and for the negotiations that serve to protect incumbents. This paper inv… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 13 pages, 3 tables

  13. arXiv:2208.12814  [pdf, other

    cs.CY cs.AI cs.LG stat.AP

    Interpretable (not just posthoc-explainable) medical claims modeling for discharge placement to prevent avoidable all-cause readmissions or death

    Authors: Joshua C. Chang, Ted L. Chang, Carson C. Chow, Rohit Mahajan, Sonya Mahajan, Joe Maisog, Shashaank Vattikuti, Hongjing Xia

    Abstract: We developed an inherently interpretable multilevel Bayesian framework for representing variation in regression coefficients that mimics the piecewise linearity of ReLU-activated deep neural networks. We used the framework to formulate a survival model for using medical claims to predict hospital readmission and death that focuses on discharge placement, adjusting for confounding in estimating cau… ▽ More

    Submitted 29 January, 2023; v1 submitted 28 August, 2022; originally announced August 2022.

    Comments: In review

  14. arXiv:2207.01062  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Distributed Online System Identification for LTI Systems Using Reverse Experience Replay

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Identification of linear time-invariant (LTI) systems plays an important role in control and reinforcement learning. Both asymptotic and finite-time offline system identification are well-studied in the literature. For online system identification, the idea of stochastic-gradient descent with reverse experience replay (SGD-RER) was recently proposed, where the data sequence is stored in several bu… ▽ More

    Submitted 15 September, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

  15. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  16. arXiv:2205.14283  [pdf, other

    stat.ML cs.LG eess.IV eess.SP

    Rethinking Bayesian Learning for Data Analysis: The Art of Prior and Inference in Sparsity-Aware Modeling

    Authors: Lei Cheng, Feng Yin, Sergios Theodoridis, Sotirios Chatzis, Tsung-Hui Chang

    Abstract: Sparse modeling for signal processing and machine learning has been at the focus of scientific research for over two decades. Among others, supervised sparsity-aware learning comprises two major paths paved by: a) discriminative methods and b) generative methods. The latter, more widely known as Bayesian methods, enable uncertainty evaluation w.r.t. the performed predictions. Furthermore, they can… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 64 pages, 16 figures, 6 tables, 98 references, submitted to IEEE Signal Processing Magazine

  17. A Continual Learning Framework for Adaptive Defect Classification and Inspection

    Authors: Wenbo Sun, Raed Al Kontar, Judy Jin, Tzyy-Shuh Chang

    Abstract: Machine-vision-based defect classification techniques have been widely adopted for automatic quality inspection in manufacturing processes. This article describes a general framework for classifying defects from high volume data batches with efficient inspection of unlabelled samples. The concept is to construct a detector to identify new defect types, send them to the inspection station for label… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Journal of Quality Technology (2022)

  18. arXiv:2201.09766  [pdf, other

    stat.AP

    Design Strategies and Approximation Methods for High-Performance Computing Variability Management

    Authors: Yueyao Wang, Li Xu, Yili Hong, Rong Pan, Tyler Chang, Thomas Lux, Jon Bernard, Layne Watson, Kirk Cameron

    Abstract: Performance variability management is an active research area in high-performance computing (HPC). We focus on input/output (I/O) variability. To study the performance variability, computer scientists often use grid-based designs (GBDs) to collect I/O variability data, and use mathematical approximation methods to build a prediction model. Mathematical approximation models could be biased particul… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 29 pages, 6 figures

  19. arXiv:2110.07959  [pdf, other

    cs.LG cs.IR stat.ML

    Low-rank Matrix Recovery With Unknown Correspondence

    Authors: Zhiwei Tang, Tsung-Hui Chang, Xiaojing Ye, Hongyuan Zha

    Abstract: We study a matrix recovery problem with unknown correspondence: given the observation matrix $M_o=[A,\tilde P B]$, where $\tilde P$ is an unknown permutation matrix, we aim to recover the underlying matrix $M=[A,B]$. Such problem commonly arises in many applications where heterogeneous data are utilized and the correspondence among them are unknown, e.g., due to privacy concerns. We show that it i… ▽ More

    Submitted 17 October, 2021; v1 submitted 15 October, 2021; originally announced October 2021.

  20. arXiv:2107.07014  [pdf

    cs.LG stat.ML

    Hybrid Bayesian Neural Networks with Functional Probabilistic Layers

    Authors: Daniel T. Chang

    Abstract: Bayesian neural networks provide a direct and natural way to extend standard deep neural networks to support probabilistic deep learning through the use of probabilistic layers that, traditionally, encode weight (and bias) uncertainty. In particular, hybrid Bayesian neural networks utilize standard deterministic layers together with few probabilistic layers judicially positioned in the networks fo… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  21. arXiv:2106.00120  [pdf

    cs.LG stat.ML

    Probabilistic Deep Learning with Probabilistic Neural Networks and Deep Probabilistic Models

    Authors: Daniel T. Chang

    Abstract: Probabilistic deep learning is deep learning that accounts for uncertainty, both model uncertainty and data uncertainty. It is based on the use of probabilistic models and deep neural networks. We distinguish two approaches to probabilistic deep learning: probabilistic neural networks and deep probabilistic models. The former employs deep neural networks that utilize probabilistic layers which can… ▽ More

    Submitted 9 June, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: arXiv admin note: text overlap with arXiv:1811.06622

  22. arXiv:2102.13276  [pdf, other

    stat.ML cs.LG q-bio.PE

    Spectral Top-Down Recovery of Latent Tree Models

    Authors: Yariv Aizenbud, Ariel Jaffe, Meng Wang, Amber Hu, Noah Amsel, Boaz Nadler, Joseph T. Chang, Yuval Kluger

    Abstract: Modeling the distribution of high dimensional data by a latent tree graphical model is a prevalent approach in multiple scientific domains. A common task is to infer the underlying tree structure, given only observations of its terminal nodes. Many algorithms for tree recovery are computationally intensive, which limits their applicability to trees of moderate size. For large trees, a common appro… ▽ More

    Submitted 7 December, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  23. arXiv:2012.11678  [pdf

    stat.AP

    Global Trends and Predictors of Face Mask Usage During the COVID-19 Pandemic

    Authors: Elena Badillo-Goicoechea, Ting-Hsuan Chang, Esther Kim, Sarah LaRocca, Katherine Morris, Xiaoyi Deng, Samantha Chiu, Adrianne Bradford, Andres Garcia, Christoph Kern, Curtiss Cobb, Frauke Kreuter, Elizabeth A. Stuart

    Abstract: Background: Guidelines and recommendations from public health authorities related to face masks have been essential in containing the COVID-19 pandemic. We assessed the prevalence and correlates of mask usage during the pandemic. Methods: We examined a total of 13,723,810 responses to a daily cross-sectional representative online survey in 38 countries who completed from April 23, 2020 to Octobe… ▽ More

    Submitted 8 January, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: 39 pages, 2 mian figures, Appendix

  24. arXiv:2012.07915  [pdf, other

    cs.DC stat.AP

    Prediction of High-Performance Computing Input/Output Variability and Its Application to Optimization for System Configurations

    Authors: Li Xu, Thomas Lux, Tyler Chang, Bo Li, Yili Hong, Layne Watson, Ali Butt, Danfeng Yao, Kirk Cameron

    Abstract: Performance variability is an important measure for a reliable high performance computing (HPC) system. Performance variability is affected by complicated interactions between numerous factors, such as CPU frequency, the number of input/output (IO) threads, and the IO scheduler. In this paper, we focus on HPC IO variability. The prediction of HPC variability is a challenging problem in the enginee… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: 29 pages, 8 figures

    Journal ref: Quality Engineering, 2021

  25. arXiv:2012.04171  [pdf, other

    cs.LG q-bio.QM stat.ML

    Sparse encoding for more-interpretable feature-selecting representations in probabilistic matrix factorization

    Authors: Joshua C. Chang, Patrick Fletcher, Jungmin Han, Ted L. Chang, Shashaank Vattikuti, Bart Desmet, Ayah Zirikly, Carson C. Chow

    Abstract: Dimensionality reduction methods for count data are critical to a wide range of applications in medical informatics and other fields where model interpretability is paramount. For such data, hierarchical Poisson matrix factorization (HPF) and other sparse probabilistic non-negative matrix factorization (NMF) methods are considered to be interpretable generative models. They consist of sparse trans… ▽ More

    Submitted 29 December, 2020; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Fixed typo in Eq 2

    Report number: ICLR 2021

  26. arXiv:2006.03912  [pdf, other

    cs.LG math.OC stat.ML

    Unconstrained Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence ($V_T$) and/or the path-length of the minimizer sequence after $T$ rounds. For strongly convex and smooth functions, , Zhang et al. establish the squared path-length of the minimizer sequence ($C^*_{2,T}$) as a lower bound on regret. They also show that online gradient desce… ▽ More

    Submitted 14 August, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

  27. arXiv:2002.12547  [pdf, ps, other

    stat.ML cs.LG

    Spectral neighbor joining for reconstruction of latent tree models

    Authors: Ariel Jaffe, Noah Amsel, Yariv Aizenbud, Boaz Nadler, Joseph T. Chang, Yuval Kluger

    Abstract: A common assumption in multiple scientific applications is that the distribution of observed data can be modeled by a latent tree graphical model. An important example is phylogenetics, where the tree models the evolutionary lineages of a set of observed organisms. Given a set of independent realizations of the random variables at the leaves of the tree, a key challenge is to infer the underlying… ▽ More

    Submitted 22 September, 2020; v1 submitted 28 February, 2020; originally announced February 2020.

  28. arXiv:2002.04930  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Federated Matrix Factorization: Algorithm Design and Application to Data Clustering

    Authors: Shuai Wang, Tsung-Hui Chang

    Abstract: Recent demands on data privacy have called for federated learning (FL) as a new distributed learning paradigm in massive and heterogeneous networks. Although many FL algorithms have been proposed, few of them have considered the matrix factorization (MF) model, which is known to have a vast number of signal processing and machine learning applications. Different from the existing FL algorithms tha… ▽ More

    Submitted 30 October, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

  29. arXiv:2002.04753  [pdf, other

    cs.LG stat.ML

    RFN: A Random-Feature Based Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Spaces

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method, where the Hessian is approximated via sub-samples of data. In RKHS, however, the dependence of the penalty function to kernel makes standard sub-sampling approac… ▽ More

    Submitted 6 June, 2022; v1 submitted 11 February, 2020; originally announced February 2020.

  30. arXiv:2002.04235  [pdf, other

    cs.LG stat.ML

    Learning Structured Communication for Multi-agent Reinforcement Learning

    Authors: Junjie Sheng, Xiangfeng Wang, Bo Jin, Junchi Yan, Wenhao Li, Tsung-Hui Chang, Jun Wang, Hongyuan Zha

    Abstract: This work explores the large-scale multi-agent communication mechanism under a multi-agent reinforcement learning (MARL) setting. We summarize the general categories of topology for communication structures in MARL literature, which are often manually specified. Then we propose a novel framework termed as Learning Structured Communication (LSC) by using a more flexible and efficient communication… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  31. arXiv:2001.04786  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Learning in the Non-Convex World: From Batch to Streaming Data, and Beyond

    Authors: Tsung-Hui Chang, Mingyi Hong, Hoi-To Wai, Xinwei Zhang, Songtao Lu

    Abstract: Distributed learning has become a critical enabler of the massively connected world envisioned by many. This article discusses four key elements of scalable distributed processing and real-time intelligence --- problems, data, communication and computation. Our aim is to provide a fresh and unique perspective about how these elements should work together in an effective and coherent manner. In par… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: Submitted to IEEE Signal Processing Magazine Special Issue on Distributed, Streaming Machine Learning; THC, MH, HTW contributed equally

  32. arXiv:1912.05686  [pdf

    cs.LG stat.ML

    Bayesian Hyperparameter Optimization with BoTorch, GPyTorch and Ax

    Authors: Daniel T. Chang

    Abstract: Deep learning models are full of hyperparameters, which are set manually before the learning process can start. To find the best configuration for these hyperparameters in such a high dimensional space, with time-consuming and expensive model training / validation, is not a trivial challenge. Bayesian optimization is a powerful tool for the joint optimization of hyperparameters, efficiently tradin… ▽ More

    Submitted 2 July, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

  33. arXiv:1908.09258  [pdf, other

    cs.LG stat.ML

    RandNet: deep learning with compressed measurements of images

    Authors: Thomas Chang, Bahareh Tolooshams, Demba Ba

    Abstract: Principal component analysis, dictionary learning, and auto-encoders are all unsupervised methods for learning representations from a large amount of training data. In all these methods, the higher the dimensions of the input data, the longer it takes to learn. We introduce a class of neural networks, termed RandNet, for learning representations using compressed random measurements of data of inte… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

    Comments: The first two authors contributed equally to this work

  34. arXiv:1908.08612  [pdf

    cs.LG stat.ML

    Tiered Graph Autoencoders with PyTorch Geometric for Molecular Graphs

    Authors: Daniel T. Chang

    Abstract: Tiered latent representations and latent spaces for molecular graphs provide a simple but effective way to explicitly represent and utilize groups (e.g., functional groups), which consist of the atom (node) tier, the group tier and the molecule (graph) tier. They can be learned using the tiered graph autoencoder architecture. In this paper we discuss adapting tiered graph autoencoders for use with… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  35. arXiv:1906.01811  [pdf, other

    cs.AI stat.AP

    The Stanford Acuity Test: A Precise Vision Test Using Bayesian Techniques and a Discovery in Human Visual Response

    Authors: Chris Piech, Ali Malik, Laura M Scott, Robert T Chang, Charles Lin

    Abstract: Chart-based visual acuity measurements are used by billions of people to diagnose and guide treatment of vision impairment. However, the ubiquitous eye exam has no mechanism for reasoning about uncertainty and as such, suffers from a well-documented reproducibility problem. In this paper we make two core contributions. First, we uncover a new parametric probabilistic model of visual acuity respons… ▽ More

    Submitted 21 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 34th AAAI Conference on Artificial Intelligence, New York, USA. 2020

  36. arXiv:1906.00570  [pdf, other

    cs.LG stat.ML

    Clustering by Orthogonal NMF Model and Non-Convex Penalty Optimization

    Authors: Shuai Wang, Tsung-Hui Chang, Ying Cui, Jong-Shi Pang

    Abstract: The non-negative matrix factorization (NMF) model with an additional orthogonality constraint on one of the factor matrices, called the orthogonal NMF (ONMF), has been found a promising clustering model and can outperform the classical K-means. However, solving the ONMF model is a challenging optimization problem because the coupling of the orthogonality and non-negativity constraints introduces a… ▽ More

    Submitted 28 July, 2021; v1 submitted 3 June, 2019; originally announced June 2019.

  37. arXiv:1812.11856  [pdf

    cs.LG stat.ML

    Latent Variable Modeling for Generative Concept Representations and Deep Generative Models

    Authors: Daniel T. Chang

    Abstract: Latent representations are the essence of deep generative models and determine their usefulness and power. For latent representations to be useful as generative concept representations, their latent space must support latent space interpolation, attribute vectors and concept vectors, among other things. We investigate and discuss latent variable modeling, including latent variable models, latent r… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: text overlap with arXiv:1706.00400 by other authors

  38. arXiv:1811.06622  [pdf

    cs.LG cs.AI stat.ML

    Concept-Oriented Deep Learning: Generative Concept Representations

    Authors: Daniel T. Chang

    Abstract: Generative concept representations have three major advantages over discriminative ones: they can represent uncertainty, they support integration of learning and reasoning, and they are good for unsupervised and semi-supervised learning. We discuss probabilistic and generative deep learning, which generative concept representations are based on, and the use of variational autoencoders and generati… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  39. arXiv:1810.03739  [pdf, other

    cs.LG stat.ML

    Efficient Two-Step Adversarial Defense for Deep Neural Networks

    Authors: Ting-Jui Chang, Yukun He, Peng Li

    Abstract: In recent years, deep neural networks have demonstrated outstanding performance in many machine learning tasks. However, researchers have discovered that these state-of-the-art models are vulnerable to adversarial examples: legitimate examples added by small perturbations which are unnoticeable to human eyes. Adversarial training, which augments the training data with adversarial examples during t… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 12 pages

  40. arXiv:0904.2229  [pdf

    stat.ME

    Uncovering shared common genetic risk factors for various aspects of complex disorders captured in multiple traits

    Authors: Summer S. Han, Elena L. Grigorenko, Joseph T. Chang

    Abstract: Identifying shared genetic risk factors for multiple measured traits has been of great interest in studying complex disorders. Marlow's (2003) method for detecting shared gene effects on complex traits has been highly influential in the literature of neurodevelopmental disorders as well as other disorders including obesity and asthma. Although its method has been widely applied and has been reco… ▽ More

    Submitted 14 April, 2009; originally announced April 2009.

  41. arXiv:0808.2000  [pdf

    stat.ME

    Reconsidering the asymptotic null distribution of likelihood ratio tests for genetic linkage in multivariate variance components models

    Authors: Summer S. Han, Joseph T. Chang

    Abstract: Accurate knowledge of the null distribution of hypothesis tests is important for valid application of the tests. In previous papers and software, the asymptotic null distribution of likelihood ratio tests for detecting genetic linkage in multivariate variance components models has been stated to be a mixture of chi-square distributions with binomial mixing probabilities. Here we show, by simulat… ▽ More

    Submitted 13 September, 2008; v1 submitted 14 August, 2008; originally announced August 2008.

    Comments: added a proposed method section, 23 pages with 6 figures, presented in Joint Statistical Meetings in 2008

  42. arXiv:0710.5896  [pdf

    stat.ML math.ST

    Supervised Machine Learning with a Novel Pointwise Density Estimator

    Authors: Yen-Jen Oyang, Chien-Yu Chen, Darby Tien-Hao Chang, Chih-Peng Wu

    Abstract: This article proposes a novel density estimation based algorithm for carrying out supervised machine learning. The proposed algorithm features O(n) time complexity for generating a classifier, where n is the number of sampling instances in the training dataset. This feature is highly desirable in contemporary applications that involve large and still growing databases. In comparison with the ker… ▽ More

    Submitted 6 November, 2007; v1 submitted 31 October, 2007; originally announced October 2007.

    Comments: Inclusion of a new "Remarks" section

  43. arXiv:0709.2760  [pdf

    stat.ML

    Supervised Machine Learning with a Novel Kernel Density Estimator

    Authors: Yen-Jen Oyang, Darby Tien-Hao Chang, Yu-Yen Ou, Hao-Geng Hung, Chih-Peng Wu, Chien-Yu Chen

    Abstract: In recent years, kernel density estimation has been exploited by computer scientists to model machine learning problems. The kernel density estimation based approaches are of interest due to the low time complexity of either O(n) or O(n*log(n)) for constructing a classifier, where n is the number of sampling instances. Concerning design of kernel density estimators, one essential issue is how fa… ▽ More

    Submitted 16 October, 2007; v1 submitted 18 September, 2007; originally announced September 2007.

    Comments: The new version includes an additional theorem, Theorem 3