Skip to main content

Showing 1–50 of 50 results for author: Chan, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.18280  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior

    Authors: Tsai Hor Chan, Dora Yan Zhang, Guosheng Yin, Lequan Yu

    Abstract: Bayesian neural networks (BNNs) treat neural network weights as random variables, which aim to provide posterior uncertainty estimates and avoid overfitting by performing inference on the posterior weights. However, the selection of appropriate prior distributions remains a challenging task, and BNNs may suffer from catastrophic inflated variance or poor predictive performance when poor choices ar… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: To appear in TPAMI

  2. arXiv:2410.11124  [pdf, other

    cs.CV cs.LG stat.AP

    Real-Time Localization and Bimodal Point Pattern Analysis of Palms Using UAV Imagery

    Authors: Kangning Cui, Wei Tang, Rongkun Zhu, Manqi Wang, Gregory D. Larsen, Victor P. Pauca, Sarra Alqahtani, Fan Yang, David Segurado, Paul Fine, Jordan Karubian, Raymond H. Chan, Robert J. Plemmons, Jean-Michel Morel, Miles R. Silman

    Abstract: Understanding the spatial distribution of palms within tropical forests is essential for effective ecological monitoring, conservation strategies, and the sustainable integration of natural forest products into local and global supply chains. However, the analysis of remotely sensed data in these environments faces significant challenges, such as overlapping palm and tree crowns, uneven shading ac… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 25 pages, 8 figures, 5 tables

  3. arXiv:2409.08965  [pdf, other

    stat.ME stat.AP stat.CO

    Dynamic Bayesian Networks with Conditional Dynamics in Edge Addition and Deletion

    Authors: Lupe S. H. Chan, Amanda M. Y. Chu, Mike K. P. So

    Abstract: This study presents a dynamic Bayesian network framework that facilitates intuitive gradual edge changes. We use two conditional dynamics to model the edge addition and deletion, and edge selection separately. Unlike previous research that uses a mixture network approach, which restricts the number of possible edge changes, or structural priors to induce gradual changes, which can lead to unclear… ▽ More

    Submitted 7 May, 2025; v1 submitted 13 September, 2024; originally announced September 2024.

    MSC Class: 62F15 ACM Class: G.3

  4. arXiv:2406.15582  [pdf, other

    stat.ME stat.AP stat.CO

    Graphical copula GARCH modeling with dynamic conditional dependence

    Authors: Lupe Shun Hin Chan, Amanda Man Ying Chu, Mike Ka Pui So

    Abstract: Modeling returns on large portfolios is a challenging problem as the number of parameters in the covariance matrix grows as the square of the size of the portfolio. Traditional correlation models, for example, the dynamic conditional correlation (DCC)-GARCH model, often ignore the nonlinear dependencies in the tail of the return distribution. In this paper, we aim to develop a framework to model t… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    MSC Class: 62F15 ACM Class: G.3

  5. arXiv:2403.04498  [pdf, other

    physics.ins-det stat.AP

    PCH-EM: A solution to information loss in the photon transfer method

    Authors: Aaron J. Hendrickson, David P. Haefner, Stanley H. Chan, Nicholas R. Shade, Eric R. Fossum

    Abstract: Working from a Poisson-Gaussian noise model, a multi-sample extension of the Photon Counting Histogram Expectation Maximization (PCH-EM) algorithm is derived as a general-purpose alternative to the Photon Transfer (PT) method. This algorithm is derived from the same model, requires the same experimental data, and estimates the same sensor performance parameters as the time-tested PT method, all wh… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

  6. arXiv:2401.05428  [pdf, other

    physics.plasm-ph physics.comp-ph stat.AP

    Effects of multi-dimensionality and energy exchange on electrostatic current-driven plasma instabilities and turbulence

    Authors: Wai Hong Ronald Chan, Kentaro Hara, Iain D. Boyd

    Abstract: Large-amplitude current-driven plasma instabilities, which can transition to the Buneman instability, were observed in one-dimensional (1D) simulations to generate high-energy backstreaming ions. We investigate the saturation of multi-dimensional plasma instabilities and its effects on energetic ion formation. Such ions directly impact spacecraft thruster lifetimes and are associated with magnetic… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  7. arXiv:2312.15447  [pdf, other

    cs.CV cs.LG stat.AP

    Superpixel-based and Spatially-regularized Diffusion Learning for Unsupervised Hyperspectral Image Clustering

    Authors: Kangning Cui, Ruoning Li, Sam L. Polk, Yinyi Lin, Hongsheng Zhang, James M. Murphy, Robert J. Plemmons, Raymond H. Chan

    Abstract: Hyperspectral images (HSIs) provide exceptional spatial and spectral resolution of a scene, crucial for various remote sensing applications. However, the high dimensionality, presence of noise and outliers, and the need for precise labels of HSIs present significant challenges to HSIs analysis, motivating the development of performant HSI clustering algorithms. This paper introduces a novel unsupe… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 27 pages, 9 figures, and 2 tables

  8. arXiv:2311.03763  [pdf, other

    math.ST stat.ME

    Thresholding the higher criticism test statistics for optimality in a heterogeneous setting

    Authors: Hock Peng Chan

    Abstract: Donoho and Kipnis (2022) showed that the the higher criticism (HC) test statistic has a non-Gaussian phase transition but remarked that it is probably not optimal, in the detection of sparse differences between two large frequency tables when the counts are low. The setting can be considered to be heterogeneous, with cells containing larger total counts more able to detect smaller differences. We… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  9. arXiv:2308.12562  [pdf, other

    cs.LG stat.ML

    Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions

    Authors: Kwan Ho Ryan Chan, Aditya Chattopadhyay, Benjamin David Haeffele, Rene Vidal

    Abstract: Variational Information Pursuit (V-IP) is a framework for making interpretable predictions by design by sequentially selecting a short chain of task-relevant, user-defined and interpretable queries about the data that are most informative for the task. While this allows for built-in interpretability in predictive models, applying V-IP to any task requires data samples with dense concept-labeling b… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  10. arXiv:2302.02876  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Information Pursuit for Interpretable Predictions

    Authors: Aditya Chattopadhyay, Kwan Ho Ryan Chan, Benjamin D. Haeffele, Donald Geman, René Vidal

    Abstract: There is a growing interest in the machine learning community in developing predictive algorithms that are "interpretable by design". Towards this end, recent work proposes to make interpretable decisions by sequentially asking interpretable queries about data until a prediction can be made with high confidence based on the answers obtained (the history). To promote short query-answer chains, a gr… ▽ More

    Submitted 15 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Code is available at https://github.com/ryanchankh/VariationalInformationPursuit

    Report number: https://openreview.net/forum?id=77lSWa-Tm3Z

  11. arXiv:2210.11056  [pdf, other

    physics.plasm-ph physics.comp-ph stat.AP

    Spectral analysis of multidimensional current-driven plasma instabilities and turbulence in hollow cathode plumes

    Authors: Wai Hong Ronald Chan, Ken Hara, Jonathan M. Wang, Suhas S. Jain, Shahab Mirjalili, Iain D. Boyd

    Abstract: Large-amplitude current-driven instabilities in hollow cathode plumes can generate energetic ions responsible for cathode sputtering and spacecraft degradation. A 2D2V (two dimensions each in configuration [D] and velocity [V] spaces) grid-based Vlasov--Poisson (direct kinetic) solver is used to study their growth and saturation, which comprises four stages: linear growth, quasilinear resonance, n… ▽ More

    Submitted 20 November, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: Center for Turbulence Research Proceedings of the Summer Program 2022

  12. arXiv:2210.05127  [pdf

    physics.flu-dyn physics.ao-ph physics.comp-ph stat.AP

    The Dynamics of Drop Breakup in Breaking Waves

    Authors: Wai Hong Ronald Chan

    Abstract: Breaking surface waves generate drops of a broad range of sizes that have a significant influence on regional and global climates, as well as the identification of ship movements. Characterizing these phenomena requires a fundamental understanding of the underlying mechanisms behind drop production. The interscale nature of these mechanisms also influences the development of models that enable cos… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of the 34th Symposium on Naval Hydrodynamics, Washington, D.C., 2022

  13. arXiv:2206.09365  [pdf, other

    cs.CV stat.AP

    Semi-supervised Change Detection of Small Water Bodies Using RGB and Multispectral Images in Peruvian Rainforests

    Authors: Kangning Cui, Seda Camalan, Ruoning Li, Victor P. Pauca, Sarra Alqahtani, Robert J. Plemmons, Miles Silman, Evan N. Dethier, David Lutz, Raymond H. Chan

    Abstract: Artisanal and Small-scale Gold Mining (ASGM) is an important source of income for many households, but it can have large social and environmental effects, especially in rainforests of developing countries. The Sentinel-2 satellites collect multispectral images that can be used for the purpose of detecting changes in water extent and quality which indicates the locations of mining sites. This work… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 8 pages, 5 figures. Accepted to Proceedings of IEEE WHISPERS 2022

  14. arXiv:2204.13497  [pdf, ps, other

    cs.CV cs.LG stat.AP

    Unsupervised Spatial-spectral Hyperspectral Image Reconstruction and Clustering with Diffusion Geometry

    Authors: Kangning Cui, Ruoning Li, Sam L. Polk, James M. Murphy, Robert J. Plemmons, Raymond H. Chan

    Abstract: Hyperspectral images, which store a hundred or more spectral bands of reflectance, have become an important data source in natural and social sciences. Hyperspectral images are often generated in large quantities at a relatively coarse spatial resolution. As such, unsupervised machine learning algorithms incorporating known structure in hyperspectral imagery are needed to analyze these images auto… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 7 pages, 1 figure

  15. arXiv:2204.09294  [pdf, other

    cs.CV stat.ML

    A 3-stage Spectral-spatial Method for Hyperspectral Image Classification

    Authors: Raymond H. Chan, Ruoning Li

    Abstract: Hyperspectral images often have hundreds of spectral bands of different wavelengths captured by aircraft or satellites that record land coverage. Identifying detailed classes of pixels becomes feasible due to the enhancement in spectral and spatial resolution of hyperspectral images. In this work, we propose a novel framework that utilizes both spatial and spectral information for classifying pixe… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 18 pages, 9 figures

  16. arXiv:2204.09041  [pdf, other

    cs.CV cs.LG stat.AP

    Unsupervised detection of ash dieback disease (Hymenoscyphus fraxineus) using diffusion-based hyperspectral image clustering

    Authors: Sam L. Polk, Aland H. Y. Chan, Kangning Cui, Robert J. Plemmons, David A. Coomes, James M. Murphy

    Abstract: Ash dieback (Hymenoscyphus fraxineus) is an introduced fungal disease that is causing the widespread death of ash trees across Europe. Remote sensing hyperspectral images encode rich structure that has been exploited for the detection of dieback disease in ash trees using supervised machine learning techniques. However, to understand the state of forest health at landscape-scale, accurate unsuperv… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: (6 pages, 2 figures). Accepted to Proceedings of IEEE IGARSS 2022

  17. arXiv:2203.15619  [pdf, other

    cs.CV stat.ML

    Classification of Hyperspectral Images Using SVM with Shape-adaptive Reconstruction and Smoothed Total Variation

    Authors: Ruoning Li, Kangning Cui, Raymond H. Chan, Robert J. Plemmons

    Abstract: In this work, a novel algorithm called SVM with Shape-adaptive Reconstruction and Smoothed Total Variation (SaR-SVM-STV) is introduced to classify hyperspectral images, which makes full use of spatial and spectral information. The Shape-adaptive Reconstruction (SaR) is introduced to preprocess each pixel based on the Pearson Correlation between pixels in its shape-adaptive (SA) region. Support Vec… ▽ More

    Submitted 14 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: 6 pages, 3 figures. Accepted to Proceedings of IEEE IGARSS 2022

  18. arXiv:2203.09992  [pdf, other

    cs.CV cs.LG stat.AP

    Unsupervised Diffusion and Volume Maximization-Based Clustering of Hyperspectral Images

    Authors: Sam L. Polk, Kangning Cui, Aland H. Y. Chan, David A. Coomes, Robert J. Plemmons, James M. Murphy

    Abstract: Hyperspectral images taken from aircraft or satellites contain information from hundreds of spectral bands, within which lie latent lower-dimensional structures that can be exploited for classifying vegetation and other materials. A disadvantage of working with hyperspectral images is that, due to an inherent trade-off between spectral and spatial resolution, they have a relatively coarse spatial… ▽ More

    Submitted 19 February, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 28 pages, 11 figures

    Journal ref: Remote Sens. 2023, 15(4), 1053

  19. arXiv:2107.07963  [pdf, other

    stat.ME math.PR math.ST

    Nearly Unstable Integer-Valued ARCH Process and Unit Root Testing

    Authors: Wagner Barreto-Souza, Ngai Hang Chan

    Abstract: This paper introduces a Nearly Unstable INteger-valued AutoRegressive Conditional Heteroskedasticity (NU-INARCH) process for dealing with count time series data. It is proved that a proper normalization of the NU-INARCH process endowed with a Skorohod topology weakly converges to a Cox-Ingersoll-Ross diffusion. The asymptotic distribution of the conditional least squares estimator of the correlati… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: Paper submitted for publication

  20. arXiv:2105.10446  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction

    Authors: Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

    Abstract: This work attempts to provide a plausible theoretical framework that aims to interpret modern deep (convolutional) networks from the principles of data compression and discriminative representation. We argue that for high-dimensional multi-class data, the optimal linear discriminative representation maximizes the coding rate difference between the whole dataset and the average of all the subsets.… ▽ More

    Submitted 28 November, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: This paper integrates previous two manuscripts: arXiv:2006.08558 and arXiv:2010.14765, with significantly improved organization, presentation, and new results; V2 polishes writing and adds citation; V3 polishes writing, adds citation and experiments

  21. arXiv:2103.07600  [pdf, other

    cs.LG cs.CV stat.ML

    Student-Teacher Learning from Clean Inputs to Noisy Inputs

    Authors: Guanzhe Hong, Zhiyuan Mao, Xiaojun Lin, Stanley H. Chan

    Abstract: Feature-based student-teacher learning, a training method that encourages the student's hidden features to mimic those of the teacher network, is empirically successful in transferring the knowledge from a pre-trained teacher network to the student network. Furthermore, recent empirical results demonstrate that, the teacher's features can boost the student network's generalization even when the st… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: Published at the Conference on Computer Vision and Pattern Recognition (CVPR 2021)

  22. arXiv:2010.14765  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    Deep Networks from the Principle of Rate Reduction

    Authors: Kwan Ho Ryan Chan, Yaodong Yu, Chong You, Haozhi Qi, John Wright, Yi Ma

    Abstract: This work attempts to interpret modern deep (convolutional) networks from the principles of rate reduction and (shift) invariant classification. We show that the basic iterative gradient ascent scheme for optimizing the rate reduction of learned features naturally leads to a multi-layer deep network, one iteration per layer. The layered architectures, linear and nonlinear operators, and even param… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  23. arXiv:2010.04438  [pdf, other

    cs.CL cs.LG stat.ML

    Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels

    Authors: Harris Chan, Jamie Kiros, William Chan

    Abstract: A channel corresponds to a viewpoint or transformation of an underlying meaning. A pair of parallel sentences in English and French express the same underlying meaning, but through two separate channels corresponding to their languages. In this work, we present the Multichannel Generative Language Model (MGLM). MGLM is a generative joint distribution model over channels. MGLM marginalizes over all… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 10 pages (+3 appendix), 11 figures, 5 tables. Accepted to Findings of EMNLP 2020

  24. arXiv:2007.02832  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning

    Authors: Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie, Jimmy Ba

    Abstract: What goals should a multi-goal reinforcement learning agent pursue during training in long-horizon tasks? When the desired (test time) goal distribution is too distant to offer a useful learning signal, we argue that the agent should not pursue unobtainable goals. Instead, it should set its own intrinsic goals that maximize the entropy of the historical achieved goal distribution. We propose to op… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 12 pages (+12 appendix). Published as a conference paper at ICML 2020. Code available at https://github.com/spitis/mrl

  25. arXiv:2006.11408  [pdf, other

    cs.CV cs.LG stat.ML

    Quasi-conformal Geometry based Local Deformation Analysis of Lateral Cephalogram for Childhood OSA Classification

    Authors: Hei-Long Chan, Hoi-Man Yuen, Chun-Ting Au, Kate Ching-Ching Chan, Albert Martin Li, Lok-Ming Lui

    Abstract: Craniofacial profile is one of the anatomical causes of obstructive sleep apnea(OSA). By medical research, cephalometry provides information on patients' skeletal structures and soft tissues. In this work, a novel approach to cephalometric analysis using quasi-conformal geometry based local deformation information was proposed for OSA classification. Our study was a retrospective analysis based on… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

  26. arXiv:2006.08558  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction

    Authors: Yaodong Yu, Kwan Ho Ryan Chan, Chong You, Chaobing Song, Yi Ma

    Abstract: To learn intrinsic low-dimensional structures from high-dimensional data that most discriminate between classes, we propose the principle of Maximal Coding Rate Reduction ($\text{MCR}^2$), an information-theoretic measure that maximizes the coding rate difference between the whole dataset and the sum of each individual class. We clarify its relationships with most existing frameworks such as cross… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  27. arXiv:2005.09627  [pdf, other

    cs.LG stat.ML

    One Size Fits All: Can We Train One Denoiser for All Noise Levels?

    Authors: Abhiram Gnansambandam, Stanley H. Chan

    Abstract: When training an estimator such as a neural network for tasks like image denoising, it is often preferred to train one estimator and apply it to all noise levels. The de facto training protocol to achieve this goal is to train the estimator with noisy samples whose noise levels are uniformly distributed across the range of interest. However, why should we allocate the samples uniformly? Can we hav… ▽ More

    Submitted 16 July, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: Published in the 37th International Conference on Machine Learning (ICML) 2020

  28. arXiv:2004.14774  [pdf, other

    cs.CV cs.LG cs.RO eess.IV stat.ML

    IROS 2019 Lifelong Robotic Vision Challenge -- Lifelong Object Recognition Report

    Authors: Qi She, Fan Feng, Qi Liu, Rosa H. M. Chan, Xinyue Hao, Chuanlin Lan, Qihan Yang, Vincenzo Lomonaco, German I. Parisi, Heechul Bae, Eoin Brophy, Baoquan Chen, Gabriele Graffieti, Vidit Goel, Hyonyoung Han, Sathursan Kanagarajah, Somesh Kumar, Siew-Kei Lam, Tin Lun Lam, Liang Ma, Davide Maltoni, Lorenzo Pellegrini, Duvindu Piyasena, Shiliang Pu, Debdoot Sheet , et al. (11 additional authors not shown)

    Abstract: This report summarizes IROS 2019-Lifelong Robotic Vision Competition (Lifelong Object Recognition Challenge) with methods and results from the top $8$ finalists (out of over~$150$ teams). The competition dataset (L)ifel(O)ng (R)obotic V(IS)ion (OpenLORIS) - Object Recognition (OpenLORIS-object) is designed for driving lifelong/continual learning research and application in robotic vision domain, w… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: 9 pages, 11 figures, 3 tables, accepted into IEEE Robotics and Automation Magazine. arXiv admin note: text overlap with arXiv:1911.06487

  29. arXiv:2003.10229  [pdf, other

    eess.IV cs.LG stat.ML

    QC-SPHRAM: Quasi-conformal Spherical Harmonics Based Geometric Distortions on Hippocampal Surfaces for Early Detection of the Alzheimer's Disease

    Authors: Anthony Hei-Long Chan, Yishan Luo, Lin Shi, Ronald Lok-Ming Lui

    Abstract: We propose a disease classification model, called the QC-SPHARM, for the early detection of the Alzheimer's Disease (AD). The proposed QC-SPHARM can distinguish between normal control (NC) subjects and AD patients, as well as between amnestic mild cognitive impairment (aMCI) patients having high possibility progressing into AD and those who do not. Using the spherical harmonics (SPHARM) based regi… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

  30. arXiv:2002.05825  [pdf, other

    cs.LG stat.ML

    An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality

    Authors: Silviu Pitis, Harris Chan, Kiarash Jamali, Jimmy Ba

    Abstract: Distances are pervasive in machine learning. They serve as similarity measures, loss functions, and learning targets; it is said that a good distance measure solves a task. When defining distances, the triangle inequality has proven to be a useful constraint, both theoretically--to prove convergence and optimality guarantees--and empirically--as an inductive bias. Deep metric learning architecture… ▽ More

    Submitted 6 July, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: 11 pages (+18 appendix). Published as a conference paper at ICLR 2020. https://openreview.net/forum?id=HJeiDpVFPr

  31. arXiv:1911.06487  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    OpenLORIS-Object: A Robotic Vision Dataset and Benchmark for Lifelong Deep Learning

    Authors: Qi She, Fan Feng, Xinyue Hao, Qihan Yang, Chuanlin Lan, Vincenzo Lomonaco, Xuesong Shi, Zhengwei Wang, Yao Guo, Yimin Zhang, Fei Qiao, Rosa H. M. Chan

    Abstract: The recent breakthroughs in computer vision have benefited from the availability of large representative datasets (e.g. ImageNet and COCO) for training. Yet, robotic vision poses unique challenges for applying visual algorithms developed from these standard computer vision datasets due to their implicit assumption over non-varying distributions for a fixed set of tasks. Fully retraining models eac… ▽ More

    Submitted 6 March, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: 7 pages, 7 figures, 4 tables

  32. arXiv:1902.08234  [pdf, other

    cs.LG stat.ML

    An Empirical Study of Large-Batch Stochastic Gradient Descent with Structured Covariance Noise

    Authors: Yeming Wen, Kevin Luk, Maxime Gazeau, Guodong Zhang, Harris Chan, Jimmy Ba

    Abstract: The choice of batch-size in a stochastic optimization algorithm plays a substantial role for both optimization and generalization. Increasing the batch-size used typically improves optimization but degrades generalization. To address the problem of improving generalization while maintaining optimal convergence in large-batch training, we propose to add covariance noise to the gradients. We demonst… ▽ More

    Submitted 28 February, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

    Journal ref: The 23rd International Conference on Artificial Intelligence and Statistics, 2020

  33. arXiv:1902.04546  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning

    Authors: Harris Chan, Yuhuai Wu, Jamie Kiros, Sanja Fidler, Jimmy Ba

    Abstract: Sparse reward is one of the most challenging problems in reinforcement learning (RL). Hindsight Experience Replay (HER) attempts to address this issue by converting a failed experience to a successful one by relabeling the goals. Despite its effectiveness, HER has limited applicability because it lacks a compact and universal goal representation. We present Augmenting experienCe via TeacheR's advi… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  34. arXiv:1809.02652  [pdf, other

    cs.LG cs.CV stat.ML

    Are You Sure You Want To Do That? Classification with Verification

    Authors: Harris Chan, Atef Chaudhury, Kevin Shen

    Abstract: Classification systems typically act in isolation, meaning they are required to implicitly memorize the characteristics of all candidate classes in order to classify. The cost of this is increased memory usage and poor sample efficiency. We propose a model which instead verifies using reference images during the classification process, reducing the burden of memorization. The model uses iterative… ▽ More

    Submitted 12 September, 2018; v1 submitted 7 September, 2018; originally announced September 2018.

    Comments: 9 pages, 5 figures, preprint

  35. arXiv:1808.01280  [pdf

    cs.NE cs.LG cs.SI stat.ML

    Geared Rotationally Identical and Invariant Convolutional Neural Network Systems

    Authors: ShihChung B. Lo, Ph. D., Matthew T. Freedman, M. D., Seong K. Mun, Ph. D., Heang-Ping Chan, Ph. D

    Abstract: Theorems and techniques to form different types of transformationally invariant processing and to produce the same output quantitatively based on either transformationally invariant operators or symmetric operations have recently been introduced by the authors. In this study, we further propose to compose a geared rotationally identical CNN system (GRI-CNN) with a small step angle by connecting ne… ▽ More

    Submitted 10 August, 2018; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: 14 pages, 6 figures, 8 tables

  36. arXiv:1805.11793  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Infinite Arms Bandit: Optimality via Confidence Bounds

    Authors: Hock Peng Chan, Shouri Hu

    Abstract: Berry et al. (1997) initiated the development of the infinite arms bandit problem. They derived a regret lower bound of all allocation strategies for Bernoulli rewards with uniform priors, and proposed strategies based on success runs. Bonald and Proutière (2013) proposed a two-target algorithm that achieves the regret lower bound, and extended optimality to Bernoulli rewards with general priors.… ▽ More

    Submitted 21 June, 2020; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: Fourth version

  37. arXiv:1712.02488  [pdf, other

    stat.ML cs.LG

    Cost-sensitive detection with variational autoencoders for environmental acoustic sensing

    Authors: Yunpeng Li, Ivan Kiskin, Davide Zilli, Marianne Sinka, Henry Chan, Kathy Willis, Stephen Roberts

    Abstract: Environmental acoustic sensing involves the retrieval and processing of audio signals to better understand our surroundings. While large-scale acoustic data make manual analysis infeasible, they provide a suitable playground for machine learning approaches. Most existing machine learning techniques developed for environmental acoustic sensing do not provide flexible control of the trade-off betwee… ▽ More

    Submitted 6 December, 2017; originally announced December 2017.

    Comments: Presented at the NIPS 2017 Workshop on Machine Learning for Audio Signal Processing

  38. arXiv:1711.06346  [pdf, other

    stat.ML cs.CY

    Mosquito detection with low-cost smartphones: data acquisition for malaria research

    Authors: Yunpeng Li, Davide Zilli, Henry Chan, Ivan Kiskin, Marianne Sinka, Stephen Roberts, Kathy Willis

    Abstract: Mosquitoes are a major vector for malaria, causing hundreds of thousands of deaths in the developing world each year. Not only is the prevention of mosquito bites of paramount importance to the reduction of malaria transmission cases, but understanding in more forensic detail the interplay between malaria, mosquito vectors, vegetation, standing water and human populations is crucial to the deploym… ▽ More

    Submitted 5 December, 2017; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Workshop on Machine Learning for the Developing World

  39. arXiv:1605.02869  [pdf, other

    q-bio.QM eess.SP q-bio.NC stat.ML

    An Efficient and Flexible Spike Train Model via Empirical Bayes

    Authors: Qi She, Xiaoli Wu, Beth Jelfs, Adam S. Charles, Rosa H. M. Chan

    Abstract: Accurate statistical models of neural spike responses can characterize the information carried by neural populations. But the limited samples of spike counts during recording usually result in model overfitting. Besides, current models assume spike counts to be Poisson-distributed, which ignores the fact that many neurons demonstrate over-dispersed spiking behaviour. Although the Negative Binomial… ▽ More

    Submitted 27 April, 2021; v1 submitted 10 May, 2016; originally announced May 2016.

    Comments: 16 pages, 20 figures, 3 tables

    Journal ref: IEEE Trans. Signal Processing 69 (2021) 3236-3251

  40. Adaptive Image Denoising by Mixture Adaptation

    Authors: Enming Luo, Stanley H. Chan, Truong Q. Nguyen

    Abstract: We propose an adaptive learning procedure to learn patch-based image priors for image denoising. The new algorithm, called the Expectation-Maximization (EM) adaptation, takes a generic prior learned from a generic external database and adapts it to the noisy image to generate a specific prior. Different from existing methods that combine internal and external statistics in ad-hoc ways, the propose… ▽ More

    Submitted 24 June, 2016; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 15 pages

  41. arXiv:1502.07190  [pdf, other

    stat.ML cs.LG

    Topic-adjusted visibility metric for scientific articles

    Authors: Linda S. L. Tan, Aik Hui Chan, Tian Zheng

    Abstract: Measuring the impact of scientific articles is important for evaluating the research output of individual scientists, academic institutions and journals. While citations are raw data for constructing impact measures, there exist biases and potential issues if factors affecting citation patterns are not properly accounted for. In this work, we address the problem of field variation and introduce an… ▽ More

    Submitted 16 October, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

    Journal ref: Annals of Applied Statistics, Volume 10, Number 1 (2016), 1-31

  42. arXiv:1411.6400   

    stat.ML cs.LG

    Mutual Information-Based Unsupervised Feature Transformation for Heterogeneous Feature Subset Selection

    Authors: Min Wei, Tommy W. S. Chow, Rosa H. M. Chan

    Abstract: Conventional mutual information (MI) based feature selection (FS) methods are unable to handle heterogeneous feature subset selection properly because of data format differences or estimation methods of MI between feature subset and class label. A way to solve this problem is feature transformation (FT). In this study, a novel unsupervised feature transformation (UFT) which can transform non-numer… ▽ More

    Submitted 29 March, 2015; v1 submitted 24 November, 2014; originally announced November 2014.

    Comments: This paper has been withdrawn by the author due to the number of datasets and classifiers are not sufficient to support the claim. Need more simulation work

  43. Adaptive Image Denoising by Targeted Databases

    Authors: Enming Luo, Stanley H. Chan, Truong Q. Nguyen

    Abstract: We propose a data-dependent denoising procedure to restore noisy images. Different from existing denoising algorithms which search for patches from either the noisy image or a generic database, the new algorithm finds patches from a database that contains only relevant patches. We formulate the denoising problem as an optimal filter design problem and make two contributions. First, we determine th… ▽ More

    Submitted 3 November, 2014; v1 submitted 30 June, 2014; originally announced July 2014.

    Comments: 15 pages, 13 figures, 2 tables, journal

  44. arXiv:1402.1888  [pdf, ps, other

    stat.ME

    A Consistent Histogram Estimator for Exchangeable Graph Models

    Authors: Stanley H. Chan, Edoardo M. Airoldi

    Abstract: Exchangeable graph models (ExGM) subsume a number of popular network models. The mathematical object that characterizes an ExGM is termed a graphon. Finding scalable estimators of graphons, provably consistent, remains an open issue. In this paper, we propose a histogram estimator of a graphon that is provably consistent and numerically efficient. The proposed estimator is based on a sorting-and-s… ▽ More

    Submitted 11 February, 2014; v1 submitted 8 February, 2014; originally announced February 2014.

    Comments: 28 pages, 5 figures

  45. Monte Carlo non local means: Random sampling for large-scale image filtering

    Authors: Stanley H. Chan, Todd Zickler, Yue M. Lu

    Abstract: We propose a randomized version of the non-local means (NLM) algorithm for large-scale image filtering. The new algorithm, called Monte Carlo non-local means (MCNLM), speeds up the classical NLM by computing a small subset of image patch distances, which are randomly selected according to a designed sampling pattern. We make two contributions. First, we analyze the performance of the MCNLM algorit… ▽ More

    Submitted 14 May, 2014; v1 submitted 27 December, 2013; originally announced December 2013.

    Comments: submitted for publication

  46. arXiv:1311.1731  [pdf, ps, other

    stat.ME cs.LG cs.SI physics.data-an stat.ML

    Stochastic blockmodel approximation of a graphon: Theory and consistent estimation

    Authors: Edoardo M Airoldi, Thiago B Costa, Stanley H Chan

    Abstract: Non-parametric approaches for analyzing network data based on exchangeable graph models (ExGM) have recently gained interest. The key object that defines an ExGM is often referred to as a graphon. This non-parametric perspective on network modeling poses challenging questions on how to make inference on the graphon underlying observed network data. In this paper, we propose a computationally effic… ▽ More

    Submitted 7 November, 2013; v1 submitted 7 November, 2013; originally announced November 2013.

    Comments: 20 pages, 4 figures, 2 algorithms. Neural Information Processing Systems (NIPS), 2013

  47. arXiv:1212.5729  [pdf, ps, other

    stat.AP

    Multiscale Adaptive Inference on Conditional Moment Inequalities

    Authors: Timothy B. Armstrong, Hock Peng Chan

    Abstract: This paper considers inference for conditional moment inequality models using a multiscale statistic. We derive the asymptotic distribution of this test statistic and use the result to propose feasible critical values that have a simple analytic formula, and to prove the asymptotic validity of a modified bootstrap procedure. The asymptotic distribution is extreme value, and the proof uses new tech… ▽ More

    Submitted 8 December, 2015; v1 submitted 22 December, 2012; originally announced December 2012.

  48. arXiv:1212.2470  [pdf

    cs.LG cs.AI stat.ML

    Reasoning about Bayesian Network Classifiers

    Authors: Hei Chan, Adnan Darwiche

    Abstract: Bayesian network classifiers are used in many fields, and one common class of classifiers are naive Bayes classifiers. In this paper, we introduce an approach for reasoning about Bayesian network classifiers in which we explicitly convert them into Ordered Decision Diagrams (ODDs), which are then used to reason about the properties of these classifiers. Specifically, we present an algorithm for co… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-107-115

  49. arXiv:1107.4344  [pdf, other

    stat.ME

    Detection with the scan and the average likelihood ratio

    Authors: Hock Peng Chan, Guenther Walther

    Abstract: We investigate the performance of the scan (maximum likelihood ratio statistic) and of the average likelihood ratio statistic in the problem of detecting a deterministic signal with unknown spatial extent in the prototypical univariate sampled data model with white Gaussian noise. Our results show that the scan statistic, a popular tool for detection problems, is optimal only for the detection of… ▽ More

    Submitted 25 February, 2014; v1 submitted 21 July, 2011; originally announced July 2011.

    Journal ref: Statistica Sinica 23 (2013), 409-428

  50. arXiv:0811.4447  [pdf, ps, other

    stat.AP q-bio.QM stat.CO

    Importance Sampling of Word Patterns in DNA and Protein Sequences

    Authors: Hock Peng Chan, Nancy R. Zhang, Louis H. Y. Chen

    Abstract: Monte Carlo methods can provide accurate p-value estimates of word counting test statistics and are easy to implement. They are especially attractive when an asymptotic theory is absent or when either the search sequence or the word pattern is too short for the application of asymptotic formulae. Naive direct Monte Carlo is undesirable for the estimation of small probabilities because the associ… ▽ More

    Submitted 26 November, 2008; originally announced November 2008.