Skip to main content

Showing 1–13 of 13 results for author: Chu, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.03360  [pdf, other

    cs.LG stat.AP

    Pedestrian Volume Prediction Using a Diffusion Convolutional Gated Recurrent Unit Model

    Authors: Yiwei Dong, Tingjin Chu, Lele Zhang, Hadi Ghaderi, Hanfang Yang

    Abstract: Effective models for analysing and predicting pedestrian flow are important to ensure the safety of both pedestrians and other road users. These tools also play a key role in optimising infrastructure design and geometry and supporting the economic utility of interconnected communities. The implementation of city-wide automatic pedestrian counting systems provides researchers with invaluable data,… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  2. arXiv:2406.13725  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Tree-Sliced Wasserstein Distance: A Geometric Perspective

    Authors: Viet-Hoang Tran, Trang Pham, Tho Tran, Minh Khoi Nguyen Nhat, Thanh Chu, Tam Le, Tan M. Nguyen

    Abstract: Many variants of Optimal Transport (OT) have been developed to address its heavy computation. Among them, notably, Sliced Wasserstein (SW) is widely used for application domains by projecting the OT problem onto one-dimensional lines, and leveraging the closed-form expression of the univariate OT to reduce the computational burden. However, projecting measures onto low-dimensional spaces can lead… ▽ More

    Submitted 9 June, 2025; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2025

  3. arXiv:2402.17595  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Implicit Regularization via Spectral Neural Networks and Non-linear Matrix Sensing

    Authors: Hong T. M. Chu, Subhro Ghosh, Chi Thanh Lam, Soumendu Sundar Mukherjee

    Abstract: The phenomenon of implicit regularization has attracted interest in recent years as a fundamental aspect of the remarkable generalizing ability of neural networks. In a nutshell, it entails that gradient descent dynamics in many neural nets, even without any explicit regularizer in the loss function, converges to the solution of a regularized learning problem. However, known results attempting to… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2402.14029  [pdf, other

    cs.LG cs.AI stat.ML

    Partially Frozen Random Networks Contain Compact Strong Lottery Tickets

    Authors: Hikari Otsuka, Daiki Chijiwa, Ángel López García-Arias, Yasuyuki Okoshi, Kazushi Kawamura, Thiem Van Chu, Daichi Fujiki, Susumu Takeuchi, Masato Motomura

    Abstract: Randomly initialized dense networks contain subnetworks that achieve high accuracy without weight learning--strong lottery tickets (SLTs). Recently, Gadhikar et al. (2023) demonstrated that SLTs could also be found within a randomly pruned source network. This phenomenon can be exploited to further compress the small memory size required by SLTs. However, their method is limited to SLTs that are e… ▽ More

    Submitted 8 February, 2025; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted at TMLR

  5. arXiv:2312.03236  [pdf, other

    cs.LG cs.AI stat.ML

    Multicoated and Folded Graph Neural Networks with Strong Lottery Tickets

    Authors: Jiale Yan, Hiroaki Ito, Ángel López García-Arias, Yasuyuki Okoshi, Hikari Otsuka, Kazushi Kawamura, Thiem Van Chu, Masato Motomura

    Abstract: The Strong Lottery Ticket Hypothesis (SLTH) demonstrates the existence of high-performing subnetworks within a randomly initialized model, discoverable through pruning a convolutional neural network (CNN) without any weight training. A recent study, called Untrained GNNs Tickets (UGT), expanded SLTH from CNNs to shallow graph neural networks (GNNs). However, discrepancies persist when comparing ba… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 9 pages, accepted in the Second Learning on Graphs Conference (LoG 2023)

    Journal ref: Proceedings of the Second Learning on Graphs Conference (LoG 2023), PMLR 231

  6. arXiv:2011.02466  [pdf, ps, other

    cs.DS cs.CC cs.LG stat.ML

    Algorithms and Hardness for Linear Algebra on Geometric Graphs

    Authors: Josh Alman, Timothy Chu, Aaron Schild, Zhao Song

    Abstract: For a function $\mathsf{K} : \mathbb{R}^{d} \times \mathbb{R}^{d} \to \mathbb{R}_{\geq 0}$, and a set $P = \{ x_1, \ldots, x_n\} \subset \mathbb{R}^d$ of $n$ points, the $\mathsf{K}$ graph $G_P$ of $P$ is the complete graph on $n$ nodes where the weight between nodes $i$ and $j$ is given by $\mathsf{K}(x_i, x_j)$. In this paper, we initiate the study of when efficient spectral graph theory is poss… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: FOCS 2020

  7. arXiv:2006.10897  [pdf, other

    cs.LG cs.MA stat.ML

    Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning

    Authors: Oscar de Lima, Hansal Shah, Ting-Sheng Chu, Brian Fogelson

    Abstract: With the advent of ride-sharing services, there is a huge increase in the number of people who rely on them for various needs. Most of the earlier approaches tackling this issue required handcrafted functions for estimating travel times and passenger waiting times. Traditional Reinforcement Learning (RL) based methods attempting to solve the ridesharing problem are unable to accurately model the c… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  8. arXiv:2004.09589  [pdf, other

    cs.LG cs.DM stat.ML

    Weighted Cheeger and Buser Inequalities, with Applications to Clustering and Cutting Probability Densities

    Authors: Timothy Chu, Gary L. Miller, Noel J. Walkington, Alex L. Wang

    Abstract: In this paper, we show how sparse or isoperimetric cuts of a probability density function relate to Cheeger cuts of its principal eigenfunction, for appropriate definitions of `sparse cut' and `principal eigenfunction'. We construct these appropriate definitions of sparse cut and principal eigenfunction in the probability density setting. Then, we prove Cheeger and Buser type inequalities simila… ▽ More

    Submitted 6 May, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  9. arXiv:2004.01339  [pdf, other

    cs.LG stat.ML

    Multi-agent Reinforcement Learning for Networked System Control

    Authors: Tianshu Chu, Sandeep Chinchali, Sachin Katti

    Abstract: This paper considers multi-agent reinforcement learning (MARL) in networked system control. Specifically, each agent learns a decentralized control policy based on local observations and messages from connected neighbors. We formulate such a networked MARL (NMARL) problem as a spatiotemporal Markov decision process and introduce a spatial discount factor to stabilize the training of each local age… ▽ More

    Submitted 23 April, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: ICLR 2020

  10. arXiv:1903.04527  [pdf, other

    cs.LG stat.ML

    Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control

    Authors: Tianshu Chu, Jie Wang, Lara Codecà, Zhaojian Li

    Abstract: Reinforcement learning (RL) is a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, and deep neural networks further enhance its learning power. However, centralized RL is infeasible for large-scale ATSC due to the extremely high dimension of the joint action space. Multi-agent RL (MARL) overcomes the scalability issue by distributing the g… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

  11. arXiv:1706.05544  [pdf

    stat.ML cs.LG

    Rgtsvm: Support Vector Machines on a GPU in R

    Authors: Zhong Wang, Tinyi Chu, Lauren A Choate, Charles G Danko

    Abstract: Rgtsvm provides a fast and flexible support vector machine (SVM) implementation for the R language. The distinguishing feature of Rgtsvm is that support vector classification and support vector regression tasks are implemented on a graphical processing unit (GPU), allowing the libraries to scale to millions of examples with >100-fold improvement in performance over existing implementations. Nevert… ▽ More

    Submitted 17 June, 2017; originally announced June 2017.

    Comments: 6 pages, 1 figure

  12. arXiv:1301.2261  [pdf

    stat.ME cs.AI stat.AP

    Semi-Instrumental Variables: A Test for Instrument Admissibility

    Authors: Tianjiao Chu, Richard Scheines, Peter L. Spirtes

    Abstract: In a causal graphical model, an instrument for a variable X and its effect Y is a random variable that is a cause of X and independent of all the causes of Y except X. (Pearl (1995), Spirtes et al (2000)). Instrumental variables can be used to estimate how the distribution of an effect will respond to a manipulation of its causes, even in the presence of unmeasured common causes (confounders). In… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-83-90

  13. arXiv:1109.0320  [pdf, ps, other

    stat.ME math.ST

    Penalized maximum likelihood estimation and variable selection in geostatistics

    Authors: Tingjin Chu, Jun Zhu, Haonan Wang

    Abstract: We consider the problem of selecting covariates in spatial linear models with Gaussian process errors. Penalized maximum likelihood estimation (PMLE) that enables simultaneous variable selection and parameter estimation is developed and, for ease of computation, PMLE is approximated by one-step sparse estimation (OSE). To further improve computational efficiency, particularly with large sample siz… ▽ More

    Submitted 23 February, 2012; v1 submitted 1 September, 2011; originally announced September 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS919 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS919

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2607-2625