Skip to main content

Showing 1–32 of 32 results for author: Ong, C S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13864  [pdf, other

    stat.ML cs.DM cs.LG

    Graphon Mixtures

    Authors: Sevvandi Kandanaarachchi, Cheng Soon Ong

    Abstract: Social networks have a small number of large hubs, and a large number of small dense communities. We propose a generative model that captures both hub and dense structures. Based on recent results about graphons on line graphs, our model is a graphon mixture, enabling us to generate sequences of graphs where each graph is a combination of sparse and dense graphs. We propose a new condition on spar… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2505.04104  [pdf, ps, other

    cs.LG cs.AI cs.CY

    Position: We Need Responsible, Application-Driven (RAD) AI Research

    Authors: Sarah Hartman, Cheng Soon Ong, Julia Powles, Petra Kuhnert

    Abstract: This position paper argues that achieving meaningful scientific and societal advances with artificial intelligence (AI) requires a responsible, application-driven approach (RAD) to AI research. As AI is increasingly integrated into society, AI researchers must engage with the specific contexts where AI is being applied. This includes being responsive to ethical and legal considerations, technical… ▽ More

    Submitted 9 June, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

    Comments: 12 pages, 1 figure, Camera Ready version with updated formatting, references, and minor fixes, Accepted to Proceedings of the 41 st International Conference on Machine Learning, Vancouver, Canada. PMLR 267, 2025

    ACM Class: I.2.0; K.4.1; J.4

  3. arXiv:2503.21128  [pdf, other

    stat.ML cs.LG

    Squared families: Searching beyond regular probability models

    Authors: Russell Tsuchida, Jiawei Liu, Cheng Soon Ong, Dino Sejdinovic

    Abstract: We introduce squared families, which are families of probability densities obtained by squaring a linear transformation of a statistic. Squared families are singular, however their singularity can easily be handled so that they form regular models. After handling the singularity, squared families possess many convenient properties. Their Fisher information is a conformal transformation of the Hess… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 43 pages. Preprint

  4. arXiv:2502.14298  [pdf, other

    cs.LG stat.ML

    Generalization Certificates for Adversarially Robust Bayesian Linear Regression

    Authors: Mahalakshmi Sabanayagam, Russell Tsuchida, Cheng Soon Ong, Debarghya Ghoshdastidar

    Abstract: Adversarial robustness of machine learning models is critical to ensuring reliable performance under data perturbations. Recent progress has been on point estimators, and this paper considers distributional predictors. First, using the link between exponential families and Bregman divergences, we formulate an adversarial Bregman divergence loss as an adversarial negative log-likelihood. Using the… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Under review

  5. arXiv:2412.13559  [pdf, other

    cs.LG

    Indirect Query Bayesian Optimization with Integrated Feedback

    Authors: Mengyan Zhang, Shahine Bouabid, Cheng Soon Ong, Seth Flaxman, Dino Sejdinovic

    Abstract: We develop the framework of Indirect Query Bayesian Optimization (IQBO), a new class of Bayesian optimization problems where the integrated feedback is given via a conditional expectation of the unknown function $f$ to be optimized. The underlying conditional distribution can be unknown and learned from data. The goal is to find the global optimum of $f$ by adaptively querying and observing in the… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Preliminary work. Under review

  6. arXiv:2411.02414  [pdf, other

    cs.CY cs.LG

    Fairness Evaluation with Item Response Theory

    Authors: Ziqi Xu, Sevvandi Kandanaarachchi, Cheng Soon Ong, Eirini Ntoutsi

    Abstract: Item Response Theory (IRT) has been widely used in educational psychometrics to assess student ability, as well as the difficulty and discrimination of test questions. In this context, discrimination specifically refers to how effectively a question distinguishes between students of different ability levels, and it does not carry any connotation related to fairness. In recent years, IRT has been s… ▽ More

    Submitted 20 October, 2024; originally announced November 2024.

  7. arXiv:2409.06142  [pdf, other

    stat.ML cs.LG

    Variational Search Distributions

    Authors: Daniel M. Steinberg, Rafael Oliveira, Cheng Soon Ong, Edwin V. Bonilla

    Abstract: We develop VSD, a method for conditioning a generative model of discrete, combinatorial designs on a rare desired class by efficiently evaluating a black-box (e.g. experiment, simulation) in a batch sequential manner. We call this task active generation; we formalize active generation's requirements and desiderata, and formulate a solution via variational inference. VSD uses off-the-shelf gradient… ▽ More

    Submitted 26 April, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: Accepted as a poster in the thirteenth International Conference on Learning Representations (ICLR), 2025

    ACM Class: G.3; G.2.1; I.2.6

  8. arXiv:2409.01656  [pdf, ps, other

    stat.ML cs.DM cs.LG math.CO

    Graphons of Line Graphs

    Authors: Sevvandi Kandanaarachchi, Cheng Soon Ong

    Abstract: We consider the problem of estimating graph limits, known as graphons, from observations of sequences of sparse finite graphs. In this paper we show a simple method that can shed light on a subset of sparse graphs. The method involves mapping the original graphs to their line graphs. We show that graphs satisfying a particular property, which we call the square-degree property are sparse, but give… ▽ More

    Submitted 4 July, 2025; v1 submitted 3 September, 2024; originally announced September 2024.

  9. arXiv:2402.09608  [pdf, other

    cs.LG stat.ML

    Exact, Fast and Expressive Poisson Point Processes via Squared Neural Families

    Authors: Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic

    Abstract: We introduce squared neural Poisson point processes (SNEPPPs) by parameterising the intensity function by the squared norm of a two layer neural network. When the hidden layer is fixed and the second layer has a single neuron, our approach resembles previous uses of squared Gaussian process or kernel methods, but allowing the hidden layer to be learnt allows for additional flexibility. In many cas… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: AAAI 2024 camera ready submission

  10. arXiv:2307.04993  [pdf, other

    cs.LG astro-ph.CO astro-ph.GA astro-ph.IM

    Uncertainty Quantification of the Virial Black Hole Mass with Conformal Prediction

    Authors: Suk Yee Yong, Cheng Soon Ong

    Abstract: Precise measurements of the black hole mass are essential to gain insight on the black hole and host galaxy co-evolution. A direct measure of the black hole mass is often restricted to nearest galaxies and instead, an indirect method using the single-epoch virial black hole mass estimation is used for objects at high redshifts. However, this method is subjected to biases and uncertainties as it is… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in MNRAS. 15 pages, 11 figures, 2 tables

  11. arXiv:2305.13552  [pdf, other

    cs.LG cs.AI stat.ML

    Squared Neural Families: A New Class of Tractable Density Models

    Authors: Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic

    Abstract: Flexible models for probability distributions are an essential ingredient in many machine learning tasks. We develop and investigate a new class of probability distributions, which we call a Squared Neural Family (SNEFY), formed by squaring the 2-norm of a neural network and normalising it with respect to a base measure. Following the reasoning similar to the well established connections between i… ▽ More

    Submitted 25 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Spotlight award at NeurIPS 2023

  12. arXiv:2211.05943  [pdf, other

    cs.LG stat.ML

    Deep equilibrium models as estimators for continuous latent variables

    Authors: Russell Tsuchida, Cheng Soon Ong

    Abstract: Principal Component Analysis (PCA) and its exponential family extensions have three components: observations, latents and parameters of a linear transformation. We consider a generalised setting where the canonical parameters of the exponential family are a nonlinear transformation of the latents. We show explicit relationships between particular neural network architectures and the corresponding… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 25 pages

  13. arXiv:2206.14648  [pdf, other

    cs.IR cs.LG

    Two-Stage Neural Contextual Bandits for Personalised News Recommendation

    Authors: Mengyan Zhang, Thanh Nguyen-Tang, Fangzhao Wu, Zhenyu He, Xing Xie, Cheng Soon Ong

    Abstract: We consider the problem of personalised news recommendation where each user consumes news in a sequential fashion. Existing personalised news recommendation methods focus on exploiting user interests and ignores exploration in recommendation, which leads to biased feedback loops and hurt recommendation quality in the long term. We build on contextual bandits recommendation strategies which natural… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  14. arXiv:2112.13029  [pdf, other

    cs.LG stat.ML

    Gaussian Process Bandits with Aggregated Feedback

    Authors: Mengyan Zhang, Russell Tsuchida, Cheng Soon Ong

    Abstract: We consider the continuum-armed bandits problem, under a novel setting of recommending the best arms within a fixed budget under aggregated feedback. This is motivated by applications where the precise rewards are impossible or expensive to obtain, while an aggregated reward or feedback, such as the average over a subset, is available. We constrain the set of reward functions by assuming that they… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: to be published in 36th AAAI Conference on Artificial Intelligence (2022)

  15. arXiv:2111.13802  [pdf, other

    cs.LG cs.CE

    Factorized Fourier Neural Operators

    Authors: Alasdair Tran, Alexander Mathews, Lexing Xie, Cheng Soon Ong

    Abstract: We propose the Factorized Fourier Neural Operator (F-FNO), a learning-based approach for simulating partial differential equations (PDEs). Starting from a recently proposed Fourier representation of flow fields, the F-FNO bridges the performance gap between pure machine learning approaches to that of the best numerical or hybrid solvers. This is achieved with new representations - separable spectr… ▽ More

    Submitted 2 March, 2023; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: Published in The Eleventh International Conference on Learning Representations (2023). Code is available at https://github.com/alasdairtran/fourierflow

  16. Radflow: A Recurrent, Aggregated, and Decomposable Model for Networks of Time Series

    Authors: Alasdair Tran, Alexander Mathews, Cheng Soon Ong, Lexing Xie

    Abstract: We propose a new model for networks of time series that influence each other. Graph structures among time series are found in diverse domains, such as web traffic influenced by hyperlinks, product sales influenced by recommendation, or urban transport volume influenced by road networks and weather. There has been recent progress in graph modeling and in time series forecasting, respectively, but a… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: Published in The Web Conference 2021. Code is available at https://github.com/alasdairtran/radflow

    Journal ref: Proceedings of The Web Conference 2021 (WWW '21)

  17. arXiv:2010.11568  [pdf, other

    cs.LG

    Quantile Bandits for Best Arms Identification

    Authors: Mengyan Zhang, Cheng Soon Ong

    Abstract: We consider a variant of the best arm identification task in stochastic multi-armed bandits. Motivated by risk-averse decision-making problems, our goal is to identify a set of $m$ arms with the highest $τ$-quantile values within a fixed budget. We prove asymmetric two-sided concentration inequalities for order statistics and quantiles of random variables that have non-decreasing hazard rate, whic… ▽ More

    Submitted 21 February, 2023; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Proceedings of the 38th International Conference on Machine Learning, 2021; Post-publication update in Appendix E

  18. arXiv:1901.11311  [pdf, other

    cs.LG stat.ML

    New Tricks for Estimating Gradients of Expectations

    Authors: Christian J. Walder, Paul Roussel, Richard Nock, Cheng Soon Ong, Masashi Sugiyama

    Abstract: We introduce a family of pairwise stochastic gradient estimators for gradients of expectations, which are related to the log-derivative trick, but involve pairwise interactions between samples. The simplest example of our new estimator, dubbed the fundamental trick estimator, is shown to arise from either a) introducing and approximating an integral representation based on the fundamental theorem… ▽ More

    Submitted 19 April, 2022; v1 submitted 31 January, 2019; originally announced January 2019.

  19. arXiv:1901.06125  [pdf, other

    cs.IR cs.LG stat.ML

    Cold-start Playlist Recommendation with Multitask Learning

    Authors: Dawei Chen, Cheng Soon Ong, Aditya Krishna Menon

    Abstract: Playlist recommendation involves producing a set of songs that a user might enjoy. We investigate this problem in three cold-start scenarios: (i) cold playlists, where we recommend songs to form new personalised playlists for an existing user; (ii) cold users, where we recommend songs to form new playlists for a new user; and (iii) cold songs, where we recommend newly released songs to extend user… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

    Comments: 15 pages

    MSC Class: 68T05

  20. arXiv:1806.02977  [pdf, other

    cs.LG stat.ML

    Monge blunts Bayes: Hardness Results for Adversarial Training

    Authors: Zac Cranko, Aditya Krishna Menon, Richard Nock, Cheng Soon Ong, Zhan Shi, Christian Walder

    Abstract: The last few years have seen a staggering number of empirical studies of the robustness of neural networks in a model of adversarial perturbations of their inputs. Most rely on an adversary which carries out local modifications within prescribed balls. None however has so far questioned the broader picture: how to frame a resource-bounded adversary so that it can be severely detrimental to learnin… ▽ More

    Submitted 7 May, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

    ACM Class: I.2.6

  21. arXiv:1806.01488  [pdf, ps, other

    cs.LG cs.AI stat.ML

    A Primer on Causal Analysis

    Authors: Finnian Lattimore, Cheng Soon Ong

    Abstract: We provide a conceptual map to navigate causal analysis problems. Focusing on the case of discrete random variables, we consider the case of causal effect estimation from observational data. The presented approaches apply also to continuous variables, but the issue of estimation becomes more complex. We then introduce the four schools of thought for causal analysis

    Submitted 5 June, 2018; originally announced June 2018.

    Comments: Parts of this document are copied verbatim from Finnian Lattimore's PhD thesis, ANU 2018

  22. arXiv:1710.04394  [pdf, other

    cs.LG

    Provably Fair Representations

    Authors: Daniel McNamara, Cheng Soon Ong, Robert C. Williamson

    Abstract: Machine learning systems are increasingly used to make decisions about people's lives, such as whether to give someone a loan or whether to interview someone for a job. This has led to considerable interest in making such machine learning systems fair. One approach is to transform the input data used by the algorithm. This can be achieved by passing each input data point through a representation f… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

  23. arXiv:1708.05165  [pdf, other

    cs.LG

    Revisiting revisits in trajectory recommendation

    Authors: Aditya Krishna Menon, Dawei Chen, Lexing Xie, Cheng Soon Ong

    Abstract: Trajectory recommendation is the problem of recommending a sequence of places in a city for a tourist to visit. It is strongly desirable for the recommended sequence to avoid loops, as tourists typically would not wish to revisit the same location. Given some learned model that scores sequences, how can we then find the highest-scoring sequence that is loop-free? This paper studies this problem, w… ▽ More

    Submitted 17 August, 2017; originally announced August 2017.

    Comments: 6 pages

    MSC Class: 68T05

  24. arXiv:1707.01627  [pdf, other

    cs.HC

    PathRec: Visual Analysis of Travel Route Recommendations

    Authors: Dawei Chen, Dongwoo Kim, Lexing Xie, Minjeong Shin, Aditya Krishna Menon, Cheng Soon Ong, Iman Avazpour, John Grundy

    Abstract: We present an interactive visualisation tool for recommending travel trajectories. This system is based on new machine learning formulations and algorithms for the sequence recommendation problem. The system starts from a map-based overview, taking an interactive query as starting point. It then breaks down contributions from different geographical and user behavior features, and those from indivi… ▽ More

    Submitted 18 July, 2017; v1 submitted 5 July, 2017; originally announced July 2017.

    Comments: 3 pages with appendix

    MSC Class: 68T05; 68U35

  25. arXiv:1706.09067  [pdf, other

    cs.IR

    Structured Recommendation

    Authors: Dawei Chen, Lexing Xie, Aditya Krishna Menon, Cheng Soon Ong

    Abstract: Current recommender systems largely focus on static, unstructured content. In many scenarios, we would like to recommend content that has structure, such as a trajectory of points-of-interests in a city, or a playlist of songs. Dubbed Structured Recommendation, this problem differs from the typical structured prediction problem in that there are multiple correct answers for a given input. Motivate… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

    Comments: 18 pages

    MSC Class: 68T05

  26. arXiv:1611.03125  [pdf, other

    cs.LG

    A Modular Theory of Feature Learning

    Authors: Daniel McNamara, Cheng Soon Ong, Robert C. Williamson

    Abstract: Learning representations of data, and in particular learning features for a subsequent prediction task, has been a fruitful area of research delivering impressive empirical results in recent years. However, relatively little is understood about what makes a representation `good'. We propose the idea of a risk gap induced by representation learning for a given prediction context, which measures the… ▽ More

    Submitted 9 November, 2016; originally announced November 2016.

  27. arXiv:1609.06831  [pdf, other

    cs.LG stat.ML

    Hawkes Processes with Stochastic Excitations

    Authors: Young Lee, Kar Wai Lim, Cheng Soon Ong

    Abstract: We propose an extension to Hawkes processes by treating the levels of self-excitation as a stochastic differential equation. Our new point process allows better approximation in application domains where events and intensities accelerate each other with correlated levels of contagion. We generalize a recent algorithm for simulating draws from Hawkes processes whose levels of excitation are stochas… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Comments: Copy of ICML paper

    Journal ref: Proceedings of The 33rd International Conference on Machine Learning (ICML), pp. 79-88. JMLR. 2016

  28. Learning Points and Routes to Recommend Trajectories

    Authors: Dawei Chen, Cheng Soon Ong, Lexing Xie

    Abstract: The problem of recommending tours to travellers is an important and broadly studied area. Suggested solutions include various approaches of points-of-interest (POI) recommendation and route planning. We consider the task of recommending a sequence of POIs, that simultaneously uses information about POIs and routes. Our approach unifies the treatment of various sources of information by representin… ▽ More

    Submitted 25 August, 2016; originally announced August 2016.

  29. arXiv:1608.05921  [pdf, other

    stat.ML cs.AI cs.LG

    Probabilistic Knowledge Graph Construction: Compositional and Incremental Approaches

    Authors: Dongwoo Kim, Lexing Xie, Cheng Soon Ong

    Abstract: Knowledge graph construction consists of two tasks: extracting information from external resources (knowledge population) and inferring missing information through a statistical analysis on the extracted information (knowledge completion). In many cases, insufficient external resources in the knowledge population hinder the subsequent statistical inference. The gap between these two processes can… ▽ More

    Submitted 5 September, 2016; v1 submitted 21 August, 2016; originally announced August 2016.

    Comments: The 25th ACM International Conference on Information and Knowledge Management (CIKM 2016)

  30. arXiv:1607.00360  [pdf, other

    cs.LG stat.ML

    A scaled Bregman theorem with applications

    Authors: Richard Nock, Aditya Krishna Menon, Cheng Soon Ong

    Abstract: Bregman divergences play a central role in the design and analysis of a range of machine learning algorithms. This paper explores the use of Bregman divergences to establish reductions between such algorithms and their analyses. We present a new scaled isodistortion theorem involving Bregman divergences (scaled Bregman theorem for short) which shows that certain "Bregman distortions'" (employing a… ▽ More

    Submitted 1 July, 2016; originally announced July 2016.

  31. arXiv:1410.4391  [pdf, other

    stat.ML cs.LG

    Multivariate Spearman's rho for aggregating ranks using copulas

    Authors: Justin Bedo, Cheng Soon Ong

    Abstract: We study the problem of rank aggregation: given a set of ranked lists, we want to form a consensus ranking. Furthermore, we consider the case of extreme lists: i.e., only the rank of the best or worst elements are known. We impute missing ranks by the average value and generalise Spearman's ρto extreme ranks. Our main contribution is the derivation of a non-parametric estimator for rank aggregatio… ▽ More

    Submitted 1 December, 2016; v1 submitted 16 October, 2014; originally announced October 2014.

    Journal ref: Journal of Machine Learning Research, 17(201):1-30, 2016

  32. arXiv:1402.6013  [pdf, ps, other

    cs.LG cs.DL

    Open science in machine learning

    Authors: Joaquin Vanschoren, Mikio L. Braun, Cheng Soon Ong

    Abstract: We present OpenML and mldata, open science platforms that provides easy access to machine learning data, software and results to encourage further study and application. They go beyond the more traditional repositories for data sets and software packages in that they allow researchers to also easily share the results they obtained in experiments and to compare their solutions with those of others.

    Submitted 24 February, 2014; originally announced February 2014.