Skip to main content

Showing 1–17 of 17 results for author: Zhong, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.01077  [pdf, other

    stat.ML cs.LG math.NA

    Learning Stochastic Dynamical Systems with Structured Noise

    Authors: Ziheng Guo, James Greene, Ming Zhong

    Abstract: Stochastic differential equations (SDEs) are a ubiquitous modeling framework that finds applications in physics, biology, engineering, social science, and finance. Due to the availability of large-scale data sets, there is growing interest in learning mechanistic models from observations with stochastic noise. In this work, we present a nonparametric framework to learn both the drift and diffusion… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  2. arXiv:2406.08335  [pdf, other

    cs.LG cs.AI cs.DB stat.CO

    A Survey of Pipeline Tools for Data Engineering

    Authors: Anthony Mbata, Yaji Sripada, Mingjun Zhong

    Abstract: Currently, a variety of pipeline tools are available for use in data engineering. Data scientists can use these tools to resolve data wrangling issues associated with data and accomplish some data engineering tasks from data ingestion through data preparation to utilization as input for machine learning (ML). Some of these tools have essential built-in components or can be combined with other tool… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  3. arXiv:2010.03729  [pdf, other

    stat.ML cs.LG math.DS math.ST

    Learning Theory for Inferring Interaction Kernels in Second-Order Interacting Agent Systems

    Authors: Jason Miller, Sui Tang, Ming Zhong, Mauro Maggioni

    Abstract: Modeling the complex interactions of systems of particles or agents is a fundamental scientific and mathematical problem that is studied in diverse fields, ranging from physics and biology, to economics and machine learning. In this work, we describe a very general second-order, heterogeneous, multivariable, interacting agent model, with an environment, that encompasses a wide variety of known sys… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 68 pages

    MSC Class: 62Gxx; 37Nxx; 68Txx

  4. arXiv:1912.11123  [pdf, other

    cs.LG math.DS nlin.AO stat.ML

    Data-driven Discovery of Emergent Behaviors in Collective Dynamics

    Authors: Mauro Maggioni, Jason Miller, Ming Zhong

    Abstract: Particle- and agent-based systems are a ubiquitous modeling tool in many disciplines. We consider the fundamental problem of inferring interaction kernels from observations of agent-based dynamical systems given observations of trajectories, in particular for collective dynamical systems exhibiting emergent behaviors with complicated interaction kernels, in a nonparametric fashion, and for kernels… ▽ More

    Submitted 30 March, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

  5. arXiv:1910.01578  [pdf, other

    cs.LG stat.ML

    GDP: Generalized Device Placement for Dataflow Graphs

    Authors: Yanqi Zhou, Sudip Roy, Amirali Abdolrashidi, Daniel Wong, Peter C. Ma, Qiumin Xu, Ming Zhong, Hanxiao Liu, Anna Goldie, Azalia Mirhoseini, James Laudon

    Abstract: Runtime and scalability of large neural networks can be significantly affected by the placement of operations in their dataflow graphs on suitable devices. With increasingly complex neural network architectures and heterogeneous device characteristics, finding a reasonable placement is extremely challenging even for domain experts. Most existing automated device placement approaches are impractica… ▽ More

    Submitted 28 September, 2019; originally announced October 2019.

  6. arXiv:1909.12301  [pdf, other

    cs.IR cs.LG stat.ML

    DBRec: Dual-Bridging Recommendation via Discovering Latent Groups

    Authors: Jingwei Ma, Jiahui Wen, Mingyang Zhong, Liangchen Liu, Chaojie Li, Weitong Chen, Yin Yang, Honghui Tu, Xue Li

    Abstract: In recommender systems, the user-item interaction data is usually sparse and not sufficient for learning comprehensive user/item representations for recommendation. To address this problem, we propose a novel dual-bridging recommendation model (DBRec). DBRec performs latent user/item group discovery simultaneously with collaborative filtering, and interacts group information with users/items for b… ▽ More

    Submitted 16 October, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: 10 pages, 16 figures, The 28th ACM International Conference on Information and Knowledge Management (CIKM '19)

  7. arXiv:1907.04710  [pdf, other

    cs.LG stat.ML

    Trust-Region Variational Inference with Gaussian Mixture Models

    Authors: Oleg Arenz, Mingjun Zhong, Gerhard Neumann

    Abstract: Many methods for machine learning rely on approximate inference from intractable probability distributions. Variational inference approximates such distributions by tractable models that can be subsequently used for approximate inference. Learning sufficiently accurate approximations requires a rich model family and careful exploration of the relevant modes of the target distribution. We propose a… ▽ More

    Submitted 4 August, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Journal ref: Journal of Machine Learning Research. 21(163):1-60, 2020

  8. arXiv:1902.08835  [pdf, other

    cs.LG stat.ML

    Transfer Learning for Non-Intrusive Load Monitoring

    Authors: Michele DIncecco, Stefano Squartini, Mingjun Zhong

    Abstract: Non-intrusive load monitoring (NILM) is a technique to recover source appliances from only the recorded mains in a household. NILM is unidentifiable and thus a challenge problem because the inferred power value of an appliance given only the mains could not be unique. To mitigate the unidentifiable problem, various methods incorporating domain knowledge into NILM have been proposed and shown effec… ▽ More

    Submitted 13 September, 2019; v1 submitted 23 February, 2019; originally announced February 2019.

    Comments: 10 pages, 12 Figures

    Journal ref: IEEE Transactions on Smart Grid, 2019

  9. Nonparametric inference of interaction laws in systems of agents from trajectory data

    Authors: Fei Lu, Mauro Maggioni, Sui Tang, Ming Zhong

    Abstract: Inferring the laws of interaction between particles and agents in complex dynamical systems from observational data is a fundamental challenge in a wide variety of disciplines. We propose a non-parametric statistical learning approach to estimate the governing laws of distance-based interactions, with no reference or assumption about their analytical form, from data consisting trajectories of inte… ▽ More

    Submitted 23 March, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

  10. arXiv:1810.09253  [pdf

    eess.SP cs.LG stat.ML

    Classification of normal/abnormal heart sound recordings based on multi-domain features and back propagation neural network

    Authors: Hong Tang, Huaming Chen, Ting Li, Mingjun Zhong

    Abstract: This paper aims to classify a single PCG recording as normal or abnormal for computer-aided diagnosis. The proposed framework for this challenge has four steps: preprocessing, feature extraction, training and validation. In the preprocessing step, a recording is segmented into four states, i.e., the first heart sound, systolic interval, the second heart sound, and diastolic interval by the Springe… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Comments: 4 pages

    Journal ref: 2016 Computing in Cardiology Conference (CinC), IEEE, Vancouver, BC, 2016, pp. 593-596

  11. arXiv:1806.00159  [pdf, other

    stat.ML cs.LG

    Neural Control Variates for Variance Reduction

    Authors: Ruosi Wan, Mingjun Zhong, Haoyi Xiong, Zhanxing Zhu

    Abstract: In statistics and machine learning, approximation of an intractable integration is often achieved by using the unbiased Monte Carlo estimator, but the variances of the estimation are generally high in many applications. Control variates approaches are well-known to reduce the variance of the estimation. These control variates are typically constructed by employing predefined parametric functions o… ▽ More

    Submitted 15 October, 2019; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ECML PKDD 2019

  12. arXiv:1612.09106  [pdf, other

    stat.AP cs.LG

    Sequence-to-point learning with neural networks for nonintrusive load monitoring

    Authors: Chaoyun Zhang, Mingjun Zhong, Zongzuo Wang, Nigel Goddard, Charles Sutton

    Abstract: Energy disaggregation (a.k.a nonintrusive load monitoring, NILM), a single-channel blind source separation problem, aims to decompose the mains which records the whole house electricity consumption into appliance-wise readings. This problem is difficult because it is inherently unidentifiable. Recent approaches have shown that the identifiability problem could be reduced by introducing domain know… ▽ More

    Submitted 18 September, 2017; v1 submitted 29 December, 2016; originally announced December 2016.

    Comments: 8 pages, 3 figures

    Journal ref: The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 2018

  13. arXiv:1510.09130  [pdf, other

    stat.ML cs.AI stat.AP stat.ME

    Latent Bayesian melding for integrating individual and population models

    Authors: Mingjun Zhong, Nigel Goddard, Charles Sutton

    Abstract: In many statistical problems, a more coarse-grained model may be suitable for population-level behaviour, whereas a more detailed model is appropriate for accurate modelling of individual behaviour. This raises the question of how to integrate both types of models. Methods such as posterior regularization follow the idea of generalized moment matching, in that they allow matching expectations betw… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: 11 pages, Advances in Neural Information Processing Systems (NIPS), 2015. (Spotlight Presentation)

  14. arXiv:1502.00231  [pdf, ps, other

    cs.LG stat.ML

    Feature Selection with Redundancy-complementariness Dispersion

    Authors: Zhijun Chen, Chaozhong Wu, Yishi Zhang, Zhen Huang, Bin Ran, Ming Zhong, Nengchao Lyu

    Abstract: Feature selection has attracted significant attention in data mining and machine learning in the past decades. Many existing feature selection methods eliminate redundancy by measuring pairwise inter-correlation of features, whereas the complementariness of features and higher inter-correlation among more than two features are ignored. In this study, a modification item concerning the complementar… ▽ More

    Submitted 1 February, 2015; originally announced February 2015.

    Comments: 28 pages, 13 figures, 7 tables

    MSC Class: 68T10; 94A17; 62B10; 68U35 ACM Class: I.5.2; H.1.1

  15. arXiv:1406.7665  [pdf, other

    stat.AP

    Interleaved Factorial Non-Homogeneous Hidden Markov Models for Energy Disaggregation

    Authors: Mingjun Zhong, Nigel Goddard, Charles Sutton

    Abstract: To reduce energy demand in households it is useful to know which electrical appliances are in use at what times. Monitoring individual appliances is costly and intrusive, whereas data on overall household electricity use is more easily obtained. In this paper, we consider the energy disaggregation problem where a household's electricity consumption is disaggregated into the component appliances. T… ▽ More

    Submitted 30 June, 2014; originally announced June 2014.

    Comments: 5 pages, 1 figure, conference, The NIPS workshop on Machine Learning for Sustainability, Lake Tahoe, NV, USA, 2013

  16. arXiv:1210.3456  [pdf, other

    stat.AP cs.LG q-bio.GN q-bio.MN stat.ML

    Bayesian Analysis for miRNA and mRNA Interactions Using Expression Data

    Authors: Mingjun Zhong, Rong Liu, Bo Liu

    Abstract: MicroRNAs (miRNAs) are small RNA molecules composed of 19-22 nt, which play important regulatory roles in post-transcriptional gene regulation by inhibiting the translation of the mRNA into proteins or otherwise cleaving the target mRNA. Inferring miRNA targets provides useful information for understanding the roles of miRNA in biological processes that are potentially involved in complex diseases… ▽ More

    Submitted 30 June, 2014; v1 submitted 12 October, 2012; originally announced October 2012.

    Comments: 21 pages, 11 figures, 8 tables

  17. arXiv:1206.4666  [pdf

    stat.CO cs.LG stat.ME

    A Bayesian Approach to Approximate Joint Diagonalization of Square Matrices

    Authors: Mingjun Zhong, Mark Girolami

    Abstract: We present a Bayesian scheme for the approximate diagonalisation of several square matrices which are not necessarily symmetric. A Gibbs sampler is derived to simulate samples of the common eigenvectors and the eigenvalues for these matrices. Several synthetic examples are used to illustrate the performance of the proposed Gibbs sampler and we then provide comparisons to several other joint diagon… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012