Skip to main content

Showing 1–39 of 39 results for author: Guo, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.16667  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Representation Learning via Non-Contrastive Mutual Information

    Authors: Zhaohan Daniel Guo, Bernardo Avila Pires, Khimya Khetarpal, Dale Schuurmans, Bo Dai

    Abstract: Labeling data is often very time consuming and expensive, leaving us with a majority of unlabeled data. Self-supervised representation learning methods such as SimCLR (Chen et al., 2020) or BYOL (Grill et al., 2020) have been very successful at learning meaningful latent representations from unlabeled image data, resulting in much more general and transferable representations for downstream tasks.… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    ACM Class: I.2.6; I.2.10

  2. arXiv:2503.04453  [pdf

    stat.ML cs.LG physics.med-ph

    Reproducibility Assessment of Magnetic Resonance Spectroscopy of Pregenual Anterior Cingulate Cortex across Sessions and Vendors via the Cloud Computing Platform CloudBrain-MRS

    Authors: Runhan Chen, Meijin Lin, Jianshu Chen, Liangjie Lin, Jiazheng Wang, Xiaoqing Li, Jianhua Wang, Xu Huang, Ling Qian, Shaoxing Liu, Yuan Long, Di Guo, Xiaobo Qu, Haiwei Han

    Abstract: Given the need to elucidate the mechanisms underlying illnesses and their treatment, as well as the lack of harmonization of acquisition and post-processing protocols among different magnetic resonance system vendors, this work is to determine if metabolite concentrations obtained from different sessions, machine models and even different vendors of 3 T scanners can be highly reproducible and be p… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  3. arXiv:2501.07035  [pdf, other

    stat.CO

    Parallel ADMM Algorithm with Gaussian Back Substitution for High-Dimensional Quantile Regression and Classification

    Authors: Xiaofei Wu, Dingzi Guo, Rongmei Liang, Zhimin Zhang

    Abstract: In the field of high-dimensional data analysis, modeling methods based on quantile loss function are highly regarded due to their ability to provide a comprehensive statistical perspective and effective handling of heterogeneous data. In recent years, many studies have focused on using the parallel alternating direction method of multipliers (P-ADMM) to solve high-dimensional quantile regression a… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  4. arXiv:2403.08635  [pdf, other

    cs.LG cs.AI stat.ML

    Human Alignment of Large Language Models through Online Preference Optimisation

    Authors: Daniele Calandriello, Daniel Guo, Remi Munos, Mark Rowland, Yunhao Tang, Bernardo Avila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot

    Abstract: Ensuring alignment of language models' outputs with human preferences is critical to guarantee a useful, safe, and pleasant user experience. Thus, human alignment has been extensively studied recently and several methods such as Reinforcement Learning from Human Feedback (RLHF), Direct Policy Optimisation (DPO) and Sequence Likelihood Calibration (SLiC) have emerged. In this paper, our contributio… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2312.00886  [pdf, other

    stat.ML cs.AI cs.GT cs.LG cs.MA

    Nash Learning from Human Feedback

    Authors: Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the main paradigm for aligning large language models (LLMs) with human preferences. Typically, RLHF involves the initial step of learning a reward model from human feedback, often expressed as preferences between pairs of text generations produced by a pre-trained LLM. Subsequently, the LLM's policy is fine-tuned by optimizing it to… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  6. arXiv:2310.12036  [pdf, other

    cs.AI cs.LG stat.ML

    A General Theoretical Paradigm to Understand Learning from Human Preferences

    Authors: Mohammad Gheshlaghi Azar, Mark Rowland, Bilal Piot, Daniel Guo, Daniele Calandriello, Michal Valko, Rémi Munos

    Abstract: The prevalent deployment of learning from human preferences through reinforcement learning (RLHF) relies on two important approximations: the first assumes that pairwise preferences can be substituted with pointwise rewards. The second assumes that a reward model trained on these pointwise rewards can generalize from collected data to out-of-distribution data sampled by the policy. Recently, Direc… ▽ More

    Submitted 21 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  7. arXiv:2206.08332  [pdf, other

    cs.LG cs.AI stat.ML

    BYOL-Explore: Exploration by Bootstrapped Prediction

    Authors: Zhaohan Daniel Guo, Shantanu Thakoor, Miruna Pîslar, Bernardo Avila Pires, Florent Altché, Corentin Tallec, Alaa Saade, Daniele Calandriello, Jean-Bastien Grill, Yunhao Tang, Michal Valko, Rémi Munos, Mohammad Gheshlaghi Azar, Bilal Piot

    Abstract: We present BYOL-Explore, a conceptually simple yet general approach for curiosity-driven exploration in visually-complex environments. BYOL-Explore learns a world representation, the world dynamics, and an exploration policy all-together by optimizing a single prediction loss in the latent space with no additional auxiliary objective. We show that BYOL-Explore is effective in DM-HARD-8, a challeng… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  8. arXiv:2203.01570  [pdf, other

    cs.LG stat.ME stat.ML

    Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

    Authors: Dongsheng Wang, Dandan Guo, He Zhao, Huangjie Zheng, Korawat Tanwisuth, Bo Chen, Mingyuan Zhou

    Abstract: A topic model is often formulated as a generative model that explains how each word of a document is generated given a set of topics and document-specific topic proportions. It is focused on capturing the word co-occurrences in a document and hence often suffers from poor performance in analyzing short documents. In addition, its parameter estimation often relies on approximate posterior inference… ▽ More

    Submitted 14 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Proceedings of ICLR, 2022

  9. arXiv:2201.12609  [pdf, other

    cs.RO cs.LG stat.ML

    ApolloRL: a Reinforcement Learning Platform for Autonomous Driving

    Authors: Fei Gao, Peng Geng, Jiaqi Guo, Yuan Liu, Dingfeng Guo, Yabo Su, Jie Zhou, Xiao Wei, Jin Li, Xu Liu

    Abstract: We introduce ApolloRL, an open platform for research in reinforcement learning for autonomous driving. The platform provides a complete closed-loop pipeline with training, simulation, and evaluation components. It comes with 300 hours of real-world data in driving scenarios and popular baselines such as Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) agents. We elaborate in this pap… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  10. arXiv:2110.09140  [pdf, other

    cs.LG stat.ML

    Learning Prototype-oriented Set Representations for Meta-Learning

    Authors: Dandan Guo, Long Tian, Minghe Zhang, Mingyuan Zhou, Hongyuan Zha

    Abstract: Learning from set-structured data is a fundamental problem that has recently attracted increasing attention, where a series of summary networks are introduced to deal with the set input. In fact, many meta-learning problems can be treated as set-input tasks. Most existing summary networks aim to design different architectures for the input set in order to enforce permutation invariance. However, s… ▽ More

    Submitted 7 March, 2023; v1 submitted 18 October, 2021; originally announced October 2021.

  11. Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

    Authors: Dandan Guo, Ruiying Lu, Bo Chen, Zequn Zeng, Mingyuan Zhou

    Abstract: Observing a set of images and their corresponding paragraph-captions, a challenging task is to learn how to produce a semantically coherent paragraph to describe the visual content of an image. Inspired by recent successes in integrating semantic topics into this task, this paper develops a plug-and-play hierarchical-topic-guided image paragraph generation framework, which couples a visual extract… ▽ More

    Submitted 25 July, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

  12. Variational Temporal Deep Generative Model for Radar HRRP Target Recognition

    Authors: Dandan Guo, Bo Chen, Wenchao Chen, Chaojie Wang, Hongwei Liu, Mingyuan Zhou

    Abstract: We develop a recurrent gamma belief network (rGBN) for radar automatic target recognition (RATR) based on high-resolution range profile (HRRP), which characterizes the temporal dependence across the range cells of HRRP. The proposed rGBN adopts a hierarchy of gamma distributions to build its temporal deep generative model. For scalable training and fast out-of-sample prediction, we propose the hyb… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

  13. arXiv:2009.06681  [pdf, other

    eess.SP cs.IT stat.ML

    Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks

    Authors: Yasar Sinan Nasir, Dongning Guo

    Abstract: Deep reinforcement learning offers a model-free alternative to supervised deep learning and classical optimization for solving the transmit power control problem in wireless networks. The multi-agent deep reinforcement learning approach considers each transmitter as an individual learning agent that determines its transmit power level by observing the local wireless environment. Following a certai… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: 5 pages, 4 figures, to appear in the 54th Annual IEEE Asilomar Conference on Signals, Systems, and Computers, Nov 2020. This is an invited paper to the session Reinforcement Learning and Bandits for Communication Systems. To reproduce the results please see https://github.com/sinannasir/Power-Control-asilomar

  14. arXiv:2009.06228  [pdf, other

    cs.LG cs.CR stat.ML

    SAPAG: A Self-Adaptive Privacy Attack From Gradients

    Authors: Yijue Wang, Jieren Deng, Dan Guo, Chenghong Wang, Xianrui Meng, Hang Liu, Caiwen Ding, Sanguthevar Rajasekaran

    Abstract: Distributed learning such as federated learning or collaborative learning enables model training on decentralized data from users and only collects local gradients, where data is processed close to its sources for data privacy. The nature of not centralizing the training data addresses the privacy issue of privacy-sensitive data. Recent studies show that a third party can reconstruct the true trai… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  15. arXiv:2008.00727  [pdf

    cs.LG cs.AI cs.NE stat.ML

    Deep Bayesian Bandits: Exploring in Online Personalized Recommendations

    Authors: Dalin Guo, Sofia Ira Ktena, Ferenc Huszar, Pranay Kumar Myana, Wenzhe Shi, Alykhan Tejani

    Abstract: Recommender systems trained in a continuous learning fashion are plagued by the feedback loop problem, also known as algorithmic bias. This causes a newly trained model to act greedily and favor items that have already been engaged by users. This behavior is particularly harmful in personalised ads recommendations, as it can also cause new campaigns to remain unexplored. Exploration aims to addres… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  16. arXiv:2006.08804  [pdf, other

    cs.LG stat.AP stat.CO stat.ML

    Deep Autoencoding Topic Model with Scalable Hybrid Bayesian Inference

    Authors: Hao Zhang, Bo Chen, Yulai Cong, Dandan Guo, Hongwei Liu, Mingyuan Zhou

    Abstract: To build a flexible and interpretable model for document analysis, we develop deep autoencoding topic model (DATM) that uses a hierarchy of gamma distributions to construct its multi-stochastic-layer generative network. In order to provide scalable posterior inference for the parameters of the generative network, we develop topic-layer-adaptive stochastic gradient Riemannian MCMC that jointly lear… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence. arXiv admin note: text overlap with arXiv:1803.01328

  17. arXiv:2006.07733  [pdf, other

    cs.LG cs.CV stat.ML

    Bootstrap your own latent: A new approach to self-supervised Learning

    Authors: Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko

    Abstract: We introduce Bootstrap Your Own Latent (BYOL), a new approach to self-supervised image representation learning. BYOL relies on two neural networks, referred to as online and target networks, that interact and learn from each other. From an augmented view of an image, we train the online network to predict the target network representation of the same image under a different augmented view. At the… ▽ More

    Submitted 10 September, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

  18. arXiv:2003.13350  [pdf, other

    cs.LG stat.ML

    Agent57: Outperforming the Atari Human Benchmark

    Authors: Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Charles Blundell

    Abstract: Atari games have been a long-standing benchmark in the reinforcement learning (RL) community for the past decade. This benchmark was proposed to test general competency of RL algorithms. Previous work has achieved good average performance by doing outstandingly well on many games of the set, but very poorly in several of the most challenging games. We propose Agent57, the first deep RL agent that… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  19. arXiv:2002.06038  [pdf, other

    cs.LG stat.ML

    Never Give Up: Learning Directed Exploration Strategies

    Authors: Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andew Bolt, Charles Blundell

    Abstract: We propose a reinforcement learning agent to solve hard exploration games by learning a range of directed exploratory policies. We construct an episodic memory-based intrinsic reward using k-nearest neighbors over the agent's recent experience to train the directed exploratory policies, thereby encouraging the agent to repeatedly revisit all states in its environment. A self-supervised inverse dyn… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper in ICLR 2020

  20. arXiv:1912.10337  [pdf, other

    cs.CL cs.LG stat.ME stat.ML

    Recurrent Hierarchical Topic-Guided RNN for Language Generation

    Authors: Dandan Guo, Bo Chen, Ruiying Lu, Mingyuan Zhou

    Abstract: To simultaneously capture syntax and global semantics from a text corpus, we propose a new larger-context recurrent neural network (RNN) based language model, which extracts recurrent hierarchical semantic structure via a dynamic deep topic model to guide natural language generation. Moving beyond a conventional RNN-based language model that ignores long-range word dependencies and sentence order,… ▽ More

    Submitted 27 June, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: ICML 2020

  21. arXiv:1909.05885  [pdf, other

    cs.CL cs.LG stat.ML

    Analyzing machine-learned representations: A natural language case study

    Authors: Ishita Dasgupta, Demi Guo, Samuel J. Gershman, Noah D. Goodman

    Abstract: As modern deep networks become more complex, and get closer to human-like capabilities in certain domains, the question arises of how the representations and decision rules they learn compare to the ones in humans. In this work, we study representations of sentences in one such artificial system for natural language processing. We first present a diagnostic test dataset to examine the degree of ab… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: This article supersedes a previous article arXiv:1802.04302

  22. arXiv:1908.01384  [pdf, other

    cs.LG cs.CV stat.ML

    Simultaneous Clustering and Optimization for Evolving Datasets

    Authors: Yawei Zhao, En Zhu, Xinwang Liu, Chang Tang, Deke Guo, Jianping Yin

    Abstract: Simultaneous clustering and optimization (SCO) has recently drawn much attention due to its wide range of practical applications. Many methods have been previously proposed to solve this problem and obtain the optimal model. However, when a dataset evolves over time, those existing methods have to update the model frequently to guarantee accuracy; such updating is computationally infeasible. In th… ▽ More

    Submitted 4 August, 2019; originally announced August 2019.

    Comments: Accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE)

  23. arXiv:1906.07805  [pdf, other

    cs.LG cs.AI stat.ML

    Directed Exploration for Reinforcement Learning

    Authors: Zhaohan Daniel Guo, Emma Brunskill

    Abstract: Efficient exploration is necessary to achieve good sample efficiency for reinforcement learning in general. From small, tabular settings such as gridworlds to large, continuous and sparse reward settings such as robotic object manipulation tasks, exploration through adding an uncertainty bonus to the reward function has been shown to be effective when the uncertainty is able to accurately drive ex… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  24. arXiv:1811.06407  [pdf, other

    cs.LG stat.ML

    Neural Predictive Belief Representations

    Authors: Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A. Pires, Rémi Munos

    Abstract: Unsupervised representation learning has succeeded with excellent results in many applications. It is an especially powerful tool to learn a good representation of environments with partial or noisy observations. In partially observable domains it is important for the representation to encode a belief state, a sufficient statistic of the observations seen so far. In this paper, we investigate whet… ▽ More

    Submitted 19 August, 2019; v1 submitted 15 November, 2018; originally announced November 2018.

  25. arXiv:1810.11209  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Deep Poisson gamma dynamical systems

    Authors: Dandan Guo, Bo Chen, Hao Zhang, Mingyuan Zhou

    Abstract: We develop deep Poisson-gamma dynamical systems (DPGDS) to model sequentially observed multivariate count data, improving previously proposed models by not only mining deep hierarchical latent structure from the data, but also capturing both first-order and long-range temporal dependencies. Using sophisticated but simple-to-implement data augmentation techniques, we derived closed-form Gibbs sampl… ▽ More

    Submitted 31 December, 2018; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: NeurIPS 2018

  26. arXiv:1808.00490  [pdf, other

    eess.SP cs.IT stat.ML

    Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks

    Authors: Yasar Sinan Nasir, Dongning Guo

    Abstract: This work demonstrates the potential of deep reinforcement learning techniques for transmit power control in wireless networks. Existing techniques typically find near-optimal power allocations by solving a challenging optimization problem. Most of these algorithms are not scalable to large networks in real-world scenarios because of their computational complexity and instantaneous cross-cell chan… ▽ More

    Submitted 8 April, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: 12 pages, 7 figures, submitted. v2: the updated title, in addition to improved readability. v3: revised

  27. arXiv:1807.03756  [pdf, other

    stat.ML cs.CL cs.LG

    Latent Alignment and Variational Attention

    Authors: Yuntian Deng, Yoon Kim, Justin Chiu, Demi Guo, Alexander M. Rush

    Abstract: Neural attention has become central to many state-of-the-art models in natural language processing and related domains. Attention networks are an easy-to-train and effective method for softly simulating alignment; however, the approach does not marginalize over latent alignments in a probabilistic sense. This property makes it difficult to compare attention to other alignment approaches, to compos… ▽ More

    Submitted 7 November, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: accepted by NIPS 2018

  28. arXiv:1806.02507  [pdf, other

    cs.LG stat.ML

    Large scale classification in deep neural network with Label Mapping

    Authors: Qizhi Zhang, Kuang-Chih Lee, Hongying Bao, Yuan You, Wenjie Li, Dongbai Guo

    Abstract: In recent years, deep neural network is widely used in machine learning. The multi-class classification problem is a class of important problem in machine learning. However, in order to solve those types of multi-class classification problems effectively, the required network size should have hyper-linear growth with respect to the number of classes. Therefore, it is infeasible to solve the multi-… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

  29. arXiv:1805.01049  [pdf, other

    cs.LG stat.ML

    Large-Scale Unsupervised Deep Representation Learning for Brain Structure

    Authors: Ayush Jaiswal, Dong Guo, Cauligi S. Raghavendra, Paul Thompson

    Abstract: Machine Learning (ML) is increasingly being used for computer aided diagnosis of brain related disorders based on structural magnetic resonance imaging (MRI) data. Most of such work employs biologically and medically meaningful hand-crafted features calculated from different regions of the brain. The construction of such highly specialized features requires a considerable amount of time, manual ov… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  30. arXiv:1803.01328  [pdf, other

    stat.ML stat.AP stat.CO

    WHAI: Weibull Hybrid Autoencoding Inference for Deep Topic Modeling

    Authors: Hao Zhang, Bo Chen, Dandan Guo, Mingyuan Zhou

    Abstract: To train an inference network jointly with a deep generative topic model, making it both scalable to big corpora and fast in out-of-sample prediction, we develop Weibull hybrid autoencoding inference (WHAI) for deep latent Dirichlet allocation, which infers posterior samples via a hybrid of stochastic-gradient MCMC and autoencoding variational Bayes. The generative network of WHAI has a hierarchy… ▽ More

    Submitted 25 April, 2020; v1 submitted 4 March, 2018; originally announced March 2018.

    Comments: ICLR 2018

  31. arXiv:1802.04302  [pdf, other

    cs.CL stat.ML

    Evaluating Compositionality in Sentence Embeddings

    Authors: Ishita Dasgupta, Demi Guo, Andreas Stuhlmüller, Samuel J. Gershman, Noah D. Goodman

    Abstract: An important challenge for human-like AI is compositional semantics. Recent research has attempted to address this by using deep neural networks to learn vector space embeddings of sentences, which then serve as input to other tasks. We present a new dataset for one such task, `natural language inference' (NLI), that cannot be solved using only word-level knowledge and requires some compositionali… ▽ More

    Submitted 17 May, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

  32. arXiv:1710.03035  [pdf, other

    cs.LG stat.ML

    Unifying Local and Global Change Detection in Dynamic Networks

    Authors: Wenzhe Li, Dong Guo, Greg Ver Steeg, Aram Galstyan

    Abstract: Many real-world networks are complex dynamical systems, where both local (e.g., changing node attributes) and global (e.g., changing network topology) processes unfold over time. Local dynamics may provoke global changes in the network, and the ability to detect such effects could have profound implications for a number of real-world problems. Most existing techniques focus individually on either… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

  33. arXiv:1709.10335  [pdf

    stat.AP

    Redefine the correlation coefficient by experiment methods

    Authors: Xiatong Cai, Guangpeng Pei, Yuen Zhu, Donggang Guo, Hua Li

    Abstract: With the establishment of global biological monitor network and development of remote sensing technology, data won't be a limitation, but the variance brought by spatial heterogeneous and fractal will influence correlation coefficient significantly with the enlarged sample scale. Those impede us to find more intrinsic principle in ecology. Ecology is based on experiment, and the experiment methods… ▽ More

    Submitted 29 September, 2017; originally announced September 2017.

    Comments: 19pages, 1figure

  34. arXiv:1703.03454  [pdf, other

    cs.LG stat.ML

    Sample Efficient Feature Selection for Factored MDPs

    Authors: Zhaohan Daniel Guo, Emma Brunskill

    Abstract: In reinforcement learning, the state of the real world is often represented by feature vectors. However, not all of the features may be pertinent for solving the current task. We propose Feature Selection Explore and Exploit (FS-EE), an algorithm that automatically selects the necessary features while learning a Factored Markov Decision Process, and prove that under mild assumptions, its sample co… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

  35. arXiv:1701.03577  [pdf, ps, other

    stat.ML cs.AI cs.CL cs.LG

    Kernel Approximation Methods for Speech Recognition

    Authors: Avner May, Alireza Bagheri Garakani, Zhiyun Lu, Dong Guo, Kuan Liu, Aurélien Bellet, Linxi Fan, Michael Collins, Daniel Hsu, Brian Kingsbury, Michael Picheny, Fei Sha

    Abstract: We study large-scale kernel methods for acoustic modeling in speech recognition and compare their performance to deep neural networks (DNNs). We perform experiments on four speech recognition datasets, including the TIMIT and Broadcast News benchmark tasks, and compare these two types of models on frame-level performance metrics (accuracy, cross-entropy), as well as on recognition metrics (word/ch… ▽ More

    Submitted 13 January, 2017; originally announced January 2017.

  36. arXiv:1605.08062  [pdf, other

    cs.LG cs.AI stat.ML

    A PAC RL Algorithm for Episodic POMDPs

    Authors: Zhaohan Daniel Guo, Shayan Doroudi, Emma Brunskill

    Abstract: Many interesting real world domains involve reinforcement learning (RL) in partially observable environments. Efficient learning in such domains is important, but existing sample complexity bounds for partially observable RL are at least exponential in the episode length. We give, to our knowledge, the first partially observable RL algorithm with a polynomial bound on the number of episodes on whi… ▽ More

    Submitted 1 June, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

    Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, pp. 510-518, 2016

  37. arXiv:1604.02100  [pdf, other

    stat.ML cs.IT math.NA math.SP physics.med-ph

    Hankel Matrix Nuclear Norm Regularized Tensor Completion for $N$-dimensional Exponential Signals

    Authors: Jiaxi Ying, Hengfa Lu, Qingtao Wei, Jian-Feng Cai, Di Guo, Jihui Wu, Zhong Chen, Xiaobo Qu

    Abstract: Signals are generally modeled as a superposition of exponential functions in spectroscopy of chemistry, biology and medical imaging. For fast data acquisition or other inevitable reasons, however, only a small amount of samples may be acquired and thus how to recover the full signal becomes an active research topic. But existing approaches can not efficiently recover $N$-dimensional exponential si… ▽ More

    Submitted 31 March, 2017; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: 15 pages, 12 figures

  38. arXiv:1603.05800  [pdf, ps, other

    cs.LG stat.ML

    A Comparison between Deep Neural Nets and Kernel Acoustic Models for Speech Recognition

    Authors: Zhiyun Lu, Dong Guo, Alireza Bagheri Garakani, Kuan Liu, Avner May, Aurelien Bellet, Linxi Fan, Michael Collins, Brian Kingsbury, Michael Picheny, Fei Sha

    Abstract: We study large-scale kernel methods for acoustic modeling and compare to DNNs on performance metrics related to both acoustic modeling and recognition. Measuring perplexity and frame-level classification accuracy, kernel-based acoustic models are as effective as their DNN counterparts. However, on token-error-rates DNN models can be significantly better. We have discovered that this might be attri… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

    Comments: arXiv admin note: text overlap with arXiv:1411.4000

  39. arXiv:1411.4000  [pdf, other

    cs.LG cs.AI stat.ML

    How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets

    Authors: Zhiyun Lu, Avner May, Kuan Liu, Alireza Bagheri Garakani, Dong Guo, Aurélien Bellet, Linxi Fan, Michael Collins, Brian Kingsbury, Michael Picheny, Fei Sha

    Abstract: The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel models to successfully tackle large-scale learning problems that are so far only approachable by deep learning architectures. Based on the seminal work by Rahimi… ▽ More

    Submitted 17 June, 2015; v1 submitted 14 November, 2014; originally announced November 2014.