Search | arXiv e-print repository

The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure

Authors: Tyler Sam, Yudong Chen, Christina Lee Yu

Abstract: Many reinforcement learning (RL) algorithms are too costly to use in practice due to the large sizes $S, A$ of the problem's state and action space. To resolve this issue, we study transfer RL with latent low rank structure. We consider the problem of transferring a latent low rank representation when the source and target MDPs have transition kernels with Tucker rank $(S , d, A )$,… ▽ More Many reinforcement learning (RL) algorithms are too costly to use in practice due to the large sizes $S, A$ of the problem's state and action space. To resolve this issue, we study transfer RL with latent low rank structure. We consider the problem of transferring a latent low rank representation when the source and target MDPs have transition kernels with Tucker rank $(S , d, A )$, $(S , S , d), (d, S, A )$, or $(d , d , d )$. In each setting, we introduce the transfer-ability coefficient $α$ that measures the difficulty of representational transfer. Our algorithm learns latent representations in each source MDP and then exploits the linear structure to remove the dependence on $S, A $, or $S A$ in the target MDP regret bound. We complement our positive results with information theoretic lower bounds that show our algorithms (excluding the ($d, d, d$) setting) are minimax-optimal with respect to $α$. △ Less

Submitted 28 October, 2024; originally announced October 2024.

arXiv:2409.03980 [pdf, other]

Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows

Authors: Yudong Chen, Xumei Xi, Christina Lee Yu

Abstract: Matrix completion tackles the task of predicting missing values in a low-rank matrix based on a sparse set of observed entries. It is often assumed that the observation pattern is generated uniformly at random or has a very specific structure tuned to a given algorithm. There is still a gap in our understanding when it comes to arbitrary sampling patterns. Given an arbitrary sampling pattern, we i… ▽ More Matrix completion tackles the task of predicting missing values in a low-rank matrix based on a sparse set of observed entries. It is often assumed that the observation pattern is generated uniformly at random or has a very specific structure tuned to a given algorithm. There is still a gap in our understanding when it comes to arbitrary sampling patterns. Given an arbitrary sampling pattern, we introduce a matrix completion algorithm based on network flows in the bipartite graph induced by the observation pattern. For additive matrices, the particular flow we used is the electrical flow and we establish error upper bounds customized to each entry as a function of the observation set, along with matching minimax lower bounds. Our results show that the minimax squared error for recovery of a particular entry in the matrix is proportional to the effective resistance of the corresponding edge in the graph. Furthermore, we show that our estimator is equivalent to the least squares estimator. We apply our estimator to the two-way fixed effects model and show that it enables us to accurately infer individual causal effects and the unit-specific and time-specific confounders. For rank-$1$ matrices, we use edge-disjoint paths to form an estimator that achieves minimax optimal estimation when the sampling is sufficiently dense. Our discovery introduces a new family of estimators parametrized by network flows, which provide a fine-grained and intuitive understanding of the impact of the given sampling pattern on the relative difficulty of estimation at an entry-specific level. This graph-based approach allows us to quantify the inherent complexity of matrix completion for individual entries, rather than relying solely on global measures of performance. △ Less

Submitted 5 September, 2024; originally announced September 2024.

Journal ref: Innovations in Theoretical Computer Science (ITCS), 2025

arXiv:2408.01460 [pdf, other]

LocalValueBench: A Collaboratively Built and Extensible Benchmark for Evaluating Localized Value Alignment and Ethical Safety in Large Language Models

Authors: Gwenyth Isobel Meadows, Nicholas Wai Long Lau, Eva Adelina Susanto, Chi Lok Yu, Aditya Paul

Abstract: The proliferation of large language models (LLMs) requires robust evaluation of their alignment with local values and ethical standards, especially as existing benchmarks often reflect the cultural, legal, and ideological values of their creators. \textsc{LocalValueBench}, introduced in this paper, is an extensible benchmark designed to assess LLMs' adherence to Australian values, and provides a f… ▽ More The proliferation of large language models (LLMs) requires robust evaluation of their alignment with local values and ethical standards, especially as existing benchmarks often reflect the cultural, legal, and ideological values of their creators. \textsc{LocalValueBench}, introduced in this paper, is an extensible benchmark designed to assess LLMs' adherence to Australian values, and provides a framework for regulators worldwide to develop their own LLM benchmarks for local value alignment. Employing a novel typology for ethical reasoning and an interrogation approach, we curated comprehensive questions and utilized prompt engineering strategies to probe LLMs' value alignment. Our evaluation criteria quantified deviations from local values, ensuring a rigorous assessment process. Comparative analysis of three commercial LLMs by USA vendors revealed significant insights into their effectiveness and limitations, demonstrating the critical importance of value alignment. This study offers valuable tools and methodologies for regulators to create tailored benchmarks, highlighting avenues for future research to enhance ethical AI development. △ Less

Submitted 27 July, 2024; originally announced August 2024.

arXiv:2408.01459 [pdf, other]

AgentPeerTalk: Empowering Students through Agentic-AI-Driven Discernment of Bullying and Joking in Peer Interactions in Schools

Authors: Aditya Paul, Chi Lok Yu, Eva Adelina Susanto, Nicholas Wai Long Lau, Gwenyth Isobel Meadows

Abstract: Addressing school bullying effectively and promptly is crucial for the mental health of students. This study examined the potential of large language models (LLMs) to empower students by discerning between bullying and joking in school peer interactions. We employed ChatGPT-4, Gemini 1.5 Pro, and Claude 3 Opus, evaluating their effectiveness through human review. Our results revealed that not all… ▽ More Addressing school bullying effectively and promptly is crucial for the mental health of students. This study examined the potential of large language models (LLMs) to empower students by discerning between bullying and joking in school peer interactions. We employed ChatGPT-4, Gemini 1.5 Pro, and Claude 3 Opus, evaluating their effectiveness through human review. Our results revealed that not all LLMs were suitable for an agentic approach, with ChatGPT-4 showing the most promise. We observed variations in LLM outputs, possibly influenced by political overcorrectness, context window limitations, and pre-existing bias in their training data. ChatGPT-4 excelled in context-specific accuracy after implementing the agentic approach, highlighting its potential to provide continuous, real-time support to vulnerable students. This study underlines the significant social impact of using agentic AI in educational settings, offering a new avenue for reducing the negative consequences of bullying and enhancing student well-being. △ Less

Submitted 27 July, 2024; originally announced August 2024.

arXiv:2405.19327 [pdf, other]

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kaijing Ma, Minghao Liu, Morry Niu , et al. (20 additional authors not shown)

Abstract: Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparabl… ▽ More Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparable to existing closed-source LLMs. However, only the model's weights are provided with most details (e.g., intermediate checkpoints, pre-training corpus, and training code, etc.) being undisclosed. To improve the transparency of LLMs, the research community has formed to open-source truly open LLMs (e.g., Pythia, Amber, OLMo), where more details (e.g., pre-training corpus and training code) are being provided. These models have greatly advanced the scientific study of these large models including their strengths, weaknesses, biases and risks. However, we observe that the existing truly open LLMs on reasoning, knowledge, and coding tasks are still inferior to existing state-of-the-art LLMs with similar model sizes. To this end, we open-source MAP-Neo, a highly capable and transparent bilingual language model with 7B parameters trained from scratch on 4.5T high-quality tokens. Our MAP-Neo is the first fully open-sourced bilingual LLM with comparable performance compared to existing state-of-the-art LLMs. Moreover, we open-source all details to reproduce our MAP-Neo, where the cleaned pre-training corpus, data cleaning pipeline, checkpoints, and well-optimized training/evaluation framework are provided. Finally, we hope our MAP-Neo will enhance and strengthen the open research community and inspire more innovations and creativities to facilitate the further improvements of LLMs. △ Less

Submitted 10 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

Comments: https://map-neo.github.io/

arXiv:2405.07979 [pdf, other]

Low-order outcomes and clustered designs: combining design and analysis for causal inference under network interference

Authors: Matthew Eichhorn, Samir Khan, Johan Ugander, Christina Lee Yu

Abstract: Variance reduction for causal inference in the presence of network interference is often achieved through either outcome modeling, which is typically analyzed under unit-randomized Bernoulli designs, or clustered experimental designs, which are typically analyzed without strong parametric assumptions. In this work, we study the intersection of these two approaches and consider the problem of estim… ▽ More Variance reduction for causal inference in the presence of network interference is often achieved through either outcome modeling, which is typically analyzed under unit-randomized Bernoulli designs, or clustered experimental designs, which are typically analyzed without strong parametric assumptions. In this work, we study the intersection of these two approaches and consider the problem of estimation in low-order outcome models using data from a general experimental design. Our contributions are threefold. First, we present an estimator of the total treatment effect (also called the global average treatment effect) in a low-degree outcome model when the data are collected under general experimental designs, generalizing previous results for Bernoulli designs. We refer to this estimator as the pseudoinverse estimator and give bounds on its bias and variance in terms of properties of the experimental design. Second, we evaluate these bounds for the case of cluster randomized designs with both Bernoulli and complete randomization. For clustered Bernoulli randomization, we find that our estimator is always unbiased and that its variance scales like the smaller of the variance obtained from a low-order assumption and the variance obtained from cluster randomization, showing that combining these variance reduction strategies is preferable to using either individually. For clustered complete randomization, we find a notable bias-variance trade-off mediated by specific features of the clustering. Third, when choosing a clustered experimental design, our bounds can be used to select a clustering from a set of candidate clusterings. Across a range of graphs and clustering algorithms, we show that our method consistently selects clusterings that perform well on a range of response models, suggesting that our bounds are useful to practitioners. △ Less

Submitted 11 July, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.05119 [pdf, other]

Analysis of Two-Stage Rollout Designs with Clustering for Causal Inference under Network Interference

Authors: Mayleen Cortez-Rodriguez, Matthew Eichhorn, Christina Lee Yu

Abstract: Estimating causal effects under interference is pertinent to many real-world settings. Recent work with low-order potential outcomes models uses a rollout design to obtain unbiased estimators that require no interference network information. However, the required extrapolation can lead to prohibitively high variance. To address this, we propose a two-stage experiment that selects a sub-population… ▽ More Estimating causal effects under interference is pertinent to many real-world settings. Recent work with low-order potential outcomes models uses a rollout design to obtain unbiased estimators that require no interference network information. However, the required extrapolation can lead to prohibitively high variance. To address this, we propose a two-stage experiment that selects a sub-population in the first stage and restricts treatment rollout to this sub-population in the second stage. We explore the role of clustering in the first stage by analyzing the bias and variance of a polynomial interpolation-style estimator under this experimental design. Bias increases with the number of edges cut in the clustering of the interference network, but variance depends on qualities of the clustering that relate to homophily and covariate balance. There is a tension between clustering objectives that minimize the number of cut edges versus those that maximize covariate balance across clusters. Through simulations, we explore a bias-variance trade-off and compare the performance of the estimator under different clustering strategies. △ Less

Submitted 10 February, 2025; v1 submitted 8 May, 2024; originally announced May 2024.

Comments: 29 pages, 5 Tables, 14 figures, accepted to AIStats 2025

MSC Class: 62K99 (Primary); 62P30 (Secondary)

arXiv:2403.00184 [pdf, other]

doi 10.1109/ISIT54713.2023.10206880

Entry-Specific Bounds for Low-Rank Matrix Completion under Highly Non-Uniform Sampling

Authors: Xumei Xi, Christina Lee Yu, Yudong Chen

Abstract: Low-rank matrix completion concerns the problem of estimating unobserved entries in a matrix using a sparse set of observed entries. We consider the non-uniform setting where the observed entries are sampled with highly varying probabilities, potentially with different asymptotic scalings. We show that under structured sampling probabilities, it is often better and sometimes optimal to run estimat… ▽ More Low-rank matrix completion concerns the problem of estimating unobserved entries in a matrix using a sparse set of observed entries. We consider the non-uniform setting where the observed entries are sampled with highly varying probabilities, potentially with different asymptotic scalings. We show that under structured sampling probabilities, it is often better and sometimes optimal to run estimation algorithms on a smaller submatrix rather than the entire matrix. In particular, we prove error upper bounds customized to each entry, which match the minimax lower bounds under certain conditions. Our bounds characterize the hardness of estimating each entry as a function of the localized sampling probabilities. We provide numerical experiments that confirm our theoretical findings. △ Less

Submitted 29 February, 2024; originally announced March 2024.

arXiv:2402.17720 [pdf, other]

The SMART approach to instance-optimal online learning

Authors: Siddhartha Banerjee, Alankrita Bhatt, Christina Lee Yu

Abstract: We devise an online learning algorithm -- titled Switching via Monotone Adapted Regret Traces (SMART) -- that adapts to the data and achieves regret that is instance optimal, i.e., simultaneously competitive on every input sequence compared to the performance of the follow-the-leader (FTL) policy and the worst case guarantee of any other input policy. We show that the regret of the SMART policy on… ▽ More We devise an online learning algorithm -- titled Switching via Monotone Adapted Regret Traces (SMART) -- that adapts to the data and achieves regret that is instance optimal, i.e., simultaneously competitive on every input sequence compared to the performance of the follow-the-leader (FTL) policy and the worst case guarantee of any other input policy. We show that the regret of the SMART policy on any input sequence is within a multiplicative factor $e/(e-1) \approx 1.58$ of the smaller of: 1) the regret obtained by FTL on the sequence, and 2) the upper bound on regret guaranteed by the given worst-case policy. This implies a strictly stronger guarantee than typical `best-of-both-worlds' bounds as the guarantee holds for every input sequence regardless of how it is generated. SMART is simple to implement as it begins by playing FTL and switches at most once during the time horizon to the worst-case algorithm. Our approach and results follow from an operational reduction of instance optimal online learning to competitive analysis for the ski-rental problem. We complement our competitive ratio upper bounds with a fundamental lower bound showing that over all input sequences, no algorithm can get better than a $1.43$-fraction of the minimum regret achieved by FTL and the minimax-optimal policy. We also present a modification of SMART that combines FTL with a ``small-loss" algorithm to achieve instance optimality between the regret of FTL and the small loss regret bound. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2312.15574 [pdf, other]

Clustered Switchback Designs for Experimentation Under Spatio-temporal Interference

Authors: Su Jia, Nathan Kallus, Christina Lee Yu

Abstract: We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outc… ▽ More We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outcome depends on its neighborhood's treatments, and that temporal interference is described by an MDP, where the transition kernel under either treatment (action) satisfies a rapid mixing condition. We propose a clustered switchback design, where units are grouped into clusters and time steps are grouped into blocks, and each whole cluster-block combination is assigned a single random treatment. Under this design, we show that for graphs that admit good clustering, a truncated Horvitz-Thompson estimator achieves a $\tilde O(1/NT)$ mean squared error (MSE), matching the lower bound up to logarithmic terms for sparse graphs. Our results simultaneously generalize the results from \citet{hu2022switchback,ugander2013graph} and \citet{leung2022rate}. Simulation studies validate the favorable performance of our approach. △ Less

Submitted 26 March, 2025; v1 submitted 24 December, 2023; originally announced December 2023.

arXiv:2305.15621 [pdf, ps, other]

Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure

Authors: Xumei Xi, Christina Lee Yu, Yudong Chen

Abstract: We consider offline Reinforcement Learning (RL), where the agent does not interact with the environment and must rely on offline data collected using a behavior policy. Previous works provide policy evaluation guarantees when the target policy to be evaluated is covered by the behavior policy, that is, state-action pairs visited by the target policy must also be visited by the behavior policy. We… ▽ More We consider offline Reinforcement Learning (RL), where the agent does not interact with the environment and must rely on offline data collected using a behavior policy. Previous works provide policy evaluation guarantees when the target policy to be evaluated is covered by the behavior policy, that is, state-action pairs visited by the target policy must also be visited by the behavior policy. We show that when the MDP has a latent low-rank structure, this coverage condition can be relaxed. Building on the connection to weighted matrix completion with non-uniform observations, we propose an offline policy evaluation algorithm that leverages the low-rank structure to estimate the values of uncovered state-action pairs. Our algorithm does not require a known feature representation, and our finite-sample error bound involves a novel discrepancy measure quantifying the discrepancy between the behavior and target policies in the spectral space. We provide concrete examples where our algorithm achieves accurate estimation while existing coverage conditions are not satisfied. Building on the above evaluation algorithm, we further design an offline policy optimization algorithm and provide non-asymptotic performance guarantees. △ Less

Submitted 24 May, 2023; originally announced May 2023.

arXiv:2305.11259 [pdf, other]

The Asymptotics of the Expected Betti Numbers of Preferential Attachment Clique Complexes

Authors: Chunyin Siu, Gennady Samorodnitsky, Christina Lee Yu, Rongyi He

Abstract: The preferential attachment model is a natural and popular random graph model for a growing network that contains very well-connected ``hubs''. We study the higher-order connectivity of such a network by investigating the topological properties of its clique complex. We concentrate on the expected Betti numbers, a sequence of topological invariants of the complex related to the numbers of holes of… ▽ More The preferential attachment model is a natural and popular random graph model for a growing network that contains very well-connected ``hubs''. We study the higher-order connectivity of such a network by investigating the topological properties of its clique complex. We concentrate on the expected Betti numbers, a sequence of topological invariants of the complex related to the numbers of holes of different dimensions. We determine the asymptotic growth rates of the expected Betti numbers, and prove that the expected Betti number at dimension 1 grows linearly fast, while those at higher dimensions grow sublinearly fast. Our theoretical results are illustrated by simulations. (Changes are made in this version to generalize Proposition 14 and to streamline proofs. These changes are shown in blue.) △ Less

Submitted 11 June, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

Comments: 28 pages, 8 figures; changes in v2: stylistic changes to improved readability, no change in the mathematical contents; changes in v3: Proposition 14 slightly generalized, proofs streamlined, changes highlighted in blue

MSC Class: 05C82; 60C05; 05E45; 55U10; 55N31; 62R40

arXiv:2210.11355 [pdf, other]

Network Synthetic Interventions: A Causal Framework for Panel Data Under Network Interference

Authors: Anish Agarwal, Sarah H. Cen, Devavrat Shah, Christina Lee Yu

Abstract: We propose a generalization of the synthetic controls and synthetic interventions methodology to incorporate network interference. We consider the estimation of unit-specific potential outcomes from panel data in the presence of spillover across units and unobserved confounding. Key to our approach is a novel latent factor model that takes into account network interference and generalizes the fact… ▽ More We propose a generalization of the synthetic controls and synthetic interventions methodology to incorporate network interference. We consider the estimation of unit-specific potential outcomes from panel data in the presence of spillover across units and unobserved confounding. Key to our approach is a novel latent factor model that takes into account network interference and generalizes the factor models typically used in panel data settings. We propose an estimator, Network Synthetic Interventions (NSI), and show that it consistently estimates the mean outcomes for a unit under an arbitrary set of counterfactual treatments for the network. We further establish that the estimator is asymptotically normal. We furnish two validity tests for whether the NSI estimator reliably generalizes to produce accurate counterfactual estimates. We provide a novel graph-based experiment design that guarantees the NSI estimator produces accurate counterfactual estimates, and also analyze the sample complexity of the proposed design. We conclude with simulations that corroborate our theoretical findings. △ Less

Submitted 11 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

Comments: 49 pages, 6 figures

arXiv:2210.00025 [pdf, other]

Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits

Authors: Siddhartha Banerjee, Sean R. Sinclair, Milind Tambe, Lily Xu, Christina Lee Yu

Abstract: Most real-world deployments of bandit algorithms exist somewhere in between the offline and online set-up, where some historical data is available upfront and additional data is collected dynamically online. How best to incorporate historical data to "warm start" bandit algorithms is an open question: naively initializing reward estimates using all historical samples can suffer from spurious data… ▽ More Most real-world deployments of bandit algorithms exist somewhere in between the offline and online set-up, where some historical data is available upfront and additional data is collected dynamically online. How best to incorporate historical data to "warm start" bandit algorithms is an open question: naively initializing reward estimates using all historical samples can suffer from spurious data and imbalanced data coverage, leading to data inefficiency (amount of historical data used) - particularly for continuous action spaces. To address these challenges, we propose ArtificialReplay, a meta-algorithm for incorporating historical data into any arbitrary base bandit algorithm. We show that ArtificialReplay uses only a fraction of the historical data compared to a full warm-start approach, while still achieving identical regret for base algorithms that satisfy independence of irrelevant data (IIData), a novel and broadly applicable property that we introduce. We complement these theoretical results with experiments on K-armed bandits and continuous combinatorial bandits, on which we model green security domains using real poaching data. Our results show the practical benefits of ArtificialReplay for improving data efficiency, including for base algorithms that do not satisfy IIData. △ Less

Submitted 19 March, 2025; v1 submitted 30 September, 2022; originally announced October 2022.

Comments: 55 pages (30 pages main paper), 9 figures

arXiv:2208.05553 [pdf, other]

doi 10.1515/jci-2022-0051

Exploiting Neighborhood Interference with Low Order Interactions under Unit Randomized Design

Authors: Mayleen Cortez-Rodriguez, Matthew Eichhorn, Christina Lee Yu

Abstract: Network interference, where the outcome of an individual is affected by the treatment assignment of those in their social network, is pervasive in real-world settings. However, it poses a challenge to estimating causal effects. We consider the task of estimating the total treatment effect (TTE), or the difference between the average outcomes of the population when everyone is treated versus when n… ▽ More Network interference, where the outcome of an individual is affected by the treatment assignment of those in their social network, is pervasive in real-world settings. However, it poses a challenge to estimating causal effects. We consider the task of estimating the total treatment effect (TTE), or the difference between the average outcomes of the population when everyone is treated versus when no one is, under network interference. Under a Bernoulli randomized design, we provide an unbiased estimator for the TTE when network interference effects are constrained to low order interactions among neighbors of an individual. We make no assumptions on the graph other than bounded degree, allowing for well-connected networks that may not be easily clustered. We derive a bound on the variance of our estimator and show in simulated experiments that it performs well compared with standard estimators for the TTE. We also derive a minimax lower bound on the mean squared error of our estimator which suggests that the difficulty of estimation can be characterized by the degree of interactions in the potential outcomes model. We also prove that our estimator is asymptotically normal under boundedness conditions on the network degree and potential outcomes model. Central to our contribution is a new framework for balancing model flexibility and statistical complexity as captured by this low order interactions structure. △ Less

Submitted 5 February, 2024; v1 submitted 10 August, 2022; originally announced August 2022.

Comments: 42 pages including citations and appendix, 2 figures (total of 12 subfigures)

MSC Class: 62K99; 91D30; 60F05

Journal ref: Journal of Causal Inference, vol. 11, no. 1, 2023, pp. 20220051

arXiv:2206.03569 [pdf, other]

Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure

Authors: Tyler Sam, Yudong Chen, Christina Lee Yu

Abstract: The practicality of reinforcement learning algorithms has been limited due to poor scaling with respect to the problem size, as the sample complexity of learning an $ε$-optimal policy is $\tildeΩ\left(|S||A|H^3 / ε^2\right)$ over worst case instances of an MDP with state space $S$, action space $A$, and horizon $H$. We consider a class of MDPs for which the associated optimal $Q^*$ function is low… ▽ More The practicality of reinforcement learning algorithms has been limited due to poor scaling with respect to the problem size, as the sample complexity of learning an $ε$-optimal policy is $\tildeΩ\left(|S||A|H^3 / ε^2\right)$ over worst case instances of an MDP with state space $S$, action space $A$, and horizon $H$. We consider a class of MDPs for which the associated optimal $Q^*$ function is low rank, where the latent features are unknown. While one would hope to achieve linear sample complexity in $|S|$ and $|A|$ due to the low rank structure, we show that without imposing further assumptions beyond low rank of $Q^*$, if one is constrained to estimate the $Q$ function using only observations from a subset of entries, there is a worst case instance in which one must incur a sample complexity exponential in the horizon $H$ to learn a near optimal policy. We subsequently show that under stronger low rank structural assumptions, given access to a generative model, Low Rank Monte Carlo Policy Iteration (LR-MCPI) and Low Rank Empirical Value Iteration (LR-EVI) achieve the desired sample complexity of $\tilde{O}\left((|S|+|A|)\mathrm{poly}(d,H)/ε^2\right)$ for a rank $d$ setting, which is minimax optimal with respect to the scaling of $|S|, |A|$, and $ε$. In contrast to literature on linear and low-rank MDPs, we do not require a known feature mapping, our algorithm is computationally simple, and our results hold for long time horizons. Our results provide insights on the minimal low-rank structural assumptions required on the MDP with respect to the transition kernel versus the optimal action-value function. △ Less

Submitted 9 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

arXiv:2205.14552 [pdf, other]

Staggered Rollout Designs Enable Causal Inference Under Interference Without Network Knowledge

Authors: Mayleen Cortez, Matthew Eichhorn, Christina Lee Yu

Abstract: Randomized experiments are widely used to estimate causal effects across a variety of domains. However, classical causal inference approaches rely on critical independence assumptions that are violated by network interference, when the treatment of one individual influences the outcomes of others. All existing approaches require at least approximate knowledge of the network, which may be unavailab… ▽ More Randomized experiments are widely used to estimate causal effects across a variety of domains. However, classical causal inference approaches rely on critical independence assumptions that are violated by network interference, when the treatment of one individual influences the outcomes of others. All existing approaches require at least approximate knowledge of the network, which may be unavailable and costly to collect. We consider the task of estimating the total treatment effect (TTE), or the average difference between the outcomes when the whole population is treated versus when the whole population is untreated. By leveraging a staggered rollout design, in which treatment is incrementally given to random subsets of individuals, we derive unbiased estimators for TTE that do not rely on any prior structural knowledge of the network, as long as the network interference effects are constrained to low-degree interactions among neighbors of an individual. We derive bounds on the variance of the estimators, and we show in experiments that our estimator performs well against baselines on simulated data. Central to our theoretical contribution is a connection between staggered rollout observations and polynomial extrapolation. △ Less

Submitted 14 October, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

Comments: 28 pages, 6 figures, accepted to Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

arXiv:2205.12803 [pdf, other]

doi 10.1073/pnas.2208975119

Estimating Total Treatment Effect in Randomized Experiments with Unknown Network Structure

Authors: Christina Lee Yu, Edoardo M Airoldi, Christian Borgs, Jennifer T Chayes

Abstract: Randomized experiments are widely used to estimate the causal effects of a proposed treatment in many areas of science, from medicine and healthcare to the physical and biological sciences, from the social sciences to engineering, to public policy and to the technology industry at large. Here, we consider situations where classical methods for estimating the total treatment effect on a target popu… ▽ More Randomized experiments are widely used to estimate the causal effects of a proposed treatment in many areas of science, from medicine and healthcare to the physical and biological sciences, from the social sciences to engineering, to public policy and to the technology industry at large. Here, we consider situations where classical methods for estimating the total treatment effect on a target population are considerably biased due to confounding network effects, i.e., the fact that the treatment of an individual may impact their neighbors' outcomes, an issue referred to as network interference or as non-individualized treatment response. A key challenge in these situations, is that the network is often unknown, and difficult, or costly, to measure. In this paper, we characterize the limitations in estimating the total treatment effect without knowledge of the network that drives interference, assuming a potential outcomes model with heterogeneous additive network effects. This model encompasses a broad class of network interference sources, including spillover, peer effects, and contagion. Within this framework, we show that, surprisingly, given access to average historical baseline measurements prior to the experiment, we can develop a simple estimator and efficient randomized design that outputs an unbiased estimate with low variance. Our solution does not require knowledge of the underlying network structure, and it comes with statistical guarantees for a broad class of models. We believe our results are poised to impact current randomized experimentation strategies due to its ease of interpretation and implementation, alongside its provable theoretical insights under heterogeneous network effects. △ Less

Submitted 24 September, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

arXiv:2204.07821 [pdf, other]

doi 10.1007/s41468-024-00166-9

Detection of Small Holes by the Scale-Invariant Robust Density-Aware Distance (RDAD) Filtration

Authors: Chunyin Siu, Gennady Samorodnitsky, Christina Lee Yu, Andrey Yao

Abstract: A novel topological-data-analytical (TDA) method is proposed to distinguish, from noise, small holes surrounded by high-density regions of a probability density function. The proposed method is robust against additive noise and outliers. Traditional TDA tools, like those based on the distance filtration, often struggle to distinguish small features from noise, because both have short persistences.… ▽ More A novel topological-data-analytical (TDA) method is proposed to distinguish, from noise, small holes surrounded by high-density regions of a probability density function. The proposed method is robust against additive noise and outliers. Traditional TDA tools, like those based on the distance filtration, often struggle to distinguish small features from noise, because both have short persistences. An alternative filtration, called the Robust Density-Aware Distance (RDAD) filtration, is proposed to prolong the persistences of small holes of high-density regions. This is achieved by weighting the distance function by the density in the sense of Bell et al. The concept of distance-to-measure is incorporated to enhance stability and mitigate noise. The persistence-prolonging property and robustness of the proposed filtration are rigorously established, and numerical experiments are presented to demonstrate the proposed filtration's utility in identifying small holes. △ Less

Submitted 30 March, 2024; v1 submitted 16 April, 2022; originally announced April 2022.

Comments: 39 pages, 38 figs, J Appl. and Comput. Topology (2024). GitHub: [github.com/c-siu/RDAD]. Published version: [rdcu.be/dCXLa]. Diff of v2/3: added publication info, NO post-submission improvements (Cor2-3 rephrased and proven, setup of Sec4.1 explained, complexity computed in Sec6.1, Thm5 simplified, comparison with DTM in Sec1,8, streamlining), so no change in pdf. Diff of v1/2: more thms, more discussion on conformality, fewer egs

MSC Class: 62R40; 55N31; 52R40; 68T09

arXiv:2110.15843 [pdf, other]

doi 10.1287/opre.2022.2396

Adaptive Discretization in Online Reinforcement Learning

Authors: Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu

Abstract: Discretization based approaches to solving online reinforcement learning problems have been studied extensively in practice on applications ranging from resource allocation to cache management. Two major questions in designing discretization-based algorithms are how to create the discretization and when to refine it. While there have been several experimental results investigating heuristic soluti… ▽ More Discretization based approaches to solving online reinforcement learning problems have been studied extensively in practice on applications ranging from resource allocation to cache management. Two major questions in designing discretization-based algorithms are how to create the discretization and when to refine it. While there have been several experimental results investigating heuristic solutions to these questions, there has been little theoretical treatment. In this paper we provide a unified theoretical analysis of tree-based hierarchical partitioning methods for online reinforcement learning, providing model-free and model-based algorithms. We show how our algorithms are able to take advantage of inherent structure of the problem by providing guarantees that scale with respect to the 'zooming dimension' instead of the ambient dimension, an instance-dependent quantity measuring the benignness of the optimal $Q_h^\star$ function. Many applications in computing systems and operations research requires algorithms that compete on three facets: low sample complexity, mild storage requirements, and low computational burden. Our algorithms are easily adapted to operating constraints, and our theory provides explicit bounds across each of the three facets. This motivates its use in practical applications as our approach automatically adapts to underlying problem structure even when very little is known a priori about the system. △ Less

Submitted 10 October, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

Comments: 77 pages, 7 figures. arXiv admin note: text overlap with arXiv:2007.00717

MSC Class: 68Q32 ACM Class: I.2.6

arXiv:2110.13969 [pdf, other]

Nonparametric Matrix Estimation with One-Sided Covariates

Authors: Christina Lee Yu

Abstract: Consider the task of matrix estimation in which a dataset $X \in \mathbb{R}^{n\times m}$ is observed with sparsity $p$, and we would like to estimate $\mathbb{E}[X]$, where $\mathbb{E}[X_{ui}] = f(α_u, β_i)$ for some Holder smooth function $f$. We consider the setting where the row covariates $α$ are unobserved yet the column covariates $β$ are observed. We provide an algorithm and accompanying an… ▽ More Consider the task of matrix estimation in which a dataset $X \in \mathbb{R}^{n\times m}$ is observed with sparsity $p$, and we would like to estimate $\mathbb{E}[X]$, where $\mathbb{E}[X_{ui}] = f(α_u, β_i)$ for some Holder smooth function $f$. We consider the setting where the row covariates $α$ are unobserved yet the column covariates $β$ are observed. We provide an algorithm and accompanying analysis which shows that our algorithm improves upon naively estimating each row separately when the number of rows is not too small. Furthermore when the matrix is moderately proportioned, our algorithm achieves the minimax optimal nonparametric rate of an oracle algorithm that knows the row covariates. In simulated experiments we show our algorithm outperforms other baselines in low data regimes. △ Less

Submitted 26 October, 2021; originally announced October 2021.

arXiv:2105.05308 [pdf, other]

Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve

Authors: Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu

Abstract: We consider the problem of dividing limited resources to individuals arriving over $T$ rounds. Each round has a random number of individuals arrive, and individuals can be characterized by their type (i.e. preferences over the different resources). A standard notion of 'fairness' in this setting is that an allocation simultaneously satisfy envy-freeness and efficiency. The former is an individual… ▽ More We consider the problem of dividing limited resources to individuals arriving over $T$ rounds. Each round has a random number of individuals arrive, and individuals can be characterized by their type (i.e. preferences over the different resources). A standard notion of 'fairness' in this setting is that an allocation simultaneously satisfy envy-freeness and efficiency. The former is an individual guarantee, requiring that each agent prefers their own allocation over the allocation of any other; in contrast, efficiency is a global property, requiring that the allocations clear the available resources. For divisible resources, when the number of individuals of each type are known upfront, the above desiderata are simultaneously achievable for a large class of utility functions. However, in an online setting when the number of individuals of each type are only revealed round by round, no policy can guarantee these desiderata simultaneously, and hence the best one can do is to try and allocate so as to approximately satisfy the two properties. We show that in the online setting, the two desired properties (envy-freeness and efficiency) are in direct contention, in that any algorithm achieving additive counterfactual envy-freeness up to a factor of $L_T$ necessarily suffers a efficiency loss of at least $1 / L_T$. We complement this uncertainty principle with a simple algorithm, HopeGuardrail, which allocates resources based on an adaptive threshold policy and is able to achieve any fairness-efficiency point on this frontier. In simulation results, our algorithm provides allocations close to the optimal fair solution in hindsight, motivating its use in practical applications as the algorithm is able to adapt to any desired fairness efficiency trade-off. △ Less

Submitted 29 September, 2022; v1 submitted 11 May, 2021; originally announced May 2021.

Comments: 42 pages, 5 figures

MSC Class: 91B32

arXiv:2011.14382 [pdf, other]

Sequential Fair Allocation of Limited Resources under Stochastic Demands

Authors: Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu

Abstract: We consider the problem of dividing limited resources between a set of agents arriving sequentially with unknown (stochastic) utilities. Our goal is to find a fair allocation - one that is simultaneously Pareto-efficient and envy-free. When all utilities are known upfront, the above desiderata are simultaneously achievable (and efficiently computable) for a large class of utility functions. In a s… ▽ More We consider the problem of dividing limited resources between a set of agents arriving sequentially with unknown (stochastic) utilities. Our goal is to find a fair allocation - one that is simultaneously Pareto-efficient and envy-free. When all utilities are known upfront, the above desiderata are simultaneously achievable (and efficiently computable) for a large class of utility functions. In a sequential setting, however, no policy can guarantee these desiderata simultaneously for all possible utility realizations. A natural online fair allocation objective is to minimize the deviation of each agent's final allocation from their fair allocation in hindsight. This translates into simultaneous guarantees for both Pareto-efficiency and envy-freeness. However, the resulting dynamic program has state-space which is exponential in the number of agents. We propose a simple policy, HopeOnline, that instead aims to `match' the ex-post fair allocation vector using the current available resources and `predicted' histogram of future utilities. We demonstrate the effectiveness of our policy compared to other heurstics on a dataset inspired by mobile food-bank allocations. △ Less

Submitted 9 July, 2022; v1 submitted 29 November, 2020; originally announced November 2020.

Comments: See arXiv:2105.05308 for an updated version. 36 pages, 6 figures

MSC Class: 91B32

arXiv:2007.00736 [pdf, other]

Tensor Estimation with Nearly Linear Samples Given Weak Side Information

Authors: Christina Lee Yu, Xumei Xi

Abstract: Tensor completion exhibits an interesting computational-statistical gap in terms of the number of samples needed to perform tensor estimation. While there are only $Θ(tn)$ degrees of freedom in a $t$-order tensor with $n^t$ entries, the best known polynomial time algorithm requires $O(n^{t/2})$ samples in order to guarantee consistent estimation. In this paper, we show that weak side information i… ▽ More Tensor completion exhibits an interesting computational-statistical gap in terms of the number of samples needed to perform tensor estimation. While there are only $Θ(tn)$ degrees of freedom in a $t$-order tensor with $n^t$ entries, the best known polynomial time algorithm requires $O(n^{t/2})$ samples in order to guarantee consistent estimation. In this paper, we show that weak side information is sufficient to reduce the sample complexity to $O(n)$. The side information consists of a weight vector for each of the modes which is not orthogonal to any of the latent factors along that mode; this is significantly weaker than assuming noisy knowledge of the subspaces. We provide an algorithm that utilizes this side information to produce a consistent estimator with $O(n^{1+κ})$ samples for any small constant $κ> 0$. We also provide experiments on both synthetic and real-world datasets that validate our theoretical insights. △ Less

Submitted 19 October, 2024; v1 submitted 1 July, 2020; originally announced July 2020.

arXiv:2007.00717 [pdf, other]

Adaptive Discretization for Model-Based Reinforcement Learning

Authors: Sean R. Sinclair, Tianyu Wang, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu

Abstract: We introduce the technique of adaptive discretization to design an efficient model-based episodic reinforcement learning algorithm in large (potentially continuous) state-action spaces. Our algorithm is based on optimistic one-step value iteration extended to maintain an adaptive discretization of the space. From a theoretical perspective we provide worst-case regret bounds for our algorithm which… ▽ More We introduce the technique of adaptive discretization to design an efficient model-based episodic reinforcement learning algorithm in large (potentially continuous) state-action spaces. Our algorithm is based on optimistic one-step value iteration extended to maintain an adaptive discretization of the space. From a theoretical perspective we provide worst-case regret bounds for our algorithm which are competitive compared to the state-of-the-art model-based algorithms. Moreover, our bounds are obtained via a modular proof technique which can potentially extend to incorporate additional structure on the problem. From an implementation standpoint, our algorithm has much lower storage and computational requirements due to maintaining a more efficient partition of the state and action spaces. We illustrate this via experiments on several canonical control problems, which shows that our algorithm empirically performs significantly better than fixed discretization in terms of both faster convergence and lower memory usage. Interestingly, we observe empirically that while fixed-discretization model-based algorithms vastly outperform their model-free counterparts, the two achieve comparable performance with adaptive discretization. △ Less

Submitted 23 October, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

Comments: 50 pages, 7 figures

MSC Class: 68Q32 ACM Class: I.2.6

arXiv:1910.08151 [pdf, other]

doi 10.1145/3366703

Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces

Authors: Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu

Abstract: We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces. Our algorithm is based on a novel $Q$-learning policy with adaptive data-driven discretization. The central idea is to maintain a finer partition of the state-action space in regions which are frequently visited in historical trajectories, and have higher payoff e… ▽ More We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces. Our algorithm is based on a novel $Q$-learning policy with adaptive data-driven discretization. The central idea is to maintain a finer partition of the state-action space in regions which are frequently visited in historical trajectories, and have higher payoff estimates. We demonstrate how our adaptive partitions take advantage of the shape of the optimal $Q$-function and the joint space, without sacrificing the worst-case performance. In particular, we recover the regret guarantees of prior algorithms for continuous state-action spaces, which additionally require either an optimal discretization as input, and/or access to a simulation oracle. Moreover, experiments demonstrate how our algorithm automatically adapts to the underlying structure of the problem, resulting in much better performance compared both to heuristics and $Q$-learning with uniform discretization. △ Less

Submitted 31 October, 2019; v1 submitted 17 October, 2019; originally announced October 2019.

Comments: 46 pages, 15 figures

MSC Class: 68Q32 ACM Class: I.2.6

arXiv:1908.01241 [pdf, other]

doi 10.1109/TIT.2023.3237231

Robust Max Entrywise Error Bounds for Tensor Estimation from Sparse Observations via Similarity Based Collaborative Filtering

Authors: Devavrat Shah, Christina Lee Yu

Abstract: Consider the task of estimating a 3-order $n \times n \times n$ tensor from noisy observations of randomly chosen entries in the sparse regime. We introduce a similarity based collaborative filtering algorithm for estimating a tensor from sparse observations and argue that it achieves sample complexity that nearly matches the conjectured computationally efficient lower bound on the sample complexi… ▽ More Consider the task of estimating a 3-order $n \times n \times n$ tensor from noisy observations of randomly chosen entries in the sparse regime. We introduce a similarity based collaborative filtering algorithm for estimating a tensor from sparse observations and argue that it achieves sample complexity that nearly matches the conjectured computationally efficient lower bound on the sample complexity for the setting of low-rank tensors. Our algorithm uses the matrix obtained from the flattened tensor to compute similarity, and estimates the tensor entries using a nearest neighbor estimator. We prove that the algorithm recovers a finite rank tensor with maximum entry-wise error (MEE) and mean-squared-error (MSE) decaying to $0$ as long as each entry is observed independently with probability $p = Ω(n^{-3/2 + κ})$ for any arbitrarily small $κ> 0$. More generally, we establish robustness of the estimator, showing that when arbitrary noise bounded by $\varepsilon \geq 0$ is added to each observation, the estimation error with respect to MEE and MSE degrades by $\text{poly}(\varepsilon)$. Consequently, even if the tensor may not have finite rank but can be approximated within $\varepsilon \geq 0$ by a finite rank tensor, then the estimation error converges to $\text{poly}(\varepsilon)$. Our analysis sheds insight into the conjectured sample complexity lower bound, showing that it matches the connectivity threshold of the graph used by our algorithm for estimating similarity between coordinates. △ Less

Submitted 17 January, 2023; v1 submitted 3 August, 2019; originally announced August 2019.

arXiv:1908.01228 [pdf, other]

Nonparametric Contextual Bandits in an Unknown Metric Space

Authors: Nirandika Wanigasekara, Christina Lee Yu

Abstract: Consider a nonparametric contextual multi-arm bandit problem where each arm $a \in [K]$ is associated to a nonparametric reward function $f_a: [0,1] \to \mathbb{R}$ mapping from contexts to the expected reward. Suppose that there is a large set of arms, yet there is a simple but unknown structure amongst the arm reward functions, e.g. finite types or smooth with respect to an unknown metric space.… ▽ More Consider a nonparametric contextual multi-arm bandit problem where each arm $a \in [K]$ is associated to a nonparametric reward function $f_a: [0,1] \to \mathbb{R}$ mapping from contexts to the expected reward. Suppose that there is a large set of arms, yet there is a simple but unknown structure amongst the arm reward functions, e.g. finite types or smooth with respect to an unknown metric space. We present a novel algorithm which learns data-driven similarities amongst the arms, in order to implement adaptive partitioning of the context-arm space for more efficient learning. We provide regret bounds along with simulations that highlight the algorithm's dependence on the local geometry of the reward functions. △ Less

Submitted 3 August, 2019; originally announced August 2019.

arXiv:1712.00710 [pdf, ps, other]

Iterative Collaborative Filtering for Sparse Matrix Estimation

Authors: Christian Borgs, Jennifer Chayes, Devavrat Shah, Christina Lee Yu

Abstract: We consider sparse matrix estimation where the goal is to estimate an $n\times n$ matrix from noisy observations of a small subset of its entries. We analyze the estimation error of the popularly utilized collaborative filtering algorithm for the sparse regime. Specifically, we propose a novel iterative variant of the algorithm, adapted to handle the setting of sparse observations. We establish th… ▽ More We consider sparse matrix estimation where the goal is to estimate an $n\times n$ matrix from noisy observations of a small subset of its entries. We analyze the estimation error of the popularly utilized collaborative filtering algorithm for the sparse regime. Specifically, we propose a novel iterative variant of the algorithm, adapted to handle the setting of sparse observations. We establish that as long as the fraction of entries observed at random scale as $\frac{\log^{1+κ}(n)}{n}$ for any fixed $κ> 0$, the estimation error with respect to the $\max$-norm decays to $0$ as $n\to\infty$ assuming the underlying matrix of interest has constant rank $r$. Our result is robust to model mis-specification in that if the underlying matrix is approximately rank $r$, then the estimation error decays to the approximate error with respect to the $\max$-norm. In the process, we establish algorithm's ability to handle arbitrary bounded noise in the observations. △ Less

Submitted 10 September, 2021; v1 submitted 3 December, 2017; originally announced December 2017.

arXiv:1705.04867 [pdf, other]

doi 10.1109/TIT.2019.2950299

Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model

Authors: Yihua Li, Devavrat Shah, Dogyoon Song, Christina Lee Yu

Abstract: We consider the setup of nonparametric {\em blind regression} for estimating the entries of a large $m \times n$ matrix, when provided with a small, random fraction of noisy measurements. We assume that all rows $u \in [m]$ and columns $i \in [n]$ of the matrix are associated to latent features $x_{\text{row}}(u)$ and $x_{\text{col}}(i)$ respectively, and the $(u,i)$-th entry of the matrix,… ▽ More We consider the setup of nonparametric {\em blind regression} for estimating the entries of a large $m \times n$ matrix, when provided with a small, random fraction of noisy measurements. We assume that all rows $u \in [m]$ and columns $i \in [n]$ of the matrix are associated to latent features $x_{\text{row}}(u)$ and $x_{\text{col}}(i)$ respectively, and the $(u,i)$-th entry of the matrix, $A(u, i)$ is equal to $f(x_{\text{row}}(u), x_{\text{col}}(i))$ for a latent function $f$. Given noisy observations of a small, random subset of the matrix entries, our goal is to estimate the unobserved entries of the matrix as well as to "de-noise" the observed entries. As the main result of this work, we introduce a nearest-neighbor-based estimation algorithm, and establish its consistency when the underlying latent function $f$ is Lipschitz, the underlying latent space is a bounded diameter Polish space, and the random fraction of observed entries in the matrix is at least $\max \left( m^{-1 + δ}, n^{-1/2 + δ} \right)$, for any $δ> 0$. As an important byproduct, our analysis sheds light into the performance of the classical collaborative filtering algorithm for matrix completion, which has been widely utilized in practice. Experiments with the MovieLens and Netflix datasets suggest that our algorithm provides a principled improvement over basic collaborative filtering and is competitive with matrix factorization methods. Our algorithm has a natural extension to the setting of tensor completion via flattening the tensor to matrix. When applied to the setting of image in-painting, which is a $3$-order tensor, we find that our approach is competitive with respect to state-of-art tensor completion algorithms across benchmark images. △ Less

Submitted 31 October, 2019; v1 submitted 13 May, 2017; originally announced May 2017.

Comments: 27 pages, 3 figures. To appear in IEEE Transactions on Information Theory

MSC Class: 62G08; 62G99

arXiv:1703.08085 [pdf, ps, other]

Reducing Crowdsourcing to Graphon Estimation, Statistically

Authors: Devavrat Shah, Christina Lee Yu

Abstract: Inferring the correct answers to binary tasks based on multiple noisy answers in an unsupervised manner has emerged as the canonical question for micro-task crowdsourcing or more generally aggregating opinions. In graphon estimation, one is interested in estimating edge intensities or probabilities between nodes using a single snapshot of a graph realization. In the recent literature, there has be… ▽ More Inferring the correct answers to binary tasks based on multiple noisy answers in an unsupervised manner has emerged as the canonical question for micro-task crowdsourcing or more generally aggregating opinions. In graphon estimation, one is interested in estimating edge intensities or probabilities between nodes using a single snapshot of a graph realization. In the recent literature, there has been exciting development within both of these topics. In the context of crowdsourcing, the key intellectual challenge is to understand whether a given task can be more accurately denoised by aggregating answers collected from other different tasks. In the context of graphon estimation, precise information limits and estimation algorithms remain of interest. In this paper, we utilize a statistical reduction from crowdsourcing to graphon estimation to advance the state-of-art for both of these challenges. We use concepts from graphon estimation to design an algorithm that achieves better performance than the {\em majority voting} scheme for a setup that goes beyond the {\em rank one} models considered in the literature. We use known explicit lower bounds for crowdsourcing to provide refined lower bounds for graphon estimation. △ Less

Submitted 26 July, 2019; v1 submitted 23 March, 2017; originally announced March 2017.

arXiv:1508.03227 [pdf]

doi 10.1039/c5nr04273a

Formation of Long Single Quantum Dots in High Quality InSb Nanowires Grown by Molecular Beam Epitaxy

Authors: Dingxun Fan, Sen Li, N. Kang, Philippe Caroff, L. B. Wang, Y. Q. Huang, M. T. Deng, C. L. Yu, H. Q. Xu

Abstract: We report on realization and transport spectroscopy study of single quantum dots (QDs) made from InSb nanowires grown by molecular beam epitaxy (MBE). The nanowires employed are 50-80 nm in diameter and the QDs are defined in the nanowires between the source and drain contacts on a Si/SiO$_2$ substrate. We show that highly tunable QD devices can be realized with the MBE-grown InSb nanowires and th… ▽ More We report on realization and transport spectroscopy study of single quantum dots (QDs) made from InSb nanowires grown by molecular beam epitaxy (MBE). The nanowires employed are 50-80 nm in diameter and the QDs are defined in the nanowires between the source and drain contacts on a Si/SiO$_2$ substrate. We show that highly tunable QD devices can be realized with the MBE-grown InSb nanowires and the gate-to-dot capacitance extracted in the many-electron regimes is scaled linearly with the longitudinal dot size, demonstrating that the devices are of single InSb nanowire QDs even with a longitudinal size of ~700 nm. In the few-electron regime, the quantum levels in the QDs are resolved and the Landé g-factors extracted for the quantum levels from the magnetotransport measurements are found to be strongly level-dependent and fluctuated in a range of 18-48. A spin-orbit coupling strength is extracted from the magnetic field evolutions of a ground state and its neighboring excited state in an InSb nanowire QD and is on the order of ~300 $μ$eV. Our results establish that the MBE-grown InSb nanowires are of high crystal quality and are promising for the use in constructing novel quantum devices, such as entangled spin qubits, one-dimensional Wigner crystals and topological quantum computing devices. △ Less

Submitted 13 August, 2015; originally announced August 2015.

Comments: 19 pages, 5 figures

Journal ref: Nanoscale 7, 14822 (2015)

arXiv:1411.2647 [pdf, other]

Asynchronous Approximation of a Single Component of the Solution to a Linear System

Authors: Asuman Ozdaglar, Devavrat Shah, Christina Lee Yu

Abstract: We present a distributed asynchronous algorithm for approximating a single component of the solution to a system of linear equations $Ax = b$, where $A$ is a positive definite real matrix, and $b \in \mathbb{R}^n$. This is equivalent to solving for $x_i$ in $x = Gx + z$ for some $G$ and $z$ such that the spectral radius of $G$ is less than 1. Our algorithm relies on the Neumann series characteriza… ▽ More We present a distributed asynchronous algorithm for approximating a single component of the solution to a system of linear equations $Ax = b$, where $A$ is a positive definite real matrix, and $b \in \mathbb{R}^n$. This is equivalent to solving for $x_i$ in $x = Gx + z$ for some $G$ and $z$ such that the spectral radius of $G$ is less than 1. Our algorithm relies on the Neumann series characterization of the component $x_i$, and is based on residual updates. We analyze our algorithm within the context of a cloud computation model, in which the computation is split into small update tasks performed by small processors with shared access to a distributed file system. We prove a robust asymptotic convergence result when the spectral radius $ρ(|G|) < 1$, regardless of the precise order and frequency in which the update tasks are performed. We provide convergence rate bounds which depend on the order of update tasks performed, analyzing both deterministic update rules via counting weighted random walks, as well as probabilistic update rules via concentration bounds. The probabilistic analysis requires analyzing the product of random matrices which are drawn from distributions that are time and path dependent. We specifically consider the setting where $n$ is large, yet $G$ is sparse, e.g., each row has at most $d$ nonzero entries. This is motivated by applications in which $G$ is derived from the edge structure of an underlying graph. Our results prove that if the local neighborhood of the graph does not grow too quickly as a function of $n$, our algorithm can provide significant reduction in computation cost as opposed to any algorithm which computes the global solution vector $x$. Our algorithm obtains an $ε\|x\|_2$ additive approximation for $x_i$ in constant time with respect to the size of the matrix when the maximum row sparsity $d = O(1)$ and $1/(1-\|G\|_2) = O(1)$. △ Less

Submitted 21 January, 2019; v1 submitted 10 November, 2014; originally announced November 2014.

Report number: MIT LIDS Report 3172

arXiv:1406.4435 [pdf, other]

doi 10.1038/srep07261

Parity independence of the zero-bias conductance peak in a nanowire based topological superconductor-quantum dot hybrid device

Authors: M. T. Deng, C. L. Yu, G. Y. Huang, M. Larsson, P. Caroff, H. Q. Xu

Abstract: We explore the signatures of Majorana fermions in a nanowire based topological superconductor-quantum dot-topological superconductor hybrid device by charge transport measurements. The device is made from an epitaxially grown InSb nanowire with two superconductor Nb contacts on a Si/SiO$_2$ substrate. At low temperatures, a quantum dot is formed in the segment of the InSb nanowire between the two… ▽ More We explore the signatures of Majorana fermions in a nanowire based topological superconductor-quantum dot-topological superconductor hybrid device by charge transport measurements. The device is made from an epitaxially grown InSb nanowire with two superconductor Nb contacts on a Si/SiO$_2$ substrate. At low temperatures, a quantum dot is formed in the segment of the InSb nanowire between the two Nb contacts and the two Nb contacted segments of the InSb nanowire show superconductivity due to the proximity effect. At zero magnetic field, well defined Coulomb diamonds and the Kondo effect are observed in the charge stability diagram measurements in the Coulomb blockade regime of the quantum dot. Under the application of a finite, sufficiently strong magnetic field, a zero-bias conductance peak structure is observed in the same Coulomb blockade regime. It is found that the zero-bias conductance peak is present in many consecutive Coulomb diamonds, irrespective of the even-odd parity of the quasi-particle occupation number in the quantum dot. In addition, we find that the zero-bias conductance peak is in most cases accompanied by two differential conductance peaks, forming a triple-peak structure, and the separation between the two side peaks in bias voltage shows oscillations closely correlated to the background Coulomb conductance oscillations of the device. The observed zero-bias conductance peak and the associated triple-peak structure are in line with the signatures of Majorana fermion physics in a nanowire based topological superconductor-quantum dot-topological superconductor system, in which the two Majorana bound states adjacent to the quantum dot are hybridized into a pair of quasi-particle states with finite energies and the other two Majorana bound states remain as the zero-energy modes located at the two ends of the entire InSb nanowire. △ Less

Submitted 12 June, 2014; originally announced June 2014.

Comments: 6 pages, 4 figures

Journal ref: Sci. Rep. 4, 7261 (2014)

arXiv:1404.3239 [pdf, other]

Generalized Method of Moments Estimator Based On Semiparametric Quantile Regression Imputation

Authors: Senniang Chen, Cindy L Yu

Abstract: In this article, we consider an imputation method to handle missing response values based on semiparametric quantile regression estimation. In the proposed method, the missing response values are generated using the estimated conditional quantile regression function at given values of covariates. We adopt the generalized method of moments for estimation of parameters defined through a general esti… ▽ More In this article, we consider an imputation method to handle missing response values based on semiparametric quantile regression estimation. In the proposed method, the missing response values are generated using the estimated conditional quantile regression function at given values of covariates. We adopt the generalized method of moments for estimation of parameters defined through a general estimation equation. We demonstrate that the proposed estimator, which combines both semiparametric quantile regression imputation and generalized method of moments, has competitive edge against some of the most widely used parametric and non-parametric imputation estimators. The consistency and the asymptotic normality of our estimator are established and variance estimation is provided. Results from a limited simulation study and an empirical study are presented to show the adequacy of the proposed method. △ Less

Submitted 11 April, 2014; originally announced April 2014.

arXiv:1302.0122 [pdf, ps, other]

doi 10.3150/11-BEJ400

Parameter estimation and model testing for Markov processes via conditional characteristic functions

Authors: Song X. Chen, Liang Peng, Cindy L. Yu

Abstract: Markov processes are used in a wide range of disciplines, including finance. The transition densities of these processes are often unknown. However, the conditional characteristic functions are more likely to be available, especially for Lévy-driven processes. We propose an empirical likelihood approach, for both parameter estimation and model specification testing, based on the conditional charac… ▽ More Markov processes are used in a wide range of disciplines, including finance. The transition densities of these processes are often unknown. However, the conditional characteristic functions are more likely to be available, especially for Lévy-driven processes. We propose an empirical likelihood approach, for both parameter estimation and model specification testing, based on the conditional characteristic function for processes with either continuous or discontinuous sample paths. Theoretical properties of the empirical likelihood estimator for parameters and a smoothed empirical likelihood ratio test for a parametric specification of the process are provided. Simulations and empirical case studies are carried out to confirm the effectiveness of the proposed estimator and test. △ Less

Submitted 1 February, 2013; originally announced February 2013.

Comments: Published in at http://dx.doi.org/10.3150/11-BEJ400 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ400

Journal ref: Bernoulli 2013, Vol. 19, No. 1, 228-251

arXiv:1204.4130 [pdf, ps, other]

doi 10.1021/nl303758w

Observation of Majorana Fermions in a Nb-InSb Nanowire-Nb Hybrid Quantum Device

Authors: M. T. Deng, C. L. Yu, G. Y. Huang, M. Larsson, P. Caroff, H. Q. Xu

Abstract: We report on the observation of excitation of Majorana fermions in a Nb-InSb nanowire quantum dot-Nb hybrid system. The InSb nanowire quantum dot is formed between the two Nb contacts by weak Schottky barriers and is thus in the regime of strong couplings to the contacts. Due to the proximity effect, the InSb nanowire segments covered by superconductor Nb contacts turn to superconductors with a su… ▽ More We report on the observation of excitation of Majorana fermions in a Nb-InSb nanowire quantum dot-Nb hybrid system. The InSb nanowire quantum dot is formed between the two Nb contacts by weak Schottky barriers and is thus in the regime of strong couplings to the contacts. Due to the proximity effect, the InSb nanowire segments covered by superconductor Nb contacts turn to superconductors with a superconducting energy gap $Δ^*$. Under an applied magnetic field larger than a critical value for which the Zeeman energy in the InSb nanowire is $E_z\sim Δ^*$, the entire InSb nanowire is found to be in a nontrivial topological superconductor phase, supporting a pair of Majorana fermions, and Cooper pairs can transport between the superconductor Nb contacts via the Majorana fermion states. This transport process will be suppressed when the applied magnetic field becomes larger than a second critical value at which the transition to a trivial topological superconductor phase occurs in the system. This physical scenario has been observed in our experiment. We have found that the measured zero-bias conductance for our hybrid device shows a conductance plateau in a range of the applied magnetic field in quasi-particle Coulomb blockade regions. △ Less

Submitted 18 April, 2012; originally announced April 2012.

Comments: 7 pages, 4 figures, supplementary materials of 3 pages and 1 figure

Journal ref: Nano Lett. 12, 6414-6419 (2012)

arXiv:1202.0829 [pdf]

Tailoring Light-Matter Interaction with a Nanoscale Plasmon Resonator

Authors: Nathalie P. de Leon, Brendan J. Shields, Chun L. Yu, Dirk Englund, Alexey V. Akimov, Mikhail D. Lukin, Hongkun Park

Abstract: We propose and demonstrate a new approach for achieving strong light-matter interactions with quantum emitters. Our approach makes use of a plasmon resonator composed of defect-free, highly crystalline silver nanowires surrounded by patterned dielectric distributed Bragg reflectors (DBRs). These resonators have an effective mode volume (Veff) two orders of magnitude below the diffraction limit and… ▽ More We propose and demonstrate a new approach for achieving strong light-matter interactions with quantum emitters. Our approach makes use of a plasmon resonator composed of defect-free, highly crystalline silver nanowires surrounded by patterned dielectric distributed Bragg reflectors (DBRs). These resonators have an effective mode volume (Veff) two orders of magnitude below the diffraction limit and quality factor (Q) approaching 100, enabling enhancement of spontaneous emission rates by a factor exceeding 75 at the cavity resonance. We also show that these resonators can be used to convert a broadband quantum emitter to a narrowband single-photon source with color-selective emission enhancement. △ Less

Submitted 3 February, 2012; originally announced February 2012.

arXiv:0904.3484 [pdf]

doi 10.1038/nphys1284

Near-field Electrical Detection of Optical Plasmons and Single Plasmon Sources

Authors: Abram L. Falk, Frank H. L. Koppens, Chun L. Yu, Kibum Kang, Nathalie de Leon Snapp, Alexey V. Akimov, Moon-Ho Jo, Mikhail D. Lukin, Hongkun Park

Abstract: Photonic circuits can be much faster than their electronic counterparts, but they are difficult to miniaturize below the optical wavelength scale. Nanoscale photonic circuits based on surface plasmon polaritons (SPs) are a promising solution to this problem because they can localize light below the diffraction limit. However, there is a general tradeoff between the localization of an SP and the… ▽ More Photonic circuits can be much faster than their electronic counterparts, but they are difficult to miniaturize below the optical wavelength scale. Nanoscale photonic circuits based on surface plasmon polaritons (SPs) are a promising solution to this problem because they can localize light below the diffraction limit. However, there is a general tradeoff between the localization of an SP and the efficiency with which it can be detected with conventional far-field optics. Here we describe a new all-electrical SP detection technique based on the near-field coupling between guided plasmons and a nanowire field-effect transistor. We use the technique to electrically detect the plasmon emission from an individual colloidal quantum dot coupled to an SP waveguide. Our detectors are both nanoscale and highly efficient (0.1 electrons/plasmon), and a plasmonic gating effect can be used to amplify the signal even higher (up to 50 electrons/plasmon). These results enable new on-chip optical sensing applications and are a key step towards "dark" optoplasmonic nanocircuits in which SPs can be generated, manipulated, and detected without involving far-field radiation. △ Less

Submitted 22 April, 2009; originally announced April 2009.

Comments: manuscript followed by supplementary information

arXiv:0809.3475 [pdf]

doi 10.1021/nl802797y

Electrically Driven Light Emission from Individual CdSe Nanowires

Authors: Yong-Joo Doh, Kristin N. Maher, Lian Ouyang, Chun L. Yu, Hongkun Park, Jiwoong Park

Abstract: We report electroluminescence (EL) measurements carried out on three-terminal devices incorporating individual n-type CdSe nanowires. Simultaneous optical and electrical measurements reveal that EL occurs near the contact between the nanowire and a positively biased electrode or drain. The surface potential profile, obtained by using Kelvin probe microscopy, shows an abrupt potential drop near t… ▽ More We report electroluminescence (EL) measurements carried out on three-terminal devices incorporating individual n-type CdSe nanowires. Simultaneous optical and electrical measurements reveal that EL occurs near the contact between the nanowire and a positively biased electrode or drain. The surface potential profile, obtained by using Kelvin probe microscopy, shows an abrupt potential drop near the position of the EL spot, while the band profile obtained from scanning photocurrent microscopy indicates the existence of an n-type Schottky barrier at the interface. These observations indicate that light emission occurs through a hole leakage or an inelastic scattering induced by the rapid potential drop at the nanowire-electrode interface. △ Less

Submitted 19 September, 2008; originally announced September 2008.

Comments: 12 pages, 4 figures

Journal ref: Nano Letters vol.8, No.12, 4552-4556 (2008)

Showing 1–40 of 40 results for author: Yu, C L