Skip to main content

Showing 1–46 of 46 results for author: Vijayaraghavan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.03603  [pdf, other

    cs.AI cs.LG

    Towards deployment-centric multimodal AI beyond vision and language

    Authors: Xianyuan Liu, Jiayang Zhang, Shuo Zhou, Thijs L. van der Plas, Avish Vijayaraghavan, Anastasiia Grishina, Mengdie Zhuang, Daniel Schofield, Christopher Tomlinson, Yuhan Wang, Ruizhe Li, Louisa van Zeeland, Sina Tabakhi, Cyndie Demeocq, Xiang Li, Arunav Das, Orlando Timmerman, Thomas Baldwin-McDonald, Jinge Wu, Peizhen Bai, Zahraa Al Sahili, Omnia Alwazzan, Thao N. Do, Mohammod N. I. Suvon, Angeline Wang , et al. (23 additional authors not shown)

    Abstract: Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction, and decision-making across disciplines such as healthcare, science, and engineering. However, most multimodal AI advances focus on models for vision and language data, while their deployability remains a key challenge. We advocate a deployment-centric workflow that in… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  2. arXiv:2504.02723  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Computing High-dimensional Confidence Sets for Arbitrary Distributions

    Authors: Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: We study the problem of learning a high-density region of an arbitrary distribution over $\mathbb{R}^d$. Given a target coverage parameter $δ$, and sample access to an arbitrary distribution $D$, we want to output a confidence set $S \subset \mathbb{R}^d$ such that $S$ achieves $δ$ coverage of $D$, i.e., $\mathbb{P}_{y \sim D} \left[ y \in S \right] \ge δ$, and the volume of $S$ is as small as pos… ▽ More

    Submitted 12 May, 2025; v1 submitted 3 April, 2025; originally announced April 2025.

    Comments: Improves volume approximation factor from $\exp(\tilde{O}(d^{2/3}))$ to $\exp(\tilde{O}(d^{1/2}))$, along with other minor edits. To appear in COLT 2025

  3. arXiv:2502.16658  [pdf, other

    cs.LG stat.ML

    Volume Optimality in Conformal Prediction with Structured Prediction Sets

    Authors: Chao Gao, Liren Shan, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: Conformal Prediction is a widely studied technique to construct prediction sets of future observations. Most conformal prediction methods focus on achieving the necessary coverage guarantees, but do not provide formal guarantees on the size (volume) of the prediction sets. We first prove an impossibility of volume optimality where any distribution-free method can only find a trivial solution. We t… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 41 pages, 19 figures, 2 tables

  4. arXiv:2411.14349  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostic Learning of Arbitrary ReLU Activation under Gaussian Marginals

    Authors: Anxin Guo, Aravindan Vijayaraghavan

    Abstract: We consider the problem of learning an arbitrarily-biased ReLU activation (or neuron) over Gaussian marginals with the squared loss objective. Despite the ReLU neuron being the basic building block of modern neural networks, we still do not understand the basic algorithmic question of whether one arbitrary ReLU neuron is learnable in the non-realizable setting. In particular, all existing polynomi… ▽ More

    Submitted 22 November, 2024; v1 submitted 21 November, 2024; originally announced November 2024.

  5. arXiv:2405.16043  [pdf, other

    cs.LG cs.CL stat.ML

    Theoretical Analysis of Weak-to-Strong Generalization

    Authors: Hunter Lang, David Sontag, Aravindan Vijayaraghavan

    Abstract: Strong student models can learn from weaker teachers: when trained on the predictions of a weaker model, a strong pretrained student can learn to correct the weak model's errors and generalize to examples where the teacher is not confident, even when these examples are excluded from training. This enables learning from cheap, incomplete, and possibly incorrect label information, such as coarse log… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 36 pages, 3 figures

  6. arXiv:2405.15084  [pdf, other

    cs.DS cs.LG stat.ML

    Efficient Certificates of Anti-Concentration Beyond Gaussians

    Authors: Ainesh Bakshi, Pravesh Kothari, Goutham Rajendran, Madhur Tulsiani, Aravindan Vijayaraghavan

    Abstract: A set of high dimensional points $X=\{x_1, x_2,\ldots, x_n\} \subset R^d$ in isotropic position is said to be $δ$-anti concentrated if for every direction $v$, the fraction of points in $X$ satisfying $|\langle x_i,v \rangle |\leq δ$ is at most $O(δ)$. Motivated by applications to list-decodable learning and clustering, recent works have considered the problem of constructing efficient certificate… ▽ More

    Submitted 28 October, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: updated exposition; added certifiable hypercontractivity of degree-two polynomials for any Poincaré distribution

  7. arXiv:2405.01517  [pdf, other

    cs.DS

    New Tools for Smoothed Analysis: Least Singular Value Bounds for Random Matrices with Dependent Entries

    Authors: Aditya Bhaskara, Eric Evert, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: We develop new techniques for proving lower bounds on the least singular value of random matrices with limited randomness. The matrices we consider have entries that are given by polynomials of a few underlying base random variables. This setting captures a core technical challenge for obtaining smoothed analysis guarantees in many algorithmic settings. Least singular value bounds often involve sh… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: To appear in STOC 2024

  8. arXiv:2401.17952  [pdf, ps, other

    cs.CY cs.DS cs.IR

    Error-Tolerant E-Discovery Protocols

    Authors: Jinshuo Dong, Jason D. Hartline, Liren Shan, Aravindan Vijayaraghavan

    Abstract: We consider the multi-party classification problem introduced by Dong, Hartline, and Vijayaraghavan (2022) in the context of electronic discovery (e-discovery). Based on a request for production from the requesting party, the responding party is required to provide documents that are responsive to the request except for those that are legally privileged. Our goal is to find a protocol that verifie… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 28 pages, 6 figures, CSLAW 2024

  9. arXiv:2308.10160  [pdf, other

    cs.DS

    Higher-Order Cheeger Inequality for Partitioning with Buffers

    Authors: Konstantin Makarychev, Yury Makarychev, Liren Shan, Aravindan Vijayaraghavan

    Abstract: We prove a new generalization of the higher-order Cheeger inequality for partitioning with buffers. Consider a graph $G=(V,E)$. The buffered expansion of a set $S \subseteq V$ with a buffer $B \subseteq V \setminus S$ is the edge expansion of $S$ after removing all the edges from set $S$ to its buffer $B$. An $\varepsilon$-buffered $k$-partitioning is a partitioning of a graph into disjoint compon… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 45 pages

  10. arXiv:2307.00660  [pdf, ps, other

    cs.AI cs.CY

    Minimum Levels of Interpretability for Artificial Moral Agents

    Authors: Avish Vijayaraghavan, Cosmin Badea

    Abstract: As artificial intelligence (AI) models continue to scale up, they are becoming more capable and integrated into various forms of decision-making systems. For models involved in moral decision-making, also known as artificial moral agents (AMA), interpretability provides a way to trust and understand the agent's internal reasoning mechanisms for effective use and error correction. In this paper, we… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

  11. arXiv:2212.03851  [pdf, ps, other

    cs.DS cs.LG math.AG quant-ph

    Computing linear sections of varieties: quantum entanglement, tensor decompositions and beyond

    Authors: Nathaniel Johnston, Benjamin Lovitz, Aravindan Vijayaraghavan

    Abstract: We study the problem of finding elements in the intersection of an arbitrary conic variety in $\mathbb{F}^n$ with a given linear subspace (where $\mathbb{F}$ can be the real or complex field). This problem captures a rich family of algorithmic problems under different choices of the variety. The special case of the variety consisting of rank-1 matrices already has strong connections to central pro… ▽ More

    Submitted 7 May, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: 39 pages. V3: Simplified some arguments and notation. Comments welcome!

  12. arXiv:2211.12389  [pdf, other

    math.OC cs.DS

    The Burer-Monteiro SDP method can fail even above the Barvinok-Pataki bound

    Authors: Liam O'Carroll, Vaidehi Srinivas, Aravindan Vijayaraghavan

    Abstract: The most widely used technique for solving large-scale semidefinite programs (SDPs) in practice is the non-convex Burer-Monteiro method, which explicitly maintains a low-rank SDP solution for memory efficiency. There has been much recent interest in obtaining a better theoretical understanding of the Burer-Monteiro method. When the maximum allowed rank $p$ of the SDP solution is above the Barvinok… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  13. arXiv:2209.02690  [pdf, ps, other

    cs.CR cs.CY cs.DS cs.LG stat.ML

    Classification Protocols with Minimal Disclosure

    Authors: Jinshuo Dong, Jason Hartline, Aravindan Vijayaraghavan

    Abstract: We consider multi-party protocols for classification that are motivated by applications such as e-discovery in court proceedings. We identify a protocol that guarantees that the requesting party receives all responsive documents and the sending party discloses the minimal amount of non-responsive documents necessary to prove that all responsive documents have been received. This protocol can be em… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Journal ref: In Proceedings of the 2022 Symposium on Computer Science and Law (CSLAW '22), November 1-2, 2022, Washington, DC, USA. ACM, New York, NY, USA, 10 pages

  14. arXiv:2208.02711  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostic Learning of General ReLU Activation Using Gradient Descent

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We provide a convergence analysis of gradient descent for the problem of agnostically learning a single ReLU function with moderate bias under Gaussian distributions. Unlike prior work that studies the setting of zero bias, we consider the more challenging scenario when the bias of the ReLU function is non-zero. Our main result establishes that starting from random initialization, in a polynomial… ▽ More

    Submitted 3 November, 2024; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: 28 oages

  15. arXiv:2206.02914  [pdf, other

    stat.ML cs.AI cs.LG

    Training Subset Selection for Weak Supervision

    Authors: Hunter Lang, Aravindan Vijayaraghavan, David Sontag

    Abstract: Existing weak supervision approaches use all the data covered by weak signals to train a classifier. We show both theoretically and empirically that this is not always optimal. Intuitively, there is a tradeoff between the amount of weakly-labeled data and the precision of the weak labels. We explore this tradeoff by combining pretrained data representations with the cut statistic (Muhlenbach et al… ▽ More

    Submitted 6 March, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  16. arXiv:2107.10209  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We present polynomial time and sample efficient algorithms for learning an unknown depth-2 feedforward neural network with general ReLU activations, under mild non-degeneracy assumptions. In particular, we consider learning an unknown network of the form $f(x) = {a}^{\mathsf{T}}σ({W}^\mathsf{T}x+b)$, where $x$ is drawn from the Gaussian distribution, and $σ(t) := \max(t,0)$ is the ReLU activation.… ▽ More

    Submitted 1 August, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: 45 pages (including appendix). This version fixes an error in the previous version of the paper

  17. arXiv:2103.00034  [pdf, other

    stat.ML cs.LG

    Beyond Perturbation Stability: LP Recovery Guarantees for MAP Inference on Noisy Stable Instances

    Authors: Hunter Lang, Aravind Reddy, David Sontag, Aravindan Vijayaraghavan

    Abstract: Several works have shown that perturbation stable instances of the MAP inference problem in Potts models can be solved exactly using a natural linear programming (LP) relaxation. However, most of these works give few (or no) guarantees for the LP solutions on instances that do not satisfy the relatively strict perturbation stability definitions. In this work, we go beyond these stability results b… ▽ More

    Submitted 26 February, 2021; originally announced March 2021.

    Comments: 25 pages, 2 figures, 2 tables. To appear in AISTATS 2021

  18. arXiv:2011.03639  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    Graph cuts always find a global optimum for Potts models (with a catch)

    Authors: Hunter Lang, David Sontag, Aravindan Vijayaraghavan

    Abstract: We prove that the $α$-expansion algorithm for MAP inference always returns a globally optimal assignment for Markov Random Fields with Potts pairwise potentials, with a catch: the returned assignment is only guaranteed to be optimal for an instance within a small perturbation of the original problem instance. In other words, all local minima with respect to expansion moves are global minima to sli… ▽ More

    Submitted 14 June, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Published at ICML 2021. 18 pages, 2 figures

  19. arXiv:2010.02841  [pdf, ps, other

    cs.DS

    Learning a mixture of two subspaces over finite fields

    Authors: Aidao Chen, Anindya De, Aravindan Vijayaraghavan

    Abstract: We study the problem of learning a mixture of two subspaces over $\mathbb{F}_2^n$. The goal is to recover the individual subspaces, given samples from a (weighted) mixture of samples drawn uniformly from the two subspaces $A_0$ and $A_1$. This problem is computationally challenging, as it captures the notorious problem of "learning parities with noise" in the degenerate setting when… ▽ More

    Submitted 15 February, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

  20. arXiv:2007.15589  [pdf, ps, other

    cs.DS cs.LG

    Efficient Tensor Decomposition

    Authors: Aravindan Vijayaraghavan

    Abstract: This chapter studies the problem of decomposing a tensor into a sum of constituent rank one tensors. While tensor decompositions are very useful in designing learning algorithms and data analysis, they are NP-hard in the worst-case. We will see how to design efficient algorithms with provable guarantees under mild assumptions, and using beyond worst-case frameworks like smoothed analysis.

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: Chapter 19 of the book "Beyond the Worst-Case Analysis of Algorithms", edited by Tim Roughgarden and published by Cambridge University Press (2020). We hope to occasionally update the survey here to include discussions of new results and advances

  21. arXiv:2007.06555  [pdf, other

    cs.LG cs.DS stat.ML

    Adversarial robustness via robust low rank representations

    Authors: Pranjal Awasthi, Himanshu Jain, Ankit Singh Rawat, Aravindan Vijayaraghavan

    Abstract: Adversarial robustness measures the susceptibility of a classifier to imperceptible perturbations made to the inputs at test time. In this work we highlight the benefits of natural low rank representations that often exist for real data such as images, for training neural networks with certified robustness guarantees. Our first contribution is for certified robustness to perturbations measured i… ▽ More

    Submitted 1 August, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: fixed a bug in the proof of Proposition B.2

  22. arXiv:2006.00602  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Estimating Principal Components under Adversarial Perturbations

    Authors: Pranjal Awasthi, Xue Chen, Aravindan Vijayaraghavan

    Abstract: Robustness is a key requirement for widespread deployment of machine learning algorithms, and has received much attention in both statistics and computer science. We study a natural model of robustness for high-dimensional statistical estimation problems that we call the adversarial perturbation model. An adversary can perturb every sample arbitrarily up to a specified magnitude $δ$ measured in so… ▽ More

    Submitted 1 June, 2020; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: It is to appear at COLT 2020

  23. arXiv:2004.10776  [pdf, other

    cs.DS cs.DC

    Scheduling Precedence-Constrained Jobs on Related Machines with Communication Delay

    Authors: Biswaroop Maiti, Rajmohan Rajaraman, David Stalfa, Zoya Svitkina, Aravindan Vijayaraghavan

    Abstract: We consider the problem of scheduling $n$ precedence-constrained jobs on $m$ uniformly-related machines in the presence of an arbitrary, fixed communication delay $ρ$. We consider a model that allows job duplication, i.e. processing of the same job on multiple machines, which, as we show, can reduce the length of a schedule (i.e., its makespan) by a logarithmic factor. Our main result is an… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  24. arXiv:1911.13268  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Adversarially Robust Low Dimensional Representations

    Authors: Pranjal Awasthi, Vaggos Chatziafratis, Xue Chen, Aravindan Vijayaraghavan

    Abstract: Many machine learning systems are vulnerable to small perturbations made to inputs either at test time or at training time. This has received much recent interest on the empirical front due to applications where reliability and security are critical. However, theoretical understanding of algorithms that are robust to adversarial perturbations is limited. In this work we focus on Principal Compon… ▽ More

    Submitted 13 August, 2021; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: 68 pages including references

  25. arXiv:1911.04681  [pdf, other

    cs.LG cs.DS stat.ML

    On Robustness to Adversarial Examples and Polynomial Optimization

    Authors: Pranjal Awasthi, Abhratanu Dutta, Aravindan Vijayaraghavan

    Abstract: We study the design of computationally efficient algorithms with provable guarantees, that are robust to adversarial (test time) perturbations. While there has been an proliferation of recent work on this topic due to its connections to test time robustness of deep networks, there is limited theoretical understanding of several basic questions like (i) when and how can one design provably robust l… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: To appear at NeurIPS2019. 30 pages

  26. arXiv:1811.12361  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Smoothed Analysis in Unsupervised Learning via Decoupling

    Authors: Aditya Bhaskara, Aidao Chen, Aidan Perreault, Aravindan Vijayaraghavan

    Abstract: Smoothed analysis is a powerful paradigm in overcoming worst-case intractability in unsupervised learning and high-dimensional data analysis. While polynomial time smoothed analysis guarantees have been obtained for worst-case intractable problems like tensor decompositions and learning mixtures of Gaussians, such guarantees have been hard to obtain for several other important problems in unsuperv… ▽ More

    Submitted 23 April, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: 44 pages

  27. arXiv:1810.05305  [pdf, other

    stat.ML cs.AI cs.LG

    Block Stability for MAP Inference

    Authors: Hunter Lang, David Sontag, Aravindan Vijayaraghavan

    Abstract: To understand the empirical success of approximate MAP inference, recent work (Lang et al., 2018) has shown that some popular approximation algorithms perform very well when the input instance is stable. The simplest stability condition assumes that the MAP solution does not change at all when some of the pairwise potentials are (adversarially) perturbed. Unfortunately, this strong condition does… ▽ More

    Submitted 12 November, 2020; v1 submitted 11 October, 2018; originally announced October 2018.

  28. arXiv:1804.08603  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Towards Learning Sparsely Used Dictionaries with Arbitrary Supports

    Authors: Pranjal Awasthi, Aravindan Vijayaraghavan

    Abstract: Dictionary learning is a popular approach for inferring a hidden basis or dictionary in which data has a sparse representation. Data generated from the dictionary A (an n by m matrix, with m > n in the over-complete setting) is given by Y = AX where X is a matrix whose columns have supports chosen from a distribution over k-sparse vectors, and the non-zero values chosen from a symmetric distributi… ▽ More

    Submitted 8 May, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: 72 pages, fixed minor typos, and added a new reference in related work

  29. arXiv:1712.01241  [pdf, other

    cs.LG cs.DS

    Clustering Stable Instances of Euclidean k-means

    Authors: Abhratanu Dutta, Aravindan Vijayaraghavan, Alex Wang

    Abstract: The Euclidean k-means problem is arguably the most widely-studied clustering problem in machine learning. While the k-means objective is NP-hard in the worst-case, practitioners have enjoyed remarkable success in applying heuristics like Lloyd's algorithm for this problem. To address this disconnect, we study the following question: what properties of real-world instances will enable us to design… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

    Comments: 23 pages, 2 figures, appearing in NIPS 2017

  30. arXiv:1711.08841  [pdf, ps, other

    cs.DS cs.LG

    Clustering Semi-Random Mixtures of Gaussians

    Authors: Pranjal Awasthi, Aravindan Vijayaraghavan

    Abstract: Gaussian mixture models (GMM) are the most widely used statistical model for the $k$-means clustering problem and form a popular framework for clustering in machine learning and data analysis. In this paper, we propose a natural semi-random model for $k$-means clustering that generalizes the Gaussian mixture model, and that we believe will be useful in identifying robust algorithms. In our model,… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

  31. arXiv:1711.02195  [pdf, ps, other

    stat.ML cs.AI cs.DS cs.LG

    Optimality of Approximate Inference Algorithms on Stable Instances

    Authors: Hunter Lang, David Sontag, Aravindan Vijayaraghavan

    Abstract: Approximate algorithms for structured prediction problems---such as LP relaxations and the popular alpha-expansion algorithm (Boykov et al. 2001)---typically far exceed their theoretical performance guarantees on real-world instances. These algorithms often find solutions that are very close to optimal. The goal of this paper is to partially explain the performance of alpha-expansion and an LP rel… ▽ More

    Submitted 23 April, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 13 pages, 2 figures

  32. arXiv:1710.11592  [pdf, ps, other

    cs.DS cs.LG math.ST

    On Learning Mixtures of Well-Separated Gaussians

    Authors: Oded Regev, Aravindan Vijayaraghavan

    Abstract: We consider the problem of efficiently learning mixtures of a large number of spherical Gaussians, when the components of the mixture are well separated. In the most basic form of this problem, we are given samples from a uniform mixture of $k$ standard spherical Gaussians, and the goal is to estimate the means up to accuracy $δ$ using $poly(k,d, 1/δ)$ samples. In this work, we study the followi… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

    Comments: Appeared in FOCS 2017. 55 pages, 1 figure

  33. arXiv:1511.03229  [pdf, ps, other

    cs.DS cs.LG math.ST

    Learning Communities in the Presence of Errors

    Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

    Abstract: We study the problem of learning communities in the presence of modeling errors and give robust recovery algorithms for the Stochastic Block Model (SBM). This model, which is also known as the Planted Partition Model, is widely used for community detection and graph partitioning in various fields, including machine learning, statistics, and social sciences. Many algorithms exist for learning commu… ▽ More

    Submitted 24 June, 2016; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: 34 pages. Appearing in the Conference on Learning Theory (COLT)'16

  34. arXiv:1505.03424  [pdf, other

    cs.CC cs.DS

    Beating the random assignment on constraint satisfaction problems of bounded degree

    Authors: Boaz Barak, Ankur Moitra, Ryan O'Donnell, Prasad Raghavendra, Oded Regev, David Steurer, Luca Trevisan, Aravindan Vijayaraghavan, David Witmer, John Wright

    Abstract: We show that for any odd $k$ and any instance of the Max-kXOR constraint satisfaction problem, there is an efficient algorithm that finds an assignment satisfying at least a $\frac{1}{2} + Ω(1/\sqrt{D})$ fraction of constraints, where $D$ is a bound on the number of constraints that each variable occurs in. This improves both qualitatively and quantitatively on the recent work of Farhi, Goldstone,… ▽ More

    Submitted 11 August, 2015; v1 submitted 13 May, 2015; originally announced May 2015.

    Comments: 14 pages, 1 figure

  35. arXiv:1410.8750  [pdf, ps, other

    cs.LG

    Learning Mixtures of Ranking Models

    Authors: Pranjal Awasthi, Avrim Blum, Or Sheffet, Aravindan Vijayaraghavan

    Abstract: This work concerns learning probabilistic models for ranking data in a heterogeneous population. The specific problem we study is learning the parameters of a Mallows Mixture Model. Despite being widely studied, current heuristics for this problem do not have theoretical guarantees and can get stuck in bad local optima. We present the first polynomial time algorithm which provably learns the param… ▽ More

    Submitted 31 October, 2014; originally announced October 2014.

  36. arXiv:1406.5667  [pdf, ps, other

    cs.DS cs.LG

    Correlation Clustering with Noisy Partial Information

    Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

    Abstract: In this paper, we propose and study a semi-random model for the Correlation Clustering problem on arbitrary graphs G. We give two approximation algorithms for Correlation Clustering instances from this model. The first algorithm finds a solution of value $(1+ δ) optcost + O_δ(n\log^3 n)$ with high probability, where $optcost$ is the value of the optimal solution (for every $δ> 0$). The second algo… ▽ More

    Submitted 12 May, 2015; v1 submitted 21 June, 2014; originally announced June 2014.

    Comments: To appear at Conference on Learning Theory (COLT) 2015. Substantial changes from previous version, including a new section on recovery of the ground truth clustering. 20 pages

  37. arXiv:1406.5665  [pdf, ps, other

    cs.DS cs.LG

    Constant Factor Approximation for Balanced Cut in the PIE model

    Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

    Abstract: We propose and study a new semi-random semi-adversarial model for Balanced Cut, a planted model with permutation-invariant random edges (PIE). Our model is much more general than planted models considered previously. Consider a set of vertices V partitioned into two clusters $L$ and $R$ of equal size. Let $G$ be an arbitrary graph on $V$ with no edges between $L$ and $R$. Let $E_{random}$ be a set… ▽ More

    Submitted 21 June, 2014; originally announced June 2014.

    Comments: Full version of the paper at the 46th ACM Symposium on the Theory of Computing (STOC 2014). 32 pages

  38. arXiv:1311.3651  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Smoothed Analysis of Tensor Decompositions

    Authors: Aditya Bhaskara, Moses Charikar, Ankur Moitra, Aravindan Vijayaraghavan

    Abstract: Low rank tensor decompositions are a powerful tool for learning generative models, and uniqueness results give them a significant advantage over matrix decomposition methods. However, tensors pose significant algorithmic challenges and tensors analogs of much of the matrix algebra toolkit are unlikely to exist because of hardness results. Efficient decomposition in the overcomplete case (where ran… ▽ More

    Submitted 20 January, 2014; v1 submitted 14 November, 2013; originally announced November 2013.

    Comments: 32 pages (including appendix)

  39. arXiv:1305.1681  [pdf, ps, other

    cs.DS

    Bilu-Linial Stable Instances of Max Cut and Minimum Multiway Cut

    Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

    Abstract: We investigate the notion of stability proposed by Bilu and Linial. We obtain an exact polynomial-time algorithm for $γ$-stable Max Cut instances with $γ\geq c\sqrt{\log n}\log\log n$ for some absolute constant $c > 0$. Our algorithm is robust: it never returns an incorrect answer; if the instance is $γ$-stable, it finds the maximum cut, otherwise, it either finds the maximum cut or certifies that… ▽ More

    Submitted 11 November, 2013; v1 submitted 7 May, 2013; originally announced May 2013.

    Comments: 24 pages

  40. arXiv:1304.8087  [pdf, other

    cs.DS cs.LG math.ST

    Uniqueness of Tensor Decompositions with Applications to Polynomial Identifiability

    Authors: Aditya Bhaskara, Moses Charikar, Aravindan Vijayaraghavan

    Abstract: We give a robust version of the celebrated result of Kruskal on the uniqueness of tensor decompositions: we prove that given a tensor whose decomposition satisfies a robust form of Kruskal's rank condition, it is possible to approximately recover the decomposition if the tensor is known up to a sufficiently small (inverse polynomial) error. Kruskal's theorem has found many applications in provin… ▽ More

    Submitted 30 April, 2013; originally announced April 2013.

    Comments: 51 pages, 2 figures

  41. arXiv:1205.2234  [pdf, ps, other

    cs.DS cs.CC

    Approximation Algorithms for Semi-random Graph Partitioning Problems

    Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

    Abstract: In this paper, we propose and study a new semi-random model for graph partitioning problems. We believe that it captures many properties of real--world instances. The model is more flexible than the semi-random model of Feige and Kilian and planted random model of Bui, Chaudhuri, Leighton and Sipser. We develop a general framework for solving semi-random instances and apply it to several problem… ▽ More

    Submitted 10 May, 2012; originally announced May 2012.

    Comments: To appear at the 44th ACM Symposium on Theory of Computing (STOC 2012)

  42. arXiv:1112.3611  [pdf, other

    cs.DS cs.CC

    Approximation Algorithms and Hardness of the k-Route Cut Problem

    Authors: Julia Chuzhoy, Yury Makarychev, Aravindan Vijayaraghavan, Yuan Zhou

    Abstract: We study the k-route cut problem: given an undirected edge-weighted graph G=(V,E), a collection {(s_1,t_1),(s_2,t_2),...,(s_r,t_r)} of source-sink pairs, and an integer connectivity requirement k, the goal is to find a minimum-weight subset E' of edges to remove, such that the connectivity of every pair (s_i, t_i) falls below k. Specifically, in the edge-connectivity version, EC-kRC, the requireme… ▽ More

    Submitted 15 December, 2011; v1 submitted 15 December, 2011; originally announced December 2011.

    Comments: To appear in the Symposium on Discrete Algorithms (SODA) 2012. 44 pages

  43. arXiv:1110.1360  [pdf, other

    cs.DS cs.CC

    Polynomial integrality gaps for strong SDP relaxations of Densest k-subgraph

    Authors: Aditya Bhaskara, Moses Charikar, Venkatesan Guruswami, Aravindan Vijayaraghavan, Yuan Zhou

    Abstract: The densest k-subgraph (DkS) problem (i.e. find a size k subgraph with maximum number of edges), is one of the notorious problems in approximation algorithms. There is a significant gap between known upper and lower bounds for DkS: the current best algorithm gives an ~ O(n^{1/4}) approximation, while even showing a small constant factor hardness requires significantly stronger assumptions than P !… ▽ More

    Submitted 6 October, 2011; originally announced October 2011.

    Comments: 26 ages, 1 figure. To appear in Symposium on Discrete Algorithms (SODA) 2012

  44. arXiv:1101.1710  [pdf, ps, other

    cs.CC cs.DS

    On Quadratic Programming with a Ratio Objective

    Authors: Aditya Bhaskara, Moses Charikar, Rajsekar Manokaran, Aravindan Vijayaraghavan

    Abstract: Quadratic Programming (QP) is the well-studied problem of maximizing over {-1,1} values the quadratic form \sum_{i \ne j} a_{ij} x_i x_j. QP captures many known combinatorial optimization problems, and assuming the unique games conjecture, semidefinite programming techniques give optimal approximation algorithms. We extend this body of work by initiating the study of Quadratic Programming problems… ▽ More

    Submitted 5 December, 2011; v1 submitted 10 January, 2011; originally announced January 2011.

  45. arXiv:1001.2891  [pdf, ps, other

    cs.DS

    Detecting High Log-Densities -- an O(n^1/4) Approximation for Densest k-Subgraph

    Authors: Aditya Bhaskara, Moses Charikar, Eden Chlamtac, Uriel Feige, Aravindan Vijayaraghavan

    Abstract: In the Densest k-Subgraph problem, given a graph G and a parameter k, one needs to find a subgraph of G induced on k vertices that contains the largest number of edges. There is a significant gap between the best known upper and lower bounds for this problem. It is NP-hard, and does not have a PTAS unless NP has subexponential time algorithms. On the other hand, the current best known algorithm… ▽ More

    Submitted 17 January, 2010; originally announced January 2010.

    Comments: 23 pages

  46. arXiv:1001.2613  [pdf, ps, other

    cs.DS cs.CC

    Approximating Matrix p-norms

    Authors: Aditya Bhaskara, Aravindan Vijayaraghavan

    Abstract: We consider the problem of computing the q->p norm of a matrix A, which is defined for p,q \ge 1, as |A|_{q->p} = max_{x !=0 } |Ax|_p / |x|_q. This is in general a non-convex optimization problem, and is a natural generalization of the well-studied question of computing singular values (this corresponds to p=q=2). Different settings of parameters give rise to a variety of known interesting problem… ▽ More

    Submitted 2 May, 2010; v1 submitted 15 January, 2010; originally announced January 2010.

    Comments: 25 pages