Skip to main content

Showing 1–22 of 22 results for author: Crammer, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.14095  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Weighted Training for Cross-Task Learning

    Authors: Shuxiao Chen, Koby Crammer, Hangfeng He, Dan Roth, Weijie J. Su

    Abstract: In this paper, we introduce Target-Aware Weighted Training (TAWT), a weighted training algorithm for cross-task learning based on minimizing a representation-based task distance between the source and target tasks. We show that TAWT is easy to implement, is computationally efficient, requires little hyperparameter tuning, and enjoys non-asymptotic learning-theoretic guarantees. The effectiveness o… ▽ More

    Submitted 1 March, 2022; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Published as a conference paper at ICLR 2022

  2. arXiv:1906.05591  [pdf, other

    cs.LG eess.SY stat.ML

    Finite Sample Analysis Of Dynamic Regression Parameter Learning

    Authors: Mark Kozdoba, Edward Moroshko, Shie Mannor, Koby Crammer

    Abstract: We consider the dynamic linear regression problem, where the predictor vector may vary with time. This problem can be modeled as a linear dynamical system, with non-constant observation operator, where the parameters that need to be learned are the variance of both the process noise and the observation noise. While variance estimation for dynamic regression is a natural problem, with a variety of… ▽ More

    Submitted 11 October, 2022; v1 submitted 13 June, 2019; originally announced June 2019.

    Journal ref: NeurIPS 2022

  3. arXiv:1812.07010  [pdf, other

    cs.LG cs.CV stat.ML

    Multi Instance Learning For Unbalanced Data

    Authors: Mark Kozdoba, Edward Moroshko, Lior Shani, Takuya Takagi, Takashi Katoh, Shie Mannor, Koby Crammer

    Abstract: In the context of Multi Instance Learning, we analyze the Single Instance (SI) learning objective. We show that when the data is unbalanced and the family of classifiers is sufficiently rich, the SI method is a useful learning algorithm. In particular, we show that larger data imbalance, a quality that is typically perceived as negative, in fact implies a better resilience of the algorithm to the… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

  4. arXiv:1803.10415  [pdf, other

    cs.LG stat.ML

    A Better Resource Allocation Algorithm with Semi-Bandit Feedback

    Authors: Yuval Dagan, Koby Crammer

    Abstract: We study a sequential resource allocation problem between a fixed number of arms. On each iteration the algorithm distributes a resource among the arms in order to maximize the expected success rate. Allocating more of the resource to a given arm increases the probability that it succeeds, yet with a cut-off. We follow Lattimore et al. (2014) and assume that the probability increases linearly unti… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

  5. arXiv:1803.03319  [pdf, other

    cs.LG stat.ML

    Efficient Loss-Based Decoding on Graphs For Extreme Classification

    Authors: Itay Evron, Edward Moroshko, Koby Crammer

    Abstract: In extreme classification problems, learning algorithms are required to map instances to labels from an extremely large label set. We build on a recent extreme classification framework with logarithmic time and space, and on a general approach for error correcting output coding (ECOC) with loss-based decoding, and introduce a flexible and efficient approach accompanied by theoretical bounds. Our f… ▽ More

    Submitted 8 November, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

    Journal ref: Advances in Neural Information Processing Systems 32 (2018), 7232-7243

  6. arXiv:1702.07274  [pdf, other

    stat.ML cs.LG

    Rotting Bandits

    Authors: Nir Levine, Koby Crammer, Shie Mannor

    Abstract: The Multi-Armed Bandits (MAB) framework highlights the tension between acquiring new knowledge (Exploration) and leveraging available knowledge (Exploitation). In the classical MAB problem, a decision maker must choose an arm at each time step, upon which she receives a reward. The decision maker's objective is to maximize her cumulative expected reward over the time horizon. The MAB problem has b… ▽ More

    Submitted 2 November, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

  7. arXiv:1602.00309  [pdf, other

    cs.LG

    Bandits meet Computer Architecture: Designing a Smartly-allocated Cache

    Authors: Yonatan Glassner, Koby Crammer

    Abstract: In many embedded systems, such as imaging sys- tems, the system has a single designated purpose, and same threads are executed repeatedly. Profiling thread behavior, allows the system to allocate each thread its resources in a way that improves overall system performance. We study an online resource al- locationproblem,wherearesourcemanagersimulta- neously allocates resources (exploration), learns… ▽ More

    Submitted 31 January, 2016; originally announced February 2016.

  8. Learn on Source, Refine on Target:A Model Transfer Learning Framework with Random Forests

    Authors: Noam Segev, Maayan Harel, Shie Mannor, Koby Crammer, Ran El-Yaniv

    Abstract: We propose novel model transfer-learning methods that refine a decision forest model M learned within a "source" domain using a training set sampled from a "target" domain, assumed to be a variation of the source. We present two random forest transfer algorithms. The first algorithm searches greedily for locally optimal modifications of each tree structure by trying to locally expand or reduce the… ▽ More

    Submitted 8 November, 2015; v1 submitted 4 November, 2015; originally announced November 2015.

    Comments: 2 columns, 14 pages, TPAMI submitted

    Journal ref: IEEE transactions on pattern analysis and machine intelligence 39 (2017) 1811-1824

  9. arXiv:1510.08974  [pdf, other

    cs.LG stat.ML

    CONQUER: Confusion Queried Online Bandit Learning

    Authors: Daniel Barsky, Koby Crammer

    Abstract: We present a new recommendation setting for picking out two items from a given set to be highlighted to a user, based on contextual input. These two items are presented to a user who chooses one of them, possibly stochastically, with a bias that favours the item with the higher value. We propose a second-order algorithm framework that members of it use uses relative upper-confidence bounds to trad… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

  10. arXiv:1505.07067  [pdf, other

    stat.ML cs.LG

    Belief Flows of Robust Online Learning

    Authors: Pedro A. Ortega, Koby Crammer, Daniel D. Lee

    Abstract: This paper introduces a new probabilistic model for online learning which dynamically incorporates information from stochastic gradients of an arbitrary loss function. Similar to probabilistic filtering, the model maintains a Gaussian belief over the optimal weight parameters. Unlike traditional Bayesian updates, the model incorporates a small number of gradient evaluations at locations chosen usi… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.

    Comments: Appears in Workshop on Information Theory and Applications (ITA), February 2015

  11. arXiv:1411.4503  [pdf, other

    cs.LG stat.ML

    Outlier-Robust Convex Segmentation

    Authors: Itamar Katz, Koby Crammer

    Abstract: We derive a convex optimization problem for the task of segmenting sequential data, which explicitly treats presence of outliers. We describe two algorithms for solving this problem, one exact and one a top-down novel approach, and we derive a consistency results for the case of two segments and no outliers. Robustness to outliers is evaluated on two real-world tasks related to speech segmentation… ▽ More

    Submitted 18 November, 2014; v1 submitted 17 November, 2014; originally announced November 2014.

    Comments: * Accepted to AAAI-15, this version includes the appendix/supplementary material referenced in the AAAI-15 submission, as well as color figures * This version include some minor typos correction

  12. arXiv:1406.3840  [pdf, ps, other

    cs.LG

    Optimal Resource Allocation with Semi-Bandit Feedback

    Authors: Tor Lattimore, Koby Crammer, Csaba Szepesvári

    Abstract: We study a sequential resource allocation problem involving a fixed number of recurring jobs. At each time-step the manager should distribute available resources among the jobs in order to maximise the expected number of completed jobs. Allocating more resources to a given job increases the probability that it completes, but with a cut-off. Specifically, we assume a linear model where the probabil… ▽ More

    Submitted 15 June, 2014; originally announced June 2014.

    Comments: 12 pages

  13. arXiv:1402.4084  [pdf, other

    cs.LG

    Selective Sampling with Drift

    Authors: Edward Moroshko, Koby Crammer

    Abstract: Recently there has been much work on selective sampling, an online active learning setting, in which algorithms work in rounds. On each round an algorithm receives an input and makes a prediction. Then, it can decide whether to query a label, and if so to update its model, otherwise the input is discarded. Most of this work is focused on the stationary case, where it is assumed that there is a fix… ▽ More

    Submitted 17 February, 2014; originally announced February 2014.

  14. arXiv:1304.3708  [pdf, ps, other

    cs.LG stat.ML

    Advice-Efficient Prediction with Expert Advice

    Authors: Yevgeny Seldin, Peter Bartlett, Koby Crammer

    Abstract: Advice-efficient prediction with expert advice (in analogy to label-efficient prediction) is a variant of prediction with expert advice game, where on each round of the game we are allowed to ask for advice of a limited number $M$ out of $N$ experts. This setting is especially interesting when asking for advice of every expert on every round is expensive. We present an algorithm for advice-efficie… ▽ More

    Submitted 12 April, 2013; originally announced April 2013.

  15. arXiv:1304.2994  [pdf, other

    cs.LG

    A Generalized Online Mirror Descent with Applications to Classification and Regression

    Authors: Francesco Orabona, Koby Crammer, Nicolò Cesa-Bianchi

    Abstract: Online learning algorithms are fast, memory-efficient, easy to implement, and applicable to many prediction problems, including classification, regression, and ranking. Several online algorithms were proposed in the past few decades, some based on additive updates, like the Perceptron, and some on multiplicative updates, like Winnow. A unifying perspective on the design and the analysis of online… ▽ More

    Submitted 13 July, 2014; v1 submitted 10 April, 2013; originally announced April 2013.

    Journal ref: Machine Learning June 2015, Volume 99, Issue 3, pp 411-435

  16. arXiv:1303.3754  [pdf, other

    cs.LG

    A Last-Step Regression Algorithm for Non-Stationary Online Learning

    Authors: Edward Moroshko, Koby Crammer

    Abstract: The goal of a learner in standard online learning is to maintain an average loss close to the loss of the best-performing single function in some class. In many real-world problems, such as rating or ranking items, there is no single best target function during the runtime of the algorithm, instead the best (local) target function is drifting over time. We develop a novel last-step minmax optimal… ▽ More

    Submitted 15 March, 2013; originally announced March 2013.

    Comments: arXiv admin note: substantial text overlap with arXiv:1303.0140

  17. arXiv:1303.0140  [pdf, other

    cs.LG stat.ML

    Second-Order Non-Stationary Online Learning for Regression

    Authors: Nina Vaits, Edward Moroshko, Koby Crammer

    Abstract: The goal of a learner, in standard online learning, is to have the cumulative loss not much larger compared with the best-performing function from some fixed class. Numerous algorithms were shown to have this gap arbitrarily close to zero, compared with the best function that is chosen off-line. Nevertheless, many real-world applications, such as adaptive filtering, are non-stationary in nature, a… ▽ More

    Submitted 1 March, 2013; originally announced March 2013.

  18. arXiv:1301.6058  [pdf, other

    cs.LG

    Weighted Last-Step Min-Max Algorithm with Improved Sub-Logarithmic Regret

    Authors: Edward Moroshko, Koby Crammer

    Abstract: In online learning the performance of an algorithm is typically compared to the performance of a fixed function from some class, with a quantity called regret. Forster proposed a last-step min-max algorithm which was somewhat simpler than the algorithm of Vovk, yet with the same regret. In fact the algorithm he analyzed assumed that the choices of the adversary are bounded, yielding artificially o… ▽ More

    Submitted 25 January, 2013; originally announced January 2013.

  19. arXiv:1209.6329  [pdf, other

    cs.LG

    More Is Better: Large Scale Partially-supervised Sentiment Classification - Appendix

    Authors: Yoav Haimovitch, Koby Crammer, Shie Mannor

    Abstract: We describe a bootstrapping algorithm to learn from partially labeled data, and the results of an empirical study for using it to improve performance of sentiment classification using up to 15 million unlabeled Amazon product reviews. Our experiments cover semi-supervised learning, domain adaptation and weakly supervised learning. In some cases our methods were able to reduce test error by more th… ▽ More

    Submitted 27 September, 2012; originally announced September 2012.

    Comments: This is the appendix to the paper "More Is Better: Large Scale Partially-supervised Sentiment Classification" accepted to ACML 2012

  20. arXiv:1206.6815  [pdf

    cs.LG stat.ML

    Discriminative Learning via Semidefinite Probabilistic Models

    Authors: Koby Crammer, Amir Globerson

    Abstract: Discriminative linear models are a popular tool in machine learning. These can be generally divided into two types: The first is linear classifiers, such as support vector machines, which are well studied and provide state-of-the-art results. One shortcoming of these models is that their output (known as the 'margin') is not calibrated, and cannot be translated naturally into a distribution over t… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI2006)

    Report number: UAI-P-2006-PG-98-105

  21. arXiv:1206.4639  [pdf

    cs.LG cs.AI

    Adaptive Regularization for Weight Matrices

    Authors: Koby Crammer, Gal Chechik

    Abstract: Algorithms for learning distributions over weight-vectors, such as AROW were recently shown empirically to achieve state-of-the-art performance at various problems, with strong theoretical guaranties. Extending these algorithms to matrix models pose challenges since the number of free parameters in the covariance of the distribution scales as $n^4$ with the dimension $n$ of the matrix, and $n$ ten… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  22. arXiv:1111.1386  [pdf, other

    cs.LG

    Confidence Estimation in Structured Prediction

    Authors: Avihai Mejer, Koby Crammer

    Abstract: Structured classification tasks such as sequence labeling and dependency parsing have seen much interest by the Natural Language Processing and the machine learning communities. Several online learning algorithms were adapted for structured tasks such as Perceptron, Passive- Aggressive and the recently introduced Confidence-Weighted learning . These online algorithms are easy to implement, fast to… ▽ More

    Submitted 6 November, 2011; originally announced November 2011.