Skip to main content

Showing 1–15 of 15 results for author: Bertinetto, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.02088  [pdf, other

    cs.CV

    SiamMask: A Framework for Fast Online Object Tracking and Segmentation

    Authors: Weiming Hu, Qiang Wang, Li Zhang, Luca Bertinetto, Philip H. S. Torr

    Abstract: In this paper we introduce SiamMask, a framework to perform both visual object tracking and video object segmentation, in real-time, with the same simple method. We improve the offline training procedure of popular fully-convolutional Siamese approaches by augmenting their losses with a binary segmentation task. Once the offline training is completed, SiamMask only requires a single bounding box f… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: 17 pages, Accepted by TPAMI 2022. arXiv admin note: substantial text overlap with arXiv:1812.05050

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022

  2. arXiv:2203.08725  [pdf, other

    cs.LG cs.CR cs.CV

    Attacking deep networks with surrogate-based adversarial black-box methods is easy

    Authors: Nicholas A. Lord, Romain Mueller, Luca Bertinetto

    Abstract: A recent line of work on black-box adversarial attacks has revived the use of transfer from surrogate models by integrating it into query-based search. However, we find that existing approaches of this type underperform their potential, and can be overly complicated besides. Here, we provide a short and simple algorithm which achieves state-of-the-art results through a search which uses the surrog… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: ICLR 2022

  3. arXiv:2201.05718  [pdf, other

    cs.CV

    Parameter-free Online Test-time Adaptation

    Authors: Malik Boudiaf, Romain Mueller, Ismail Ben Ayed, Luca Bertinetto

    Abstract: Training state-of-the-art vision models has become prohibitively expensive for researchers and practitioners. For the sake of accessibility and resource reuse, it is important to focus on adapting these models to a variety of downstream scenarios. An interesting and practical paradigm is online test-time adaptation, according to which training data is inaccessible, no labelled data from the test d… ▽ More

    Submitted 4 April, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: CVPR 2022 (oral). Code available at https://github.com/fiveai/LAME

  4. arXiv:2107.02156  [pdf, other

    cs.CV cs.AI

    Do Different Tracking Tasks Require Different Appearance Models?

    Authors: Zhongdao Wang, Hengshuang Zhao, Ya-Li Li, Shengjin Wang, Philip H. S. Torr, Luca Bertinetto

    Abstract: Tracking objects of interest in a video is one of the most popular and widely applicable problems in computer vision. However, with the years, a Cambrian explosion of use cases and benchmarks has fragmented the problem in a multitude of different experimental setups. As a consequence, the literature has fragmented too, and now novel approaches proposed by the community are usually specialised to f… ▽ More

    Submitted 1 December, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: To appear at NeurIPS 2021

  5. arXiv:2012.09831  [pdf, other

    cs.LG cs.CV

    On Episodes, Prototypical Networks, and Few-shot Learning

    Authors: Steinar Laenen, Luca Bertinetto

    Abstract: Episodic learning is a popular practice among researchers and practitioners interested in few-shot learning. It consists of organising training in a series of learning problems (or episodes), each divided into a small training and validation subset to mimic the circumstances encountered during evaluation. But is this always necessary? In this paper, we investigate the usefulness of episodic learni… ▽ More

    Submitted 30 November, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: 18 pages. To appear at NeurIPS 2021. A preliminary version of this work appeared as an oral presentation at the NeurIPS 2020 meta-learning workshop

  6. arXiv:1912.09393  [pdf, other

    cs.CV cs.LG

    Making Better Mistakes: Leveraging Class Hierarchies with Deep Networks

    Authors: Luca Bertinetto, Romain Mueller, Konstantinos Tertikas, Sina Samangooei, Nicholas A. Lord

    Abstract: Deep neural networks have improved image classification dramatically over the past decade, but have done so by focusing on performance measures that treat all classes other than the ground truth as equally wrong. This has led to a situation in which mistakes are less likely to be made than before, but are equally likely to be absurd or catastrophic when they do occur. Past works have recognised an… ▽ More

    Submitted 12 June, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: To appear at CVPR 2020. Code available at https://github.com/fiveai/making-better-mistakes

  7. arXiv:1910.10895  [pdf, other

    cs.CV

    Anchor Diffusion for Unsupervised Video Object Segmentation

    Authors: Zhao Yang, Qiang Wang, Luca Bertinetto, Weiming Hu, Song Bai, Philip H. S. Torr

    Abstract: Unsupervised video object segmentation has often been tackled by methods based on recurrent neural networks and optical flow. Despite their complexity, these kinds of approaches tend to favour short-term temporal dependencies and are thus prone to accumulating inaccuracies, which cause drift over time. Moreover, simple (static) image segmentation models, alone, can perform competitively against th… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: To appear in ICCV 2019

  8. arXiv:1906.08744  [pdf, other

    cs.CV cs.LG cs.RO

    Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

    Authors: Tommaso Cavallari, Luca Bertinetto, Jishnu Mukhoti, Philip Torr, Stuart Golodetz

    Abstract: Many applications require a camera to be relocalised online, without expensive offline training on the target scene. Whilst both keyframe and sparse keypoint matching methods can be used online, the former often fail away from the training trajectory, and the latter can struggle in textureless regions. By contrast, scene coordinate regression (SCoRe) methods generalise to novel poses and can lever… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Tommaso Cavallari and Stuart Golodetz contributed equally to this paper

  9. arXiv:1812.05050  [pdf, other

    cs.CV

    Fast Online Object Tracking and Segmentation: A Unifying Approach

    Authors: Qiang Wang, Li Zhang, Luca Bertinetto, Weiming Hu, Philip H. S. Torr

    Abstract: In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach. Our method, dubbed SiamMask, improves the offline training procedure of popular fully-convolutional Siamese approaches for object tracking by augmenting their loss with a binary segmentation task. Once trained, SiamMask solely relies on… ▽ More

    Submitted 4 May, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: CVPR 2019 camera ready. Code available at https://github.com/foolwood/SiamMask

  10. arXiv:1805.08136  [pdf, other

    cs.CV cs.LG stat.ML

    Meta-learning with differentiable closed-form solvers

    Authors: Luca Bertinetto, João F. Henriques, Philip H. S. Torr, Andrea Vedaldi

    Abstract: Adapting deep networks to new concepts from a few examples is challenging, due to the high computational requirements of standard fine-tuning procedures. Most work on few-shot learning has thus focused on simple learning techniques for adaptation, such as nearest neighbours or gradient descent. Nonetheless, the machine learning literature contains a wealth of methods that learn non-deep models ver… ▽ More

    Submitted 24 July, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: Published at ICLR'19. Code and data available at http://www.robots.ox.ac.uk/~luca/r2d2.html

  11. arXiv:1803.09502  [pdf, other

    cs.CV

    Long-term Tracking in the Wild: A Benchmark

    Authors: Jack Valmadre, Luca Bertinetto, João F. Henriques, Ran Tao, Andrea Vedaldi, Arnold Smeulders, Philip Torr, Efstratios Gavves

    Abstract: We introduce the OxUvA dataset and benchmark for evaluating single-object tracking algorithms. Benchmarks have enabled great strides in the field of object tracking by defining standardized evaluations on large sets of diverse videos. However, these works have focused exclusively on sequences that are just tens of seconds in length and in which the target is always visible. Consequently, most rese… ▽ More

    Submitted 10 August, 2018; v1 submitted 26 March, 2018; originally announced March 2018.

    Comments: To appear at ECCV 2018

  12. arXiv:1704.06036  [pdf, other

    cs.CV cs.LG

    End-to-end representation learning for Correlation Filter based tracking

    Authors: Jack Valmadre, Luca Bertinetto, João F. Henriques, Andrea Vedaldi, Philip H. S. Torr

    Abstract: The Correlation Filter is an algorithm that trains a linear template to discriminate between images and their translations. It is well suited to object tracking because its formulation in the Fourier domain provides a fast solution, enabling the detector to be re-trained once per frame. Previous works that use the Correlation Filter, however, have adopted features that were either manually designe… ▽ More

    Submitted 20 April, 2017; originally announced April 2017.

    Comments: To appear at CVPR 2017

  13. arXiv:1606.09549  [pdf, other

    cs.CV

    Fully-Convolutional Siamese Networks for Object Tracking

    Authors: Luca Bertinetto, Jack Valmadre, João F. Henriques, Andrea Vedaldi, Philip H. S. Torr

    Abstract: The problem of arbitrary object tracking has traditionally been tackled by learning a model of the object's appearance exclusively online, using as sole training data the video itself. Despite the success of these methods, their online-only approach inherently limits the richness of the model they can learn. Recently, several attempts have been made to exploit the expressive power of deep convolut… ▽ More

    Submitted 1 December, 2021; v1 submitted 30 June, 2016; originally announced June 2016.

    Comments: The first two authors contributed equally, and are listed in alphabetical order. Code available at http://www.robots.ox.ac.uk/~luca/siamese-fc.html

  14. arXiv:1606.05233  [pdf, other

    cs.CV cs.LG

    Learning feed-forward one-shot learners

    Authors: Luca Bertinetto, João F. Henriques, Jack Valmadre, Philip H. S. Torr, Andrea Vedaldi

    Abstract: One-shot learning is usually tackled by using generative models or discriminative embeddings. Discriminative methods based on deep learning, which are very effective in other learning scenarios, are ill-suited for one-shot learning as they need large amounts of training data. In this paper, we propose a method to learn the parameters of a deep model in one shot. We construct the learner as a secon… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: The first three authors contributed equally, and are listed in alphabetical order

  15. arXiv:1512.01355  [pdf, other

    cs.CV

    Staple: Complementary Learners for Real-Time Tracking

    Authors: Luca Bertinetto, Jack Valmadre, Stuart Golodetz, Ondrej Miksik, Philip Torr

    Abstract: Correlation Filter-based trackers have recently achieved excellent performance, showing great robustness to challenging situations exhibiting motion blur and illumination changes. However, since the model that they learn depends strongly on the spatial layout of the tracked object, they are notoriously sensitive to deformation. Models based on colour statistics have complementary traits: they cope… ▽ More

    Submitted 13 April, 2016; v1 submitted 4 December, 2015; originally announced December 2015.

    Comments: To appear in CVPR 2016