Skip to main content

Showing 1–20 of 20 results for author: Cavazza, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.10312  [pdf, other

    cs.CV

    Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance

    Authors: Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue

    Abstract: This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition. We propose a new architecture with a convolutional autoencoder that uses graph Laplacian regularization to model the skeletal geometry across the temporal dynamics of actions. Our approach is robust towards viewpoint variations by including a self-supervised gradient reverse layer… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Journal ref: The 32nd British Machine Vision Conference (BMVC) 2021

  2. arXiv:2201.00577  [pdf, other

    cs.CV

    Semantically Grounded Visual Embeddings for Zero-Shot Learning

    Authors: Shah Nawaz, Jacopo Cavazza, Alessio Del Bue

    Abstract: Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks. This is a weakness of current zero-shot learning frameworks as such disjoint embeddings fail to adequately associate visual and textual information to their shared semantic content. Therefore, we propose to learn semantically… ▽ More

    Submitted 10 April, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: Accepted at CVPRW

  3. arXiv:2103.12437  [pdf, other

    cs.CV

    Learning without Seeing nor Knowing: Towards Open Zero-Shot Learning

    Authors: Federico Marmoreo, Julio Ivan Davila Carrazco, Vittorio Murino, Jacopo Cavazza

    Abstract: In Generalized Zero-Shot Learning (GZSL), unseen categories (for which no visual data are available at training time) can be predicted by leveraging their class embeddings (e.g., a list of attributes describing them) together with a complementary pool of seen classes (paired with both visual data and class embeddings). Despite GZSL is arguably challenging, we posit that knowing in advance the clas… ▽ More

    Submitted 14 September, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  4. arXiv:2103.11112  [pdf, other

    cs.CV

    Classifier Crafting: Turn Your ConvNet into a Zero-Shot Learner!

    Authors: Jacopo Cavazza

    Abstract: In Zero-shot learning (ZSL), we classify unseen categories using textual descriptions about their expected appearance when observed (class embeddings) and a disjoint pool of seen classes, for which annotated visual data are accessible. We tackle ZSL by casting a "vanilla" convolutional neural network (e.g. AlexNet, ResNet-101, DenseNet-201 or DarkNet-53) into a zero-shot learner. We do so by craft… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: 8 pages (excluding references), 9 figures

  5. arXiv:2102.03266  [pdf, other

    cs.CV

    Transductive Zero-Shot Learning by Decoupled Feature Generation

    Authors: Federico Marmoreo, Jacopo Cavazza, Vittorio Murino

    Abstract: In this paper, we address zero-shot learning (ZSL), the problem of recognizing categories for which no labeled visual data are available during training. We focus on the transductive setting, in which unlabelled visual data from unseen classes is available. State-of-the-art paradigms in ZSL typically exploit generative adversarial networks to synthesize visual features from semantic attributes. We… ▽ More

    Submitted 14 September, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Published at the IEEE/CVF Winter Conference on Computer Vision (WACV) 2021

  6. arXiv:2010.08428  [pdf, other

    cs.SD eess.AS

    Are Multiple Cross-Correlation Identities better than just Two? Improving the Estimate of Time Differences-of-Arrivals from Blind Audio Signals

    Authors: Danilo Greco, Jacopo Cavazza, Alessio Del Bue

    Abstract: Given an unknown audio source, the estimation of time differences-of-arrivals (TDOAs) can be efficiently and robustly solved using blind channel identification and exploiting the cross-correlation identity (CCI). Prior "blind" works have improved the estimate of TDOAs by means of different algorithmic solutions and optimization strategies, while always sticking to the case N = 2 microphones. But w… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  7. Subspace Clustering for Action Recognition with Covariance Representations and Temporal Pruning

    Authors: Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue

    Abstract: This paper tackles the problem of human action recognition, defined as classifying which action is displayed in a trimmed sequence, from skeletal data. Albeit state-of-the-art approaches designed for this application are all supervised, in this paper we pursue a more challenging direction: Solving the problem with unsupervised learning. To this end, we propose a novel subspace clustering method, w… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Journal ref: 25th International Conference on Pattern Recognition (ICPR) 2020

  8. arXiv:2003.06430  [pdf, other

    cs.CV

    Learning Unbiased Representations via Mutual Information Backpropagation

    Authors: Ruggero Ragonesi, Riccardo Volpi, Jacopo Cavazza, Vittorio Murino

    Abstract: We are interested in learning data-driven representations that can generalize well, even when trained on inherently biased data. In particular, we face the case where some attributes (bias) of the data, if learned by the model, can severely compromise its generalization properties. We tackle this problem through the lens of information theory, leveraging recent findings for a differentiable estima… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: Code publicly available at https://github.com/rugrag/learn-unbiased

  9. arXiv:1711.10290  [pdf, other

    cs.CV

    Scalable and Compact 3D Action Recognition with Approximated RBF Kernel Machines

    Authors: Jacopo Cavazza, Pietro Morerio, Vittorio Murino

    Abstract: Despite the recent deep learning (DL) revolution, kernel machines still remain powerful methods for action recognition. DL has brought the use of large datasets and this is typically a problem for kernel approaches, which are not scaling up efficiently due to kernel Gram matrices. Nevertheless, kernel methods are still attractive and more generally applicable since they can equally manage differen… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  10. arXiv:1711.10288  [pdf, other

    cs.CV

    Minimal-Entropy Correlation Alignment for Unsupervised Deep Domain Adaptation

    Authors: Pietro Morerio, Jacopo Cavazza, Vittorio Murino

    Abstract: In this work, we face the problem of unsupervised domain adaptation with a novel deep learning approach which leverages on our finding that entropy minimization is induced by the optimal alignment of second order statistics between source and target domains. We formally demonstrate this hypothesis and, aiming at achieving an optimal alignment in practical cases, we adopt a more principled strategy… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  11. arXiv:1710.05092  [pdf, other

    cs.LG stat.ML

    Dropout as a Low-Rank Regularizer for Matrix Factorization

    Authors: Jacopo Cavazza, Pietro Morerio, Benjamin Haeffele, Connor Lane, Vittorio Murino, Rene Vidal

    Abstract: Regularization for matrix factorization (MF) and approximation problems has been carried out in many different ways. Due to its popularity in deep learning, dropout has been applied also for this class of problems. Despite its solid empirical performance, the theoretical properties of dropout as a regularizer remain quite elusive for this class of problems. In this paper, we present a theoretical… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  12. arXiv:1710.03487  [pdf, other

    cs.LG stat.ML

    An Analysis of Dropout for Matrix Factorization

    Authors: Jacopo Cavazza, Connor Lane, Benjamin D. Haeffele, Vittorio Murino, René Vidal

    Abstract: Dropout is a simple yet effective algorithm for regularizing neural networks by randomly dropping out units through Bernoulli multiplicative noise, and for some restricted problem classes, such as linear or logistic regression, several theoretical studies have demonstrated the equivalence between dropout and a fully deterministic optimization problem with data-dependent Tikhonov regularization. Th… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

  13. arXiv:1709.01695  [pdf, other

    cs.CV

    A Compact Kernel Approximation for 3D Action Recognition

    Authors: Jacopo Cavazza, Pietro Morerio, Vittorio Murino

    Abstract: 3D action recognition was shown to benefit from a covariance representation of the input data (joint 3D positions). A kernel machine feed with such feature is an effective paradigm for 3D action recognition, yielding state-of-the-art results. Yet, the whole framework is affected by the well-known scalability issue. In fact, in general, the kernel function has to be evaluated for all pairs of insta… ▽ More

    Submitted 4 October, 2017; v1 submitted 6 September, 2017; originally announced September 2017.

    Comments: Best paper award special mention at the 19th edition of the GIRPR International Conference on Image Analysis and Processing (ICIAP) 2017

  14. What Will I Do Next? The Intention from Motion Experiment

    Authors: Andrea Zunino, Jacopo Cavazza, Atesh Koul, Andrea Cavallo, Cristina Becchio, Vittorio Murino

    Abstract: In computer vision, video-based approaches have been widely explored for the early classification and the prediction of actions or activities. However, it remains unclear whether this modality (as compared to 3D kinematics) can still be reliable for the prediction of human intentions, defined as the overarching goal embedded in an action sequence. Since the same action can be performed with differ… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops

  15. When Kernel Methods meet Feature Learning: Log-Covariance Network for Action Recognition from Skeletal Data

    Authors: Jacopo Cavazza, Pietro Morerio, Vittorio Murino

    Abstract: Human action recognition from skeletal data is a hot research topic and important in many open domain applications of computer vision, thanks to recently introduced 3D sensors. In the literature, naive methods simply transfer off-the-shelf techniques from video to the skeletal representation. However, the current state-of-the-art is contended between to different paradigms: kernel-based methods an… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: 2017 IEEE Computer Vision and Pattern Recognition (CVPR) Workshops

  16. arXiv:1703.06229  [pdf, other

    cs.NE cs.LG stat.ML

    Curriculum Dropout

    Authors: Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, Vittorio Murino

    Abstract: Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. Besides, Dropout can be interpreted as an approximate model aggregation technique, where an exponential number of smaller networks are averaged in o… ▽ More

    Submitted 3 August, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: Accepted at ICCV (International Conference on Computer Vision) 2017

  17. arXiv:1606.01568  [pdf, other

    cs.LG cs.CV

    Active Regression with Adaptive Huber Loss

    Authors: Jacopo Cavazza, Vittorio Murino

    Abstract: This paper addresses the scalar regression problem through a novel solution to exactly optimize the Huber loss in a general semi-supervised setting, which combines multi-view learning and manifold regularization. We propose a principled algorithm to 1) avoid computationally expensive iterative schemes while 2) adapting the Huber loss threshold in a data-driven fashion and 3) actively balancing the… ▽ More

    Submitted 26 June, 2016; v1 submitted 5 June, 2016; originally announced June 2016.

  18. arXiv:1605.09526  [pdf, other

    cs.CV

    Predicting Human Intentions from Motion Only: A 2D+3D Fusion Approach

    Authors: Andrea Zunino, Jacopo Cavazza, Atesh Koul, Andrea Cavallo, Cristina Becchio, Vittorio Murino

    Abstract: In this paper, we address the new problem of the prediction of human intents. There is neuro-psychological evidence that actions performed by humans are anticipated by peculiar motor acts which are discriminant of the type of action going to be performed afterwards. In other words, an actual intent can be forecast by looking at the kinematics of the immediately preceding movement. To prove it in a… ▽ More

    Submitted 6 September, 2017; v1 submitted 31 May, 2016; originally announced May 2016.

    Comments: accepted as poster at the 25th ACM Multimedia (ACM MM) 2017, Mountain View, California, USA

  19. arXiv:1605.00392  [pdf, other

    cs.CV

    Revisiting Human Action Recognition: Personalization vs. Generalization

    Authors: Andrea Zunino, Jacopo Cavazza, Vittorio Murino

    Abstract: By thoroughly revisiting the classic human action recognition paradigm, this paper aims at proposing a new approach for the design of effective action classification systems. Taking as testbed publicly available three-dimensional (MoCap) action/activity datasets, we analyzed and validated different training/testing strategies. In particular, considering that each human action in the datasets is pe… ▽ More

    Submitted 2 May, 2016; originally announced May 2016.

  20. arXiv:1604.06582  [pdf, other

    cs.CV

    Kernelized Covariance for Action Recognition

    Authors: Jacopo Cavazza, Andrea Zunino, Marco San Biagio, Vittorio Murino

    Abstract: In this paper we aim at increasing the descriptive power of the covariance matrix, limited in capturing linear mutual dependencies between variables only. We present a rigorous and principled mathematical pipeline to recover the kernel trick for computing the covariance matrix, enhancing it to model more complex, non-linear relationships conveyed by the raw data. To this end, we propose Kernelized… ▽ More

    Submitted 2 September, 2016; v1 submitted 22 April, 2016; originally announced April 2016.

    Comments: Accepted paper at the 23rd International Conference on Pattern Recognition (ICPR), Cancun, Mexico, 2016