Skip to main content

Showing 1–38 of 38 results for author: Azizpour, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11544  [pdf, ps, other

    cs.CV

    Leveraging Satellite Image Time Series for Accurate Extreme Event Detection

    Authors: Heng Fang, Hossein Azizpour

    Abstract: Climate change is leading to an increase in extreme weather events, causing significant environmental damage and loss of life. Early detection of such events is essential for improving disaster response. In this work, we propose SITS-Extreme, a novel framework that leverages satellite image time series to detect extreme events by incorporating multiple pre-disaster observations. This approach effe… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: Accepted to the WACV 2025 Workshop on GeoCV. Code, datasets, and model checkpoints available at: https://github.com/hfangcat/SITS-ExtremeEvents

  2. arXiv:2506.00214  [pdf, ps, other

    physics.flu-dyn cs.AI

    Diff-SPORT: Diffusion-based Sensor Placement Optimization and Reconstruction of Turbulent flows in urban environments

    Authors: Abhijeet Vishwasrao, Sai Bharath Chandra Gutha, Andres Cremades, Klas Wijk, Aakash Patil, Catherine Gorle, Beverley J McKeon, Hossein Azizpour, Ricardo Vinuesa

    Abstract: Rapid urbanization demands accurate and efficient monitoring of turbulent wind patterns to support air quality, climate resilience and infrastructure design. Traditional sparse reconstruction and sensor placement strategies face major accuracy degradations under practical constraints. Here, we introduce Diff-SPORT, a diffusion-based framework for high-fidelity flow reconstruction and optimal senso… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  3. arXiv:2410.11433  [pdf, other

    cs.LG eess.SY

    Hessian-Informed Flow Matching

    Authors: Christopher Iliffe Sprague, Arne Elofsson, Hossein Azizpour

    Abstract: Modeling complex systems that evolve toward equilibrium distributions is important in various physical applications, including molecular dynamics and robotic control. These systems often follow the stochastic gradient descent of an underlying energy function, converging to stationary distributions around energy minima. The local covariance of these distributions is shaped by the energy landscape's… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: In submission

  4. arXiv:2409.20253  [pdf, other

    cs.CV

    Medical Image Segmentation with SAM-generated Annotations

    Authors: Iira Häkkinen, Iaroslav Melekhov, Erik Englesson, Hossein Azizpour, Juho Kannala

    Abstract: The field of medical image segmentation is hindered by the scarcity of large, publicly available annotated datasets. Not all datasets are made public for privacy reasons, and creating annotations for a large dataset is time-consuming and expensive, as it requires specialized expertise to accurately identify regions of interest (ROIs) within the images. To address these challenges, we evaluate the… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: Accepted to the European Conference on Computer Vision (ECCVW) Workshops 2024

  5. arXiv:2407.20784  [pdf, other

    cs.CV cs.LG

    Inverse Problems with Diffusion Models: A MAP Estimation Perspective

    Authors: Sai Bharath Chandra Gutha, Ricardo Vinuesa, Hossein Azizpour

    Abstract: Inverse problems have many applications in science and engineering. In Computer vision, several image restoration tasks such as inpainting, deblurring, and super-resolution can be formally modeled as inverse problems. Recently, methods have been developed for solving inverse problems that only leverage a pre-trained unconditional diffusion model and do not require additional task-specific training… ▽ More

    Submitted 18 September, 2024; v1 submitted 27 July, 2024; originally announced July 2024.

  6. arXiv:2407.16058  [pdf, other

    cs.LG stat.ML

    Revisiting Score Function Estimators for $k$-Subset Sampling

    Authors: Klas Wijk, Ricardo Vinuesa, Hossein Azizpour

    Abstract: Are score function estimators an underestimated approach to learning with $k$-subset sampling? Sampling $k$-subsets is a fundamental operation in many machine learning tasks that is not amenable to differentiable parametrization, impeding gradient-based optimization. Prior work has focused on relaxed sampling or pathwise gradient estimators. Inspired by the success of score function estimators in… ▽ More

    Submitted 16 August, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: ICML 2024 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators

  7. arXiv:2406.17458  [pdf, ps, other

    cs.CV

    Continuous Urban Change Detection from Satellite Image Time Series with Temporal Feature Refinement and Multi-Task Integration

    Authors: Sebastian Hafner, Heng Fang, Hossein Azizpour, Yifang Ban

    Abstract: Urbanization advances at unprecedented rates, leading to negative environmental and societal impacts. Remote sensing can help mitigate these effects by supporting sustainable development strategies with accurate information on urban growth. Deep learning-based methods have achieved promising urban change detection results from optical satellite image pairs using convolutional neural networks (Conv… ▽ More

    Submitted 9 June, 2025; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE Transactions on Geoscience and Remote Sensing, Code will be available at https://github.com/SebastianHafner/ContUrbanCD.git

  8. arXiv:2405.04161  [pdf, other

    cs.LG cs.AI

    Decoding complexity: how machine learning is redefining scientific discovery

    Authors: Ricardo Vinuesa, Paola Cinnella, Jean Rabault, Hossein Azizpour, Stefan Bauer, Bingni W. Brunton, Arne Elofsson, Elias Jarlebring, Hedvig Kjellstrom, Stefano Markidis, David Marlevi, Javier Garcia-Martinez, Steven L. Brunton

    Abstract: As modern scientific instruments generate vast amounts of data and the volume of information in the scientific literature continues to grow, machine learning (ML) has become an essential tool for organising, analysing, and interpreting these complex datasets. This paper explores the transformative role of ML in accelerating breakthroughs across a range of scientific disciplines. By presenting key… ▽ More

    Submitted 25 April, 2025; v1 submitted 7 May, 2024; originally announced May 2024.

  9. arXiv:2403.00563  [pdf, other

    cs.LG stat.ML

    Indirectly Parameterized Concrete Autoencoders

    Authors: Alfred Nilsson, Klas Wijk, Sai bharath chandra Gutha, Erik Englesson, Alexandra Hotti, Carlo Saccardi, Oskar Kviman, Jens Lagergren, Ricardo Vinuesa, Hossein Azizpour

    Abstract: Feature selection is a crucial task in settings where data is high-dimensional or acquiring the full set of features is costly. Recent developments in neural network-based embedded feature selection show promising results across a wide range of applications. Concrete Autoencoders (CAEs), considered state-of-the-art in embedded feature selection, may struggle to achieve stable joint optimization, h… ▽ More

    Submitted 16 August, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  10. arXiv:2402.05774  [pdf, other

    cs.LG cs.AI eess.SY

    Stable Autonomous Flow Matching

    Authors: Christopher Iliffe Sprague, Arne Elofsson, Hossein Azizpour

    Abstract: In contexts where data samples represent a physically stable state, it is often assumed that the data points represent the local minima of an energy landscape. In control theory, it is well-known that energy can serve as an effective Lyapunov function. Despite this, connections between control theory and generative models in the literature are sparse, even though there are several machine learning… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: In submission

  11. arXiv:2307.15063  [pdf, other

    cs.CV

    To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation

    Authors: Marc Botet Colomer, Pier Luigi Dovesi, Theodoros Panagiotakopoulos, Joao Frederico Carvalho, Linus Härenstam-Nielsen, Hossein Azizpour, Hedvig Kjellström, Daniel Cremers, Matteo Poggi

    Abstract: The goal of Online Domain Adaptation for semantic segmentation is to handle unforeseeable domain changes that occur during deployment, like sudden weather events. However, the high computational costs associated with brute-force adaptation make this paradigm unfeasible for real-world applications. In this paper we propose HAMLET, a Hardware-Aware Modular Least Expensive Training framework for real… ▽ More

    Submitted 7 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: ICCV 2023. The first two authors contributed equally. Project page: https://marcbotet.github.io/hamlet-web/

  12. arXiv:2304.02849  [pdf, other

    cs.LG cs.CV stat.ML

    Logistic-Normal Likelihoods for Heteroscedastic Label Noise

    Authors: Erik Englesson, Amir Mehrpanah, Hossein Azizpour

    Abstract: A natural way of estimating heteroscedastic label noise in regression is to model the observed (potentially noisy) target as a sample from a normal distribution, whose parameters can be learned by minimizing the negative log-likelihood. This formulation has desirable loss attenuation properties, as it reduces the contribution of high-error examples. Intuitively, this behavior can improve robustnes… ▽ More

    Submitted 14 August, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  13. arXiv:2301.12309  [pdf, other

    cs.LG

    On the Lipschitz Constant of Deep Networks and Double Descent

    Authors: Matteo Gamba, Hossein Azizpour, Mårten Björkman

    Abstract: Existing bounds on the generalization error of deep networks assume some form of smooth or bounded dependence on the input variable, falling short of investigating the mechanisms controlling such factors in practice. In this work, we present an extensive experimental study of the empirical Lipschitz constant of deep networks undergoing double descent, and highlight non-monotonic trends strongly co… ▽ More

    Submitted 14 November, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

  14. arXiv:2212.03675  [pdf, other

    cs.CV

    Unsupervised Flood Detection on SAR Time Series

    Authors: Ritu Yadav, Andrea Nascetti, Hossein Azizpour, Yifang Ban

    Abstract: Human civilization has an increasingly powerful influence on the earth system. Affected by climate change and land-use change, natural disasters such as flooding have been increasing in recent years. Earth observations are an invaluable source for assessing and mitigating negative impacts. Detecting changes from Earth observation data is one way to monitor the possible impact. Effective and reliab… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  15. arXiv:2210.09919  [pdf, other

    cs.CV cs.LG

    Dense FixMatch: a simple semi-supervised learning method for pixel-wise prediction tasks

    Authors: Miquel Martí i Rabadán, Alessandro Pieropan, Hossein Azizpour, Atsuto Maki

    Abstract: We propose Dense FixMatch, a simple method for online semi-supervised learning of dense and structured prediction tasks combining pseudo-labeling and consistency regularization via strong data augmentation. We enable the application of FixMatch in semi-supervised learning problems beyond image classification by adding a matching operation on the pseudo-labels. This allows us to still use the full… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  16. arXiv:2209.10080  [pdf, other

    cs.LG stat.ML

    Deep Double Descent via Smooth Interpolation

    Authors: Matteo Gamba, Erik Englesson, Mårten Björkman, Hossein Azizpour

    Abstract: The ability of overparameterized deep networks to interpolate noisy data, while at the same time showing good generalization performance, has been recently characterized in terms of the double descent curve for the test error. Common intuition from polynomial regression suggests that overparameterized networks are able to sharply interpolate noisy data, without considerably deviating from the grou… ▽ More

    Submitted 8 April, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

  17. arXiv:2208.07220  [pdf, other

    cs.CV cs.LG

    PatchDropout: Economizing Vision Transformers Using Patch Dropout

    Authors: Yue Liu, Christos Matsoukas, Fredrik Strand, Hossein Azizpour, Kevin Smith

    Abstract: Vision transformers have demonstrated the potential to outperform CNNs in a variety of vision tasks. But the computational and memory requirements of these models prohibit their use in many applications, especially those that depend on high-resolution images, such as medical image classification. Efforts to train ViTs more efficiently are overly complicated, necessitating architectural changes or… ▽ More

    Submitted 4 October, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  18. arXiv:2203.05997  [pdf, other

    cs.CV

    Towards Self-Supervised Learning of Global and Object-Centric Representations

    Authors: Federico Baldassarre, Hossein Azizpour

    Abstract: Self-supervision allows learning meaningful representations of natural images, which usually contain one central object. How well does it transfer to multi-entity scenes? We discuss key aspects of learning structured object-centric representations with self-supervision and validate our insights through several experiments on the CLEVR dataset. Regarding the architecture, we confirm the importance… ▽ More

    Submitted 13 April, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: Published at the ICLR 2022 workshop on Objects, Structure and Causality. Code, datasets, and notebooks are available at https://github.com/baldassarreFe/iclr-osc-22

  19. arXiv:2202.11749  [pdf, other

    cs.LG

    Are All Linear Regions Created Equal?

    Authors: Matteo Gamba, Adrian Chmielewski-Anders, Josephine Sullivan, Hossein Azizpour, Mårten Björkman

    Abstract: The number of linear regions has been studied as a proxy of complexity for ReLU networks. However, the empirical success of network compression techniques like pruning and knowledge distillation, suggest that in the overparameterized setting, linear regions density might fail to capture the effective nonlinearity. In this work, we propose an efficient algorithm for discovering linear regions and u… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  20. arXiv:2202.07424  [pdf

    cs.CY

    The potential of artificial intelligence for achieving healthy and sustainable societies

    Authors: B. Sirmacek, S. Gupta, F. Mallor, H. Azizpour, Y. Ban, H. Eivazi, H. Fang, F. Golzar, I. Leite, G. I. Melsion, K. Smith, F. Fuso Nerini, R. Vinuesa

    Abstract: In this chapter we extend earlier work (Vinuesa et al., Nature Communications 11, 2020) on the potential of artificial intelligence (AI) to achieve the 17 Sustainable Development Goals (SDGs) proposed by the United Nations (UN) for the 2030 Agenda. The present contribution focuses on three SDGs related to healthy and sustainable societies, i.e. SDG 3 (on good health), SDG 11 (on sustainable cities… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  21. arXiv:2201.00604  [pdf, other

    cs.LG cs.CV

    An analysis of over-sampling labeled data in semi-supervised learning with FixMatch

    Authors: Miquel Martí i Rabadán, Sebastian Bujwid, Alessandro Pieropan, Hossein Azizpour, Atsuto Maki

    Abstract: Most semi-supervised learning methods over-sample labeled data when constructing training mini-batches. This paper studies whether this common practice improves learning and how. We compare it to an alternative setting where each mini-batch is uniformly sampled from all the training data, labeled or not, which greatly reduces direct supervision from true labels in typical low-label regimes. Howeve… ▽ More

    Submitted 8 April, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: 10 pages, 3 figures. Published at NLDL 2022

    Journal ref: Vol. 3 (2022): Proceedings of the Northern Lights Deep Learning Workshop 2022

  22. arXiv:2112.01330  [pdf, other

    cs.CV cs.LG

    CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

    Authors: Moein Sorkhei, Yue Liu, Hossein Azizpour, Edward Azavedo, Karin Dembrower, Dimitra Ntoula, Athanasios Zouzos, Fredrik Strand, Kevin Smith

    Abstract: Interval and large invasive breast cancers, which are associated with worse prognosis than other cancers, are usually detected at a late stage due to false negative assessments of screening mammograms. The missed screening-time detection is commonly caused by the tumor being obscured by its surrounding breast tissues, a phenomenon called masking. To study and benchmark mammographic masking of canc… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  23. arXiv:2110.01242  [pdf, other

    cs.LG cs.CV stat.ML

    Consistency Regularization Can Improve Robustness to Label Noise

    Authors: Erik Englesson, Hossein Azizpour

    Abstract: Consistency regularization is a commonly-used technique for semi-supervised and self-supervised learning. It is an auxiliary objective function that encourages the prediction of the network to be similar in the vicinity of the observed training samples. Hendrycks et al. (2020) have recently shown such regularization naturally brings test-time robustness to corrupted data and helps with calibration… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: Presented at the ICML 2021 Workshop on Uncertainty and Robustness in Deep Learning. arXiv admin note: text overlap with arXiv:2105.04522

  24. arXiv:2105.04522  [pdf, other

    cs.LG cs.CV stat.ML

    Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

    Authors: Erik Englesson, Hossein Azizpour

    Abstract: Prior works have found it beneficial to combine provably noise-robust loss functions e.g., mean absolute error (MAE) with standard categorical loss function e.g. cross entropy (CE) to improve their learnability. Here, we propose to use Jensen-Shannon divergence as a noise-robust loss function and show that it interestingly interpolate between CE and MAE with a controllable mixing parameter. Furthe… ▽ More

    Submitted 29 October, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Neural Information Processing Systems (NeurIPS 2021)

  25. arXiv:2007.05791  [pdf, other

    eess.IV cs.CV cs.LG

    Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models

    Authors: Yue Liu, Hossein Azizpour, Fredrik Strand, Kevin Smith

    Abstract: The ability to accurately estimate risk of developing breast cancer would be invaluable for clinical decision-making. One promising new approach is to integrate image-based risk models based on deep neural networks. However, one must take care when using such models, as selection of training data influences the patterns the network will learn to identify. With this in mind, we trained networks usi… ▽ More

    Submitted 16 September, 2020; v1 submitted 11 July, 2020; originally announced July 2020.

    Comments: Accepted by MICCAI 2020 (Medical Image Computing and Computer Assisted Intervention)

  26. arXiv:2006.09562  [pdf, other

    cs.CV

    Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

    Authors: Federico Baldassarre, Kevin Smith, Josephine Sullivan, Hossein Azizpour

    Abstract: Visual relationship detection is fundamental for holistic image understanding. However, the localization and classification of (subject, predicate, object) triplets remain challenging tasks, due to the combinatorial explosion of possible relationships, their long-tailed distribution in natural images, and an expensive annotation process. This paper introduces a novel weakly-supervised method for v… ▽ More

    Submitted 17 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Published at the European Conference on Computer Vision, ECCV 2020 (Poster)

  27. arXiv:2005.02762  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    Recurrent neural networks and Koopman-based frameworks for temporal predictions in a low-order model of turbulence

    Authors: Hamidreza Eivazi, Luca Guastoni, Philipp Schlatter, Hossein Azizpour, Ricardo Vinuesa

    Abstract: The capabilities of recurrent neural networks and Koopman-based frameworks are assessed in the prediction of temporal dynamics of the low-order model of near-wall turbulence by Moehlis et al. (New J. Phys. 6, 56, 2004). Our results show that it is possible to obtain excellent reproductions of the long-term statistics and the dynamic behavior of the chaotic system with properly trained long-short-t… ▽ More

    Submitted 14 April, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: International Journal of Heat and Fluid Flow. arXiv admin note: substantial text overlap with arXiv:2002.01222

  28. arXiv:2003.07797  [pdf, other

    cs.CV cs.LG

    Hyperplane Arrangements of Trained ConvNets Are Biased

    Authors: Matteo Gamba, Stefan Carlsson, Hossein Azizpour, Mårten Björkman

    Abstract: We investigate the geometric properties of the functions learned by trained ConvNets in the preactivation space of their convolutional layers, by performing an empirical study of hyperplane arrangements induced by a convolutional layer. We introduce statistics over the weights of a trained network to study local arrangements and relate them to the training dynamics. We observe that trained ConvNet… ▽ More

    Submitted 14 April, 2023; v1 submitted 17 March, 2020; originally announced March 2020.

  29. arXiv:2002.01222  [pdf, ps, other

    physics.flu-dyn cs.LG physics.comp-ph

    On the use of recurrent neural networks for predictions of turbulent flows

    Authors: Luca Guastoni, Prem A. Srinivasan, Hossein Azizpour, Philipp Schlatter, Ricardo Vinuesa

    Abstract: In this paper, the prediction capabilities of recurrent neural networks are assessed in the low-order model of near-wall turbulence by Moehlis {\it et al.} (New J. Phys. {\bf 6}, 56, 2004). Our results show that it is possible to obtain excellent predictions of the turbulence statistics and the dynamic behavior of the flow with properly trained long short-term memory (LSTM) networks, leading to re… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: Conference paper presented at 11th International Symposium on Turbulence and Shear Flow Phenomena (TSFP11) at Southampton, UK, July 30 to August 2, 2019

  30. arXiv:1906.05419  [pdf, other

    cs.LG stat.ML

    Efficient Evaluation-Time Uncertainty Estimation by Improved Distillation

    Authors: Erik Englesson, Hossein Azizpour

    Abstract: In this work we aim to obtain computationally-efficient uncertainty estimates with deep networks. For this, we propose a modified knowledge distillation procedure that achieves state-of-the-art uncertainty estimates both for in and out-of-distribution samples. Our contributions include a) demonstrating and adapting to distillation's regularization effect b) proposing a novel target teacher distrib… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: Submitted at the ICML 2019 Workshop on Uncertainty & Robustness in Deep Learning(poster & spotlight talk)

  31. arXiv:1905.13686  [pdf, other

    cs.LG cs.AI stat.ML

    Explainability Techniques for Graph Convolutional Networks

    Authors: Federico Baldassarre, Hossein Azizpour

    Abstract: Graph Networks are used to make decisions in potentially complex scenarios but it is usually not obvious how or why they made them. In this work, we study the explainability of Graph Network decisions using two main classes of techniques, gradient-based and decomposition-based, on a toy dataset and a chemistry task. Our study sets the ground for future development as well as application to real-wo… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: Accepted at the ICML 2019 Workshop "Learning and Reasoning with Graph-Structured Representations" (poster + spotlight talk)

  32. The role of artificial intelligence in achieving the Sustainable Development Goals

    Authors: Ricardo Vinuesa, Hossein Azizpour, Iolanda Leite, Madeline Balaam, Virginia Dignum, Sami Domisch, Anna Felländer, Simone Langhans, Max Tegmark, Francesco Fuso Nerini

    Abstract: The emergence of artificial intelligence (AI) and its progressively wider impact on many sectors across the society requires an assessment of its effect on sustainable development. Here we analyze published evidence of positive or negative impacts of AI on the achievement of each of the 17 goals and 169 targets of the 2030 Agenda for Sustainable Development. We find that AI can support the achieve… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

  33. arXiv:1812.01710  [pdf, other

    cs.CV cs.LG stat.ML

    GANtruth - an unpaired image-to-image translation method for driving scenarios

    Authors: Sebastian Bujwid, Miquel Martí, Hossein Azizpour, Alessandro Pieropan

    Abstract: Synthetic image translation has significant potentials in autonomous transportation systems. That is due to the expense of data collection and annotation as well as the unmanageable diversity of real-words situations. The main issue with unpaired image-to-image translation is the ill-posed nature of the problem. In this work, we propose a novel method for constraining the output space of unpaired… ▽ More

    Submitted 26 November, 2018; originally announced December 2018.

    Comments: 32nd Conference on Neural Information Processing Systems (NeurIPS), Machine Learning for Intelligent Transportation Systems Workshop, Montréal, Canada. 2018

  34. arXiv:1507.02144  [pdf, other

    cs.CV

    Spotlight the Negatives: A Generalized Discriminative Latent Model

    Authors: Hossein Azizpour, Mostafa Arefiyan, Sobhan Naderi Parizi, Stefan Carlsson

    Abstract: Discriminative latent variable models (LVM) are frequently applied to various visual recognition tasks. In these systems the latent (hidden) variables provide a formalism for modeling structured variation of visual features. Conventionally, latent variables are de- fined on the variation of the foreground (positive) class. In this work we augment LVMs to include negative latent variables correspon… ▽ More

    Submitted 8 July, 2015; originally announced July 2015.

    Comments: Published in proceedings of BMVC 2015

  35. arXiv:1411.6509  [pdf, other

    cs.CV

    Persistent Evidence of Local Image Properties in Generic ConvNets

    Authors: Ali Sharif Razavian, Hossein Azizpour, Atsuto Maki, Josephine Sullivan, Carl Henrik Ek, Stefan Carlsson

    Abstract: Supervised training of a convolutional network for object classification should make explicit any information related to the class of objects and disregard any auxiliary information associated with the capture of the image or the variation within the object class. Does this happen in practice? Although this seems to pertain to the very final layers in the network, if we look at earlier layers we f… ▽ More

    Submitted 24 November, 2014; originally announced November 2014.

  36. arXiv:1406.5774  [pdf, other

    cs.CV

    Factors of Transferability for a Generic ConvNet Representation

    Authors: Hossein Azizpour, Ali Sharif Razavian, Josephine Sullivan, Atsuto Maki, Stefan Carlsson

    Abstract: Evidence is mounting that Convolutional Networks (ConvNets) are the most effective representation learning method for visual recognition tasks. In the common scenario, a ConvNet is trained on a large labeled dataset (source) and the feed-forward units activation of the trained network, at a certain layer of the network, is used as a generic representation of an input image for a task with relative… ▽ More

    Submitted 15 July, 2015; v1 submitted 22 June, 2014; originally announced June 2014.

    Comments: Extended version of the workshop paper with more experiments and updated text and title. Original CVPR15 DeepVision workshop paper title: "From Generic to Specific Deep Representations for Visual Recognition"

  37. arXiv:1405.5732  [pdf, other

    cs.CV

    Self-tuned Visual Subclass Learning with Shared Samples An Incremental Approach

    Authors: Hossein Azizpour, Stefan Carlsson

    Abstract: Computer vision tasks are traditionally defined and evaluated using semantic categories. However, it is known to the field that semantic classes do not necessarily correspond to a unique visual class (e.g. inside and outside of a car). Furthermore, many of the feasible learning techniques at hand cannot model a visual class which appears consistent to the human eye. These problems have motivated t… ▽ More

    Submitted 26 May, 2014; v1 submitted 22 May, 2014; originally announced May 2014.

    Comments: Updated ICCV 2013 submission

  38. arXiv:1403.6382  [pdf, other

    cs.CV

    CNN Features off-the-shelf: an Astounding Baseline for Recognition

    Authors: Ali Sharif Razavian, Hossein Azizpour, Josephine Sullivan, Stefan Carlsson

    Abstract: Recent results indicate that the generic descriptors extracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is indeed the case. We report on a series of experiments conducted for different recognition tasks using the publicly available code and model of the \overfeat network which was trained to perform object classification on ILSVRC… ▽ More

    Submitted 12 May, 2014; v1 submitted 23 March, 2014; originally announced March 2014.

    Comments: version 3 revisions: 1)Added results using feature processing and data augmentation 2)Referring to most recent efforts of using CNN for different visual recognition tasks 3) updated text/caption