Skip to main content

Showing 1–17 of 17 results for author: Sari, E

.
  1. arXiv:2505.13174  [pdf, ps, other

    cs.CV

    FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching

    Authors: Alp Eren Sari, Paolo Favaro

    Abstract: We propose FlowCut, a simple and capable method for unsupervised video instance segmentation consisting of a three-stage framework to construct a high-quality video dataset with pseudo labels. To our knowledge, our work is the first attempt to curate a video dataset with pseudo-labels for unsupervised video instance segmentation. In the first stage, we generate pseudo-instance masks by exploiting… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  2. arXiv:2404.03392  [pdf, other

    cs.CV

    Boosting Unsupervised Segmentation Learning

    Authors: Alp Eren Sari, Francesco Locatello, Paolo Favaro

    Abstract: We present two practical improvement techniques for unsupervised segmentation learning. These techniques address limitations in the resolution and accuracy of predicted segmentation maps of recent state-of-the-art methods. Firstly, we leverage image post-processing techniques such as guided filtering to refine the output masks, improving accuracy while avoiding substantial computational costs. Sec… ▽ More

    Submitted 28 November, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted to NeurIPS 2024 Workshop: Self-Supervised Learning - Theory and Practice

  3. arXiv:2301.09264  [pdf, ps, other

    cs.LG cs.CV cs.NE

    Efficient Training Under Limited Resources

    Authors: Mahdi Zolnouri, Dounia Lakhmiri, Christophe Tribes, Eyyüb Sari, Sébastien Le Digabel

    Abstract: Training time budget and size of the dataset are among the factors affecting the performance of a Deep Neural Network (DNN). This paper shows that Neural Architecture Search (NAS), Hyper Parameters Optimization (HPO), and Data Augmentation help DNNs perform much better while these two factors are limited. However, searching for an optimal architecture and the best hyperparameter values besides a g… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

  4. arXiv:2212.11791  [pdf, other

    cs.LG cs.CL

    Training Integer-Only Deep Recurrent Neural Networks

    Authors: Vahid Partovi Nia, Eyyüb Sari, Vanessa Courville, Masoud Asgharian

    Abstract: Recurrent neural networks (RNN) are the backbone of many text and speech applications. These architectures are typically made up of several computationally complex components such as; non-linear activation functions, normalization, bi-directional dependence and attention. In order to maintain good accuracy, these components are frequently run using full-precision floating-point computation, making… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2109.09828

    ACM Class: I.2.6

  5. arXiv:2110.13220  [pdf, other

    cs.LG stat.ML

    Demystifying and Generalizing BinaryConnect

    Authors: Tim Dockhorn, Yaoliang Yu, Eyyüb Sari, Mahdi Zolnouri, Vahid Partovi Nia

    Abstract: BinaryConnect (BC) and its many variations have become the de facto standard for neural network quantization. However, our understanding of the inner workings of BC is still quite limited. We attempt to close this gap in four different aspects: (a) we show that existing quantization algorithms, including post-training quantization, are surprisingly similar to each other; (b) we argue for proximal… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  6. arXiv:2109.09828  [pdf, other

    cs.LG cs.NE

    iRNN: Integer-only Recurrent Neural Network

    Authors: Eyyüb Sari, Vanessa Courville, Vahid Partovi Nia

    Abstract: Recurrent neural networks (RNN) are used in many real-world text and speech applications. They include complex modules such as recurrence, exponential-based activation, gate interaction, unfoldable normalization, bi-directional dependence, and attention. The interaction between these elements prevents running them on integer-only operations without a significant performance drop. Deploying RNNs th… ▽ More

    Submitted 14 February, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

  7. arXiv:2107.06197  [pdf, other

    cs.LG cs.CV

    Generative Adversarial Learning via Kernel Density Discrimination

    Authors: Abdelhak Lemkhenter, Adam Bielski, Alp Eren Sari, Paolo Favaro

    Abstract: We introduce Kernel Density Discrimination GAN (KDD GAN), a novel method for generative adversarial learning. KDD GAN formulates the training as a likelihood ratio optimization problem where the data distributions are written explicitly via (local) Kernel Density Estimates (KDE). This is inspired by the recent progress in contrastive learning and its relation to KDE. We define the KDEs directly in… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  8. arXiv:2004.14214  [pdf, ps, other

    cs.LG stat.ML

    Batch Normalization in Quantized Networks

    Authors: Eyyüb Sari, Vahid Partovi Nia

    Abstract: Implementation of quantized neural networks on computing hardware leads to considerable speed up and memory saving. However, quantized deep networks are difficult to train and batch~normalization (BatchNorm) layer plays an important role in training full-precision and quantized networks. Most studies on BatchNorm are focused on full-precision networks, and there is little research in understanding… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  9. arXiv:1911.09968  [pdf, other

    cs.CV cs.LG eess.IV

    SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation

    Authors: Yasin Almalioglu, Mehmet Turan, Alp Eren Sari, Muhamad Risqi U. Saputra, Pedro P. B. de Gusmão, Andrew Markham, Niki Trigoni

    Abstract: In the last decade, numerous supervised deep learning approaches requiring large amounts of labeled data have been proposed for visual-inertial odometry (VIO) and depth map estimation. To overcome the data limitation, self-supervised learning has emerged as a promising alternative, exploiting constraints such as geometric and photometric consistency in the scene. In this study, we introduce a nove… ▽ More

    Submitted 23 July, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: 15 pages, submitted to The IEEE Transactions on Robotics (T-RO) journal, under review

  10. arXiv:1909.12205  [pdf, other

    cs.LG cs.CV stat.ML

    Adaptive Binary-Ternary Quantization

    Authors: Ryan Razani, Grégoire Morin, Vahid Partovi Nia, Eyyüb Sari

    Abstract: Neural network models are resource hungry. It is difficult to deploy such deep networks on devices with limited resources, like smart wearables, cellphones, drones, and autonomous vehicles. Low bit quantization such as binary and ternary quantization is a common approach to alleviate this resource requirements. Ternary quantization provides a more flexible model and outperforms binary quantization… ▽ More

    Submitted 13 September, 2021; v1 submitted 26 September, 2019; originally announced September 2019.

    MSC Class: 97R40

    Journal ref: CVPR 2021 BiVision Workshop

  11. arXiv:1909.09139  [pdf, ps, other

    cs.LG stat.ML

    How Does Batch Normalization Help Binary Training?

    Authors: Eyyüb Sari, Mouloud Belbahri, Vahid Partovi Nia

    Abstract: Binary Neural Networks (BNNs) are difficult to train, and suffer from drop of accuracy. It appears in practice that BNNs fail to train in the absence of Batch Normalization (BatchNorm) layer. We find the main role of BatchNorm is to avoid exploding gradients in the case of BNNs. This finding suggests that the common initialization methods developed for full-precision networks are irrelevant to BNN… ▽ More

    Submitted 29 April, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

  12. arXiv:1909.04567  [pdf, other

    cs.LG stat.ML

    Differentiable Mask for Pruning Convolutional and Recurrent Networks

    Authors: Ramchalam Kinattinkara Ramakrishnan, Eyyüb Sari, Vahid Partovi Nia

    Abstract: Pruning is one of the most effective model reduction techniques. Deep networks require massive computation and such models need to be compressed to bring them on edge devices. Most existing pruning techniques are focused on vision-based models like convolutional networks, while text-based models are still evolving. The emergence of multi-modal multi-task learning calls for a general method that wo… ▽ More

    Submitted 29 April, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

  13. arXiv:1901.06414  [pdf, other

    stat.ML cs.LG

    Foothill: A Quasiconvex Regularization for Edge Computing of Deep Neural Networks

    Authors: Mouloud Belbahri, Eyyüb Sari, Sajad Darabi, Vahid Partovi Nia

    Abstract: Deep neural networks (DNNs) have demonstrated success for many supervised learning tasks, ranging from voice recognition, object detection, to image classification. However, their increasing complexity might yield poor generalization error that make them hard to be deployed on edge devices. Quantization is an effective approach to compress DNNs in order to meet these constraints. Using a quasiconv… ▽ More

    Submitted 23 May, 2019; v1 submitted 18 January, 2019; originally announced January 2019.

    Comments: Accepted in 16th International Conference of Image Analysis and Recognition (ICIAR 2019)

  14. Thinning CsPb2Br5 Perovskite Down to Monolayers: Cs-dependent Stability

    Authors: Fadil Iyikanat, Emre Sari, Hasan Sahin

    Abstract: Using first-principles density functional theory calculations, we systematically investigate the structural, electronic and vibrational properties of bulk and potential single-layer structures of perovskite-like CsPb2Br5 crystal. It is found that while Cs atoms have no effect on the electronic structure, their presence is essential for the formation of stable CsPb2Br5 crystals. Calculated vibratio… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: 18 pages, 5 figures

    Journal ref: Phys. Rev. B 96, 155442, 2017

  15. arXiv:1709.06041  [pdf, other

    cs.RO

    Endo-VMFuseNet: Deep Visual-Magnetic Sensor Fusion Approach for Uncalibrated, Unsynchronized and Asymmetric Endoscopic Capsule Robot Localization Data

    Authors: Mehmet Turan, Yasin Almalioglu, Hunter Gilbert, Alp Eren Sari, Ufuk Soylu, Metin Sitti

    Abstract: In the last decade, researchers and medical device companies have made major advances towards transforming passive capsule endoscopes into active medical robots. One of the major challenges is to endow capsule robots with accurate perception of the environment inside the human body, which will provide necessary information and enable improved medical procedures. We extend the success of deep learn… ▽ More

    Submitted 22 September, 2017; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Submitted to ICRA 2018

  16. arXiv:1708.00885   

    cs.HC cs.MA

    Proc. of the 9th Workshop on Semantic Ambient Media Experiences (SAME'2016/2): Visualisation, Emerging Media, and User-Experience: International Series on Information Systems and Management in Creative eMedia (CreMedia)

    Authors: Artur Lugmayr, Richard Seale, Andrew Woods, Eunice Sari, Adi Tedjasaputra

    Abstract: The 9th Semantic Ambient Media Experience (SAME) proceedings where based on the academic contributions to a two day workshop that was held at Curtin University, Perth, WA, Australia. The symposium was held to discuss visualisation, emerging media, and user-experience from various angles. The papers of this workshop are freely available through http://www.ambientmediaassociation.org/Journal under o… ▽ More

    Submitted 28 July, 2017; originally announced August 2017.

    Journal ref: Proc. of the 9th Workshop on Semantic Ambient Media Experiences, Visualisation, Emerging Media, and User-Experience, International Series on Information Systems and Management in Creative eMedia (CreMedia), No. 2016/2, 2016

  17. arXiv:1707.08949   

    cs.HC

    Proceedings of the 8th Workshop on Semantic Ambient Media Experiences (SAME 2016): Smart Cities for Better Living with HCI and UX (SEACHI), International Series on Information Systems and Management in Creative eMedia (CreMedia)

    Authors: Eunice Sari, Adi Tedjasaputra, Do Yi Luen Ellen, Henry Duh, Artur Lugmayr

    Abstract: Digital and interactive technologies are becoming increasingly embedded in everyday lives of people around the world. Application of technologies such as real-time, context-aware, and interactive technologies; augmented and immersive realities; social media; and location-based services has been particularly evident in urban environments where technological and sociocultural infrastructures enable… ▽ More

    Submitted 28 July, 2017; v1 submitted 27 July, 2017; originally announced July 2017.

    Journal ref: Eunice Sari, et. al., Proc. of the 8th Workshop on Semantic Ambient Media Experiences: Smart Cities for Better Living with HCI and UX, Int. SERIES on Information Systems and Management in Creative eMedia (CreMedia), n. 2016/1, 2017