Skip to main content

Showing 1–8 of 8 results for author: Khorrami, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.08149  [pdf, other

    cs.CV

    LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution

    Authors: Danielle Sullivan-Pao, Nicole Tian, Pooya Khorrami

    Abstract: As generative AI image technologies become more widespread and advanced, there is a growing need for strong attribution models. These models are crucial for verifying the authenticity of images and identifying the architecture of their originating generative models-key to maintaining media integrity. However, attribution models struggle to generalize to unseen models, and traditional fine-tuning m… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Journal ref: Proceedings of the Media Authenticity in the Age of Artificial Intelligence Workshop at the 35th British Machine Vision Conference (BMVC), Glasgow, UK, 2024

  2. arXiv:2112.11929  [pdf, other

    cs.CV cs.LG

    Meta-Learning and Self-Supervised Pretraining for Real World Image Translation

    Authors: Ileana Rugina, Rumen Dangovski, Mark Veillette, Pooya Khorrami, Brian Cheung, Olga Simek, Marin Soljačić

    Abstract: Recent advances in deep learning, in particular enabled by hardware advances and big data, have provided impressive results across a wide range of computational problems such as computer vision, natural language, or reinforcement learning. Many of these improvements are however constrained to problems with large-scale curated data-sets which require a lot of human labor to gather. Additionally, th… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 10 pages, 8 figures, 2 tables

  3. arXiv:1704.06001  [pdf, other

    cs.LG cs.CV stat.ML

    Fast Generation for Convolutional Autoregressive Models

    Authors: Prajit Ramachandran, Tom Le Paine, Pooya Khorrami, Mohammad Babaeizadeh, Shiyu Chang, Yang Zhang, Mark A. Hasegawa-Johnson, Roy H. Campbell, Thomas S. Huang

    Abstract: Convolutional autoregressive models have recently demonstrated state-of-the-art performance on a number of generation tasks. While fast, parallel training methods have been crucial for their success, generation is typically implemented in a naïve fashion where redundant computations are unnecessarily repeated. This results in slow generation, making such models infeasible for production environmen… ▽ More

    Submitted 20 April, 2017; originally announced April 2017.

    Comments: Accepted at ICLR 2017 Workshop

  4. arXiv:1611.09482  [pdf, other

    cs.SD cs.DS cs.LG

    Fast Wavenet Generation Algorithm

    Authors: Tom Le Paine, Pooya Khorrami, Shiyu Chang, Yang Zhang, Prajit Ramachandran, Mark A. Hasegawa-Johnson, Thomas S. Huang

    Abstract: This paper presents an efficient implementation of the Wavenet generation process called Fast Wavenet. Compared to a naive implementation that has complexity O(2^L) (L denotes the number of layers in the network), our proposed approach removes redundant convolution operations by caching previous calculations, thereby reducing the complexity to O(L) time. Timing experiments show significant advanta… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: Technical Report

  5. arXiv:1602.08465  [pdf, other

    cs.CV

    Seq-NMS for Video Object Detection

    Authors: Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad Babaeizadeh, Honghui Shi, Jianan Li, Shuicheng Yan, Thomas S. Huang

    Abstract: Video object detection is challenging because objects that are easily detected in one frame may be difficult to detect in another frame within the same clip. Recently, there have been major advances for doing object detection in a single image. These methods typically contain three phases: (i) object proposal generation (ii) object classification and (iii) post-processing. We propose a modificatio… ▽ More

    Submitted 22 August, 2016; v1 submitted 26 February, 2016; originally announced February 2016.

    Comments: Technical Report for Imagenet VID Competition 2015

  6. arXiv:1602.07377  [pdf, other

    cs.CV

    How Deep Neural Networks Can Improve Emotion Recognition on Video Data

    Authors: Pooya Khorrami, Tom Le Paine, Kevin Brady, Charlie Dagli, Thomas S. Huang

    Abstract: We consider the task of dimensional emotion recognition on video data using deep learning. While several previous methods have shown the benefits of training temporal neural network models such as recurrent neural networks (RNNs) on hand-crafted features, few works have considered combining convolutional neural networks (CNNs) with RNNs. In this work, we present a system that performs emotion reco… ▽ More

    Submitted 9 January, 2017; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: Accepted at ICIP 2016. Fixed typo in Experiments section

  7. arXiv:1510.02969  [pdf, other

    cs.CV cs.LG cs.NE

    Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition?

    Authors: Pooya Khorrami, Tom Le Paine, Thomas S. Huang

    Abstract: Despite being the appearance-based classifier of choice in recent years, relatively few works have examined how much convolutional neural networks (CNNs) can improve performance on accepted expression recognition benchmarks and, more importantly, examine what it is they actually learn. In this work, not only do we show that CNNs can achieve strong performance, but we also introduce an approach to… ▽ More

    Submitted 15 March, 2017; v1 submitted 10 October, 2015; originally announced October 2015.

    Comments: Accepted at ICCV 2015 CV4AC Workshop. Corrected numbers in Tables 2 and 3

  8. arXiv:1412.6597  [pdf, other

    cs.CV cs.LG cs.NE

    An Analysis of Unsupervised Pre-training in Light of Recent Advances

    Authors: Tom Le Paine, Pooya Khorrami, Wei Han, Thomas S. Huang

    Abstract: Convolutional neural networks perform well on object recognition because of a number of recent advances: rectified linear units (ReLUs), data augmentation, dropout, and large labelled datasets. Unsupervised data has been proposed as another way to improve performance. Unfortunately, unsupervised pre-training is not used by state-of-the-art methods leading to the following question: Is unsupervised… ▽ More

    Submitted 10 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: Accepted as a workshop contribution to ICLR 2015