Skip to main content

Showing 1–9 of 9 results for author: Samangouei, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.11697  [pdf, other

    cs.CY

    AMMeBa: A Large-Scale Survey and Dataset of Media-Based Misinformation In-The-Wild

    Authors: Nicholas Dufour, Arkanath Pathak, Pouya Samangouei, Nikki Hariri, Shashi Deshetti, Andrew Dudfield, Christopher Guess, Pablo Hernández Escayola, Bobby Tran, Mevan Babakar, Christoph Bregler

    Abstract: The prevalence and harms of online misinformation is a perennial concern for internet platforms, institutions and society at large. Over time, information shared online has become more media-heavy and misinformation has readily adapted to these new modalities. The rise of generative AI-based tools, which provide widely-accessible methods for synthesizing realistic audio, images, video and human-li… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: Grammar, spelling corrections. Minor rewording and clarification of one sentence. 24 pages, 31 figures

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2401.01970  [pdf, other

    cs.CV cs.AI

    FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding

    Authors: Xingxing Zuo, Pouya Samangouei, Yunwen Zhou, Yan Di, Mingyang Li

    Abstract: Precisely perceiving the geometric and semantic properties of real-world 3D objects is crucial for the continued evolution of augmented reality and robotic applications. To this end, we present Foundation Model Embedded Gaussian Splatting (FMGS), which incorporates vision-language embeddings of foundation models into 3D Gaussian Splatting (GS). The key contribution of this work is an efficient met… ▽ More

    Submitted 3 May, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: Project page: https://xingxingzuo.github.io/fmgs

  4. arXiv:1911.10291  [pdf, other

    cs.LG cs.CV stat.ML

    Invert and Defend: Model-based Approximate Inversion of Generative Adversarial Networks for Secure Inference

    Authors: Wei-An Lin, Yogesh Balaji, Pouya Samangouei, Rama Chellappa

    Abstract: Inferring the latent variable generating a given test sample is a challenging problem in Generative Adversarial Networks (GANs). In this paper, we propose InvGAN - a novel framework for solving the inference problem in GANs, which involves training an encoder network capable of inverting a pre-trained generator network without access to any training data. Under mild assumptions, we theoretically s… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  5. arXiv:1805.06605  [pdf, other

    cs.CV cs.LG stat.ML

    Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models

    Authors: Pouya Samangouei, Maya Kabkab, Rama Chellappa

    Abstract: In recent years, deep neural network approaches have been widely adopted for machine learning tasks, including classification. However, they were shown to be vulnerable to adversarial perturbations: carefully crafted small perturbations can cause misclassification of legitimate images. We propose Defense-GAN, a new framework leveraging the expressive capability of generative models to defend deep… ▽ More

    Submitted 17 May, 2018; v1 submitted 17 May, 2018; originally announced May 2018.

    Comments: Published as a conference paper at the Sixth International Conference on Learning Representations (ICLR 2018)

  6. arXiv:1803.05258  [pdf, other

    cs.CV

    Face-MagNet: Magnifying Feature Maps to Detect Small Faces

    Authors: Pouya Samangouei, Mahyar Najibi, Larry Davis, Rama Chellappa

    Abstract: In this paper, we introduce the Face Magnifier Network (Face-MageNet), a face detector based on the Faster-RCNN framework which enables the flow of discriminative information of small scale faces to the classifier without any skip or residual connections. To achieve this, Face-MagNet deploys a set of ConvTranspose, also known as deconvolution, layers in the Region Proposal Network (RPN) and anothe… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

    Comments: Accepted in WACV18

  7. arXiv:1802.01284  [pdf, other

    cs.LG stat.ML

    Task-Aware Compressed Sensing with Generative Adversarial Networks

    Authors: Maya Kabkab, Pouya Samangouei, Rama Chellappa

    Abstract: In recent years, neural network approaches have been widely adopted for machine learning tasks, with applications in computer vision. More recently, unsupervised generative models based on neural networks have been successfully applied to model data distributions via low-dimensional latent spaces. In this paper, we use Generative Adversarial Networks (GANs) to impose structure in compressed sensin… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: Accepted for publication at the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18)

  8. arXiv:1708.03979  [pdf, other

    cs.CV

    SSH: Single Stage Headless Face Detector

    Authors: Mahyar Najibi, Pouya Samangouei, Rama Chellappa, Larry Davis

    Abstract: We introduce the Single Stage Headless (SSH) face detector. Unlike two stage proposal-classification detectors, SSH detects faces in a single stage directly from the early convolutional layers in a classification network. SSH is headless. That is, it is able to achieve state-of-the-art results while removing the "head" of its underlying classification network -- i.e. all fully connected layers in… ▽ More

    Submitted 17 October, 2017; v1 submitted 13 August, 2017; originally announced August 2017.

    Comments: International Conference on Computer Vision (ICCV) 2017

  9. arXiv:1604.08865  [pdf, other

    cs.CV

    Convolutional Neural Networks for Attribute-based Active Authentication on Mobile Devices

    Authors: Pouya Samangouei, Rama Chellappa

    Abstract: We present a Deep Convolutional Neural Network (DCNN) architecture for the task of continuous authentication on mobile devices. To deal with the limited resources of these devices, we reduce the complexity of the networks by learning intermediate features such as gender and hair color instead of identities. We present a multi-task, part-based DCNN architecture for attribute detection that performs… ▽ More

    Submitted 8 July, 2016; v1 submitted 29 April, 2016; originally announced April 2016.

    Comments: Accepted in BTAS 2016