Skip to main content

Showing 1–5 of 5 results for author: Mofayezi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07932  [pdf, ps, other

    cs.GR cs.CV cs.LG

    Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor

    Authors: Rishit Dagli, Yushi Guan, Sankeerth Durvasula, Mohammadreza Mofayezi, Nandita Vijaykumar

    Abstract: We propose Squeeze3D, a novel framework that leverages implicit prior knowledge learnt by existing pre-trained 3D generative models to compress 3D data at extremely high compression ratios. Our approach bridges the latent spaces between a pre-trained encoder and a pre-trained generation model through trainable mapping networks. Any 3D model represented as a mesh, point cloud, or a radiance field i… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  3. arXiv:2402.02369  [pdf, other

    cs.CV cs.CL cs.MM

    M$^3$Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing

    Authors: Mohammadreza Mofayezi, Reza Alipour, Mohammad Ali Kakavand, Ehsaneddin Asgari

    Abstract: Human face generation and editing represent an essential task in the era of computer vision and the digital world. Recent studies have shown remarkable progress in multi-modal face generation and editing, for instance, using face segmentation to guide image generation. However, it may be challenging for some users to create these conditioning modalities manually. Thus, we introduce M3Face, a unifi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  4. arXiv:2304.02963  [pdf, other

    cs.CV

    Benchmarking Robustness to Text-Guided Corruptions

    Authors: Mohammadreza Mofayezi, Yasamin Medghalchi

    Abstract: This study investigates the robustness of image classifiers to text-guided corruptions. We utilize diffusion models to edit images to different domains. Unlike other works that use synthetic or hand-picked data for benchmarking, we use diffusion models as they are generative models capable of learning to edit images while preserving their semantic content. Thus, the corruptions will be more realis… ▽ More

    Submitted 31 July, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPRW 2023

  5. arXiv:2210.05669  [pdf, other

    cs.CV cs.HC cs.RO

    A generic diffusion-based approach for 3D human pose prediction in the wild

    Authors: Saeed Saadatnejad, Ali Rasekh, Mohammadreza Mofayezi, Yasamin Medghalchi, Sara Rajabzadeh, Taylor Mordan, Alexandre Alahi

    Abstract: Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are consider… ▽ More

    Submitted 15 March, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to ICRA 2023