Skip to main content

Showing 1–20 of 20 results for author: Ghahremani, M

.
  1. arXiv:2505.22046   

    cs.CV

    LatentMove: Towards Complex Human Movement Video Generation

    Authors: Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Farid Boussaid, Aref Miri Rekavandi, Zinuo Li, Qiuhong Ke, Hamid Laga

    Abstract: Image-to-video (I2V) generation seeks to produce realistic motion sequences from a single reference image. Although recent methods exhibit strong temporal consistency, they often struggle when dealing with complex, non-repetitive human movements, leading to unnatural deformations. To tackle this issue, we present LatentMove, a DiT-based framework specifically tailored for highly dynamic human anim… ▽ More

    Submitted 27 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: The authors are withdrawing this paper due to major issues in the experiments and methodology. To prevent citation of this outdated and flawed version, we have decided to remove it while we work on a substantial revision. Thank you

  2. arXiv:2505.21698  [pdf, ps, other

    cs.CV

    MedBridge: Bridging Foundation Vision-Language Models to Medical Image Diagnosis

    Authors: Yitong Li, Morteza Ghahremani, Christian Wachinger

    Abstract: Recent vision-language foundation models deliver state-of-the-art results on natural image classification but falter on medical images due to pronounced domain shifts. At the same time, training a medical foundation model requires substantial resources, including extensive annotated data and high computational capacity. To bridge this gap with minimal overhead, we introduce MedBridge, a lightweigh… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2410.23219  [pdf, other

    cs.CV cs.AI

    DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET

    Authors: Yitong Li, Morteza Ghahremani, Youssef Wally, Christian Wachinger

    Abstract: Diagnosing dementia, particularly for Alzheimer's Disease (AD) and frontotemporal dementia (FTD), is complex due to overlapping symptoms. While magnetic resonance imaging (MRI) and positron emission tomography (PET) data are critical for the diagnosis, integrating these modalities in deep learning faces challenges, often resulting in suboptimal performance compared to using single modalities. More… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: Accepted by IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025

  4. Mamba? Catch The Hype Or Rethink What Really Helps for Image Registration

    Authors: Bailiang Jian, Jiazhen Pan, Morteza Ghahremani, Daniel Rueckert, Christian Wachinger, Benedikt Wiestler

    Abstract: Our findings indicate that adopting "advanced" computational elements fails to significantly improve registration accuracy. Instead, well-established registration-specific designs offer fair improvements, enhancing results by a marginal 1.5\% over the baseline. Our findings emphasize the importance of rigorous, unbiased evaluation and contribution disentanglement of all low- and high-level registr… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: WBIR 2024 Workshop on Biomedical Imaging Registration

  5. arXiv:2407.19205  [pdf, other

    cs.CV cs.AI

    Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions

    Authors: Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Zinuo Li, Hamid Laga, Farid Boussaid

    Abstract: This paper investigates the role of CLIP image embeddings within the Stable Video Diffusion (SVD) framework, focusing on their impact on video generation quality and computational efficiency. Our findings indicate that CLIP embeddings, while crucial for aesthetic quality, do not significantly contribute towards the subject and background consistency of video outputs. Moreover, the computationally… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  6. arXiv:2406.13472  [pdf, other

    physics.optics

    Extraordinary Quality Factors in Dual-Band Polarization-Insensitive QuasiBound States in the Continuum

    Authors: Maryam Ghahremani, Carlos J. Zapata-Rodríguez

    Abstract: In this study, we investigate a novel "dimerized" dielectric metasurface featuring dual-mode resonances governed by symmetry-protected bound states in the continuum (BICs). The metasurface design offers advantages such as insensitivity to incident light polarization and exceptionally high quality factors exceeding 10$^5$ for low and moderate structural deviations from the monoatomic array. By intr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  7. arXiv:2406.02485  [pdf, other

    cs.CV

    Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation

    Authors: Jiajun Wang, Morteza Ghahremani, Yitong Li, Björn Ommer, Christian Wachinger

    Abstract: Controllable text-to-image (T2I) diffusion models have shown impressive performance in generating high-quality visual content through the incorporation of various conditions. Current methods, however, exhibit limited performance when guided by skeleton human poses, especially in complex pose conditions such as side or rear perspectives of human figures. To address this issue, we present Stable-Pos… ▽ More

    Submitted 5 November, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by NeurIPS 2024

  8. arXiv:2405.06810  [pdf

    physics.optics physics.bio-ph

    Simulating Light Propagation through Biological Media Using Monte-Carlo Method

    Authors: Maryam Ghahremani

    Abstract: Biological tissues are complex structures composed of many elements which make light-based tissue diagnostics challenging. Over the past decades, Monte Carlo technique has been used as a fundamental and versatile approach toward modeling photon-tissue interactions. This report first describes a MC simulation of steady-state light transport in an absorbing and diffusing multi-layered structure. Fur… ▽ More

    Submitted 15 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:1711.03244 by other authors

  9. arXiv:2404.12200  [pdf, other

    physics.optics

    Metamaterial-induced-transparency engineering through quasi-bound states in the continuum by using dielectric cross-shaped trimers

    Authors: Maryam Ghahremani, Carlos J. Zapata-Rodriguez

    Abstract: This study presents a novel approach to activate a narrowband transparency line within a reflecting broadband window in all-dielectric metasurfaces, in analogy to the electromagnetically-induced transparency effect, by means of a quasi-bound state in the continuum (qBIC). We demonstrate that the resonance overlapping of a bright mode and a qBIC-based nearly-dark mode with distinct Q-factor can be… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  10. arXiv:2402.17910  [pdf, other

    cs.CV

    Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models

    Authors: Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Hamid Laga, Farid Boussaid

    Abstract: While latent diffusion models (LDMs) excel at creating imaginative images, they often lack precision in semantic fidelity and spatial control over where objects are generated. To address these deficiencies, we introduce the Box-it-to-Bind-it (B2B) module - a novel, training-free approach for improving spatial control and semantic accuracy in text-to-image (T2I) diffusion models. B2B targets three… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2401.08115  [pdf, other

    cs.CV cs.AI

    No-Clean-Reference Image Super-Resolution: Application to Electron Microscopy

    Authors: Mohammad Khateri, Morteza Ghahremani, Alejandra Sierra, Jussi Tohka

    Abstract: The inability to acquire clean high-resolution (HR) electron microscopy (EM) images over a large brain tissue volume hampers many neuroscience studies. To address this challenge, we propose a deep-learning-based image super-resolution (SR) approach to computationally reconstruct clean HR 3D-EM with a large field of view (FoV) from noisy low-resolution (LR) acquisition. Our contributions are I) Inv… ▽ More

    Submitted 26 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 13 pages, 12 figures, and 2 tables

  12. arXiv:2310.00641  [pdf, other

    cs.CV

    RegBN: Batch Normalization of Multimodal Data with Regularization

    Authors: Morteza Ghahremani, Christian Wachinger

    Abstract: Recent years have witnessed a surge of interest in integrating high-dimensional data captured by multisource sensors, driven by the impressive success of neural networks in the integration of multimodal data. However, the integration of heterogeneous multimodal data poses a significant challenge, as confounding effects and dependencies among such heterogeneous data sources introduce unwanted varia… ▽ More

    Submitted 19 November, 2023; v1 submitted 1 October, 2023; originally announced October 2023.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS 2023)

  13. arXiv:2309.10646  [pdf, other

    eess.IV cs.CV

    Self-Supervised Super-Resolution Approach for Isotropic Reconstruction of 3D Electron Microscopy Images from Anisotropic Acquisition

    Authors: Mohammad Khateri, Morteza Ghahremani, Alejandra Sierra, Jussi Tohka

    Abstract: Three-dimensional electron microscopy (3DEM) is an essential technique to investigate volumetric tissue ultra-structure. Due to technical limitations and high imaging costs, samples are often imaged anisotropically, where resolution in the axial direction ($z$) is lower than in the lateral directions $(x,y)$. This anisotropy 3DEM can hamper subsequent analysis and visualization tasks. To overcome… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  14. arXiv:2306.17811  [pdf, ps, other

    cs.DM

    Safe Edges: A Study of Triangulation in Fill-in and Tree-Width Problems

    Authors: Mani Ghahremani, Janka Chlebikova

    Abstract: This paper considers two well-studied problems \textsc{Minimum Fill-In} (\textsc{Min Fill-In}) and \textsc{Treewidth}. Since both problems are \textsf{NP}-hard, various reduction rules simplifying an input graph have been intensively studied to better understand the structural properties relevant to these problems. Bodlaender at el. introduced the concept of a safe edge that is included in a solut… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    MSC Class: 05C75; 05C85

  15. arXiv:2204.14100  [pdf, other

    eess.IV cs.CV

    Adversarial Distortion Learning for Medical Image Denoising

    Authors: Morteza Ghahremani, Mohammad Khateri, Alejandra Sierra, Jussi Tohka

    Abstract: We present a novel adversarial distortion learning (ADL) for denoising two- and three-dimensional (2D/3D) biomedical image data. The proposed ADL consists of two auto-encoders: a denoiser and a discriminator. The denoiser removes noise from input data and the discriminator compares the denoised result to its noise-free counterpart. This process is repeated until the discriminator cannot differenti… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 April, 2022; originally announced April 2022.

  16. Regional Attention Network (RAN) for Head Pose and Fine-grained Gesture Recognition

    Authors: Ardhendu Behera, Zachary Wharton, Morteza Ghahremani, Swagat Kumar, Nik Bessis

    Abstract: Affect is often expressed via non-verbal body language such as actions/gestures, which are vital indicators for human behaviors. Recent studies on recognition of fine-grained actions/gestures in monocular images have mainly focused on modeling spatial configuration of body parts representing body pose, human-objects interactions and variations in local appearance. The results show that this is a b… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

    Comments: This manuscript is the accepted version of the published paper in IEEE Transaction on Affective Computing

    Journal ref: IEEE Transaction on Affective Computing 2020

  17. FFD: Fast Feature Detector

    Authors: Morteza Ghahremani, Yonghuai Liu, Bernard Tiddeman

    Abstract: Scale-invariance, good localization and robustness to noise and distortions are the main properties that a local feature detector should possess. Most existing local feature detectors find excessive unstable feature points that increase the number of keypoints to be matched and the computational time of the matching step. In this paper, we show that robust and accurate keypoints exist in the speci… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Journal ref: IEEE Transactions on Image Processing, 2021

  18. Orderly Disorder in Point Cloud Domain

    Authors: Morteza Ghahremani, Bernard Tiddeman, Yonghuai Liu, Ardhendu Behera

    Abstract: In the real world, out-of-distribution samples, noise and distortions exist in test data. Existing deep networks developed for point cloud data analysis are prone to overfitting and a partial change in test data leads to unpredictable behaviour of the networks. In this paper, we propose a smart yet simple deep network for analysis of 3D models using `orderly disorder' theory. Orderly disorder is a… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Journal ref: 16th European Conference on Computer Vision (ECCV2020)

  19. arXiv:1511.03717  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Electric-field Controlled Magnetization Switching in Co/Pt thin-Film Ferromagnets

    Authors: A. Siddique, S. Gu, R. Witte, M. Ghahremani, C. A. Nwokoye, A. Aslani, R. Kruk, V. Provenzano, L. H. Bennett, E. Della Torre

    Abstract: A study of dynamic and reversible voltage controlled magnetization switching in ferromagnetic Co/Pt thin film with perpendicular magnetic anisotropy at room temperature is presented. The change in the magnetic properties of the system is observed in a relatively thick film of 15 nm. A surface charge is induced by the formation of electrochemical double layer between the metallic thin film and non-… ▽ More

    Submitted 11 November, 2015; originally announced November 2015.

    Comments: 5 pages, 8 figures

  20. arXiv:1511.02312  [pdf

    cond-mat.mtrl-sci physics.ins-det

    Optimization of Magnetic Refrigerators by Tuning the Heat Transfer Medium and Operating Conditions

    Authors: Mohammadreza Ghahremani, Amir Aslani, Lawrence H. Bennett, Edward Della Torre

    Abstract: A new experimental test bed has been designed, built, and tested to evaluate the effect of the systems parameters on a reciprocating Active Magnetic Regenerator (AMR) near room temperature. Bulk gadolinium was used as the refrigerant, silicon oil as the heat transfer medium, and a magnetic field of 1.3 T was cycled. This study focuses on the methodology of single stage AMR operation conditions to… ▽ More

    Submitted 7 November, 2015; originally announced November 2015.

    Comments: 5 pages, 1 table, 8 figures