Skip to main content

Showing 1–13 of 13 results for author: Taher, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.10131  [pdf, other

    cs.CV

    ACE: Anatomically Consistent Embeddings in Composition and Decomposition

    Authors: Ziyu Zhou, Haozhe Luo, Mohammad Reza Hosseinzadeh Taher, Jiaxuan Pang, Xiaowei Ding, Michael Gotway, Jianming Liang

    Abstract: Medical images acquired from standardized protocols show consistent macroscopic or microscopic anatomical structures, and these structures consist of composable/decomposable organs and tissues, but existing self-supervised learning (SSL) methods do not appreciate such composable/decomposable structure attributes inherent to medical images. To overcome this limitation, this paper introduces a novel… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: Accepted by WACV 2025

  2. arXiv:2410.22446  [pdf, other

    cs.CL cs.AI

    Do Large Language Models Align with Core Mental Health Counseling Competencies?

    Authors: Viet Cuong Nguyen, Mohammad Taher, Dongwan Hong, Vinicius Konkolics Possobom, Vibha Thirunellayi Gopalakrishnan, Ekta Raj, Zihang Li, Heather J. Soled, Michael L. Birnbaum, Srijan Kumar, Munmun De Choudhury

    Abstract: The rapid evolution of Large Language Models (LLMs) presents a promising solution to the global shortage of mental health professionals. However, their alignment with essential counseling competencies remains underexplored. We introduce CounselingBench, a novel NCMHCE-based benchmark evaluating 22 general-purpose and medical-finetuned LLMs across five key competencies. While frontier models surpas… ▽ More

    Submitted 26 February, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: 10 Pages, Accepted to Findings of NAACL 2025

  3. arXiv:2404.15672  [pdf, other

    cs.CV

    Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability, Composability, and Decomposability from Anatomy via Self-Supervision

    Authors: Mohammad Reza Hosseinzadeh Taher, Michael B. Gotway, Jianming Liang

    Abstract: Humans effortlessly interpret images by parsing them into part-whole hierarchies; deep learning excels in learning multi-level feature spaces, but they often lack explicit coding of part-whole relations, a prominent property of medical imaging. To overcome this limitation, we introduce Adam-v2, a new self-supervised learning framework extending Adam [79] by explicitly incorporating part-whole hier… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024 [main conference]

  4. arXiv:2402.03908  [pdf, other

    cs.CV

    EscherNet: A Generative Model for Scalable View Synthesis

    Authors: Xin Kong, Shikun Liu, Xiaoyang Lyu, Marwan Taher, Xiaojuan Qi, Andrew J. Davison

    Abstract: We introduce EscherNet, a multi-view conditioned diffusion model for view synthesis. EscherNet learns implicit and generative 3D representations coupled with a specialised camera positional encoding, allowing precise and continuous relative control of the camera transformation between an arbitrary number of reference and target views. EscherNet offers exceptional generality, flexibility, and scala… ▽ More

    Submitted 19 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: CVPR2024 Project Page: https://kxhit.github.io/EscherNet

  5. arXiv:2401.02357  [pdf, other

    cs.CV

    Fit-NGP: Fitting Object Models to Neural Graphics Primitives

    Authors: Marwan Taher, Ignacio Alzugaray, Andrew J. Davison

    Abstract: Accurate 3D object pose estimation is key to enabling many robotic applications that involve challenging object interactions. In this work, we show that the density field created by a state-of-the-art efficient radiance field reconstruction method is suitable for highly accurate and robust pose estimation for objects with known 3D models, even when they are very small and with challenging reflecti… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  6. arXiv:2309.15358  [pdf, other

    cs.CV

    Towards Foundation Models Learned from Anatomy in Medical Imaging via Self-Supervision

    Authors: Mohammad Reza Hosseinzadeh Taher, Michael B. Gotway, Jianming Liang

    Abstract: Human anatomy is the foundation of medical imaging and boasts one striking characteristic: its hierarchy in nature, exhibiting two intrinsic properties: (1) locality: each anatomical structure is morphologically distinct from the others; and (2) compositionality: each anatomical structure is an integrated part of a larger whole. We envision a foundation model for medical imaging that is consciousl… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)---Domain Adaptation and Representation Transfer

  7. arXiv:2302.01838  [pdf, other

    cs.CV

    vMAP: Vectorised Object Mapping for Neural Field SLAM

    Authors: Xin Kong, Shikun Liu, Marwan Taher, Andrew J. Davison

    Abstract: We present vMAP, an object-level dense SLAM system using neural field representations. Each object is represented by a small MLP, enabling efficient, watertight object modelling without the need for 3D priors. As an RGB-D camera browses a scene with no prior information, vMAP detects object instances on-the-fly, and dynamically adds them to its map. Specifically, thanks to the power of vectorised… ▽ More

    Submitted 13 March, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: CVPR2023 Project Page:https://kxhit.github.io/vMAP

  8. arXiv:2212.09460  [pdf

    cs.CV eess.IV

    Hardware Acceleration of Lane Detection Algorithm: A GPU Versus FPGA Comparison

    Authors: Mohamed Alshemi, Sherif Saif, Mohamed Taher

    Abstract: A Complete Computer vision system can be divided into two main categories: detection and classification. The Lane detection algorithm is a part of the computer vision detection category and has been applied in autonomous driving and smart vehicle systems. The lane detection system is responsible for lane marking in a complex road environment. At the same time, lane detection plays a crucial role i… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  9. arXiv:2204.10437  [pdf, other

    cs.CV eess.IV

    DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image Analysis

    Authors: Fatemeh Haghighi, Mohammad Reza Hosseinzadeh Taher, Michael B. Gotway, Jianming Liang

    Abstract: Discriminative learning, restorative learning, and adversarial learning have proven beneficial for self-supervised learning schemes in computer vision and medical imaging. Existing efforts, however, omit their synergistic effects on each other in a ternary setup, which, we envision, can significantly benefit deep semantic representation learning. To realize this vision, we have developed DiRA, the… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Accepted at CVPR 2022 [main conference]

  10. arXiv:2204.07344  [pdf, other

    eess.IV cs.CV

    CAiD: Context-Aware Instance Discrimination for Self-supervised Learning in Medical Imaging

    Authors: Mohammad Reza Hosseinzadeh Taher, Fatemeh Haghighi, Michael B. Gotway, Jianming Liang

    Abstract: Recently, self-supervised instance discrimination methods have achieved significant success in learning visual representations from unlabeled photographic images. However, given the marked differences between photographic and medical images, the efficacy of instance-based objectives, focusing on learning the most discriminative global features in the image (i.e., wheels in bicycle), remains unknow… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted at MIDL 2022 [main conference]

  11. arXiv:2108.05930  [pdf, other

    cs.CV eess.IV

    A Systematic Benchmarking Analysis of Transfer Learning for Medical Image Analysis

    Authors: Mohammad Reza Hosseinzadeh Taher, Fatemeh Haghighi, Ruibin Feng, Michael B. Gotway, Jianming Liang

    Abstract: Transfer learning from supervised ImageNet models has been frequently used in medical image analysis. Yet, no large-scale evaluation has been conducted to benchmark the efficacy of newly-developed pre-training techniques for medical image analysis, leaving several important questions unanswered. As the first step in this direction, we conduct a systematic study on the transferability of models pre… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2021); Domain Adaptation and Representation Transfer (DART)

  12. arXiv:2102.10680  [pdf, other

    cs.CV eess.IV

    Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-supervised Learning

    Authors: Fatemeh Haghighi, Mohammad Reza Hosseinzadeh Taher, Zongwei Zhou, Michael B. Gotway, Jianming Liang

    Abstract: This paper introduces a new concept called "transferable visual words" (TransVW), aiming to achieve annotation efficiency for deep learning in medical image analysis. Medical imaging--focusing on particular parts of the body for defined clinical purposes--generates images of great similarity in anatomy across patients and yields sophisticated anatomical patterns across images, which are associated… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: Journal version of arXiv:2007.06959, accepted by IEEE Transactions on Medical Imaging (TMI)

  13. arXiv:2007.06959  [pdf, other

    cs.CV eess.IV

    Learning Semantics-enriched Representation via Self-discovery, Self-classification, and Self-restoration

    Authors: Fatemeh Haghighi, Mohammad Reza Hosseinzadeh Taher, Zongwei Zhou, Michael B. Gotway, Jianming Liang

    Abstract: Medical images are naturally associated with rich semantics about the human anatomy, reflected in an abundance of recurring anatomical patterns, offering unique potential to foster deep semantic representation learning and yield semantically more powerful models for different medical applications. But how exactly such strong yet free semantics embedded in medical images can be harnessed for self-s… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2020)