Skip to main content

Showing 1–4 of 4 results for author: Salehi, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.13209  [pdf, other

    cs.SD cs.AI cs.HC eess.AS

    Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis

    Authors: Pegah Salehi, Sajad Amouei Sheshkal, Vajira Thambawita, Sushant Gautam, Saeed S. Sabet, Dag Johansen, Michael A. Riegler, Pål Halvorsen

    Abstract: This paper examines the integration of real-time talking-head generation for interviewer training, focusing on overcoming challenges in Audio Feature Extraction (AFE), which often introduces latency and limits responsiveness in real-time applications. To address these issues, we propose and implement a fully integrated system that replaces conventional AFE models with Open AI's Whisper, leveraging… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: 16 pages, 6 figures, 3 tables. submitted to MDPI journal in as Big Data and Cognitive Computing

    MSC Class: 68T45; 68T07; 68T01

  2. SinGAN-Seg: Synthetic training data generation for medical image segmentation

    Authors: Vajira Thambawita, Pegah Salehi, Sajad Amouei Sheshkal, Steven A. Hicks, Hugo L. Hammer, Sravanthi Parasa, Thomas de Lange, Pål Halvorsen, Michael A. Riegler

    Abstract: Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the da… ▽ More

    Submitted 25 April, 2022; v1 submitted 29 June, 2021; originally announced July 2021.

  3. arXiv:2005.13178  [pdf

    cs.CV cs.LG eess.IV

    Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

    Authors: Pegah Salehi, Abdolah Chalechale, Maryam Taghizadeh

    Abstract: One of the most significant challenges in statistical signal processing and machine learning is how to obtain a generative model that can produce samples of large-scale data distribution, such as images and speeches. Generative Adversarial Network (GAN) is an effective method to address this problem. The GANs provide an appropriate way to learn deep representations without widespread use of labele… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: Submitted to a journal in the computer vision field

  4. arXiv:2002.00647  [pdf

    eess.IV cs.CV

    Pix2Pix-based Stain-to-Stain Translation: A Solution for Robust Stain Normalization in Histopathology Images Analysis

    Authors: Pegah Salehi, Abdolah Chalechale

    Abstract: The diagnosis of cancer is mainly performed by visual analysis of the pathologists, through examining the morphology of the tissue slices and the spatial arrangement of the cells. If the microscopic image of a specimen is not stained, it will look colorless and textured. Therefore, chemical staining is required to create contrast and help identify specific tissue components. During tissue preparat… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: 7 pages, 6 figures, 4 table, The 11th Iranian and the first International Conference on Machine Vision and Image Processing (MVIP 2020)