Skip to main content

Showing 1–6 of 6 results for author: Tran, D T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.20745  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Foundation Model Hidden Representations for Heart Rate Estimation from Auscultation

    Authors: Jingping Nie, Dung T. Tran, Karan Thakkar, Vasudha Kowtha, Jon Huang, Carlos Avendano, Erdrin Azemi, Vikramjit Mitra

    Abstract: Auscultation, particularly heart sound, is a non-invasive technique that provides essential vital sign information. Recently, self-supervised acoustic representation foundation models (FMs) have been proposed to offer insights into acoustics-based vital signs. However, there has been little exploration of the extent to which auscultation is encoded in these pre-trained FM representations. In this… ▽ More

    Submitted 29 May, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 5 pages, Interspeech 2025 conference

  2. arXiv:2503.22711  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Modeling speech emotion with label variance and analyzing performance across speakers and unseen acoustic conditions

    Authors: Vikramjit Mitra, Amrit Romana, Dung T. Tran, Erdrin Azemi

    Abstract: Spontaneous speech emotion data usually contain perceptual grades where graders assign emotion score after listening to the speech files. Such perceptual grades introduce uncertainty in labels due to grader opinion variation. Grader variation is addressed by using consensus grades as groundtruth, where the emotion with the highest vote is selected. Consensus grades fail to consider ambiguous insta… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 11 pages, 5 figures

  3. arXiv:2501.03848  [pdf, other

    eess.IV cs.CV

    Semise: Semi-supervised learning for severity representation in medical image

    Authors: Dung T. Tran, Hung Vu, Anh Tran, Hieu Pham, Hong Nguyen, Phong Nguyen

    Abstract: This paper introduces SEMISE, a novel method for representation learning in medical imaging that combines self-supervised and supervised learning. By leveraging both labeled and augmented data, SEMISE addresses the challenge of data scarcity and enhances the encoder's ability to extract meaningful features. This integrated approach leads to more informative representations, improving performance o… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: Accepted for presentation at the 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI)

  4. Building a temperature forecasting model for the city with the regression neural network (RNN)

    Authors: Nguyen Phuc Tran, Duy Thanh Tran, Thi Thuy Nga Duong

    Abstract: In recent years, a study by environmental organizations in the world and Vietnam shows that weather change is quite complex. global warming has become a serious problem in the modern world, which is a concern for scientists. last century, it was difficult to forecast the weather due to missing weather monitoring stations and technological limitations. this made it hard to collect data for building… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 6 pages

    Journal ref: The 6th International Conference for Small & Medium Business in 2020 (ICSMB 2020)

  5. Recognition of Defective Mineral Wool Using Pruned ResNet Models

    Authors: Mehdi Rafiei, Dat Thanh Tran, Alexandros Iosifidis

    Abstract: Mineral wool production is a non-linear process that makes it hard to control the final quality. Therefore, having a non-destructive method to analyze the product quality and recognize defective products is critical. For this purpose, we developed a visual quality control system for mineral wool. X-ray images of wool specimens were collected to create a training set of defective and non-defective… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 6 pages, 5 figures, 3 tables Submitted on IEEE Transactions on Industrial Informatics

  6. arXiv:2109.01184  [pdf, other

    cs.CV eess.IV

    Remote Multilinear Compressive Learning with Adaptive Compression

    Authors: Dat Thanh Tran, Moncef Gabbouj, Alexandros Iosifidis

    Abstract: Multilinear Compressive Learning (MCL) is an efficient signal acquisition and learning paradigm for multidimensional signals. The level of signal compression affects the detection or classification performance of a MCL model, with higher compression rates often associated with lower inference accuracy. However, higher compression rates are more amenable to a wider range of applications, especially… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: 2 figures, 6 tables