Skip to main content

Showing 1–37 of 37 results for author: Wong, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.08414  [pdf

    eess.IV cs.CV

    An integrated language-vision foundation model for conversational diagnostics and triaging in primary eye care

    Authors: Zhi Da Soh, Yang Bai, Kai Yu, Yang Zhou, Xiaofeng Lei, Sahil Thakur, Zann Lee, Lee Ching Linette Phang, Qingsheng Peng, Can Can Xue, Rachel Shujuan Chong, Quan V. Hoang, Lavanya Raghavan, Yih Chung Tham, Charumathi Sabanayagam, Wei-Chi Wu, Ming-Chih Ho, Jiangnan He, Preeti Gupta, Ecosse Lamoureux, Seang Mei Saw, Vinay Nangia, Songhomitra Panda-Jonas, Jie Xu, Ya Xing Wang , et al. (6 additional authors not shown)

    Abstract: Current deep learning models are mostly task specific and lack a user-friendly interface to operate. We present Meta-EyeFM, a multi-function foundation model that integrates a large language model (LLM) with vision foundation models (VFMs) for ocular disease assessment. Meta-EyeFM leverages a routing mechanism to enable accurate task-specific analysis based on text queries. Using Low Rank Adaptati… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  2. arXiv:2502.09473  [pdf, other

    cs.LG eess.SP

    Learning to Predict Global Atrial Fibrillation Dynamics from Sparse Measurements

    Authors: Alexander Jenkins, Andrea Cini, Joseph Barker, Alexander Sharp, Arunashis Sau, Varun Valentine, Srushti Valasang, Xinyang Li, Tom Wong, Timothy Betts, Danilo Mandic, Cesare Alippi, Fu Siong Ng

    Abstract: Catheter ablation of Atrial Fibrillation (AF) consists of a one-size-fits-all treatment with limited success in persistent AF. This may be due to our inability to map the dynamics of AF with the limited resolution and coverage provided by sequential contact mapping catheters, preventing effective patient phenotyping for personalised, targeted ablation. Here we introduce FibMap, a graph recurrent n… ▽ More

    Submitted 14 February, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: Under review

  3. Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

    Authors: Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Vanessa Boey, Irene Lee, Joo Young Hong, Jian Kang, Kar Fye Alvin Lee, Georgios Christopoulos, Woon-Seng Gan

    Abstract: Formalized in ISO 12913, the "soundscape" approach is a paradigmatic shift towards perception-based urban sound management, aiming to alleviate the substantial socioeconomic costs of noise pollution to advance the United Nations Sustainable Development Goals. Focusing on traffic-exposed outdoor residential sites, we implemented an automatic masker selection system (AMSS) utilizing natural sounds t… ▽ More

    Submitted 8 October, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 41 pages, 4 figures. Preprint submitted to Building and Environment

    Journal ref: Building and Environment, vol. 266, p. 112106, Dec. 2024

  4. arXiv:2402.08788  [pdf

    cs.CL cs.SD eess.AS

    Syllable based DNN-HMM Cantonese Speech to Text System

    Authors: Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T. Y. Ng

    Abstract: This paper reports our work on building up a Cantonese Speech-to-Text (STT) system with a syllable based acoustic model. This is a part of an effort in building a STT system to aid dyslexic students who have cognitive deficiency in writing skills but have no problem expressing their ideas through speech. For Cantonese speech recognition, the basic unit of acoustic models can either be the conventi… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures, LREC 2016

    MSC Class: 94-06 ACM Class: I.2.7

  5. arXiv:2311.14065  [pdf, other

    eess.SP

    3D Printed Discrete Dielectric Lens With Improved Matching Layers

    Authors: Juan Andrés Vásquez-Peralvo, José Manuel Fernández-González, Thomas Wong

    Abstract: This paper presents a non-zoned discrete dielectric lens comprising two or three matching layers to reduce the 50-110 GHz frequency range reflections. Based on Chebyshev and binomial multi-section transformers, the designed models use matching layers at the top and bottom. In addition, the presented designs use pins instead of the conventional slots for the matching layers, thus easing the manufac… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  6. VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Journal ref: The latest VisionFM work has been published in NEJM AI, 2024

  7. arXiv:2310.03884  [pdf, other

    cs.IT cs.LG eess.SP math.DG stat.ML

    Information Geometry for the Working Information Theorist

    Authors: Kumar Vijay Mishra, M. Ashok Kumar, Ting-Kam Leonard Wong

    Abstract: Information geometry is a study of statistical manifolds, that is, spaces of probability distributions from a geometric perspective. Its classical information-theoretic applications relate to statistical concepts such as Fisher information, sufficient statistics, and efficient estimators. Today, information geometry has emerged as an interdisciplinary field that finds applications in diverse areas… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 12 pages, 3 figures, 1 table

  8. arXiv:2308.07767  [pdf, other

    eess.AS cs.SD

    Preliminary investigation of the short-term in situ performance of an automatic masker selection system

    Authors: Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Woon-Seng Gan

    Abstract: Soundscape augmentation or "masking" introduces wanted sounds into the acoustic environment to improve acoustic comfort. Usually, the masker selection and playback strategies are either arbitrary or based on simple rules (e.g. -3 dBA), which may lead to sub-optimal increment or even reduction in acoustic comfort for dynamic acoustic environments. To reduce ambiguity in the selection of maskers, an… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: paper submitted to the 52nd International Congress and Exposition on Noise Control Engineering held in Chiba, Greater Tokyo, Japan, on 20-23 August 2023 (Inter-Noise 2023)

    ACM Class: J.2; J.4

  9. Taming Reversible Halftoning via Predictive Luminance

    Authors: Cheuk-Kit Lau, Menghan Xia, Tien-Tsin Wong

    Abstract: Traditional halftoning usually drops colors when dithering images with binary dots, which makes it difficult to recover the original color information. We proposed a novel halftoning technique that converts a color image into a binary halftone with full restorability to its original version. Our novel base halftoning technique consists of two convolutional neural networks (CNNs) to produce the rev… ▽ More

    Submitted 7 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: published in IEEE Transactions on Visualization and Computer Graphics

  10. arXiv:2306.04114  [pdf, other

    cs.CV eess.IV

    Manga Rescreening with Interpretable Screentone Representation

    Authors: Minshan Xie, Chengze Li, Tien-Tsin Wong

    Abstract: The process of adapting or repurposing manga pages is a time-consuming task that requires manga artists to manually work on every single screentone region and apply new patterns to create novel screentones across multiple panels. To address this issue, we propose an automatic manga rescreening pipeline that aims to minimize the human effort involved in manga adaptation. Our pipeline automatically… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 10 pages, 11 figures

  11. arXiv:2207.12899  [pdf, other

    eess.AS cs.SD

    Assessment of a cost-effective headphone calibration procedure for soundscape evaluations

    Authors: Bhan Lam, Kenneth Ooi, Zhen-Ting Ong, Karn N. Watcharasupat, Trevor Wong, Woon-Seng Gan

    Abstract: To increase the availability and adoption of the soundscape standard, a low-cost calibration procedure for reproduction of audio stimuli over headphones was proposed as part of the global ``Soundscape Attributes Translation Project'' (SATP) for validating ISO/TS~12913-2:2018 perceived affective quality (PAQ) attribute translations. A previous preliminary study revealed significant deviations from… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: For 24th International Congress on Acoustics

    Journal ref: in Proc. 24th Int. Congr. Acoust., 2022, pp. 1-8

  12. arXiv:2207.09221  [pdf, other

    eess.AS stat.AP

    Do uHear? Validation of uHear App for Preliminary Screening of Hearing Ability in Soundscape Studies

    Authors: Zhen-Ting Ong, Bhan Lam, Kenneth Ooi, Karn N. Watcharasupat, Trevor Wong, Woon-Seng Gan

    Abstract: Studies involving soundscape perception often exclude participants with hearing loss to prevent impaired perception from affecting experimental results. Participants are typically screened with pure tone audiometry, the "gold standard" for identifying and quantifying hearing loss at specific frequencies, and excluded if a study-dependent threshold is not met. However, procuring professional audiom… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: Full paper submitted to 24th International Congress on Acoustics

  13. arXiv:2206.15445  [pdf, other

    eess.IV cs.CV

    Asymmetry Disentanglement Network for Interpretable Acute Ischemic Stroke Infarct Segmentation in Non-Contrast CT Scans

    Authors: Haomiao Ni, Yuan Xue, Kelvin Wong, John Volpi, Stephen T. C. Wong, James Z. Wang, Xiaolei Huang

    Abstract: Accurate infarct segmentation in non-contrast CT (NCCT) images is a crucial step toward computer-aided acute ischemic stroke (AIS) assessment. In clinical practice, bilateral symmetric comparison of brain hemispheres is usually used to locate pathological abnormalities. Recent research has explored asymmetries to assist with AIS segmentation. However, most previous symmetry-based work mixed differ… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022

  14. arXiv:2206.01741  [pdf, other

    eess.IV cs.CV

    Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation

    Authors: Yanglan Ou, Ye Yuan, Xiaolei Huang, Stephen T. C. Wong, John Volpi, James Z. Wang, Kelvin Wong

    Abstract: We present a new encoder-decoder Vision Transformer architecture, Patcher, for medical image segmentation. Unlike standard Vision Transformers, it employs Patcher blocks that segment an image into large patches, each of which is further divided into small patches. Transformers are applied to the small patches within a large patch, which constrains the receptive field of each pixel. We intentionall… ▽ More

    Submitted 29 May, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022

  15. arXiv:2205.04728  [pdf, other

    eess.AS cs.SD

    Preliminary assessment of a cost-effective headphone calibration procedure for soundscape evaluations

    Authors: Bhan Lam, Kenneth Ooi, Karn N. Watcharasupat, Zhen-Ting Ong, Yun-Ting Lau, Trevor Wong, Woon-Seng Gan

    Abstract: The introduction of ISO 12913-2:2018 has provided a framework for standardized data collection and reporting procedures for soundscape practitioners. A strong emphasis was placed on the use of calibrated head and torso simulators (HATS) for binaural audio capture to obtain an accurate subjective impression and acoustic measure of the soundscape under evaluation. To auralise the binaural recordings… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Submitted to the 28th International Congress on Sound and Vibration

  16. arXiv:2204.13890  [pdf, other

    eess.AS cs.SD eess.SY

    Deployment of an IoT System for Adaptive In-Situ Soundscape Augmentation

    Authors: Trevor Wong, Karn N. Watcharasupat, Bhan Lam, Kenneth Ooi, Zhen-Ting Ong, Furi Andi Karnapi, Woon-Seng Gan

    Abstract: Soundscape augmentation is an emerging approach for noise mitigation by introducing additional sounds known as "maskers" to increase acoustic comfort. Traditionally, the choice of maskers is often predicated on expert guidance or post-hoc analysis which can be time-consuming and sometimes arbitrary. Moreover, this often results in a static set of maskers that are inflexible to the dynamic nature o… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: To be presented at the 51st International Congress and Exposition on Noise Control Engineering

    Journal ref: INTER-NOISE and NOISE-CON Congress and Conference Proceedings, Feb. 2022, vol. 265, no. 5, pp. 2013-2021

  17. arXiv:2204.13883  [pdf, other

    eess.AS cs.LG cs.SD

    Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain

    Authors: Karn N. Watcharasupat, Kenneth Ooi, Bhan Lam, Trevor Wong, Zhen-Ting Ong, Woon-Seng Gan

    Abstract: The selection of maskers and playback gain levels in a soundscape augmentation system is crucial to its effectiveness in improving the overall acoustic comfort of a given environment. Traditionally, the selection of appropriate maskers and gain levels has been informed by expert opinion, which may not representative of the target population, or by listening tests, which can be time-consuming and l… ▽ More

    Submitted 23 July, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted to IEEE Signal Processing Letters. (c) 2022 IEEE

    Journal ref: IEEE Signal Processing Letters, Vol. 29, pp. 1749 - 1753, 2022

  18. arXiv:2203.09860  [pdf, other

    eess.IV cs.CV

    Pseudo Bias-Balanced Learning for Debiased Chest X-ray Classification

    Authors: Luyang Luo, Dunyuan Xu, Hao Chen, Tien-Tsin Wong, Pheng-Ann Heng

    Abstract: Deep learning models were frequently reported to learn from shortcuts like dataset biases. As deep learning is playing an increasingly important role in the modern healthcare system, it is of great need to combat shortcut learning in medical data as well as develop unbiased and trustworthy models. In this paper, we study the problem of developing debiased chest X-ray diagnosis models from the bias… ▽ More

    Submitted 4 August, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: To appear in MICCAI 2022. Code available at https://github.com/LLYXC/PBBL

  19. arXiv:2201.12576  [pdf, other

    cs.CV eess.IV

    Scale-arbitrary Invertible Image Downscaling

    Authors: Jinbo Xing, Wenbo Hu, Tien-Tsin Wong

    Abstract: Conventional social media platforms usually downscale the HR images to restrict their resolution to a specific size for saving transmission/storage cost, which leads to the super-resolution (SR) being highly ill-posed. Recent invertible image downscaling methods jointly model the downscaling/upscaling problems and achieve significant improvements. However, they only consider fixed integer scale fa… ▽ More

    Submitted 9 March, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

  20. arXiv:2110.04491  [pdf, other

    eess.IV cs.CV

    Invertible Tone Mapping with Selectable Styles

    Authors: Zhuming Zhang, Menghan Xia, Xueting Liu, Chengze Li, Tien-Tsin Wong

    Abstract: Although digital cameras can acquire high-dynamic range (HDR) images, the captured HDR information are mostly quantized to low-dynamic range (LDR) images for display compatibility and compact storage. In this paper, we propose an invertible tone mapping method that converts the multi-exposure HDR to a true LDR (8-bit per color channel) and reserves the capability to accurately restore the original… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  21. arXiv:2109.08311  [pdf, other

    eess.IV cs.CV cs.LG

    Adaptive Hierarchical Dual Consistency for Semi-Supervised Left Atrium Segmentation on Cross-Domain Data

    Authors: Jun Chen, Heye Zhang, Raad Mohiaddin, Tom Wong, David Firmin, Jennifer Keegan, Guang Yang

    Abstract: Semi-supervised learning provides great significance in left atrium (LA) segmentation model learning with insufficient labelled data. Generalising semi-supervised learning to cross-domain data is of high importance to further improve model robustness. However, the widely existing distribution difference and sample mismatch between different data domains hinder the generalisation of semi-supervised… ▽ More

    Submitted 20 September, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  22. Multimodal Breast Lesion Classification Using Cross-Attention Deep Networks

    Authors: Hung Q. Vo, Pengyu Yuan, Tiancheng He, Stephen T. C. Wong, Hien V. Nguyen

    Abstract: Accurate breast lesion risk estimation can significantly reduce unnecessary biopsies and help doctors decide optimal treatment plans. Most existing computer-aided systems rely solely on mammogram features to classify breast lesions. While this approach is convenient, it does not fully exploit useful information in clinical reports to achieve the optimal performance. Would clinical features signifi… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  23. arXiv:2105.06830  [pdf, other

    eess.IV cs.CV

    Exploiting Aliasing for Manga Restoration

    Authors: Minshan Xie, Menghan Xia, Tien-Tsin Wong

    Abstract: As a popular entertainment art form, manga enriches the line drawings details with bitonal screentones. However, manga resources over the Internet usually show screentone artifacts because of inappropriate scanning/rescaling resolution. In this paper, we propose an innovative two-stage method to restore quality bitonal manga from degraded ones. Our key observation is that the aliasing induced by d… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  24. arXiv:2105.04260  [pdf, other

    cs.CR eess.SY

    EPICTWIN: An Electric Power Digital Twin for Cyber Security Testing, Research and Education

    Authors: Nandha Kumar Kandasamy, Sarad Venugopalan, Tin Kit Wong, Leu Junming Nicholas

    Abstract: Cyber-Physical Systems (CPS) rely on advanced communication and control technologies to efficiently manage devices and the flow of information in the system. However, a wide variety of potential security challenges has emerged due to the evolution of critical infrastructures (CI) from siloed sub-systems into connected and integrated networks. This is also the case for CI such as a smart grid. Smar… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  25. arXiv:2105.00234  [pdf, other

    eess.IV cs.CV

    JAS-GAN: Generative Adversarial Network Based Joint Atrium and Scar Segmentations on Unbalanced Atrial Targets

    Authors: Jun Chen, Guang Yang, Habib Khan, Heye Zhang, Yanping Zhang, Shu Zhao, Raad Mohiaddin, Tom Wong, David Firmin, Jennifer Keegan

    Abstract: Automated and accurate segmentations of left atrium (LA) and atrial scars from late gadolinium-enhanced cardiac magnetic resonance (LGE CMR) images are in high demand for quantifying atrial scars. The previous quantification of atrial scars relies on a two-phase segmentation for LA and atrial scars due to their large volume difference (unbalanced atrial targets). In this paper, we propose an inter… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Comments: Accepted by IEEE Journal of Biomedical and Health Informatics

    MSC Class: 68T01

  26. arXiv:2104.13917  [pdf, other

    eess.IV cs.CV

    LambdaUNet: 2.5D Stroke Lesion Segmentation of Diffusion-weighted MR Images

    Authors: Yanglan Ou, Ye Yuan, Xiaolei Huang, Kelvin Wong, John Volpi, James Z. Wang, Stephen T. C. Wong

    Abstract: Diffusion-weighted (DW) magnetic resonance imaging is essential for the diagnosis and treatment of ischemic stroke. DW images (DWIs) are usually acquired in multi-slice settings where lesion areas in two consecutive 2D slices are highly discontinuous due to large slice thickness and sometimes even slice gaps. Therefore, although DWIs contain rich 3D information, they cannot be treated as regular 3… ▽ More

    Submitted 29 May, 2023; v1 submitted 28 April, 2021; originally announced April 2021.

  27. arXiv:2011.05142  [pdf

    cs.CV cs.AI cs.LG eess.IV

    Multi-modal, multi-task, multi-attention (M3) deep learning detection of reticular pseudodrusen: towards automated and accessible classification of age-related macular degeneration

    Authors: Qingyu Chen, Tiarnan D. L. Keenan, Alexis Allot, Yifan Peng, Elvira Agrón, Amitha Domalpally, Caroline C. W. Klaver, Daniel T. Luttikhuizen, Marcus H. Colyer, Catherine A. Cukras, Henry E. Wiley, M. Teresa Magone, Chantal Cousineau-Krieger, Wai T. Wong, Yingying Zhu, Emily Y. Chew, Zhiyong Lu

    Abstract: Objective Reticular pseudodrusen (RPD), a key feature of age-related macular degeneration (AMD), are poorly detected by human experts on standard color fundus photography (CFP) and typically require advanced imaging modalities such as fundus autofluorescence (FAF). The objective was to develop and evaluate the performance of a novel 'M3' deep learning framework on RPD detection. Materials and Meth… ▽ More

    Submitted 11 November, 2020; v1 submitted 8 November, 2020; originally announced November 2020.

    Comments: 5 figures and 4 tables, To appear in Journal of the American Medical Informatics Association

  28. Mononizing Binocular Videos

    Authors: Wenbo Hu, Menghan Xia, Chi-Wing Fu, Tien-Tsin Wong

    Abstract: This paper presents the idea ofmono-nizingbinocular videos and a frame-work to effectively realize it. Mono-nize means we purposely convert abinocular video into a regular monocular video with the stereo informationimplicitly encoded in a visual but nearly-imperceptible form. Hence, wecan impartially distribute and show the mononized video as an ordinarymonocular video. Unlike ordinary monocular v… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: 16 pages, 17 figures. Accepted in Siggraph Asia 2020

    Journal ref: ACM Transactions on Graphics (SIGGRAPH Asia 2020 issue)

  29. arXiv:2007.09550  [pdf, other

    eess.IV cs.CV

    Predicting risk of late age-related macular degeneration using deep learning

    Authors: Yifan Peng, Tiarnan D. Keenan, Qingyu Chen, Elvira Agrón, Alexis Allot, Wai T. Wong, Emily Y. Chew, Zhiyong Lu

    Abstract: By 2040, age-related macular degeneration (AMD) will affect approximately 288 million people worldwide. Identifying individuals at high risk of progression to late AMD, the sight-threatening stage, is critical for clinical actions, including medical interventions and timely monitoring. Although deep learning has shown promise in diagnosing/screening AMD using color fundus photographs, it remains d… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

    Comments: Accepted by npj Digital Medicine

  30. arXiv:2002.00440  [pdf

    eess.IV cs.CV

    Simultaneous Left Atrium Anatomy and Scar Segmentations via Deep Learning in Multiview Information with Attention

    Authors: Guang Yang, Jun Chen, Zhifan Gao, Shuo Li, Hao Ni, Elsa Angelini, Tom Wong, Raad Mohiaddin, Eva Nyktari, Ricardo Wage, Lei Xu, Yanping Zhang, Xiuquan Du, Heye Zhang, David Firmin, Jennifer Keegan

    Abstract: Three-dimensional late gadolinium enhanced (LGE) cardiac MR (CMR) of left atrial scar in patients with atrial fibrillation (AF) has recently emerged as a promising technique to stratify patients, to guide ablation therapy and to predict treatment success. This requires a segmentation of the high intensity scar tissue and also a segmentation of the left atrium (LA) anatomy, the latter usually being… ▽ More

    Submitted 2 February, 2020; originally announced February 2020.

    Comments: 34 pages, 10 figures, 7 tables, accepted by Future Generation Computer Systems journal

  31. arXiv:1911.00417  [pdf, other

    cs.SD cs.LG eess.AS

    Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization

    Authors: Vincent Lostanlen, Kaitlin Palmer, Elly Knight, Christopher Clark, Holger Klinck, Andrew Farnsworth, Tina Wong, Jason Cramer, Juan Pablo Bello

    Abstract: This paper proposes to perform unsupervised detection of bioacoustic events by pooling the magnitudes of spectrogram frames after per-channel energy normalization (PCEN). Although PCEN was originally developed for speech recognition, it also has beneficial effects in enhancing animal vocalizations, despite the presence of atmospheric absorption and intermittent noise. We prove that PCEN generalize… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: 5 pages, 3 figures. Presented at the 3rd International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE). 25--26 October 2019, New York, NY, USA

  32. arXiv:1907.10267  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Discriminative Consistent Domain Generation for Semi-supervised Learning

    Authors: Jun Chen, Heye Zhang, Yanping Zhang, Shu Zhao, Raad Mohiaddin, Tom Wong, David Firmin, Guang Yang, Jennifer Keegan

    Abstract: Deep learning based task systems normally rely on a large amount of manually labeled training data, which is expensive to obtain and subject to operator variations. Moreover, it does not always hold that the manually labeled data and the unlabeled data are sitting in the same distribution. In this paper, we alleviate these problems by proposing a discriminative consistent domain generation (DCDG)… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: MICCAI 2019

  33. arXiv:1907.01377  [pdf, other

    cs.CV cs.LG eess.IV

    Training Auto-encoder-based Optimizers for Terahertz Image Reconstruction

    Authors: Tak Ming Wong, Matthias Kahl, Peter Haring Bolívar, Andreas Kolb, Michael Möller

    Abstract: Terahertz (THz) sensing is a promising imaging technology for a wide variety of different applications. Extracting the interpretable and physically meaningful parameters for such applications, however, requires solving an inverse problem in which a model function determined by these parameters needs to be fitted to the measured data. Since the underlying optimization problem is nonconvex and very… ▽ More

    Submitted 29 October, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: This is a pre-print of a conference paper published in German Conference on Pattern Recognition (GCPR) 2019

    Journal ref: Pattern Recognition. DAGM GCPR 2019. Lecture Notes in Computer Science, vol 11824. Springer, Cham

  34. arXiv:1906.03153  [pdf, other

    eess.IV cs.CV

    A deep learning approach for automated detection of geographic atrophy from color fundus photographs

    Authors: Tiarnan D. Keenan, Shazia Dharssi, Yifan Peng, Qingyu Chen, Elvira Agrón, Wai T. Wong, Zhiyong Lu, Emily Y. Chew

    Abstract: Purpose: To assess the utility of deep learning in the detection of geographic atrophy (GA) from color fundus photographs; secondary aim to explore potential utility in detecting central GA (CGA). Design: A deep learning model was developed to detect the presence of GA in color fundus photographs, and two additional models to detect CGA in different scenarios. Participants: 59,812 color fundus pho… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: Accepted for publication in Ophthalmology

  35. arXiv:1807.03710  [pdf, other

    cs.LG cs.AI cs.NE eess.SP stat.ML

    Recurrent Auto-Encoder Model for Large-Scale Industrial Sensor Signal Analysis

    Authors: Timothy Wong, Zhiyuan Luo

    Abstract: Recurrent auto-encoder model summarises sequential data through an encoder structure into a fixed-length vector and then reconstructs the original sequence through the decoder structure. The summarised vector can be used to represent time series features. In this paper, we propose relaxing the dimensionality of the decoder output so that it performs partial reconstruction. The fixed-length vector… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

    Comments: Accepted paper at the 19th International Conference on Engineering Applications of Neural Networks (EANN 2018)

    Journal ref: E. Pimenidis and C. Jayne (Eds.): EANN 2018, CCIS 893

  36. arXiv:1806.04597  [pdf, other

    cs.CV eess.IV

    Multiview Two-Task Recursive Attention Model for Left Atrium and Atrial Scars Segmentation

    Authors: Jun Chen, Guang Yang, Zhifan Gao, Hao Ni, Elsa Angelini, Raad Mohiaddin, Tom Wong, Yanping Zhang, Xiuquan Du, Heye Zhang, Jennifer Keegan, David Firmin

    Abstract: Late Gadolinium Enhanced Cardiac MRI (LGE-CMRI) for detecting atrial scars in atrial fibrillation (AF) patients has recently emerged as a promising technique to stratify patients, guide ablation therapy and predict treatment success. Visualisation and quantification of scar tissues require a segmentation of both the left atrium (LA) and the high intensity scar regions from LGE-CMRI images. These t… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 8 pages, 4 figures, accepted by MICCAI 2018

  37. Computational Image Enhancement for Frequency Modulated Continuous Wave (FMCW) THz Image

    Authors: Tak Ming Wong, Matthias Kahl, Peter Haring Bolívar, Andreas Kolb

    Abstract: In this paper, a novel method to enhance Frequency Modulated Continuous Wave (FMCW) THz imaging resolution beyond its diffraction limit is proposed. Our method comprises two stages. Firstly, we reconstruct the signal in depth-direction using a sinc-envelope, yielding a significant improvement in depth estimation and signal parameter extraction. The resulting high precision depth estimate is used t… ▽ More

    Submitted 2 July, 2019; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: This is a pre-print of an article published in Journal of Infrared, Millimeter, and Terahertz Waves. The final authenticated version is available online at: https://doi.org/10.1007/s10762-019-00609-w

    Journal ref: Journal of Infrared, Millimeter, and Terahertz Waves (2019)