Skip to main content

Showing 1–21 of 21 results for author: Teo, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.16077  [pdf, other

    cs.CL

    RelCAT: Advancing Extraction of Clinical Inter-Entity Relationships from Unstructured Electronic Health Records

    Authors: Shubham Agarwal, Vlad Dinu, Thomas Searle, Mart Ratas, Anthony Shek, Dan F. Stein, James Teo, Richard Dobson

    Abstract: This study introduces RelCAT (Relation Concept Annotation Toolkit), an interactive tool, library, and workflow designed to classify relations between entities extracted from clinical narratives. Building upon the CogStack MedCAT framework, RelCAT addresses the challenge of capturing complete clinical relations dispersed within text. The toolkit implements state-of-the-art machine learning models s… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  2. arXiv:2412.10848  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models for Medical Forecasting -- Foresight 2

    Authors: Zeljko Kraljevic, Joshua Au Yeung, Daniel Bean, James Teo, Richard J. Dobson

    Abstract: Foresight 2 (FS2) is a large language model fine-tuned on hospital data for modelling patient timelines (GitHub 'removed for anon'). It can understand patients' clinical notes and predict SNOMED codes for a wide range of biomedical use cases, including diagnosis suggestions, risk forecasting, and procedure and medication recommendations. FS2 is trained on the free text portion of the MIMIC-III dat… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

  3. Unboxing Virgil ADTs for Fun and Profit

    Authors: Bradley Wei Jie Teo, Ben L. Titzer

    Abstract: Algebraic Data Types (ADTs) are an increasingly common feature in modern programming languages. In many implementations, values of non-nullary, multi-case ADTs are allocated on the heap, which may reduce performance and increase memory usage. This work explores annotation-guided optimizations to ADT representation in Virgil, a systems-level programming language that compiles to x86, x86-64, Wasm a… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  4. arXiv:2408.17181  [pdf, other

    cs.CL

    Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study

    Authors: Shubham Agarwal, Thomas Searle, Mart Ratas, Anthony Shek, James Teo, Richard Dobson

    Abstract: Electronic Health Records are large repositories of valuable clinical data, with a significant portion stored in unstructured text format. This textual data includes clinical events (e.g., disorders, symptoms, findings, medications and procedures) in context that if extracted accurately at scale can unlock valuable downstream applications such as disease prediction. Using an existing Named Entity… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  5. DHR+S: Distributed Hybrid Rendering with Realistic Real-time Shadows for Interactive Thin Client Metaverse and Game Applications

    Authors: Yu Wei Tan, Siang Ern Low, Jonas Chow, Javon Teo, Anand Bhojan

    Abstract: Distributed hybrid rendering (DHR) is a real-time rendering approach that incorporates cloud-based ray tracing with locally rasterized graphics for interactive thin client metaverse and game applications. With cloud assistance, DHR can generate high-fidelity ray-traced graphics contents remotely and deliver them to thin clients with low graphics capability, including standalone extended reality de… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    MSC Class: 68U05 ACM Class: I.3

  6. arXiv:2404.04025  [pdf, ps, other

    cs.CV q-bio.QM

    Framework to generate perfusion map from CT and CTA images in patients with acute ischemic stroke: A longitudinal and cross-sectional study

    Authors: Chayanin Tangwiriyasakul, Pedro Borges, Stefano Moriconi, Paul Wright, Yee-Haur Mah, James Teo, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Stroke is a leading cause of disability and death. Effective treatment decisions require early and informative vascular imaging. 4D perfusion imaging is ideal but rarely available within the first hour after stroke, whereas plain CT and CTA usually are. Hence, we propose a framework to extract a predicted perfusion map (PPM) derived from CT and CTA images. In all eighteen patients, we found signif… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted and presented in SWITCH2023: Stroke Workshop on Imaging and Treatment CHallenges (MICCAI 2023, Vancouver Canada)

  7. arXiv:2403.07973  [pdf, other

    cs.PL

    Flexible Non-intrusive Dynamic Instrumentation for WebAssembly

    Authors: Ben L. Titzer, Elizabeth Gilbert, Bradley Wei Jie Teo, Yash Anand, Kazuyuki Takayama, Heather Miller

    Abstract: A key strength of managed runtimes over hardware is the ability to gain detailed insight into the dynamic execution of programs with instrumentation. Analyses such as code coverage, execution frequency, tracing, and debugging, are all made easier in a virtual setting. As a portable, low-level bytecode, WebAssembly offers inexpensive in-process sandboxing with high performance. Yet to date, Wasm en… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    ACM Class: D.3.4

  8. arXiv:2310.04468  [pdf, other

    cs.CL cs.AI

    Validating transformers for redaction of text from electronic health records in real-world healthcare

    Authors: Zeljko Kraljevic, Anthony Shek, Joshua Au Yeung, Ewart Jonathan Sheldon, Mohammad Al-Agil, Haris Shuaib, Xi Bai, Kawsar Noor, Anoop D. Shah, Richard Dobson, James Teo

    Abstract: Protecting patient privacy in healthcare records is a top priority, and redaction is a commonly used method for obscuring directly identifiable information in text. Rule-based methods have been widely used, but their precision is often low causing over-redaction of text and frequently not being adaptable enough for non-standardised or unconventional structures of personal health information. Deep… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  9. arXiv:2307.03777  [pdf, other

    cs.CV

    Unsupervised 3D out-of-distribution detection with latent diffusion models

    Authors: Mark S. Graham, Walter Hugo Lopez Pinaya, Paul Wright, Petru-Daniel Tudosiu, Yee H. Mah, James T. Teo, H. Rolf Jäger, David Werring, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Methods for out-of-distribution (OOD) detection that scale to 3D data are crucial components of any real-world clinical deep learning system. Classic denoising diffusion probabilistic models (DDPMs) have been recently proposed as a robust way to perform reconstruction-based OOD detection on 2D datasets, but do not trivially scale to 3D data. In this work, we propose to use Latent Diffusion Models… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Accepted at MICCAI 2023

  10. arXiv:2212.08072  [pdf

    cs.CL cs.AI cs.LG

    Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRs

    Authors: Zeljko Kraljevic, Dan Bean, Anthony Shek, Rebecca Bendayan, Harry Hemingway, Joshua Au Yeung, Alexander Deng, Alfie Baston, Jack Ross, Esther Idowu, James T Teo, Richard J Dobson

    Abstract: Background: Electronic Health Records hold detailed longitudinal information about each patient's health status and general clinical history, a large portion of which is stored within the unstructured text. Existing approaches focus mostly on structured data and a subset of single-domain outcomes. We explore how temporal modelling of patients from free text and structured data, using deep generati… ▽ More

    Submitted 24 January, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  11. Discharge Summary Hospital Course Summarisation of In Patient Electronic Health Record Text with Clinical Concept Guided Deep Pre-Trained Transformer Models

    Authors: Thomas Searle, Zina Ibrahim, James Teo, Richard Dobson

    Abstract: Brief Hospital Course (BHC) summaries are succinct summaries of an entire hospital encounter, embedded within discharge summaries, written by senior clinicians responsible for the overall care of a patient. Methods to automatically produce summaries from inpatient documentation would be invaluable in reducing clinician manual burden of summarising documents under high time-pressure to admit and di… ▽ More

    Submitted 10 April, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  12. arXiv:2206.03461  [pdf, other

    cs.CV eess.IV q-bio.QM

    Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models

    Authors: Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Deep generative models have emerged as promising tools for detecting arbitrary anomalies in data, dispensing with the necessity for manual labelling. Recently, autoregressive transformers have achieved state-of-the-art performance for anomaly detection in medical imaging. Nonetheless, these models still have some intrinsic weaknesses, such as requiring images to be modelled as 1D sequences, the ac… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  13. arXiv:2205.10650  [pdf, other

    cs.CV cs.LG

    Transformer-based out-of-distribution detection for clinically safe segmentation

    Authors: Mark S Graham, Petru-Daniel Tudosiu, Paul Wright, Walter Hugo Lopez Pinaya, U Jean-Marie, Yee Mah, James Teo, Rolf H Jäger, David Werring, Parashkev Nachev, Sebastien Ourselin, M Jorge Cardoso

    Abstract: In a clinical setting it is essential that deployed image processing systems are robust to the full range of inputs they might encounter and, in particular, do not make confidently wrong predictions. The most popular approach to safe processing is to train networks that can provide a measure of their uncertainty, but these tend to fail for inputs that are far outside the training data distribution… ▽ More

    Submitted 17 May, 2023; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: Accepted at MIDL 2022 (Oral)

  14. arXiv:2111.09061  [pdf, other

    cs.LG cs.NI

    Exploring Unsupervised Learning Methods for Automated Protocol Analysis

    Authors: Arijit Dasgupta, Yi-Xue Yan, Clarence Ong, Jenn-Yue Teo, Chia-Wei Lim

    Abstract: The ability to analyse and differentiate network protocol traffic is crucial for network resource management to provide differentiated services by Telcos. Automated Protocol Analysis (APA) is crucial to significantly improve efficiency and reduce reliance on human experts. There are numerous automated state-of-the-art unsupervised methods for clustering unknown protocols in APA. However, many such… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted to the IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2021)

  15. arXiv:2107.03134  [pdf, other

    cs.CL

    MedGPT: Medical Concept Prediction from Clinical Narratives

    Authors: Zeljko Kraljevic, Anthony Shek, Daniel Bean, Rebecca Bendayan, James Teo, Richard Dobson

    Abstract: The data available in Electronic Health Records (EHRs) provides the opportunity to transform care, and the best way to provide better care for one patient is through learning from the data available on all other patients. Temporal modelling of a patient's medical history, which takes into account the sequence of past events, can be used to predict future events such as a diagnosis of a new disorde… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 6 pages, 2 figures, 3 tables

  16. Estimating Redundancy in Clinical Text

    Authors: Thomas Searle, Zina Ibrahim, James Teo, Richard JB Dobson

    Abstract: The current mode of use of Electronic Health Record (EHR) elicits text redundancy. Clinicians often populate new documents by duplicating existing notes, then updating accordingly. Data duplication can lead to a propagation of errors, inconsistencies and misreporting of care. Therefore, quantifying information redundancy can play an essential role in evaluating innovations that operate on clinical… ▽ More

    Submitted 26 October, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Journal ref: JBI v124 (2021)

  17. arXiv:2011.09361  [pdf, other

    cs.LG cs.CY

    A Knowledge Distillation Ensemble Framework for Predicting Short and Long-term Hospitalisation Outcomes from Electronic Health Records Data

    Authors: Zina M Ibrahim, Daniel Bean, Thomas Searle, Honghan Wu, Anthony Shek, Zeljko Kraljevic, James Galloway, Sam Norton, James T Teo, Richard JB Dobson

    Abstract: The ability to perform accurate prognosis of patients is crucial for proactive clinical decision making, informed resource management and personalised care. Existing outcome prediction models suffer from a low recall of infrequent positive outcomes. We present a highly-scalable and robust machine learning framework to automatically predict adversity represented by mortality and ICU admission from… ▽ More

    Submitted 11 June, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: 14 pages

  18. arXiv:2010.01165  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-domain Clinical Natural Language Processing with MedCAT: the Medical Concept Annotation Toolkit

    Authors: Zeljko Kraljevic, Thomas Searle, Anthony Shek, Lukasz Roguski, Kawsar Noor, Daniel Bean, Aurelie Mascio, Leilei Zhu, Amos A Folarin, Angus Roberts, Rebecca Bendayan, Mark P Richardson, Robert Stewart, Anoop D Shah, Wai Keong Wong, Zina Ibrahim, James T Teo, Richard JB Dobson

    Abstract: Electronic health records (EHR) contain large volumes of unstructured text, requiring the application of Information Extraction (IE) technologies to enable clinical analysis. We present the open-source Medical Concept Annotation Toolkit (MedCAT) that provides: a) a novel self-supervised machine learning algorithm for extracting concepts using any concept vocabulary including UMLS/SNOMED-CT; b) a f… ▽ More

    Submitted 25 March, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Preprint: 27 Pages, 3 Figures

  19. Data-Driven Multi-Objective Controller Optimization for a Magnetically-Levitated Nanopositioning System

    Authors: Xiaocong Li, Haiyue Zhu, Jun Ma, Tat Joo Teo, Chek Sing Teo, Masayoshi Tomizuka, Tong Heng Lee

    Abstract: The performance achieved with traditional model-based control system design approaches typically relies heavily upon accurate modeling of the motion dynamics. However, modeling the true dynamics of present-day increasingly complex systems can be an extremely challenging task; and the usually necessary practical approximations often render the automation system to operate in a non-optimal condition… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 11 pages, 11 figures, to appear in IEEE/ASME Transactions on Mechatronics

  20. arXiv:1708.08743  [pdf

    cs.CY physics.ed-ph

    Massive Open and Online Courses and Open Education Resources in Singapore

    Authors: Victor Lim, Lawrence Wee, Jessica Teo, Shannalyn Ng

    Abstract: This paper looks at the increasing popularity of massive open and online courses (MOOCs) and open educational resources (OERs) offered in Singapore. Despite being a relatively new phenomenon, the Singapore government has collaborated with different organizations to improve the quality and accessibility of MOOCs, and many institutions of higher learning (IHLs) are spearheading efforts to improve OE… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 13 pages, Southeast Asian Ministers of Education (SEAMEO) Journal, Journal of Southeast Asian Education | 2017 | Volume 1

  21. A Smart Cushion for Real-Time Heart Rate Monitoring

    Authors: Chacko John Deepu, Zhihao Chen, Ju Teng Teo, Soon Huat Ng, Xiefeng Yang, Yong Lian

    Abstract: This paper presents a smart cushion for real time heart rate monitoring. The cushion comprises of an integrated micro-bending fiber sensor, which records the BCG (Ballistocardiogram) signal without direct skin-electrode contact, and an optical transceiver that does signal amplification, digitization, and pre-filtering. To remove the artifacts and extract heart rate from BCG signal, a computational… ▽ More

    Submitted 29 September, 2014; originally announced September 2014.

    Comments: 2012 IEEE Biomedical Circuits and Systems Conference