Skip to main content

Showing 1–8 of 8 results for author: Michalopoulos, G

.
  1. arXiv:2305.17364  [pdf, other

    cs.CL cs.AI cs.LG

    An Investigation of Evaluation Metrics for Automated Medical Note Generation

    Authors: Asma Ben Abacha, Wen-wai Yim, George Michalopoulos, Thomas Lin

    Abstract: Recent studies on automatic note generation have shown that doctors can save significant amounts of time when using automatic clinical note generation (Knoll et al., 2022). Summarization models have been used for this task to generate clinical notes as summaries of doctor-patient conversations (Krishna et al., 2021; Cai et al., 2022). However, assessing which model would best serve clinicians in t… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL Findings 2023

  2. arXiv:2204.10408  [pdf, other

    cs.CL cs.LG

    ICDBigBird: A Contextual Embedding Model for ICD Code Classification

    Authors: George Michalopoulos, Michal Malyska, Nicola Sahar, Alexander Wong, Helen Chen

    Abstract: The International Classification of Diseases (ICD) system is the international standard for classifying diseases and procedures during a healthcare encounter and is widely used for healthcare reporting and management purposes. Assigning correct codes for clinical procedures is important for clinical, operational, and financial decision-making in healthcare. Contextual word embedding models have ac… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: 7 pages, 1 figure, accepted in BioNLP 2022

  3. arXiv:2109.01739  [pdf

    cs.LG

    Cohort Characteristics and Factors Associated with Cannabis Use among Adolescents in Canada Using Pattern Discovery and Disentanglement Method

    Authors: Peiyuan Zhou, Andrew K. C. Wong, Yang Yang, Scott T. Leatherdale, Kate Battista, Zahid A. Butt, George Michalopoulos, Helen Chen

    Abstract: COMPASS is a longitudinal, prospective cohort study collecting data annually from students attending high school in jurisdictions across Canada. We aimed to discover significant frequent/rare associations of behavioral factors among Canadian adolescents related to cannabis use. We use a subset of COMPASS dataset which contains 18,761 records of students in grades 9 to 12 with 31 selected features… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: 21 pages, 3 figures, 4 tables

  4. arXiv:2107.05132  [pdf, other

    cs.LG

    LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution

    Authors: George Michalopoulos, Ian McKillop, Alexander Wong, Helen Chen

    Abstract: Lexical substitution is the task of generating meaningful substitutes for a word in a given textual context. Contextual word embedding models have achieved state-of-the-art results in the lexical substitution task by relying on contextual information extracted from the replaced word within the sentence. However, such models do not take into account structured knowledge that exists in external lexi… ▽ More

    Submitted 31 March, 2022; v1 submitted 11 July, 2021; originally announced July 2021.

    Comments: 11 pages, 1 figure

  5. arXiv:2010.10391  [pdf, other

    cs.CL cs.AI cs.LG

    UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus

    Authors: George Michalopoulos, Yuanxin Wang, Hussam Kaka, Helen Chen, Alexander Wong

    Abstract: Contextual word embedding models, such as BioBERT and Bio_ClinicalBERT, have achieved state-of-the-art results in biomedical natural language processing tasks by focusing their pre-training process on domain-specific corpora. However, such models do not take into consideration expert domain knowledge. In this work, we introduced UmlsBERT, a contextual embedding model that integrates domain knowl… ▽ More

    Submitted 3 June, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 10 pages, 3 figures, accepted in NAACL 2021

  6. Where's the Question? A Multi-channel Deep Convolutional Neural Network for Question Identification in Textual Data

    Authors: George Michalopoulos, Helen Chen, Alexander Wong

    Abstract: In most clinical practice settings, there is no rigorous reviewing of the clinical documentation, resulting in inaccurate information captured in the patient medical records. The gold standard in clinical data capturing is achieved via "expert-review", where clinicians can have a dialogue with a domain expert (reviewers) and ask them questions about data entry rules. Automatically identifying "rea… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 12 pages, 4 figures, to be published in The 3rd Clinical Natural Language Processing Workshop

  7. arXiv:2006.10208  [pdf, other

    cs.LG cs.DB cs.IR stat.ML

    Record fusion: A learning approach

    Authors: Alireza Heidari, George Michalopoulos, Shrinu Kushagra, Ihab F. Ilyas, Theodoros Rekatsinas

    Abstract: Record fusion is the task of aggregating multiple records that correspond to the same real-world entity in a database. We can view record fusion as a machine learning problem where the goal is to predict the "correct" value for each attribute for each entity. Given a database, we use a combination of attribute-level, recordlevel, and database-level signals to construct a feature vector for each ce… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: 18 pages, 9 figures

  8. arXiv:1511.08303  [pdf, ps, other

    cs.DS

    Engineering Oracles for Time-Dependent Road Networks

    Authors: Spyros Kontogiannis, George Michalopoulos, Georgia Papastavrou, Andreas Paraskevopoulos, Dorothea Wagner, Christos Zaroliagis

    Abstract: We implement and experimentally evaluate landmark-based oracles for min-cost paths in large-scale time-dependent road networks. We exploit parallelism and lossless compression, combined with a novel travel-time approximation technique, to severely reduce preprocessing space and time. We significantly improve the FLAT oracle, improving the previous query time by $30\%$ and doubling the Dijkstra-ran… ▽ More

    Submitted 26 November, 2015; originally announced November 2015.

    Comments: In ALENEX 2016