Skip to main content

Showing 1–5 of 5 results for author: Pappas, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.04711  [pdf, other

    cs.CL cs.AI

    Data Augmentation for Biomedical Factoid Question Answering

    Authors: Dimitris Pappas, Prodromos Malakasiotis, Ion Androutsopoulos

    Abstract: We study the effect of seven data augmentation (da) methods in factoid question answering, focusing on the biomedical domain, where obtaining training instances is particularly difficult. We experiment with data from the BioASQ challenge, which we augment with training instances obtained from an artificial biomedical machine reading comprehension dataset, or via back-translation, information retri… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  2. arXiv:2106.08908  [pdf, other

    cs.IR cs.LG

    A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections

    Authors: Dimitris Pappas, Ion Androutsopoulos

    Abstract: Question answering (QA) systems for large document collections typically use pipelines that (i) retrieve possibly relevant documents, (ii) re-rank them, (iii) rank paragraphs or other snippets of the top-ranked documents, and (iv) select spans of the top-ranked snippets as exact answers. Pipelines are conceptually simple, but errors propagate from one component to the next, without later component… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 12 pages, 3 figures, 4 tables, ACL-IJCNLP 2021

    MSC Class: 68P20; 68P10; 68T50; 68T07 ACM Class: H.3.3

  3. arXiv:2005.06376  [pdf, other

    cs.CL cs.LG stat.ML

    BIOMRC: A Dataset for Biomedical Machine Reading Comprehension

    Authors: Petros Stavropoulos, Dimitris Pappas, Ion Androutsopoulos, Ryan McDonald

    Abstract: We introduce BIOMRC, a large-scale cloze-style biomedical MRC dataset. Care was taken to reduce noise, compared to the previous BIOREAD dataset of Pappas et al. (2018). Experiments show that simple heuristics do not perform well on the new dataset, and that two neural MRC models that had been tested on BIOREAD perform much better on BIOMRC, indicating that the new dataset is indeed less noisy or a… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 10 pages, 4 figures, 5 tables

  4. arXiv:1906.05939  [pdf, other

    cs.AI cs.CL

    Embedding Biomedical Ontologies by Jointly Encoding Network Structure and Textual Node Descriptors

    Authors: Sotiris Kotitsas, Dimitris Pappas, Ion Androutsopoulos, Ryan McDonald, Marianna Apidianaki

    Abstract: Network Embedding (NE) methods, which map network nodes to low-dimensional feature vectors, have wide applications in network analysis and bioinformatics. Many existing NE methods rely only on network structure, overlooking other information associated with the nodes, e.g., text describing the nodes. Recent attempts to combine the two sources of information only consider local network structure. W… ▽ More

    Submitted 20 June, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 18th Workshop on Biomedical Natural Language Processing (BioNLP 2019) of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019), Florence, Italy, 2019

  5. arXiv:1809.06366  [pdf, other

    cs.IR cs.CL

    AUEB at BioASQ 6: Document and Snippet Retrieval

    Authors: Georgios-Ioannis Brokos, Polyvios Liosis, Ryan McDonald, Dimitris Pappas, Ion Androutsopoulos

    Abstract: We present AUEB's submissions to the BioASQ 6 document and snippet retrieval tasks (parts of Task 6b, Phase A). Our models use novel extensions to deep learning architectures that operate solely over the text of the query and candidate document/snippets. Our systems scored at the top or near the top for all batches of the challenge, highlighting the effectiveness of deep learning for these tasks.

    Submitted 15 September, 2018; originally announced September 2018.

    Comments: In Proceedings of the workshop BioASQ: Large-scale Biomedical Semantic Indexing and Question Answering, at the Conference on Empirical Methods in Natural Language Processing (EMNLP 2018), Brussels, Belgium, 2018. arXiv admin note: text overlap with arXiv:1809.01682