Skip to main content

Showing 1–19 of 19 results for author: Fuerst, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00983  [pdf, ps, other

    eess.IV cs.CV

    DMCIE: Diffusion Model with Concatenation of Inputs and Errors to Improve the Accuracy of the Segmentation of Brain Tumors in MRI Images

    Authors: Sara Yavari, Rahul Nitin Pandya, Jacob Furst

    Abstract: Accurate segmentation of brain tumors in MRI scans is essential for reliable clinical diagnosis and effective treatment planning. Recently, diffusion models have demonstrated remarkable effectiveness in image generation and segmentation tasks. This paper introduces a novel approach to corrective segmentation based on diffusion models. We propose DMCIE (Diffusion Model with Concatenation of Inputs… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2505.07917  [pdf, other

    cs.IR cs.AI cs.DB cs.LG

    Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation

    Authors: Linus Stuhlmann, Michael Alexander Saxer, Jonathan Fürst

    Abstract: Biomedical question-answering (QA) systems require effective retrieval and generation components to ensure accuracy, efficiency, and scalability. This study systematically examines a Retrieval-Augmented Generation (RAG) system for biomedical QA, evaluating retrieval strategies and response time trade-offs. We first assess state-of-the-art retrieval methods, including BM25, BioBERT, MedCPT, and a h… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Accepted at SDS25

  3. arXiv:2504.20033  [pdf, other

    cs.CV

    Mitigating Catastrophic Forgetting in the Incremental Learning of Medical Images

    Authors: Sara Yavari, Jacob Furst

    Abstract: This paper proposes an Incremental Learning (IL) approach to enhance the accuracy and efficiency of deep learning models in analyzing T2-weighted (T2w) MRI medical images prostate cancer detection using the PI-CAI dataset. We used multiple health centers' artificial intelligence and radiology data, focused on different tasks that looked at prostate cancer detection using MRI (PI-CAI). We utilized… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 15 Pages, 3 Figures, 3 Tables, 1 Algorithm, This paper will be updated

    ACM Class: I.2.6; I.2.10

  4. arXiv:2502.18179  [pdf, other

    cs.CL cs.AI

    Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs

    Authors: Gaye Colakoglu, Gürkan Solmaz, Jonathan Fürst

    Abstract: This paper defines and explores the design space for information extraction (IE) from layout-rich documents using large language models (LLMs). The three core challenges of layout-aware IE with LLMs are 1) data structuring, 2) model engagement, and 3) output refinement. Our study delves into the sub-problems within these core challenges, such as input representation, chunking, prompting, and selec… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  5. arXiv:2412.18428  [pdf, other

    cs.AI cs.CL

    Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent

    Authors: Farhad Nooralahzadeh, Yi Zhang, Jonathan Furst, Kurt Stockinger

    Abstract: International enterprises, organizations, or hospitals collect large amounts of multi-modal data stored in databases, text documents, images, and videos. While there has been recent progress in the separate fields of multi-modal data exploration as well as in database systems that automatically translate natural language questions to database query languages, the research challenge of querying dat… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  6. arXiv:2411.05521  [pdf, other

    cs.DB cs.AI

    SM3-Text-to-Query: Synthetic Multi-Model Medical Text-to-Query Benchmark

    Authors: Sithursan Sivasubramaniam, Cedric Osei-Akoto, Yi Zhang, Kurt Stockinger, Jonathan Fuerst

    Abstract: Electronic health records (EHRs) are stored in various database systems with different database models on heterogeneous storage architectures, such as relational databases, document stores, or graph databases. These different database models have a big impact on query complexity and performance. While this has been a known fact in database research, its implications for the growing number of Text-… ▽ More

    Submitted 14 November, 2024; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024 Track Datasets and Benchmarks

  7. arXiv:2409.18596  [pdf, ps, other

    cs.AI cs.CL cs.LG

    ASAG2024: A Combined Benchmark for Short Answer Grading

    Authors: Gérôme Meyer, Philip Breuer, Jonathan Fürst

    Abstract: Open-ended questions test a more thorough understanding than closed-ended questions and are often a preferred assessment method. However, open-ended questions are tedious to grade and subject to personal bias. Therefore, there have been efforts to speed up the grading process through automation. Short Answer Grading (SAG) systems aim to automatically score students' answers. Despite growth in SAG… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Accepted at SIGCSE-Virtual 2024

  8. arXiv:2409.10776  [pdf, other

    cs.DL cond-mat.mtrl-sci physics.chem-ph

    Research evolution of metal organic frameworks: A scientometric approach with human-in-the-loop

    Authors: Xintong Zhao, Kyle Langlois, Jacob Furst, Yuan An, Xiaohua Hu, Diego Gomez Gualdron, Fernando Uribe-Romo, Jane Greenberg

    Abstract: This paper reports on a scientometric analysis bolstered by human in the loop, domain experts, to examine the field of metal organic frameworks (MOFs) research. Scientometric analyses reveal the intellectual landscape of a field. The study engaged MOF scientists in the design and review of our research workflow. MOF materials are an essential component in next generation renewable energy storage a… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  9. arXiv:2404.07663  [pdf, other

    cs.DB cs.AI cs.LG

    Interactive Ontology Matching with Cost-Efficient Learning

    Authors: Bin Cheng, Jonathan Fürst, Tobias Jacobs, Celia Garrido-Hidalgo

    Abstract: The creation of high-quality ontologies is crucial for data integration and knowledge-based reasoning, specifically in the context of the rising data economy. However, automatic ontology matchers are often bound to the heuristics they are based on, leaving many matches unidentified. Interactive ontology matching systems involving human experts have been introduced, but they do not solve the fundam… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  10. arXiv:2402.08349  [pdf, other

    cs.DB cs.AI cs.CL

    Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries

    Authors: Jonathan Fürst, Catherine Kosten, Farhad Nooralahzadeh, Yi Zhang, Kurt Stockinger

    Abstract: Text-to-SQL systems (also known as NL-to-SQL systems) have become an increasingly popular solution for bridging the gap between user capabilities and SQL-based data access. These systems translate user requests in natural language to valid SQL statements for a specific database. Recent Text-to-SQL systems have benefited from the rapid improvement of transformer-based language models. However, whil… ▽ More

    Submitted 29 November, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  11. arXiv:2310.12417  [pdf, other

    cs.DL

    Metadata for Scientific Experiment Reporting: A Case Study in Metal-Organic Frameworks

    Authors: Xintong Zhao, Kyle Langlois, Jacob Furst, Scott McClellan, Xiaohua Hu, Yuan An, Diego A. Gómez-Gualdrón, Fernando J. Uribe-Romo, Jane Greenberg

    Abstract: Research methods and procedures are core aspects of the research process. Metadata focused on these components is critical to supporting the FAIR principles, particularly reproducibility. The research reported on in this paper presents a methodological framework for metadata documentation supporting the reproducibility of research producing Metal Organic Frameworks (MOFs). The MOF case study invol… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted by the 17th International Conference on Metadata and Semantics Research

  12. arXiv:2309.11361  [pdf, other

    cs.AI

    Knowledge Graph Question Answering for Materials Science (KGQA4MAT): Developing Natural Language Interface for Metal-Organic Frameworks Knowledge Graph (MOF-KG) Using LLM

    Authors: Yuan An, Jane Greenberg, Alex Kalinowski, Xintong Zhao, Xiaohua Hu, Fernando J. Uribe-Romo, Kyle Langlois, Jacob Furst, Diego A. Gómez-Gualdrón

    Abstract: We present a comprehensive benchmark dataset for Knowledge Graph Question Answering in Materials Science (KGQA4MAT), with a focus on metal-organic frameworks (MOFs). A knowledge graph for metal-organic frameworks (MOF-KG) has been constructed by integrating structured databases and knowledge extracted from the literature. To enhance MOF-KG accessibility for domain experts, we aim to develop a natu… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: In 17th International Conference on Metadata and Semantics Research, October 2023

  13. arXiv:2209.14454  [pdf

    cs.CV cs.AI

    CompNet: A Designated Model to Handle Combinations of Images and Designed features

    Authors: Bowen Qiu, Daniela Raicu, Jacob Furst, Roselyne Tchoua

    Abstract: Convolutional neural networks (CNNs) are one of the most popular models of Artificial Neural Networks (ANN)s in Computer Vision (CV). A variety of CNN-based structures were developed by researchers to solve problems like image classification, object detection, and image similarity measurement. Although CNNs have shown their value in most cases, they still have a downside: they easily overfit when… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  14. arXiv:2207.04502  [pdf, other

    cs.AI

    Building Open Knowledge Graph for Metal-Organic Frameworks (MOF-KG): Challenges and Case Studies

    Authors: Yuan An, Jane Greenberg, Xintong Zhao, Xiaohua Hu, Scott McCLellan, Alex Kalinowski, Fernando J. Uribe-Romo, Kyle Langlois, Jacob Furst, Diego A. Gómez-Gualdrón, Fernando Fajardo-Rojas, Katherine Ardila

    Abstract: Metal-Organic Frameworks (MOFs) are a class of modular, porous crystalline materials that have great potential to revolutionize applications such as gas storage, molecular separations, chemical sensing, catalysis, and drug delivery. The Cambridge Structural Database (CSD) reports 10,636 synthesized MOF crystals which in addition contains ca. 114,373 MOF-like structures. The sheer number of synthes… ▽ More

    Submitted 29 November, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: Accepted by the International Workshop on Knowledge Graphs and Open Knowledge Network (OKN'22) Co-located with the 28th ACM SIGKDD Conference

  15. arXiv:2205.10900  [pdf, other

    cs.CV cs.AI

    Visual Explanations from Deep Networks via Riemann-Stieltjes Integrated Gradient-based Localization

    Authors: Mirtha Lucas, Miguel Lerma, Jacob Furst, Daniela Raicu

    Abstract: Neural networks are becoming increasingly better at tasks that involve classifying and recognizing images. At the same time techniques intended to explain the network output have been proposed. One such technique is the Gradient-based Class Activation Map (Grad-CAM), which is able to locate features of an input image at various levels of a convolutional neural network (CNN), but is sensitive to th… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: 16 pages, 33 figures

    MSC Class: 68T45 ACM Class: I.2.m; I.4.m

  16. arXiv:2005.12848  [pdf, other

    eess.SP cs.NI eess.SY

    Group-In: Group Inference from Wireless Traces of Mobile Devices

    Authors: Gürkan Solmaz, Jonathan Fürst, Samet Aytaç, Fang-Jing Wu

    Abstract: This paper proposes Group-In, a wireless scanning system to detect static or mobile people groups in indoor or outdoor environments. Group-In collects only wireless traces from the Bluetooth-enabled mobile devices for group inference. The key problem addressed in this work is to detect not only static groups but also moving groups with a multi-phased approach based only noisy wireless Received Sig… ▽ More

    Submitted 10 June, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: This work has been funded by the EU Horizon 2020 Programme under Grant Agreements No. 731993 AUTOPILOT and No.871249 LOCUS projects. The content of this paper does not reflect the official opinion of the EU. Responsibility for the information and views expressed therein lies entirely with the authors. Proc. of ACM/IEEE IPSN'20, 2020

  17. arXiv:1907.08278  [pdf, other

    cs.DC

    Fog Function: Serverless Fog Computing for Data Intensive IoT Services

    Authors: Bin Cheng, Jonathan Fürst, Gurkan Solmaz, Takuya Sanada

    Abstract: Fog computing can support IoT services with fast response time and low bandwidth usage by moving computation from the cloud to edge devices. However, existing fog computing frameworks have limited flexibility to support dynamic service composition with a data-oriented approach. Function-as-a-Service (FaaS) is a promising programming model for fog computing to enhance flexibility, but the current e… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

  18. arXiv:1904.12676  [pdf, other

    cs.DC cs.PF

    Reinforcement Learning Based Orchestration for Elastic Services

    Authors: M. Fadel Argerich, B. Cheng, J. Fürst

    Abstract: Due to the highly variable execution context in which edge services run, adapting their behavior to the execution context is crucial to comply with their requirements. However, adapting service behavior is a challenging task because it is hard to anticipate the execution contexts in which it will be deployed, as well as assessing the impact that each behavior change will produce. In order to provi… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: 2019 IEEE 5th World Forum on Internet of Things (WF-IoT), 6 pages

  19. arXiv:1807.02608  [pdf

    cs.LG cs.CV stat.ML

    Synthetic Sampling for Multi-Class Malignancy Prediction

    Authors: Matthew Yung, Eli T. Brown, Alexander Rasin, Jacob D. Furst, Daniela S. Raicu

    Abstract: We explore several oversampling techniques for an imbalanced multi-label classification problem, a setting often encountered when developing models for Computer-Aided Diagnosis (CADx) systems. While most CADx systems aim to optimize classifiers for overall accuracy without considering the relative distribution of each class, we look into using synthetic sampling to increase per-class performance w… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: 5 pages, 3 figures, 4 Tables, KDD MLMH'18 Workshop