FoodSEM: Large Language Model Specialized in Food Named-Entity Linking

Gjorgjevikj, Ana; Martinc, Matej; Cenikj, Gjorgjina; Džeroski, Sašo; Seljak, Barbara Koroušić; Eftimov, Tome

Computer Science > Computation and Language

arXiv:2509.22125 (cs)

[Submitted on 26 Sep 2025]

Title:FoodSEM: Large Language Model Specialized in Food Named-Entity Linking

Authors:Ana Gjorgjevikj, Matej Martinc, Gjorgjina Cenikj, Sašo Džeroski, Barbara Koroušić Seljak, Tome Eftimov

View PDF HTML (experimental)

Abstract:This paper introduces FoodSEM, a state-of-the-art fine-tuned open-source large language model (LLM) for named-entity linking (NEL) to food-related ontologies. To the best of our knowledge, food NEL is a task that cannot be accurately solved by state-of-the-art general-purpose (large) language models or custom domain-specific models/systems. Through an instruction-response (IR) scenario, FoodSEM links food-related entities mentioned in a text to several ontologies, including FoodOn, SNOMED-CT, and the Hansard taxonomy. The FoodSEM model achieves state-of-the-art performance compared to related models/systems, with F1 scores even reaching 98% on some ontologies and datasets. The presented comparative analyses against zero-shot, one-shot, and few-shot LLM prompting baselines further highlight FoodSEM's superior performance over its non-fine-tuned version. By making FoodSEM and its related resources publicly available, the main contributions of this article include (1) publishing a food-annotated corpora into an IR format suitable for LLM fine-tuning/evaluation, (2) publishing a robust model to advance the semantic understanding of text in the food domain, and (3) providing a strong baseline on food NEL for future benchmarking.

Comments:	To appear in the Proceedings of the 28th International Conference on Discovery Science (DS 2025)
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR)
Cite as:	arXiv:2509.22125 [cs.CL]
	(or arXiv:2509.22125v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2509.22125

Submission history

From: Tome Eftimov [view email]
[v1] Fri, 26 Sep 2025 09:47:35 UTC (673 KB)

Computer Science > Computation and Language

Title:FoodSEM: Large Language Model Specialized in Food Named-Entity Linking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FoodSEM: Large Language Model Specialized in Food Named-Entity Linking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators