Skip to main content

Showing 1–10 of 10 results for author: Smirnova, N

Searching in archive cs. Search in all archives.
.
  1. Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches

    Authors: Panggih Kusuma Ningrum, Philipp Mayr, Nina Smirnova, Iana Atanassova

    Abstract: UnScientify, a system designed to detect scientific uncertainty in scholarly full text. The system utilizes a weakly supervised technique to identify verbally expressed uncertainty in scientific texts and their authorial references. The core methodology of UnScientify is based on a multi-faceted pipeline that integrates span pattern matching, complex sentence analysis and author reference checking… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: Paper Accepted for Publication in the Journal of Informetrics (2025)

    Journal ref: 2025

  2. arXiv:2408.13501  [pdf

    cs.CL cs.IR

    Utilizing Large Language Models for Named Entity Recognition in Traditional Chinese Medicine against COVID-19 Literature: Comparative Study

    Authors: Xu Tong, Nina Smirnova, Sharmila Upadhyaya, Ran Yu, Jack H. Culbert, Chao Sun, Wolfgang Otto, Philipp Mayr

    Abstract: Objective: To explore and compare the performance of ChatGPT and other state-of-the-art LLMs on domain-specific NER tasks covering different entity types and domains in TCM against COVID-19 literature. Methods: We established a dataset of 389 articles on TCM against COVID-19, and manually annotated 48 of them with 6 types of entities belonging to 3 domains as the ground truth, against which the NE… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

    Comments: 22 pages with 2 figures

    ACM Class: H.3.3

  3. arXiv:2406.03067  [pdf, other

    cs.DL

    Automatically detecting scientific political science texts from a large general document index

    Authors: Nina Smirnova

    Abstract: This technical report outlines the filtering approach applied to the collection of the Bielefeld Academic Search Engine (BASE) data to extract articles from the political science domain. We combined hard and soft filters to address entries with different available metadata, e.g. title, abstract or keywords. The hard filter is a weighted keyword-based approach. The soft filter uses a multilingual B… ▽ More

    Submitted 24 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2404.17171  [pdf

    cs.DL

    The rise of Indo-German collaborative research: 1990-2022

    Authors: Aasif Ahmad Mir, Nina Smirnova, Jeyshankar Ramalingam, Philipp Mayr

    Abstract: The study aims to highlight the growth and development of Indo-German collaborative research over the past three decades. Moreover, this study encompasses an in-depth examination of funding acknowledgements to gain valuable insights into the financial support that underpins these collaborative endeavors. Together with this paper, we provide an openly accessible dataset of Indo-German research arti… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 37 pages, 9 figures, accepted paper Global Knowledge, Memory and Communication

  5. arXiv:2403.17618  [pdf, other

    cs.DL

    Technical Report: Incorporating Blogs in Pollux

    Authors: Tobias Holtdirk, Nina Smirnova

    Abstract: This technical report describes the incorporation of political blogs into Pollux, the Specialised Information Service (FID) for Political Science in Germany. Considering the widespread use of political blogs in political science research, we decided to include them in the Pollux search system to enhance the available information infrastructure. We describe the crawling and analyzing of the blogs a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 6 pages, 6 figures, technical report

  6. The Emergence of Preprints: Comparing Publishing Behaviour in the Global South and the Global North

    Authors: Kristin Biesenbender, Nina Smirnova, Philipp Mayr, Isabella Peters

    Abstract: Purpose: The recent proliferation of preprints could be a way for researchers worldwide to increase the availability and visibility of their research findings. Against the background of rising publication costs caused by the increasing prevalence of article processing fees, the search for other ways to publish research results besides traditional journal publication may increase. This could be esp… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Journal ref: Online Information Review 2024

  7. arXiv:2307.13377  [pdf, other

    cs.DL cs.CL cs.IR

    Embedding Models for Supervised Automatic Extraction and Classification of Named Entities in Scientific Acknowledgements

    Authors: Nina Smirnova, Philipp Mayr

    Abstract: Acknowledgments in scientific papers may give an insight into aspects of the scientific community, such as reward systems, collaboration patterns, and hidden research trends. The aim of the paper is to evaluate the performance of different embedding models for the task of automatic extraction and classification of acknowledged entities from the acknowledgment text in scientific papers. We trained… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: The present paper is an extended version of the article Evaluation of Embedding Models for Automatic Extraction and Classification of Acknowledged Entities in Scientific Documents (Smirnova and Mayr, 2022) presented at the 3rd Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents (EEKE2022). arXiv admin note: substantial text overlap with arXiv:2206.10939

    ACM Class: J.4; J.5; I.5.1; H.3.3; I.2.7

  8. A Comprehensive Analysis of Acknowledgement Texts in Web of Science: a case study on four scientific domains

    Authors: Nina Smirnova, Philipp Mayr

    Abstract: Analysis of acknowledgments is particularly interesting as acknowledgments may give information not only about funding, but they are also able to reveal hidden contributions to authorship and the researcher's collaboration patterns, context in which research was conducted, and specific aspects of the academic work. The focus of the present research is the analysis of a large sample of acknowledgem… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 30 pages, 14 figures, accepted in Scientometrics

    Journal ref: Scientometrics 128, (2023)

  9. arXiv:2206.10939  [pdf, other

    cs.CL cs.DL

    Evaluation of Embedding Models for Automatic Extraction and Classification of Acknowledged Entities in Scientific Documents

    Authors: Nina Smirnova, Philipp Mayr

    Abstract: Acknowledgments in scientific papers may give an insight into aspects of the scientific community, such as reward systems, collaboration patterns, and hidden research trends. The aim of the paper is to evaluate the performance of different embedding models for the task of automatic extraction and classification of acknowledged entities from the acknowledgment text in scientific papers. We trained… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted workshop paper at EEKE2022 Workshop(JCDL2022)

  10. arXiv:1908.08266  [pdf, other

    cs.SE cs.DS

    Interactive Duplicate Search in Software Documentation

    Authors: D. V. Luciv, D. V. Koznov, A. A. Shelikhovskii, K. Yu. Romanovsky, G. A. Chernishev, A. N. Terekhov, D. A. Grigoriev, A. N. Smirnova, D. V. Borovkov, A. I. Vasenina

    Abstract: Various software features such as classes, methods, requirements, and tests often have similar functionality. This can lead to emergence of duplicates in their descriptive documentation. Uncontrolled duplicates created via copy/paste hinder the process of documentation maintenance. Therefore, the task of duplicate detection in software documentation is of importance. Solving it makes planned reuse… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    MSC Class: 68N30 ACM Class: D.2.6; I.7.0