Skip to main content

Showing 1–11 of 11 results for author: Tkaczyk, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.03771  [pdf, other

    cs.DL

    Detection of metadata manipulations: Finding sneaked references in the scholarly literature

    Authors: Lonni Besançon, Guillaume Cabanac, Cyril Labbé, Alexander Magazinov, Jules di Scala, Dominika Tkaczyk, Kathryn Weber-Boer

    Abstract: We report evidence of a new set of sneaked references discovered in the scientific literature. Sneaked references are references registered in the metadata of publications without being listed in reference section or in the full text of the actual publications where they ought to be found. We document here 80,205 references sneaked in metadata of the International Journal of Innovative Science and… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  2. arXiv:1912.10170  [pdf

    cs.CL cs.DL cs.IR cs.LG stat.ML

    NaïveRole: Author-Contribution Extraction and Parsing from Biomedical Manuscripts

    Authors: Dominika Tkaczyk, Andrew Collins, Joeran Beel

    Abstract: Information about the contributions of individual authors to scientific publications is important for assessing authors' achievements. Some biomedical publications have a short section that describes authors' roles and contributions. It is usually written in natural language and hence author contributions cannot be trivially extracted in a machine-readable format. In this paper, we present 1) A st… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1802.01174

    Journal ref: 27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science, 2019

  3. arXiv:1811.10369  [pdf

    cs.IR cs.CL cs.DL cs.LG

    ParsRec: A Novel Meta-Learning Approach to Recommending Bibliographic Reference Parsers

    Authors: Dominika Tkaczyk, Rohit Gupta, Riccardo Cinti, Joeran Beel

    Abstract: Bibliographic reference parsers extract machine-readable metadata such as author names, title, journal, and year from bibliographic reference strings. To extract the metadata, the parsers apply heuristics or machine learning. However, no reference parser, and no algorithm, consistently gives the best results in every scenario. For instance, one tool may be best in extracting titles in ACM citation… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted at the 26th Irish Conference on Artificial Intelligence and Cognitive Science. This paper is an extended version of a poster published at the 12th ACM Conference on Recommender Systems, Proceedings of the 26th Irish Conference on Artificial Intelligence and Cognitive Science (AICS). Dublin, Ireland 2018

  4. arXiv:1811.02213  [pdf, ps, other

    cs.SE cs.CY cs.LG cs.RO

    Hybrid Approach to Automation, RPA and Machine Learning: a Method for the Human-centered Design of Software Robots

    Authors: Wiesław Kopeć, Marcin Skibiński, Cezary Biele, Kinga Skorupska, Dominika Tkaczyk, Anna Jaskulska, Katarzyna Abramczuk, Piotr Gago, Krzysztof Marasek

    Abstract: One of the more prominent trends within Industry 4.0 is the drive to employ Robotic Process Automation (RPA), especially as one of the elements of the Lean approach. The full implementation of RPA is riddled with challenges relating both to the reality of everyday business operations, from SMEs to SSCs and beyond, and the social effects of the changing job market. To successfully address these poi… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    ACM Class: K.4.3; K.6.3; H.5.2; D.2.11

  5. arXiv:1808.09036  [pdf

    cs.IR

    ParsRec: Meta-Learning Recommendations for Bibliographic Reference Parsing

    Authors: Dominika Tkaczyk, Paraic Sheridan, Joeran Beel

    Abstract: Bibliographic reference parsers extract metadata (e.g. author names, title, year) from bibliographic reference strings. No reference parser consistently gives the best results in every scenario. For instance, one tool may be best in extracting titles, and another tool in extracting author names. In this paper, we address the problem of reference parsing from a recommender-systems perspective. We p… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

  6. arXiv:1805.12118  [pdf

    cs.IR

    One-at-a-time: A Meta-Learning Recommender-System for Recommendation-Algorithm Selection on Micro Level

    Authors: Andrew Collins, Dominika Tkaczyk, Joeran Beel

    Abstract: The effectiveness of recommendation algorithms is typically assessed with evaluation metrics such as root mean square error, F1, or click through rates, calculated over entire datasets. The best algorithm is typically chosen based on these overall metrics. However, there is no single-best algorithm for all users, items, and contexts. Choosing a single algorithm based on overall evaluation results… ▽ More

    Submitted 30 November, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

  7. arXiv:1802.06565  [pdf

    cs.DL cs.IR

    A Study of Position Bias in Digital Library Recommender Systems

    Authors: Andrew Collins, Dominika Tkaczyk, Akiko Aizawa, Joeran Beel

    Abstract: "Position bias" describes the tendency of users to interact with items on top of a list with higher probability than with items at a lower position in the list, regardless of the items' actual relevance. In the domain of recommender systems, particularly recommender systems in digital libraries, position bias has received little attention. We conduct a study in a real-world recommender system that… ▽ More

    Submitted 19 February, 2018; originally announced February 2018.

  8. arXiv:1802.01174  [pdf

    cs.DL

    A Method for Discovering and Extracting Author Contributions Information from Scientific Biomedical Publications

    Authors: Dominika Tkaczyk, Andrew Collins, Joeran Beel

    Abstract: Creating scientific publications is a complex process, typically composed of a number of different activities, such as designing the experiments, data preparation, programming software and writing and editing the manuscript. The information about the contributions of individual authors of a paper is important in the context of assessing authors' scientific achievements. Some publications in biomed… ▽ More

    Submitted 4 February, 2018; originally announced February 2018.

  9. arXiv:1802.01168  [pdf

    cs.DL

    Machine Learning vs. Rules and Out-of-the-Box vs. Retrained: An Evaluation of Open-Source Bibliographic Reference and Citation Parsers

    Authors: Dominika Tkaczyk, Andrew Collins, Paraic Sheridan, Joeran Beel

    Abstract: Bibliographic reference parsing refers to extracting machine-readable metadata, such as the names of the authors, the title, or journal name, from bibliographic reference strings. Many approaches to this problem have been proposed so far, including regular expressions, knowledge bases and supervised machine learning. Many open source reference parsers based on various algorithms are also available… ▽ More

    Submitted 19 April, 2018; v1 submitted 4 February, 2018; originally announced February 2018.

    Comments: to appear in Proceedings of Joint Conference on Digital Libraries 2018

  10. arXiv:1710.10201  [pdf, other

    cs.DL cs.IR

    New Methods for Metadata Extraction from Scientific Literature

    Authors: Dominika Tkaczyk

    Abstract: Within the past few decades we have witnessed digital revolution, which moved scholarly communication to electronic media and also resulted in a substantial increase in its volume. Nowadays keeping track with the latest scientific achievements poses a major challenge for the researchers. Scientific information overload is a severe problem that slows down scholarly communication and knowledge propa… ▽ More

    Submitted 27 October, 2017; originally announced October 2017.

    Comments: PhD Thesis

    ACM Class: I.7.5; H.3.7

  11. arXiv:1303.6906  [pdf, ps, other

    cs.IR cs.DL

    Large scale citation matching using Apache Hadoop

    Authors: Mateusz Fedoryszak, Dominika Tkaczyk, Łukasz Bolikowski

    Abstract: During the process of citation matching links from bibliography entries to referenced publications are created. Such links are indicators of topical similarity between linked texts, are used in assessing the impact of the referenced document and improve navigation in the user interfaces of digital libraries. In this paper we present a citation matching method and show how to scale it up to handle… ▽ More

    Submitted 26 March, 2013; originally announced March 2013.

    Comments: 11 pages, 4 figures

    ACM Class: H.3.3