-
Research information in the light of artificial intelligence: quality and data ecologies
Authors:
Otmane Azeroual,
Tibor Koltay
Abstract:
This paper presents multi- and interdisciplinary approaches for finding the appropriate AI technologies for research information. Professional research information management (RIM) is becoming increasingly important as an expressly data-driven tool for researchers. It is not only the basis of scientific knowledge processes, but also related to other data. A concept and a process model of the eleme…
▽ More
This paper presents multi- and interdisciplinary approaches for finding the appropriate AI technologies for research information. Professional research information management (RIM) is becoming increasingly important as an expressly data-driven tool for researchers. It is not only the basis of scientific knowledge processes, but also related to other data. A concept and a process model of the elementary phases from the start of the project to the ongoing operation of the AI methods in the RIM is presented, portraying the implementation of an AI project, meant to enable universities and research institutions to support their researchers in dealing with incorrect and incomplete research information, while it is being stored in their RIMs. Our aim is to show how research information harmonizes with the challenges of data literacy and data quality issues, related to AI, also wanting to underline that any project can be successful if the research institutions and various departments of universities, involved work together and appropriate support is offered to improve research information and data management.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
NoSQL security: can my data-driven decision-making be influenced from outside?
Authors:
Anastasija Nikiforova,
Artjoms Daskevics,
Otmane Azeroual
Abstract:
Nowadays, there are billions interconnected devices forming Cyber-Physical Systems, Internet of Things (IoT) and Industrial Internet of Things (IIoT) ecosystems. With an increasing number of devices and systems in use, amount and the value of data, the risks of security breaches increase. One of these risks is posed by open data sources, by which are meant databases, which are not properly protect…
▽ More
Nowadays, there are billions interconnected devices forming Cyber-Physical Systems, Internet of Things (IoT) and Industrial Internet of Things (IIoT) ecosystems. With an increasing number of devices and systems in use, amount and the value of data, the risks of security breaches increase. One of these risks is posed by open data sources, by which are meant databases, which are not properly protected. These poorly protected databases are accessible to external actors, which poses a serious risk to the data holder and the results of data-related activities such as analysis, forecasting, monitoring, decision-making, policy development, and the whole contemporary society. This chapter aims at examining the state of the security of open data databases representing both relational databases and NoSQL, with a particular focus on a later category.
△ Less
Submitted 12 January, 2023; v1 submitted 23 June, 2022;
originally announced June 2022.
-
GRAPHYP: A Scientific Knowledge Graph with Manifold Subnetworks of Communities. Detection of Scholarly Disputes in Adversarial Information Routes
Authors:
Renaud Fabre,
Otmane Azeroual,
Patrice Bellot,
Joachim Schöpfel,
Daniel Egret
Abstract:
The cognitive manifold of published content is currently expanding in all areas of science. However, Scientific Knowledge Graphs (SKGs) only provide poor pictures of the adversarial directions and scientific controversies that feed the production of knowledge. In this Article, we tackle the understanding of the design of the information space of a cognitive representation of research activities, a…
▽ More
The cognitive manifold of published content is currently expanding in all areas of science. However, Scientific Knowledge Graphs (SKGs) only provide poor pictures of the adversarial directions and scientific controversies that feed the production of knowledge. In this Article, we tackle the understanding of the design of the information space of a cognitive representation of research activities, and of related bottlenecks that affect search interfaces, in the mapping of structured objects into graphs. We propose, with SKG GRAPHYP, a novel graph designed geometric architecture which optimizes both the detection of the knowledge manifold of "cognitive communities", and the representation of alternative paths to adversarial answers to a research question, for instance in the context of academic disputes. With a methodology for designing "Manifold Subnetworks of Cognitive Communities", GRAPHYP provides a classification of distinct search paths in a research field. Users are detected from the variety of their search practices and classified in "Cognitive communities" from the analysis of the search history of their logs of scientific documentation. The manifold of practices is expressed from metrics of differentiated uses by triplets of nodes shaped into symmetrical graph subnetworks, with the following three parameters: Mass, Intensity, and Variety.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Implementation and user acceptance of research information systems
Authors:
Otmane Azeroual,
Joachim Schöpfel,
Gunter Saake
Abstract:
PurposeThe purpose of this paper is to present empirical evidence on the implementation, acceptance and quality-related aspects of research information systems (RIS) in academic institutions.Design/methodology/approachThe study is based on a 2018 survey with 160 German universities and research institutions.FindingsThe paper presents recent figures about the implementation of RIS in German academi…
▽ More
PurposeThe purpose of this paper is to present empirical evidence on the implementation, acceptance and quality-related aspects of research information systems (RIS) in academic institutions.Design/methodology/approachThe study is based on a 2018 survey with 160 German universities and research institutions.FindingsThe paper presents recent figures about the implementation of RIS in German academic institutions, including results on the satisfaction, perceived usefulness and ease of use. It contains also information about the perceived data quality and the preferred quality management. RIS acceptance can be achieved only if the highest possible quality of the data is to be ensured. For this reason, the impact of data quality on the technology acceptance model (TAM) is examined, and the relation between the level of data quality and user acceptance of the associated institutional RIS is addressed.Research limitations/implicationsThe data provide empirical elements for a better understanding of the role of the data quality for the acceptance of RIS, in the framework of a TAM. The study puts the focus on commercial and open-source solutions while in-house developments have been excluded. Also, mainly because of the small sample size, the data analysis was limited to descriptive statistics.Practical implicationsThe results are helpful for the management of RIS projects, to increase acceptance and satisfaction with the system, and for the further development of RIS functionalities.Originality/valueThe number of empirical studies on the implementation and acceptance of RIS is low, and very few address in this context the question of data quality. The study tries to fill the gap.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Research Intelligence (CRIS) and the Cloud: A Review
Authors:
Otmane Azeroual,
Joachim Schöpfel
Abstract:
The purpose of this paper is to explore the impact of the cloud technology on current research information systems (CRIS). Based on an overview of published literature and on empirical evidence from surveys, the paper presents main characteristics, delivery models, service levels and general benefits of cloud computing. The second part assesses how the cloud computing challenges the research infor…
▽ More
The purpose of this paper is to explore the impact of the cloud technology on current research information systems (CRIS). Based on an overview of published literature and on empirical evidence from surveys, the paper presents main characteristics, delivery models, service levels and general benefits of cloud computing. The second part assesses how the cloud computing challenges the research information management, from three angles: networking, specific benefits, and the ingestion of data in the cloud. The third part describes three aspects of the implementation of current research systems in the clouds, i.e. service models, requirements and potential risks and barriers. The paper concludes with some perspectives for future work. The paper is written for CRIS administrators and users, in order to improve research information management and to contribute to future development and implementation of these systems, but also for scholars and students who want to have detailed knowledge on this topic.
△ Less
Submitted 2 October, 2019;
originally announced October 2019.
-
The Effects of Using Business Intelligence Systems on an Excellence Management and Decision-Making Process by Start-Up Companies: A Case Study
Authors:
Otmane Azeroual,
Horst Theel
Abstract:
The rapid increase in data volumes in companies has meant that momentous and comprehensive information gathering is barely possible by manual means. Business intelligence solutions can help here. They provide tools with appropriate technologies to assist with the collection, integration, storage, editing, and analysis of existing data. While almost only large companies were interested in this topi…
▽ More
The rapid increase in data volumes in companies has meant that momentous and comprehensive information gathering is barely possible by manual means. Business intelligence solutions can help here. They provide tools with appropriate technologies to assist with the collection, integration, storage, editing, and analysis of existing data. While almost only large companies were interested in this topic a few years ago, it has meanwhile also become necessary for start-up companies, and so the market for business intelligence has been growing for years. This article focuses on the general potentials of using BI in start-ups. First, will be examined which providers of BI solutions that are suitable for start-ups and what opportunities exist for implementing BI systems in start-ups. Then it will be shown to what extent BI has prevailed in start-ups, in which areas the techniques of BI are used in start-ups and what purpose BI has in start-ups. Finally, the success factors for BI projects in start-ups are considered.
△ Less
Submitted 18 January, 2019;
originally announced January 2019.
-
Improving the data quality in the research information systems
Authors:
Otmane Azeroual,
Mohammad Abuosba
Abstract:
In order to introduce an integrated research information system, this will provide scientific institutions with the necessary information on research activities and research results in assured quality. Since data collection, duplication, missing values, incorrect formatting, inconsistencies, etc. can arise in the collection of research data in different research information systems, which can have…
▽ More
In order to introduce an integrated research information system, this will provide scientific institutions with the necessary information on research activities and research results in assured quality. Since data collection, duplication, missing values, incorrect formatting, inconsistencies, etc. can arise in the collection of research data in different research information systems, which can have a wide range of negative effects on data quality, the subject of data quality should be treated with better results. This paper examines the data quality problems in research information systems and presents the new techniques that enable organizations to improve their quality of research information.
△ Less
Submitted 18 January, 2019;
originally announced January 2019.
-
Data Quality Measures and Data Cleansing for Research Information Systems
Authors:
Otmane Azeroual,
Gunter Saake,
Mohammad Abuosba
Abstract:
The collection, transfer and integration of research information into different research Information systems can result in different data errors that can have a variety of negative effects on data quality. In order to detect errors at an early stage and treat them efficiently, it is necessary to determine the clean-up measures and the new techniques of data cleansing for quality improvement in res…
▽ More
The collection, transfer and integration of research information into different research Information systems can result in different data errors that can have a variety of negative effects on data quality. In order to detect errors at an early stage and treat them efficiently, it is necessary to determine the clean-up measures and the new techniques of data cleansing for quality improvement in research institutions. Thereby an adequate and reliable basis for decision-making using an RIS is provided, and confidence in a given dataset increased. In this paper, possible measures and the new techniques of data cleansing for improving and increasing the data quality in research information systems will be presented and how these are to be applied to the Research information.
△ Less
Submitted 18 January, 2019;
originally announced January 2019.
-
Text data mining and data quality management for research information systems in the context of open data and open science
Authors:
Otmane Azeroual,
Gunter Saake,
Mohammad Abuosba,
Joachim Schöpfel
Abstract:
In the implementation and use of research information systems (RIS) in scientific institutions, text data mining and semantic technologies are a key technology for the meaningful use of large amounts of data. It is not the collection of data that is difficult, but the further processing and integration of the data in RIS. Data is usually not uniformly formatted and structured, such as texts and ta…
▽ More
In the implementation and use of research information systems (RIS) in scientific institutions, text data mining and semantic technologies are a key technology for the meaningful use of large amounts of data. It is not the collection of data that is difficult, but the further processing and integration of the data in RIS. Data is usually not uniformly formatted and structured, such as texts and tables that cannot be linked. These include various source systems with their different data formats such as project and publication databases, CERIF and RCD data model, etc. Internal and external data sources continue to develop. On the one hand, they must be constantly synchronized and the results of the data links checked. On the other hand, the texts must be processed in natural language and certain information extracted. Using text data mining, the quality of the metadata is analyzed and this identifies the entities and general keywords. So that the user is supported in the search for interesting research information. The information age makes it easier to store huge amounts of data and increase the number of documents on the internet, in institutions' intranets, in newswires and blogs is overwhelming. Search engines should help to specifically open up these sources of information and make them usable for administrative and research purposes. Against this backdrop, the aim of this paper is to provide an overview of text data mining techniques and the management of successful data quality for RIS in the context of open data and open science in scientific institutions and libraries, as well as to provide ideas for their application. In particular, solutions for the RIS will be presented.
△ Less
Submitted 11 December, 2018;
originally announced December 2018.