-
A comparison of several AI techniques for authorship attribution on Romanian texts
Abstract: Determining the author of a text is a difficult task. Here we compare multiple AI techniques for classifying literary texts written by multiple authors by taking into account a limited number of speech parts (prepositions, adverbs, and conjunctions). We also introduce a new dataset composed of texts written in the Romanian language on which we have run the algorithms. The compared methods are Arti… ▽ More
Submitted 21 January, 2023; v1 submitted 9 November, 2022; originally announced November 2022.
Comments: We initially used the Accuracy evaluation tool to compute the macro-accuracy, obtaining a value of 88.84%. We, thereafter discovered that this value was erroneous and used other methods which gave us the value of 80.94% for the macro-accuracy. In this version of the paper we present the python module solution by using sklearn.metrics's classification_report and balanced_accuracy_score
MSC Class: 03B65; 62H30; 68T01; 68T05; 68T07; 68T10; 68T20; 68T30; 68T50; 91F20 ACM Class: I.2.0; I.2.6
Journal ref: Mathematics 2022, 10(23), 4589