Authorship Attribution Based on Life-Like Network Automata

Machicao, Jeaneth; Corrêa Jr., Edilson A.; Miranda, Gisele H. B.; Amancio, Diego R.; Bruno, Odemir M.

doi:10.1371/journal.pone.0193703

Computer Science > Computation and Language

arXiv:1610.06498 (cs)

[Submitted on 20 Oct 2016]

Title:Authorship Attribution Based on Life-Like Network Automata

Authors:Jeaneth Machicao, Edilson A. Corrêa Jr., Gisele H. B. Miranda, Diego R. Amancio, Odemir M. Bruno

View PDF

Abstract:The authorship attribution is a problem of considerable practical and technical interest. Several methods have been designed to infer the authorship of disputed documents in multiple contexts. While traditional statistical methods based solely on word counts and related measurements have provided a simple, yet effective solution in particular cases; they are prone to manipulation. Recently, texts have been successfully modeled as networks, where words are represented by nodes linked according to textual similarity measurements. Such models are useful to identify informative topological patterns for the authorship recognition task. However, there is no consensus on which measurements should be used. Thus, we proposed a novel method to characterize text networks, by considering both topological and dynamical aspects of networks. Using concepts and methods from cellular automata theory, we devised a strategy to grasp informative spatio-temporal patterns from this model. Our experiments revealed an outperformance over traditional analysis relying only on topological measurements. Remarkably, we have found a dependence of pre-processing steps (such as the lemmatization) on the obtained results, a feature that has mostly been disregarded in related works. The optimized results obtained here pave the way for a better characterization of textual networks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1610.06498 [cs.CL]
	(or arXiv:1610.06498v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1610.06498
Journal reference:	PLoS ONE 13(3): e0193703, 2018
Related DOI:	https://doi.org/10.1371/journal.pone.0193703

Submission history

From: Diego Amancio Dr. [view email]
[v1] Thu, 20 Oct 2016 17:00:42 UTC (5,806 KB)

Computer Science > Computation and Language

Title:Authorship Attribution Based on Life-Like Network Automata

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Authorship Attribution Based on Life-Like Network Automata

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators