Statistical Inferences for Polarity Identification in Natural Language

Pröllochs, Nicolas; Feuerriegel, Stefan; Neumann, Dirk

doi:10.1371/journal.pone.0209323

Computer Science > Computation and Language

arXiv:1706.06996 (cs)

[Submitted on 21 Jun 2017 (v1), last revised 5 Apr 2018 (this version, v2)]

Title:Statistical Inferences for Polarity Identification in Natural Language

Authors:Nicolas Pröllochs, Stefan Feuerriegel, Dirk Neumann

View PDF

Abstract:Information forms the basis for all human behavior, including the ubiquitous decision-making that people constantly perform in their every day lives. It is thus the mission of researchers to understand how humans process information to reach decisions. In order to facilitate this task, this work proposes a novel method of studying the reception of granular expressions in natural language. The approach utilizes LASSO regularization as a statistical tool to extract decisive words from textual content and draw statistical inferences based on the correspondence between the occurrences of words and an exogenous response variable. Accordingly, the method immediately suggests significant implications for social sciences and Information Systems research: everyone can now identify text segments and word choices that are statistically relevant to authors or readers and, based on this knowledge, test hypotheses from behavioral research. We demonstrate the contribution of our method by examining how authors communicate subjective information through narrative materials. This allows us to answer the question of which words to choose when communicating negative information. On the other hand, we show that investors trade not only upon facts in financial disclosures but are distracted by filler words and non-informative language. Practitioners - for example those in the fields of investor communications or marketing - can exploit our insights to enhance their writings based on the true perception of word choice.

Subjects:	Computation and Language (cs.CL); Applications (stat.AP)
Cite as:	arXiv:1706.06996 [cs.CL]
	(or arXiv:1706.06996v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1706.06996
Related DOI:	https://doi.org/10.1371/journal.pone.0209323

Submission history

From: Nicolas Pröllochs [view email]
[v1] Wed, 21 Jun 2017 16:37:54 UTC (137 KB)
[v2] Thu, 5 Apr 2018 23:45:33 UTC (82 KB)

Computer Science > Computation and Language

Title:Statistical Inferences for Polarity Identification in Natural Language

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Statistical Inferences for Polarity Identification in Natural Language

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators