Skip to main content

Showing 1–2 of 2 results for author: Orczyk, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1007.0936  [pdf, ps, other

    cs.CL physics.soc-ph

    Linguistic complexity: English vs. Polish, text vs. corpus

    Authors: Jaroslaw Kwapien, Stanislaw Drozdz, Adam Orczyk

    Abstract: We analyze the rank-frequency distributions of words in selected English and Polish texts. We show that for the lemmatized (basic) word forms the scale-invariant regime breaks after about two decades, while it might be consistent for the whole range of ranks for the inflected word forms. We also find that for a corpus consisting of texts written by different authors the basic scale-invariant regim… ▽ More

    Submitted 6 July, 2010; originally announced July 2010.

    Journal ref: Acta Phys. Pol. A 117, 716-720 (2010)

  2. arXiv:0901.3291  [pdf, ps, other

    cs.CL physics.data-an

    Approaching the linguistic complexity

    Authors: Stanislaw Drozdz, Jaroslaw Kwapien, Adam Orczyk

    Abstract: We analyze the rank-frequency distributions of words in selected English and Polish texts. We compare scaling properties of these distributions in both languages. We also study a few small corpora of Polish literary texts and find that for a corpus consisting of texts written by different authors the basic scaling regime is broken more strongly than in the case of comparable corpus consisting of… ▽ More

    Submitted 21 January, 2009; originally announced January 2009.

    Comments: to be published in conference proceedings

    Journal ref: Complex Sciences, Lect. Notes ICST vol.4, 1044-1050 (Springer, 2009)