-
Formal concept analysis for evaluating intrinsic dimension of a natural language
Authors:
Sergei O. Kuznetsov,
Vasilii A. Gromov,
Nikita S. Borodin,
Andrei M. Divavin
Abstract:
Some results of a computational experiment for determining the intrinsic dimension of linguistic varieties for the Bengali and Russian languages are presented. At the same time, both sets of words and sets of bigrams in these languages were considered separately. The method used to solve this problem was based on formal concept analysis algorithms. It was found that the intrinsic dimensions of the…
▽ More
Some results of a computational experiment for determining the intrinsic dimension of linguistic varieties for the Bengali and Russian languages are presented. At the same time, both sets of words and sets of bigrams in these languages were considered separately. The method used to solve this problem was based on formal concept analysis algorithms. It was found that the intrinsic dimensions of these languages are significantly less than the dimensions used in popular neural network models in natural language processing.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
A Language and Its Dimensions: Intrinsic Dimensions of Language Fractal Structures
Authors:
Vasilii A. Gromov,
Nikita S. Borodin,
Asel S. Yerbolova
Abstract:
The present paper introduces a novel object of study - a language fractal structure. We hypothesize that a set of embeddings of all $n$-grams of a natural language constitutes a representative sample of this fractal set. (We use the term Hailonakea to refer to the sum total of all language fractal structures, over all $n$). The paper estimates intrinsic (genuine) dimensions of language fractal str…
▽ More
The present paper introduces a novel object of study - a language fractal structure. We hypothesize that a set of embeddings of all $n$-grams of a natural language constitutes a representative sample of this fractal set. (We use the term Hailonakea to refer to the sum total of all language fractal structures, over all $n$). The paper estimates intrinsic (genuine) dimensions of language fractal structures for the Russian and English languages. To this end, we employ methods based on (1) topological data analysis and (2) a minimum spanning tree of a data graph for a cloud of points considered (Steele theorem). For both languages, for all $n$, the intrinsic dimensions appear to be non-integer values (typical for fractal sets), close to 9 for both of the Russian and English language.
△ Less
Submitted 20 November, 2023; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Element-Resolved Corrosion Analysis of Stainless-Type Glass-Forming Steels
Authors:
M. J. Duarte,
J. Klemm,
S. O. Klemm,
K. J. J. Mayrhofer,
M. Stratmann,
S. Borodin,
A. H. Romero,
M. Madinehe,
D. Crespo,
J. Serrano,
S. S. A. Gerstl,
P. P. Choi,
D. Raabe,
F. U. Renner
Abstract:
Ultrathin passive films effectively prevent the chemical attack of stainless steel grades in corrosive environments; their stability critically depends on the interplay between structure and chemistry of the constituents Fe-Cr-Mo. In particular, nanoscale inhomogeneities along the surface can have a tremendous impact on material failure, but are yet barely understood. Addressing a stainless-type g…
▽ More
Ultrathin passive films effectively prevent the chemical attack of stainless steel grades in corrosive environments; their stability critically depends on the interplay between structure and chemistry of the constituents Fe-Cr-Mo. In particular, nanoscale inhomogeneities along the surface can have a tremendous impact on material failure, but are yet barely understood. Addressing a stainless-type glass-forming Fe50Cr15Mo14C15B6 alloy and utilizing a combination of complementary high-resolution analytical techniques, we relate near-atomistic insight into different gradual nanostructures with time- and element-resolved dissolution behavior. The progressive elemental segregation on the nanoscale is followed in its influence on the concomitant degree of passivity. A detrimental transition from Cr-controlled passivity to Mo-controlled breakdown is dissected atom-by-atom demonstrating the importance of nanoscale knowledge for understanding corrosion.
△ Less
Submitted 13 February, 2014;
originally announced February 2014.