Evaluating the irregularity of natural languages

Candelario Hernández-Gómez, Rogelio Basurto-Flores, Bibiana Obregón-Quintana, Lev Guzmán-Vargas

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

7 Citas (Scopus)

Resumen

In the present work, we quantify the irregularity of different European languages belonging to four linguistic families (Romance, Germanic, Uralic and Slavic) and an artificial language (Esperanto). We modified a well-known method to calculate the approximate and sample entropy of written texts. We find differences in the degree of irregularity between the families and our method, which is based on the search of regularities in a sequence of symbols, and consistently distinguishes between natural and synthetic randomized texts. Moreover, we extended our study to the case where multiple scales are accounted for, such as the multiscale entropy analysis. Our results revealed that real texts have non-trivial structure compared to the ones obtained from randomization procedures.

Idioma originalInglés
Número de artículo521
PublicaciónEntropy
Volumen19
N.º10
DOI
EstadoPublicada - 1 oct. 2017

Huella

Profundice en los temas de investigación de 'Evaluating the irregularity of natural languages'. En conjunto forman una huella única.

Citar esto