Ambigüedad en nombres hispanos

Translated title of the contribution: Ambiguity in hispanic names

Grettel Barceló, Eduardo Cendejas, Igor Bolshakov, Grigori Sidorov

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

The constitution of Hispanic names assumes a degree of ambiguity in many cases. The structure of the denominative sequences in Hispanic countries presents five fundamental problems that obstruct their interpretation: (1) the double sex deduction in personal names, as in Guadalupe; (2) the association of names and/or surnames in one name, as in Jorge Luis, whose components exist separately; (3) the composition of the elements by means of a connector; (4) the name/surname duality; and (5) the accepted omission of some of the elements of the denominative sequences. This study focuses on the automatic detection and analysis of these types of ambiguities (uncertainties). A formal grammar that determines valid interpretations of the nominal chains was developed by means of the automatic labeling of all the elements of which this grammar is composed. Furthermore, graphs of the distribution of the names and surnames are presented, the most important of which reveals that the frequency abides by Zipf's law. A corpus of 745,084 personal records was used as a data source. From these records, 93,998 type names, and 13,779 type surnames, including simple, compound, and associate ones, were taken. From these, 77,162 (82%) ambiguity sources in names and 2,739 (20%) ambiguity sources in surnames were detected. From all of the personal records analyzed, 241,992 (33%) present at least two valid interpretations in the denomination.

Translated title of the contributionAmbiguity in hispanic names
Original languageSpanish
Pages (from-to)153-169
Number of pages17
JournalRevista Signos
Volume42
Issue number70
StatePublished - 2009

Fingerprint

Dive into the research topics of 'Ambiguity in hispanic names'. Together they form a unique fingerprint.

Cite this