Resolving Ambiguities in Toponym Recognition in Cartographic Maps

Alexander Gelbukh; Serguei Levachkine; Sang Yong Han

doi:10.1007/978-3-540-25977-0_7

Resolving Ambiguities in Toponym Recognition in Cartographic Maps

Alexander Gelbukh, Serguei Levachkine, Sang Yong Han

Centro de Investigación en Computación (CIC)

Research output: Chapter in Book/Report/Conference proceeding › Chapter › peer-review

7 Scopus citations

Abstract

To date many methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is semantically much more ambiguous in comparison with standard text. To recognize a text of graphic documents, it is necessary first to separate it from linear objects, solids, and symbols and to define its orientation. Even so, the recognition programs nearly always produce errors. In the context of raster-to-vector conversion of graphic documents, the problem of text recognition is of special interest, because textual information can be used for verifi- . cation of vectorization results (post-processing). In this work, we propose a method that combines OCR-based text recognition in raster-scanned maps with heuristics specially adapted for cartographic data to resolve the recognition ambiguities using, among other information sources, the spatial object relationships. Our goal is to form in the vector thematic layers geographically meaningful words correctly attached to the cartographic objects.

Original language	English
Title of host publication	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Editors	Josep Llados, Young-Bin Kwon
Publisher	Springer Verlag
Pages	75-86
Number of pages	12
ISBN (Electronic)	9783540224785
DOIs	https://doi.org/10.1007/978-3-540-25977-0_7
State	Published - 2004

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	3088
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Access to Document

10.1007/978-3-540-25977-0_7

Cite this

Gelbukh, A., Levachkine, S., & Han, S. Y. (2004). Resolving Ambiguities in Toponym Recognition in Cartographic Maps. In J. Llados, & Y.-B. Kwon (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 75-86). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3088). Springer Verlag. https://doi.org/10.1007/978-3-540-25977-0_7

Gelbukh, Alexander ; Levachkine, Serguei ; Han, Sang Yong. / Resolving Ambiguities in Toponym Recognition in Cartographic Maps. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). editor / Josep Llados ; Young-Bin Kwon. Springer Verlag, 2004. pp. 75-86 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inbook{f2ae2255461a478796a454ade61aa08d,

title = "Resolving Ambiguities in Toponym Recognition in Cartographic Maps",

abstract = "To date many methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is semantically much more ambiguous in comparison with standard text. To recognize a text of graphic documents, it is necessary first to separate it from linear objects, solids, and symbols and to define its orientation. Even so, the recognition programs nearly always produce errors. In the context of raster-to-vector conversion of graphic documents, the problem of text recognition is of special interest, because textual information can be used for verifi- . cation of vectorization results (post-processing). In this work, we propose a method that combines OCR-based text recognition in raster-scanned maps with heuristics specially adapted for cartographic data to resolve the recognition ambiguities using, among other information sources, the spatial object relationships. Our goal is to form in the vector thematic layers geographically meaningful words correctly attached to the cartographic objects.",

author = "Alexander Gelbukh and Serguei Levachkine and Han, {Sang Yong}",

year = "2004",

doi = "10.1007/978-3-540-25977-0_7",

language = "Ingl{\'e}s",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "75--86",

editor = "Josep Llados and Young-Bin Kwon",

booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

address = "Alemania",

}

Gelbukh, A, Levachkine, S & Han, SY 2004, Resolving Ambiguities in Toponym Recognition in Cartographic Maps. in J Llados & Y-B Kwon (eds), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 3088, Springer Verlag, pp. 75-86. https://doi.org/10.1007/978-3-540-25977-0_7

Resolving Ambiguities in Toponym Recognition in Cartographic Maps. / Gelbukh, Alexander; Levachkine, Serguei; Han, Sang Yong.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). ed. / Josep Llados; Young-Bin Kwon. Springer Verlag, 2004. p. 75-86 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3088).

Research output: Chapter in Book/Report/Conference proceeding › Chapter › peer-review

TY - CHAP

T1 - Resolving Ambiguities in Toponym Recognition in Cartographic Maps

AU - Gelbukh, Alexander

AU - Levachkine, Serguei

AU - Han, Sang Yong

PY - 2004

Y1 - 2004

N2 - To date many methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is semantically much more ambiguous in comparison with standard text. To recognize a text of graphic documents, it is necessary first to separate it from linear objects, solids, and symbols and to define its orientation. Even so, the recognition programs nearly always produce errors. In the context of raster-to-vector conversion of graphic documents, the problem of text recognition is of special interest, because textual information can be used for verifi- . cation of vectorization results (post-processing). In this work, we propose a method that combines OCR-based text recognition in raster-scanned maps with heuristics specially adapted for cartographic data to resolve the recognition ambiguities using, among other information sources, the spatial object relationships. Our goal is to form in the vector thematic layers geographically meaningful words correctly attached to the cartographic objects.

AB - To date many methods and programs for automatic text recognition exist. However there are no effective text recognition systems for graphic documents. Graphic documents usually contain a great variety of textual information. As a rule the text appears in arbitrary spatial positions, in different fonts, sizes and colors. The text can touch and overlap graphic symbols. The text meaning is semantically much more ambiguous in comparison with standard text. To recognize a text of graphic documents, it is necessary first to separate it from linear objects, solids, and symbols and to define its orientation. Even so, the recognition programs nearly always produce errors. In the context of raster-to-vector conversion of graphic documents, the problem of text recognition is of special interest, because textual information can be used for verifi- . cation of vectorization results (post-processing). In this work, we propose a method that combines OCR-based text recognition in raster-scanned maps with heuristics specially adapted for cartographic data to resolve the recognition ambiguities using, among other information sources, the spatial object relationships. Our goal is to form in the vector thematic layers geographically meaningful words correctly attached to the cartographic objects.

UR - http://www.scopus.com/inward/record.url?scp=35048857534&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-25977-0_7

DO - 10.1007/978-3-540-25977-0_7

M3 - Capítulo

AN - SCOPUS:35048857534

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 75

EP - 86

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

A2 - Llados, Josep

A2 - Kwon, Young-Bin

PB - Springer Verlag

ER -

Gelbukh A, Levachkine S, Han SY. Resolving Ambiguities in Toponym Recognition in Cartographic Maps. In Llados J, Kwon YB, editors, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Verlag. 2004. p. 75-86. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-540-25977-0_7

Resolving Ambiguities in Toponym Recognition in Cartographic Maps

Abstract

Publication series

Access to Document

Other files and links

Fingerprint

Cite this