TY - GEN
T1 - Lazy query enrichment
T2 - 11th International Conference on Database and Expert Systems Applications, DEXA 2000
AU - Gelbukh, Alexander F.
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2000.
PY - 2000
Y1 - 2000
N2 - A full-text information retrieval system has to deal with various phenomena of string equivalence: ignore case matching, morphological inflection, derivation, synonymy, and hyponymy or hyperonymy. Technically, this can be handled either at the time of indexing by reducing equivalent strings to a common form or at the time of query processing by enriching the query with the whole set of the equivalent forms. We argue for that the latter way allows for greater flexibility and easier maintenance, while being more affordable than it is usually considered. Our proposal consists in enriching the query only with those forms that really appear in the document base. Our experiments with a thesaurus- based information retrieval system showed only insignificant increase of the query size on average with a 200-megabyte document base, even with highly inflective Spanish language.
AB - A full-text information retrieval system has to deal with various phenomena of string equivalence: ignore case matching, morphological inflection, derivation, synonymy, and hyponymy or hyperonymy. Technically, this can be handled either at the time of indexing by reducing equivalent strings to a common form or at the time of query processing by enriching the query with the whole set of the equivalent forms. We argue for that the latter way allows for greater flexibility and easier maintenance, while being more affordable than it is usually considered. Our proposal consists in enriching the query only with those forms that really appear in the document base. Our experiments with a thesaurus- based information retrieval system showed only insignificant increase of the query size on average with a 200-megabyte document base, even with highly inflective Spanish language.
UR - http://www.scopus.com/inward/record.url?scp=77954228036&partnerID=8YFLogxK
U2 - 10.1007/3-540-44469-6_49
DO - 10.1007/3-540-44469-6_49
M3 - Contribución a la conferencia
SN - 9783540679783
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 526
EP - 535
BT - Database and Expert Systems Applications - 11th International Conference, DEXA 2000, Proceedings
A2 - Ibrahim, Mohamed
A2 - Kung, Josef
A2 - Revell, Norman
PB - Springer Verlag
Y2 - 4 September 2000 through 8 September 2000
ER -