TY - GEN
T1 - Use of a weighted topic hierarchy for document classification?
AU - Gelbukh, Alexander
AU - Sidorov, Grigori
AU - Guzman-Arénas, Adolfo
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 1999.
PY - 1999
Y1 - 1999
N2 - A statistical method of document classification driven by a hierarchical topic dictionary is proposed. The method uses a dictionary with a simple structure and is insensible to inaccuracies in the dictionary. Two kinds of weights of dictionary entries, namely, relevance and discrimination weights are discussed. The first type of weights is associated with the links between words and topics and between the nodes in the tree, while the weights of the second type depend on user database. A common sense-complaint way of assignment of these weights to the topics is presented. A system for text classification Classifier based on the discussed method is described.
AB - A statistical method of document classification driven by a hierarchical topic dictionary is proposed. The method uses a dictionary with a simple structure and is insensible to inaccuracies in the dictionary. Two kinds of weights of dictionary entries, namely, relevance and discrimination weights are discussed. The first type of weights is associated with the links between words and topics and between the nodes in the tree, while the weights of the second type depend on user database. A common sense-complaint way of assignment of these weights to the topics is presented. A system for text classification Classifier based on the discussed method is described.
UR - http://www.scopus.com/inward/record.url?scp=84957808345&partnerID=8YFLogxK
U2 - 10.1007/3-540-48239-3_24
DO - 10.1007/3-540-48239-3_24
M3 - Contribución a la conferencia
SN - 3540664947
SN - 9783540664949
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 133
EP - 138
BT - Text, Speech and Dialogue - 2nd International Workshop, TSD 1999, Proceedings
A2 - Matousek, Václav
A2 - Mautner, Pavel
A2 - Oceláková, Jana
A2 - Sojka, Petr
PB - Springer Verlag
T2 - 2nd International Workshop on Text, Speech and Dialogue, TSD 1999
Y2 - 13 September 1999 through 17 September 1999
ER -