TY - JOUR
T1 - Advanced clustering technique for medical data using semantic information
AU - Shin, Kwangcheol
AU - Han, Sang Yong
AU - Gelbukh, Alexander
PY - 2004
Y1 - 2004
N2 - MEDLINE is a representative collection of medical documents supplied with original full-text natural-language abstracts as well as with representative keywords (called MeSH-terms) manually selected by the expert annotators from a pre-defined ontology and structured according to their relation to the document. We show how the structured manually assigned semantic descriptions can be combined with the original full-text abstracts to improve quality of clustering the documents into a small number of clusters. As a baseline, we compare our results with clustering using only abstracts or only MeSH-terms. Our experiments show 36% to 47% higher cluster coherence, as well as more refined keywords for the produced clusters.
AB - MEDLINE is a representative collection of medical documents supplied with original full-text natural-language abstracts as well as with representative keywords (called MeSH-terms) manually selected by the expert annotators from a pre-defined ontology and structured according to their relation to the document. We show how the structured manually assigned semantic descriptions can be combined with the original full-text abstracts to improve quality of clustering the documents into a small number of clusters. As a baseline, we compare our results with clustering using only abstracts or only MeSH-terms. Our experiments show 36% to 47% higher cluster coherence, as well as more refined keywords for the produced clusters.
UR - http://www.scopus.com/inward/record.url?scp=9444235094&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-24694-7_33
DO - 10.1007/978-3-540-24694-7_33
M3 - Artículo de la conferencia
AN - SCOPUS:9444235094
SN - 0302-9743
VL - 2972
SP - 322
EP - 331
JO - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
JF - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
T2 - Third Mexican International Conferenceon Artificial Intelligence
Y2 - 26 April 2004 through 30 April 2004
ER -