Chi-square classifier for document categorization

Mikhail Alexandrov; Alexander Gelbukh; George Lozovoi

doi:10.1007/3-540-44686-9_45

Chi-square classifier for document categorization

Mikhail Alexandrov, Alexander Gelbukh, George Lozovoi

Centro de Investigación en Computación (CIC)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

8 Scopus citations

Abstract

The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply the well-known statistical hypothesis test that considers images of documents and domains as normalized vectors. In comparison with existing methods, such approach allows to take into account a random character of initial data. The classifier is developed in the framework of Document Investigator software package.

Original language	English
Title of host publication	Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings
Editors	Alexander Gelbukh
Publisher	Springer Verlag
Pages	457-459
Number of pages	3
ISBN (Print)	3540416870, 9783540416876
DOIs	https://doi.org/10.1007/3-540-44686-9_45
State	Published - 2001
Event	2nd International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2001 - Mexico City, Mexico Duration: 18 Feb 2001 → 24 Feb 2001

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	2004
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	2nd International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2001
Country/Territory	Mexico
City	Mexico City
Period	18/02/01 → 24/02/01

Access to Document

10.1007/3-540-44686-9_45

Cite this

Alexandrov, M., Gelbukh, A., & Lozovoi, G. (2001). Chi-square classifier for document categorization. In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings (pp. 457-459). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2004). Springer Verlag. https://doi.org/10.1007/3-540-44686-9_45

Alexandrov, Mikhail ; Gelbukh, Alexander ; Lozovoi, George. / Chi-square classifier for document categorization. Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings. editor / Alexander Gelbukh. Springer Verlag, 2001. pp. 457-459 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{99073076c1154907a07a03ed6819f47a,

title = "Chi-square classifier for document categorization",

abstract = "The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply the well-known statistical hypothesis test that considers images of documents and domains as normalized vectors. In comparison with existing methods, such approach allows to take into account a random character of initial data. The classifier is developed in the framework of Document Investigator software package.",

author = "Mikhail Alexandrov and Alexander Gelbukh and George Lozovoi",

note = "Publisher Copyright: {\textcopyright} Springer-Verlag Berlin Heidelberg 2001.; 2nd International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2001 ; Conference date: 18-02-2001 Through 24-02-2001",

year = "2001",

doi = "10.1007/3-540-44686-9_45",

language = "Ingl{\'e}s",

isbn = "3540416870",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "457--459",

editor = "Alexander Gelbukh",

booktitle = "Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings",

address = "Alemania",

}

Alexandrov, M, Gelbukh, A & Lozovoi, G 2001, Chi-square classifier for document categorization. in A Gelbukh (ed.), Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 2004, Springer Verlag, pp. 457-459, 2nd International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2001, Mexico City, Mexico, 18/02/01. https://doi.org/10.1007/3-540-44686-9_45

Chi-square classifier for document categorization. / Alexandrov, Mikhail; Gelbukh, Alexander; Lozovoi, George.
Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings. ed. / Alexander Gelbukh. Springer Verlag, 2001. p. 457-459 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2004).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Chi-square classifier for document categorization

AU - Alexandrov, Mikhail

AU - Gelbukh, Alexander

AU - Lozovoi, George

PY - 2001

Y1 - 2001

N2 - The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply the well-known statistical hypothesis test that considers images of documents and domains as normalized vectors. In comparison with existing methods, such approach allows to take into account a random character of initial data. The classifier is developed in the framework of Document Investigator software package.

AB - The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply the well-known statistical hypothesis test that considers images of documents and domains as normalized vectors. In comparison with existing methods, such approach allows to take into account a random character of initial data. The classifier is developed in the framework of Document Investigator software package.

UR - http://www.scopus.com/inward/record.url?scp=84928609713&partnerID=8YFLogxK

U2 - 10.1007/3-540-44686-9_45

DO - 10.1007/3-540-44686-9_45

M3 - Contribución a la conferencia

SN - 3540416870

SN - 9783540416876

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 457

EP - 459

BT - Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings

A2 - Gelbukh, Alexander

PB - Springer Verlag

T2 - 2nd International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2001

Y2 - 18 February 2001 through 24 February 2001

ER -

Alexandrov M, Gelbukh A, Lozovoi G. Chi-square classifier for document categorization. In Gelbukh A, editor, Computational Linguistics and Intelligent Text Processing - 2nd International Conference, CICLing 2001, Proceedings. Springer Verlag. 2001. p. 457-459. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/3-540-44686-9_45

Chi-square classifier for document categorization

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this