TY - GEN
T1 - Terms derived from frequent sequences for extractive text summarization
AU - Ledeneva, Yulia
AU - Gelbukh, Alexander
AU - García-Hernández, René Arnulfo
N1 - Funding Information:
Work done under partial support of Mexican Government (CONACyT, SNI, SIP-IPN, COTEPABE-IPN, COFAA-IPN). The authors thank Rada Mihalcea for useful discussion.
PY - 2008
Y1 - 2008
N2 - Automatic text summarization helps the user to quickly understand large volumes of information. We present a language- and domain-independent statistical-based method for single-document extractive summarization, i.e., to produce a text summary by extracting some sentences from the given text. We show experimentally that words that are parts of bigrams that repeat more than once in the text are good terms to describe the text's contents, and so are also so-called maximal frequent sentences. We also show that the frequency of the term as term weight gives good results (while we only count the occurrences of a term in repeating bigrams).
AB - Automatic text summarization helps the user to quickly understand large volumes of information. We present a language- and domain-independent statistical-based method for single-document extractive summarization, i.e., to produce a text summary by extracting some sentences from the given text. We show experimentally that words that are parts of bigrams that repeat more than once in the text are good terms to describe the text's contents, and so are also so-called maximal frequent sentences. We also show that the frequency of the term as term weight gives good results (while we only count the occurrences of a term in repeating bigrams).
UR - http://www.scopus.com/inward/record.url?scp=49949097893&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-78135-6_51
DO - 10.1007/978-3-540-78135-6_51
M3 - Contribución a la conferencia
SN - 354078134X
SN - 9783540781349
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 593
EP - 604
BT - Computational Linguistics and Intelligent Text Processing - 9th International Conference, CICLing 2008, Proceedings
T2 - 9th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2008
Y2 - 17 February 2008 through 23 February 2008
ER -