TY - GEN
T1 - Intra-document and inter-document redundancy in multi-document summarization
AU - Carrillo-Mendoza, Pabel
AU - Calvo, Hiram
AU - Gelbukh, Alexander
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2017
Y1 - 2017
N2 - Multi-document summarization differs from single-document summarization in excessive redundancy of mentions of some events or ideas. We show how the amount of redundancy in a document collection can be used for assigning importance to sentences in multi-document extractive summarization: for instance, an idea could be important if it is redundant across documents because of its popularity; on the other hand, an idea could be important if it is not redundant across documents because of its novelty. We propose an unsupervised graph-based technique that, based on proper similarity measures, allows us to experiment with intra-document and inter-document redundancy. Our experiments on DUC corpora show promising results.
AB - Multi-document summarization differs from single-document summarization in excessive redundancy of mentions of some events or ideas. We show how the amount of redundancy in a document collection can be used for assigning importance to sentences in multi-document extractive summarization: for instance, an idea could be important if it is redundant across documents because of its popularity; on the other hand, an idea could be important if it is not redundant across documents because of its novelty. We propose an unsupervised graph-based technique that, based on proper similarity measures, allows us to experiment with intra-document and inter-document redundancy. Our experiments on DUC corpora show promising results.
KW - Cross-documents redundancy
KW - Doc2vec
KW - Graph-based methods
KW - Inter-document redundancy
KW - Intra-document redundancy
KW - Multi-document summarization
KW - Per-document redundancy
KW - Unsupervised summarization
UR - http://www.scopus.com/inward/record.url?scp=85028457892&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-62434-1_9
DO - 10.1007/978-3-319-62434-1_9
M3 - Contribución a la conferencia
SN - 9783319624334
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 105
EP - 115
BT - Advances in Soft Computing - 15th Mexican International Conference on Artificial Intelligence, MICAI 2016, Proceedings
A2 - Herrera-Alcantara, Oscar
A2 - Sidorov, Grigori
PB - Springer Verlag
T2 - 15th Mexican International Conference on Artificial Intelligence, MICAI 2016
Y2 - 23 October 2016 through 28 October 2016
ER -