PPChecker: Plagiarism pattern checker in document copy detection

Nam Oh Kang, Alexander Gelbukh, Sang Yong Han

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

39 Scopus citations

Abstract

Nowadays, most of documents are produced in digital format, in which they can be easily accessed and copied. Document copy detection is a very important tool for protecting the author's copyright. We present PPChecker, a document copy detection system based on plagiarism pattern checking. PPChecker calculates the amount of data copied from the original document to the query document, based on linguistically-motivated plagiarism patterns. Experiments performed on CISI document collection show that PPChecker produces better decision information for document copy detection than existing systems.

Original languageEnglish
Title of host publicationText, Speech and Dialogue - 9th International Conference, TSD 2006, Proceedings
PublisherSpringer Verlag
Pages661-667
Number of pages7
ISBN (Print)3540390901, 9783540390909
DOIs
StatePublished - 2006
Event9th International Conference on Text, Speech and Dialogue, TSD 2006 - Brno, Czech Republic
Duration: 11 Sep 200615 Sep 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4188 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Text, Speech and Dialogue, TSD 2006
Country/TerritoryCzech Republic
CityBrno
Period11/09/0615/09/06

Keywords

  • Document Copy Detection
  • Plagiarism Pattern

Fingerprint

Dive into the research topics of 'PPChecker: Plagiarism pattern checker in document copy detection'. Together they form a unique fingerprint.

Cite this