Algorithm for extraction of subtrees of a sentence dependency parse tree

Juan Pablo Posadas-Durán, Grigori Sidorov, Helena Gómez-Adorno, Ildar Batyrshin, Elibeth Mirasol-Mélendez, Gabriela Posadas-Durán, Liliana Chanona-Hernández

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

In this paper, we introduce an algorithm for obtaining the subtrees (continuous and non-continuous syntactic n-grams) from a dependency parse tree of a sentence. Our algorithm traverses the dependency tree of the sentences within a text document and extracts all its subtrees (syntactic n-grams). Syntactic n-grams are being successfully used in the literature (by ourselves and other authors) as features to characterize text documents using machine learning approach in the field of Natural Language Processing.

Original languageEnglish
Pages (from-to)79-98
Number of pages20
JournalActa Polytechnica Hungarica
Volume14
Issue number3
DOIs
StatePublished - 2017

Keywords

  • Linguistic features
  • Subtrees extraction
  • Syntactic n-grams
  • Tree traversal

Fingerprint

Dive into the research topics of 'Algorithm for extraction of subtrees of a sentence dependency parse tree'. Together they form a unique fingerprint.

Cite this