NLP for shallow question answering of legal documents using graphs

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

18 Citas (Scopus)

Resumen

Previous work has shown that modeling relationships between articles of a regulation as vertices of a graph network works twice as better than traditional information retrieval systems for returning articles relevant to the question. In this work we experiment by using natural language techniques such as lemmatizing and using manual and automatic thesauri for improving question based document retrieval. For the construction of the graph, we follow the approach of representing the set of all the articles as a graph; the question is split in two parts, and each of them is added as part of the graph. Then several paths are constructed from part A of the question to part B, so that the shortest path contains the relevant articles to the question. We evaluate our method comparing the answers given by a traditional information retrieval system - vector space model adjusted for article retrieval, instead of document retrieval - and the answers to 21 questions given manually by the general lawyer of the National Polytechnic Institute, based on 25 different regulations (academy regulation, scholarships regulation, postgraduate studies regulation, etc.); with the answer of our system based on the same set of regulations. We found that lemmatizing increases performance in around 10%, while the use of thesaurus has a low impact.

Idioma originalInglés
Título de la publicación alojadaComputational Linguistics and Intelligent Text Processing - 10th International Conference, CICLing 2009, Proceedings
Páginas498-508
Número de páginas11
DOI
EstadoPublicada - 2009
Evento10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2009 - Mexico City, México
Duración: 1 mar. 20097 mar. 2009

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen5449 LNCS
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2009
País/TerritorioMéxico
CiudadMexico City
Período1/03/097/03/09

Huella

Profundice en los temas de investigación de 'NLP for shallow question answering of legal documents using graphs'. En conjunto forman una huella única.

Citar esto