TY - GEN
T1 - A domain independent natural language interface to databases capable of processing complex queries
AU - Rangel, Rodolfo A.Pazos
AU - Joaquín Pérez, O.
AU - Juan Javier González, B.
AU - Gelbukh, Alexander
AU - Sidorov, Grigori
AU - Myriam, J. Rodríguez M.
PY - 2005
Y1 - 2005
N2 - We present a method for creating natural language interfaces to databases (NLIDB) that allow for translating natural language queries into SQL. The method is domain independent, i.e., it avoids the tedious process of configuring the NLIDB for a given domain. We automatically generate the domain dictionary for query translation using semantic metadata of the database. Our semantic representation of a query is a graph including information from database metadata. The query is translated taking into account the parts of speech of its words (obtained with some linguistic processing). Specifically, unlike most existing NLIDBs, we take seriously auxiliary words (prepositions and conjunctions) as set theory operators, which allows for processing more complex queries. Experimental results (conducted on two Spanish databases from different domains) show that treatment of auxiliary words improves correctness of translation by 12.1%. With the developed NLIDB 82of queries were correctly translated (and thus answered). Reconfiguring the NLIDB from one domain to the other took only ten minutes.
AB - We present a method for creating natural language interfaces to databases (NLIDB) that allow for translating natural language queries into SQL. The method is domain independent, i.e., it avoids the tedious process of configuring the NLIDB for a given domain. We automatically generate the domain dictionary for query translation using semantic metadata of the database. Our semantic representation of a query is a graph including information from database metadata. The query is translated taking into account the parts of speech of its words (obtained with some linguistic processing). Specifically, unlike most existing NLIDBs, we take seriously auxiliary words (prepositions and conjunctions) as set theory operators, which allows for processing more complex queries. Experimental results (conducted on two Spanish databases from different domains) show that treatment of auxiliary words improves correctness of translation by 12.1%. With the developed NLIDB 82of queries were correctly translated (and thus answered). Reconfiguring the NLIDB from one domain to the other took only ten minutes.
UR - http://www.scopus.com/inward/record.url?scp=33646780907&partnerID=8YFLogxK
U2 - 10.1007/11579427_85
DO - 10.1007/11579427_85
M3 - Contribución a la conferencia
SN - 3540298967
SN - 9783540298960
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 833
EP - 842
BT - MICAI 2005
T2 - 4th Mexican International Conference on Artificial Intelligence, MICAI 2005
Y2 - 14 November 2005 through 18 November 2005
ER -