Change search
ReferencesLink to record
Permanent link

Direct link
Investigating PRECISE: Implementing a portable natural language interface to databases
KTH, School of Computer Science and Communication (CSC).
2015 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

A natural language interface to a database (NLIDB) lets a user query a database using a natural language. PRECISE (Popescu et al., 2003) is a formal model for a portable SQL NLIDB which interprets a question by pairing sentence tokens to database attributes and values with a maximum flow solution. PRECISE is said to be sound and complete for a large class of semantically tractable questions. We implemented PRECISE and deployed it on Geoquery, a database of geographical facts. PRECISE made no errors in terms of returning a single, incorrect query, giving it the highest possible precision value. However, out of the 448 questions given, PRECISE was only able to produce SQL queries for 162, giving it a recall value of 0.361. A considerable amount of sentences gave rise to multiple interpretations, which prompted PRECISE to produce no query. Moreover, PRECISE by design could not produce queries for sentences which did not contain a WH-token ({"what", "where", "when", "who", "which"}). Our implementation of PRECISE required some manual configuration when deployed on Geoquery for best recall. While the results are tied to our implementation they give an indication of the size of the semantically tractable class as well as the portability of PRECISE.

Place, publisher, year, edition, pages
National Category
Computer Science
URN: urn:nbn:se:kth:diva-166574OAI: diva2:811351
Available from: 2015-05-28 Created: 2015-05-11 Last updated: 2015-05-28Bibliographically approved

Open Access in DiVA

fulltext(754 kB)81 downloads
File information
File name FULLTEXT01.pdfFile size 754 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
School of Computer Science and Communication (CSC)
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 81 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 97 hits
ReferencesLink to record
Permanent link

Direct link