[PDF] Semantic Parsing to Manipulate Relational Database For a Management System

Abstract

Chatbots and AI assistants have claimed their importance in today life. The main reason behind adopting this technology is to connect with the user, understand their requirements, and fulfill them. This has been achieved but at the cost of heavy training data and complex learning models. This work is carried out proposes a simple algorithm, a model which can be implemented in different fields each with its own work scope. The proposed model converts human language text to computer-understandable SQL queries. The model requires data only related to the specific field, saving data space. This model performs linear computation hence solving the computational complexity. This work also defines the stages where a new methodology is implemented and what previous method was adopted to fulfill the requirement at that stage. Two datasets available online will be used in this work, the ATIS dataset, and WikiSQL. This work compares the computation time among the 2 datasets and also compares the accuracy of both. This paper works over basic Natural language processing tasks like semantic parsing, NER, parts of speech and tends to achieve results through these simple methods.

Full PDF

SSemantic Parsing to Manipulate RelationalDatabase For a Management System

Muhammad Hamzah Mushtaq

FAST National University of Computers and Emerging Sciences

Abstract

Chatbots and AI assistants have claimed their importance in today’slife. The main reason behind adopting this technology is to connectwith the user, understand their requirements, and fulﬁll them. Thishas been achieved but at the cost of heavy training data and com-plex learning models. This work is carried out proposes a simplealgorithm, a model which can be implemented in different ﬁeldseach with its own work scope. The proposed model converts hu-man language text to computer-understandable SQL queries. Themodel requires data only related to the speciﬁc ﬁeld, saving dataspace. This model performs linear computation hence solving thecomputational complexity. This work also deﬁnes the stages wherea new methodology is implemented and what previous methodwas adopted to fulﬁll the requirement at that stage. Two datasetsavailable online will be used in this work, the ATIS dataset, andWikiSQL. This work compares the computation time among the 2datasets and also compares the accuracy of both. This paper worksover basic Natural language processing tasks like semantic parsing,NER, parts of speech and tends to achieve results through thesesimple methods.

Keywords

Semantic Parsing,NER,SQL,Text-to-SQL

1. Problem statement

To generate a generic model for low end individual systems whichwhen provided with any limited database would convert Humanlanguage to respected database query efﬁciently.

2. Introduction

Many companies have their own customer support department.Sometimes, users queries do get overbooked and there are notenough staff to handle those queries in time. Instead of employingmore workforce, having a smart voice query and result systemwould save more time and money. More importantly, in a low endsystem which doesn’t have a huge processing power and memoryand which needs to perform operations just related to its ﬁeld, it ishard for it to compute such huge algorithms that today’s algorithmsrequire. Training models, requiring loads of training and testing

Permission to make digital or hard copies of all or part of this work for personal orclassroom use is granted without fee provided that copies are not made or distributedfor proﬁt or commercial advantage and that copies bear this notice and the full citationon the ﬁrst page. Copyrights for components of this work owned by others than ACMmust be honored. Abstracting with credit is permitted. To copy otherwise, or republish,to post on servers or to redistribute to lists, requires prior speciﬁc permission and/or afee. Request permissions from [email protected].

CONF ’yy , Month d–d, 20yy, City, ST, Country.Copyright © 20yy ACM 978-1-nnnn-nnnn-n/yy/mm. . . $15.00.http://dx.doi.org/10.1145/nnnnnnn.nnnnnnn data ﬁrst and dependency over multiple computers/nodes for heavyprocessing. It is true that these high tech solution make our lifeeasier but just for solving and achieving bigger goals. In orderto work in a limited scope environment, one doesn’t need suchamount of data and dependencies. This paper aims to provide asolution for such limited scope operations. For this work purpose,we refer to a hotel room management system which only needs tocommunicate data and information regarding hotel and its vacantrooms and availability. In the race of developing high end genericsolutions, we have lost vision and though of achievements thatcould come by individually capturing each ﬁeld. Same is the casefor semantic parsing in Speech recognition systems. Here, we aretalking about a smart system which understands human languagequeries through their voice and interprets their question and givesrequired answer in human language. For example, as DDL queriesare simple, the support assistant just needs to view the data fromdatabase and reveal it to user. If a smart user assistant handlesthese queries, the actual support assistants could get more time inhandling users with more difﬁcult queries and issues. This impliesthe importance of voice assistants which entertain users throughdirectly communicating with database. Henceforth, there must bea channel in which users natural language is translated to querylanguage for computers to understand and interpret. Traditionalapproach revolves around training over huge amount of data whichcould fail when out of training set data is occurred. Therefore, themotivation behind the research is to develop a solution which isindependent of training set and computes efﬁcient result over someset of rules.

Understanding human language by converting it into machine un-derstandable form and then retrieving information back is a tedioustask. In this path the main big concept is of Semantic parsing re-sponsible for converting human language to query structure. Itrequires multiple functions like tokenization, semantic analysis,parsing and passing through language models. If we talk particu-larly for a speciﬁc area of focus, a speciﬁc ﬁeld of work where weneed to implement our semantic parsing of human language, thenmuch less data is needed for training purposes. Moreover, infor-mation extraction becomes more easy when there is just a singledatabase involved. Instead of training models on various keywordsand sentences, just a set of main keywords and there replacementQuery syntax are required. This leads to less memory and timeconsumption. Also, situation speciﬁc data can be processed easily.If we generalize the overall approaches used globally for theNLP tasks,[8] we say there are three categories namely: • Symbolic Approach or Rule-Based Approach a r X i v : . [ c s . C L ] F e b igure 1. Sample for natural language and related SQL extracted from [1]. In multi turns, these previous models save the query structureand re-query the database. Proposed model adopts different technique for multi turn questions by querying from previous results directly. • Empirical Approach or Corpus-Based Approach • Connectionist Approach or Using Neural NetworksThis research on makes use of both the rule based and corpusbased machine learning approach. This is made possible by uti-lizing the text-to-sql dataset. This dataset itself contains multipleﬁelds databases, mostly with limited database table usage. Thisis highly likely to compensate in this research since the aim isto manipulate relational database for single scope databases. Sec-ondly, the template that would assist in reﬁning the query consistof a classiﬁcation model that classiﬁes the table names and columnnames to the most accurate within the dataset. Researc SPYDERcarried out clearly outlines the speciﬁcations of multiple text-to-sql datasets. This research doesn’t consider dataset Spyder since itdoes contain multiple database with multiple tables but with differ-ent scopes which is not the aim this research.The original working ﬂow for the research was that a black box textto sql generator would be used so as to provide a dummy or body ofrespected sql. This would save time. But translating through modelwould saved time and complexity. This research focuses generatingand utilizing a generic Query Language template which assists indesigning a query, and also how to use already processed data formulti turn dialogues. For template generation, NER will be thor-oughly used. We will learn the implications of NER over querygeneration. Multiple datasets are available as given in cath:cgo02.We will be using the structured database of many of these datasetsinstead of their Natural language to query language translation.We will compare our generated queries with that already given indataset and distinguish the correctness of our methodology. An ex-ample for comparison is shown below. We compare the total num-ber of statements that are translated to query and how many ofthem were correctly translated. Since the true translations are givenwithin the respective datasets, we can match our results and get thecomparison.

3. Related work

A lot of work has been done over voice assistants and chat bots.Mainly working over Open domain question answering (QA).[1]For this purpose, a huge amount of data corpus is required and fromvarious sources. Traditional approach requires training state of theart models which utilize this data corpus and learn user behaviorover voice commands.The research [1] also shows the use of domain ontology tripleswhich carry this format. . This partgives easy understanding for generating a better query. The re- search uses Lexicons to match the instances from natural languageto query language. This is a very effective technique which acts asa relay, a bridge between normal sentences and Query sentences.More further, the query templates are directly grouped in particularpredicate group which deﬁnes its structure and mechanism, eitherDDl or DML.Earlier models were trained on pre deﬁned text andsupposed result. But the accuracy decreased when a new text orsentence appeared which was not trained in training set.Previous work also included a dialog based structure [6] inwhich the user itself intervened for the better and accurate resultsof query, basically reﬁning the query as the questions asked by themodel from user. This research excludes such intervention for themain cause that user just has to input his voice command and getthe results, the model must be properly trained to streamline andreﬁne the sql query itself.Many methods have been implemented ranging from rule basedto neural network approach as discussed in paper [9]. Each hasits own limitations. Neural network implementation from translat-ing human language to sql gives successful results with high ac-curacy but at higher cost. They require huge umber of parametersand huge training corpus. Authors have also highlighted that sinceeach model is trained and tested on different dataset, there accu-racy’s differ because of the accuracy and correctness of datasetitself. Authors also state that the very famous WikiSQL is basedon a simple syntax pattern including SELECT FROM T [ WHERE (and )* ], where T is a given single table. With this we know thatit doesn’t support grouping, ordering, join or nested groups. A welldistinction of WikiSQL along with other benchmark datasets is de-ﬁned in ﬁgure 4 extracted from another paper.Human generated dataset have been evaluated efﬁciently in [2]. Itidentiﬁed that human generated datasets lacked some of the prop-erties needed in large-scale query sets.Up till now, work was beingcarried out keeping a generic concept in mind. By generic we meancovering all aspects of information, all areas of research and mul-tiple topics and subjects that must be learned by model. Hence abroader approach was necessary.

4. Implementation

Our ﬁrst approach over this research is to convert human voice totext. The text is passed through semantic parsing to develop theinitial syntax for our query. We proposed a rule based approachin which the initially developed syntax is reﬁned through the help igure 2.

Control ﬂow logic of the being implemented modelof predeﬁned set to Query Language template or rules. In thisway, the computer gets to learn the correct form.Then the queryis run over the database to fetch results. These queries can be ofany type; Data Deﬁnition Language (DDL) or Data ManipulationLanguage(DML). This process is to be adopted for single-turn dia-logue, where there is just one single operation. The second changethat is to implemented is using already gathered results for multiturn dialogues.Instead of redesigning the query and gather the similar resultsover it, we propose a set of grammar and rule which deﬁne whento run query over already gathered results and when to query thedatabase. Also, this research will put in use NER. This will behelpful when matching parsed query with domain ontology tuplesor predeﬁned templates.

This is the ﬁrst and initial part of our model in which voice input isconverted to text. The main work of this model resides in efﬁcientlyprocessing and computing users text command in sql query form,hence, we will be using third party API for just converting humanvoice to text. For best results we would use google’s speech to textapi. We convert human voice and then perform further process onit.Also, our model as it is trained over recognizing English verbs orpronouns which would declare what type of operation to perform,the model is prone to some errors because of its dependency ongenerated language.

The generated English sentence is broken down here into parts ofspeech and object/ subject distinction. Hence each word carriesits attributes further for query generation. Major use here will beof Named Entity Recognition NER which will tag the words asnames, location, time etc. The reason for adopting this processis easily match the data with our already deﬁned template. Thatquery template would convey us the reﬁned SQL based on somepredeﬁned set of conditions. For example, when certain words fallinto certain category, a different SQL syntax will be use. In parallel, a model is trained which generates a sql body for thenext method to work on. The model is trained over input sentencesand predict what type of sql query would be used, for example aselect statement or a insert or delete statement. This model woulddecide if to form DDL or DML.

The already deﬁned SQL template also tells us whether to exe-cute our statement on previously generated result or re-query overdatabase. This would help us shortening the query and saving a lotof time which would be wasted in conditional query over database.Secondly, this is the part which is different from most of the mod-ern techniques. Since our goal is to develop a model for low endsystems running individually for a speciﬁc ﬁeld and purpose, ourgoal to convert language to SQL query becomes easy. When theﬁeld and scope is narrowed down, so is the data needed to identityhuman language meaning.The ﬁeld speciﬁc template would include precise NER sub-jects and there respective SQL query mapping, along with the thestructure of conveying results in text form. First column woulddescribe the NER subjects like object, name, place etc, second col-umn would deﬁne the assigned query along with the positions ofthe entities. The third column denotes the structure of the resultsthat would be displayed. For example if a SELECT statement with

Figure 3.

Reﬁning query includes the ﬁeld speciﬁc template andparsed text as input igure 4. [7]Datasets comparisonsSatement SQL mapping Result statementHow manyrooms areavailable? SELECTCOUNT(ID)FROM THERE ARE(COUNT)