A Novel Approach to Interactive Dialogue Generation Based on Natural Language Creation with Context-Free Grammars and Sentiment Analysis

Bilexical context-free grammars (2-LCFGs) have proved to be accurate models for statistical natural language parsing. Existing dynamic programming algorithms used to parse sentences under these models have running time of O(∣w∣4), where w is the input string. A 2-LCFG is splittable if the left arguments of a lexical head are always independent of the right arguments, and vice versa. When a 2-LCFGs is splittable, parsing time can be asymptotically improved to O(∣w∣3). Testing this property is therefore of central interest to parsing efficiency. In this article, however, we show the negative result that splittability of 2-LCFGs is undecidable.

Download Full-text

Generating Context-Free Grammars using Classical Planning

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/613 ◽

2017 ◽

Cited By ~ 2

Author(s):

Javier Segovia-Aguas ◽

Sergio Jiménez ◽

Anders Jonsson

Keyword(s):

Input String ◽

Planning Problem ◽

Novel Approach ◽

Single Input ◽

Input Strings ◽

Context Free ◽

Context Free Grammars

This paper presents a novel approach for generating Context-Free Grammars (CFGs) from small sets of input strings (a single input string in some cases). Our approach is to compile this task into a classical planning problem whose solutions are sequences of actions that build and validate a CFG compliant with the input strings. In addition, we show that our compilation is suitable for implementing the two canonical tasks for CFGs, string production and string recognition.

Download Full-text

Researching on Parsing

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.846-847.1376 ◽

2013 ◽

Vol 846-847 ◽

pp. 1376-1379

Author(s):

Li Fei Geng ◽

Hong Lian Li

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Linguistic Analysis ◽

Syntactic Analysis ◽

The Core ◽

Core Technology ◽

Context Free ◽

Probabilistic Context ◽

Context Free Grammars

Syntactic analysis is the core technology of natural language processing and it is the cornerstone for further linguistic analysis. This paper, first introduces the basic grammatical system and summary the technology of current parsing. Then analysis the characteristics of probabilistic context-free grammars deep and introduce the method of improving for probabilistic context-free. The last we point the difficulty of Chinese parsing.

Download Full-text

Evolutionary Grammatical Inference

Encyclopedia of Artificial Intelligence ◽

10.4018/978-1-59904-849-9.ch091 ◽

2011 ◽

pp. 596-602

Author(s):

Ernesto Rodrigues ◽

Heitor Silvério Lopes

Keyword(s):

Genetic Programming ◽

Natural Language ◽

Search Space ◽

Grammatical Inference ◽

Language L ◽

Finite State ◽

Finite Set ◽

Context Free ◽

Pushdown Automata ◽

Context Free Grammars

Grammatical Inference (also known as grammar induction) is the problem of learning a grammar for a language from a set of examples. In a broad sense, some data is presented to the learner that should return a grammar capable of explaining to some extent the input data. The grammar inferred from data can then be used to classify unseen data or provide some suitable model for it. The classical formalization of Grammatical Inference (GI) is known as Language Identification in the Limit (Gold, 1967). Here, there are a finite set S+ of strings known to belong to the language L (the positive examples) and another finite set S- of strings not belonging to L (the negative examples). The language L is said to be identifiable in the limit if there exists a procedure to find a grammar G such that S+ ? L(G), S- ? L(G) and, in the limit, for sufficiently large S+ and S-, L = L(G). The disjoint sets S+ and S- are given to provide clues for the inference of the production rules P of the unknown grammar G used to generate the language L. Grammatical inference include such diverse fields as speech and natural language processing, gene analysis, pattern recognition, image processing, sequence prediction, information retrieval, cryptography, and many more. An excellent source for a state-of-the art overview of the subject is provided in (de la Higuera, 2005). Traditionally, most work in GI has been focused on the inference of regular grammars trying to induce finite-state automata, which can be efficiently learned. For context free languages some recent approaches have shown limited success (Starckie, Costie & Zaanen, 2004), because the search space of possible grammars is infinite. Basically, the parenthesis and palindrome languages are common test cases for the effectiveness of grammatical inference methods. Both languages are context-free. The parenthesis language is deterministic but the palindrome language is nondeterministic (de la Higuera, 2005). The use of evolutionary methods for context-free grammatical inference are not new, but only a few attempts have been successful. Wyard (1991) used Genetic Algorithm (GA) to infer grammars for the language of correctly balanced and nested parentheses with success, but fails on the language of sentences containing the same number of a’s and b’s (anbn language). In another attempt (Wyard, 1994), he obtained positive results on the inference of two classes of context-free grammars: the class of n-symbol palindromes with 2 = n = 4 and a class of small natural language grammars. Sen and Janakiraman (1992) applied a GA using a pushdown automata to the inference and successfully learned the anbn language and the parentheses balancing problem. But their approach does not scale well. Huijsen (1994) applied GA to infer context-free grammars for the parentheses balancing problem, the language of equal numbers of a’s and b’s and the even-length 2-symbol palindromes. Huijsen uses a “markerbased” encoding scheme with has the main advantage of allowing variable length chromosomes. The inference of regular grammars was successful but the inference of context-free grammars failed. Those results obtained in earlier attempts using GA to context-free grammatical inference were limited. The first attempt to use Genetic Programming (GP) for grammatical inference used a pushdown automata (Dunay, 1994) and successfully learned the parenthesis language, but failed for the anbn language. Korkmaz and Ucoluk (2001) also presented a GP approach using a prototype theory, which provides a way to recognize similarity between the grammars in the population. With this representation, it is possible to recognize the so-called building blocks but the results are preliminary. Javed and his colleagues (2004) proposed a Genetic Programming (GP) approach with grammar-specific heuristic operators with non-random construction of the initial grammar population. Their approach succeeded in inducing small context-free grammars. More recently, Rodrigues and Lopes (2006) proposed a hybrid GP approach that uses a confusion matrix to compute the fitness. They also proposed a local search mechanism that uses information obtained from the sentence parsing to generate a set of useful productions. The system was used for the parenthesis and palindromes languages with success.

Download Full-text

Natural Language Understanding by Combining Statistical Methods and Extended Context-Free Grammars

Lecture Notes in Computer Science - Pattern Recognition ◽

10.1007/978-3-540-69321-5_26 ◽

2008 ◽

pp. 254-263

Author(s):

Stefan Schwärzler ◽

Joachim Schenk ◽

Frank Wallhoff ◽

Günther Ruske

Keyword(s):

Natural Language ◽

Statistical Methods ◽

Natural Language Understanding ◽

Language Understanding ◽

Context Free ◽

Context Free Grammars

Download Full-text

Incremental Learning of Context Free Grammars by Parsing-Based Rule Generation and Rule Set Search

Transactions of the Japanese Society for Artificial Intelligence ◽

10.1527/tjsai.21.371 ◽

2006 ◽

Vol 21 ◽

pp. 371-379 ◽

Cited By ~ 1

Author(s):

Katsuhiko Nakamura ◽

Akemi Hoshina

Keyword(s):

Incremental Learning ◽

Rule Generation ◽

Rule Set ◽

Context Free ◽

Context Free Grammars

Download Full-text

VADER Natural Language Processing in Market Sentiment Analysis

SSRN Electronic Journal ◽

10.2139/ssrn.3676706 ◽

2020 ◽

Author(s):

Jonathan Seror

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Sentiment Analysis ◽

Language Processing ◽

Market Sentiment

Download Full-text

A Review on Sentiment Classification: Natural Language Understanding

Recent Patents on Engineering ◽

10.2174/1872212112666180731113353 ◽

2019 ◽

Vol 13 (1) ◽

pp. 20-27 ◽

Cited By ~ 1

Author(s):

Srishty Jindal ◽

Kamlesh Sharma

Keyword(s):

Natural Language ◽

Sentiment Analysis ◽

Social Networking Sites ◽

Natural Language Understanding ◽

Business Analytics ◽

Language Understanding ◽

Text Data ◽

Data Set ◽

Market Positioning ◽

Illegal Activities

Background: With the tremendous increase in the use of social networking sites for sharing the emotions, views, preferences etc. a huge volume of data and text is available on the internet, there comes the need for understanding the text and analysing the data to determine the exact intent behind the same for a greater good. This process of understanding the text and data involves loads of analytical methods, several phases and multiple techniques. Efficient use of these techniques is important for an effective and relevant understanding of the text/data. This analysis can in turn be very helpful in ecommerce for targeting audience, social media monitoring for anticipating the foul elements from society and take proactive actions to avoid unethical and illegal activities, business analytics, market positioning etc. Method: The goal is to understand the basic steps involved in analysing the text data which can be helpful in determining sentiments behind them. This review provides detailed description of steps involved in sentiment analysis with the recent research done. Patents related to sentiment analysis and classification are reviewed to throw some light in the work done related to the field. Results: Sentiment analysis determines the polarity behind the text data/review. This analysis helps in increasing the business revenue, e-health, or determining the behaviour of a person. Conclusion: This study helps in understanding the basic steps involved in natural language understanding. At each step there are multiple techniques that can be applied on data. Different classifiers provide variable accuracy depending upon the data set and classification technique used.

Download Full-text