Generation of the state tree based on generative grammar over trees of strings
Автор: Lichargin D.V.
Журнал: Сибирский аэрокосмический журнал @vestnik-sibsau
Рубрика: Математика, механика, информатика
Статья в выпуске: 7 (33), 2010 года.
Бесплатный доступ
In the article the principle of state trees generation is considered based on the generative grammars over trees of strings in such objects as the sentences of natural languages, as well as two and tree dimensional images. The image of the object as a forest is considered; including the trees of object different layouts for the purpose of complex system modeling.
Natural language generation, generative grammars, semantics
Короткий адрес: https://sciup.org/148176476
IDR: 148176476
Текст научной статьи Generation of the state tree based on generative grammar over trees of strings
The problem of natural language sentences generation is one of the key issues in the field of computer science and formal grammar theories. The issue of meaningful speech generation applies to the area of semantics and computer science [1–7]. The states tree generation issue is studied well enough in computer science and in system analysis. In respect to the question of meaningful phrases tree generation the problem is first of all connected to the method of sentence generation by means of Chomsky’s generative grammars. Generative grammars are successfully applied in software such as electronic translation systems, expert systems, systems of orthography checking, etc.
The flash point of the article is the analysis prospects for using generative grammars not over strings, but over trees of strings. In this respect it is possible to solve the task of generating grammatically and semantically meaningful speech more effectively and increasing the efficiency of different images analysis and synthesis aspects.
The importance of the issue on effective generating language meaningful constructions and two or three dimensional images is generally understood and is connected with the demands of linguistic and other software.
The purpose of this research is to apply generative grammars on the necessity basis over trees as means of meaningful speech generation connected with greater heterogeneous context.
The novelty of the work is in the application of generative grammars not over strings but over trees of strings.
It is well-known that standard generative grammar over strings have the form of the four: G, where S is an initial symbol of the generative grammar, T is a set of terminal symbols, N is a set of non-terminal symbols, and R is a set of rules for transforming one string into another.
For generative grammars over trees, the strings of symbols t and n are substituted by trees (or forest – trees with equal nodes). t = t < t ’, t ’’, …, tn >, where t ’ = t ’ < t 1 , t 2 , …, tm > etc, n = n < n ’, n ’’, …, nn >, where n ’ = n ’ < n 1 , n 2 , …, nm > etc.
One of the main particularities of any system is the presence of hierarchy elements in the system. Meanwhile the hierarchy relations can sometimes be presented as a set of hierarchies in different layouts of the system consideration. For example, the sum of three systems: a sentence as a distributed narration, a sentence with the purpose to order the tea, and a sentence with the purpose to support polite dialogue can result in a meaningful sentence of the natural language. At the same time, for the generation of such complex systems with several purposes and layouts of consideration it is necessary to use more complex means of generative grammar over the trees of strings, for the purpose of generating the tree of possible natural language sentences.
The generative grammar over trees of strings is composed in the following way. Let A<_B<_C1 ^ C2_>,_, B’<...C1’ ^ C2’>...> is a rule of generative grammar over trees, from a set of such rules with the trees of strings for terminal symbols T and nonterminal symbols N, « ^ » is a symbol of transferring one string to another. S<> is an initial symbol of generative grammar over trees.
During each stage of deepening the tree of states into another generated tree, or a forest of strings reduced to the multiplication of the obtained generated tree by the generative grammar rule.
It is also possible to consider the trees of heterogeneous information A = {A
As a result, the sentence can be considered as the union (addition) of trees from different consideration layouts over the whole space (tree) of natural language points [4–6].
Let’s have a tree A , B’’<…>,…, B’’’<…>>, C 
Let’s consider an example; the tree maneuvers in a chess game: Board 
      The half-move of a knight can be such: Board <... Column [X] <^Cell [Y] 
The generation of a chair image for example, can presuppose a potential image of a person on the chair. Chair 
The principle of reducing or adding the images is the following: semantically analogue elements – tree nodes are declared to be identical, in the case for several reduction variants an additional sub-tree of possible system states is built as a result of adding system element trees or generating system state trees.
The sentence of the natural language can be presented in the form of a tree as well. For example, the tree of grammatical sentence analysis the can be simplified as: Clause 
The tree can be added to (reduced by) the tree of semantic analysis, for example, the Topic “Building” 
A tree of the following type can be used for the generation of natural language sentences:
- 
        1. Subject – Essence (the … / person / man / woman). 
- 
        2. Modality – Action over Relation (want / wish / love / adore). 
- 
        3. Predicate – Action with Clothes (buy / get / try on / wear). 
- 
        4. Object – Clothes (the … / jeans / sweater / footballer). 
The given tree can be multiplied by the following rule of generative grammar.
- 
        1. 0 ^ the. 
- 
        2. 0 ^ Attribute - Property of Clothes (stylish / fashionable / checked. 
- 
        3. Object - Clothes (The -> 0 / jeans / sweater / 
footballer).
In result, a sentence like: “the person wants to get the fashionable sweater” or “the woman wishes to buy the checked footballer” is obtained.
It can be assumed that the analyzing of the image recognition problem, natural language analysis, and a number of other problems can be effectively solved only based on their synthetic joint consideration. For example, for the translation of the word-combination “up-link communication” not into the English language as “communication with a satellite” it is necessary to use a visual image of the facts discussed in the text. This way, in a system of translations, while the text translating is a semantically visual image of narration that should be grown, a translation without a latter close to the human one is impossible.
For the realization of the principles aforementioned, it is necessary to start the elaboration of the dictionary for semantic trees of heterogeneous data: images, patterns of sentences composition, algorithms, and so on. It will be necessary to use the already existing dictionaries of sentences generation in the “Electronic Dictionary” software for the system basis.
In conclusion it is necessary to mention that generative grammar over the trees of strings is an effective means of generating the state trees for such systems, like natural language sentence and semantically loaded images. It is thought to apply the generative grammars over the trees of strings on the basis of the Semantic Trees’ Dictionary, which is a classification of heterogeneous semantic data.
 
	 
		