2nd International Conference on Research in Social Sciences | Vienna, Austria | 2019 | 3-5 December | Webster Vienna Private University | The Translation Model Based on Sentential Primitives. | oral | The main problem of language modelling is the meaning of the utterance, which is the ultimate goal of one of the directions of language model functioning (analysis) and the starting point for its opposite direction (synthesis). The paper suggests one of the options of the problem solution. In particular, dividing sentences into the set of (quasi)-synonymous “sentential primitives" and on the basis of the "primitives", building such a structure, which reflects the appropriate content of this set quite transparently. Determination of the estimated set of primitives should be based on the search of the constituents according to their characteristics, particularly, to their syntactic properties. Verbs and their dominant role in sentences are considered according to "multilevel" syntax. By this theory, a primitive is a central component (core) of the layered structure, which is "surrounded" by the periphery i.e. traditional adverbs. The central component itself involves the verb and its actants. At the same time, internal relations of primitives, i.e. dependencies of actants on the predicates and peripheries on central structures are defined by the semantic roles. Such approach to language content conveys the meaning of the utterance through the sentential primitives where the primitives are the language units. In the frame of the presentation, we will offer a translation model, which is simplified to the level of primitives. For testing the model, Georgian-English parallel corpus will be used. | www.dpublication.com/wp-content/uploads/2019/11/8046.pdf |
III International Scientific Conference "Current Issues in Applied Linguistics" | Baku, Azerbaijan | 2018 | 25-26 ოქტომბერი | Азербайджанский университет языков | Модель Автоматического Грузино-Английского Перевода «Сентенциальных Примитивов» Содержащих Имя Прилагательное. | oral | In the article the essential characteristics of the role relations of “sentential primitives” and specifically primitives, containing the adjectives of the Georgian language and a model for their automatic translation into English are considered. According to the fact which member of the sentence the adjective is connected with and what role it plays in the sentence, types of sentential primitives are separated. If during the translation process both languages are provided with means of partitioning into primitives and vice versa, compiling an expression from primitives, then the process will be brought up to the level of translation of primitives, and the translation will be significantly simplified. | https://muhaz.org/to-azerbaijan.html?page=3 |
IV International Conference "Verbal Communication Technologies-2016" | Tbilisi, Georgia | 2016 | 25-27 ნოემბერი | Georgian Technical University | GeWordNet – the Model of Georgian Lexical System | oral | WordNet is the most widely used lexical database in the field of modern information technology. It was based on the 1996 model of the human mental dictionary developed at the Princeton University Laboratory of Cognitive Sciences, which eventually became the most authoritative and widely used standard for building lexical-semantic databases. WordNet lexical-semantic thesaurus knowledge bases are used in tasks such as information retrieval, machine translation, word meaning determination, and dialog systems. EuroWordNet was created in 1999, combining WordNet dictionaries of European languages. Currently, the Department of Linguistic and Speech Systems of the Institute of Control Systems is working on the creation of Georgian WordNet (Shota Rustaveli National Science Foundation grant "Georgian Word Network Compiler GeWordNet"). The basic building block of GeWordNet is a set of synonyms (synset) that combine words with similar meanings. . It is assumed that each synset in the dictionary is a lexicalized concept of a given language. To make the dictionary easier to use, each synset is accompanied by a definition and contextual examples of synset words. In GeWordNet, synsets are interconnected by semantic relations - hyponymy, meronymy, lexical causation, presupposition, and more. Georgian WordNet is developed in two stages. First, a WordNet dictionary is created for the Georgian language, and in the second stage, the Georgian dictionary is linked to EuroWordNet through the Georgian-English interlingual index (ILI). Different levels of its equivalence with ILI index synsets can be used to describe specific language synsets: EQ_SYNONYM - Complete match between ILI index synset and language synset; EQ_NEAR_SYNONYM - synset of the target language corresponds to the synset of several ILI indexes; HAS_EQ_HYPERONYM - target language synset is more specific than index synset; HAS_EQ_HYPONYM Language synset can only communicate with index-specific synset. Georgian WordNet synsets are automatically formed using a bilingual electronic dictionary. | https://iliauni.edu.ge/uploads/other/38/38129.pdf |
| Tbilisi, Georgia | 2015 | 10-15 სექტემბერი | Arnold Chikobava Institute of Linguistics, Iv. Javakhishvili Tbilisi State University, Frankfurt University | Georgian language computer models | oral | Fundamental scientific researches in Computational Linguistics have been carried out at the Archil Eliashvili Institute of Control Systems, Georgian Technical University, for many years. Various combination methods (lexical functions, synonymous series, semantic roles and superparadigms) were developed for the Georgian language; A computer dictionary of the Georgian language has been created, which, at the same time, carries out functions of a morphological generator, in other words, it produces the full paradigm for each lexical unit. Within the framework of the fundamental issues of language modeling, a means of presenting linguistic algorithms has been developed, that allows the formulation of a bi-directional analysis-combined processor. For some languages, the filling-widening process of dictionaries has been simplified with the help of a grammar compiler, which is the most modern tool for the automatic realization of a formal language model. It is possible to compile morphological processor libraries of individual languages for different variations of any language (according to time, space, origin, genre, etc.) and so on. Automatic machine translation can be considered as the main achievement. To solve this task completely it is necessary to create lexical translator. This type of system is rather valued among the ordinary users, as it makes it easier for them to learn foreign-language texts intensively thus is much more useful while composing text. The strategy of our team is to provide reliable support for future language technologies by the theoretical and practical key issues that have been worked up in separate projects for the past years. The computer products created by our team are used in various linguistic areas. It is a challenge for linguists to create computational model of a language, taking into account its multi detections and changes. We have created main components to compile a national corpus manager. | |
11th Tbilisi Symposium on Language, Logic and Computation, TbiLLC 2015 | Tbilisi, Georgia | 2015 | 21-26 სექტემბერი | The Centre for Language, Logic and Speech at the Tbilisi State University, the Georgian Academy of Sciences and Institute for Logic, Language and Computation (ILLC) of the University of Amsterdam. | Syntax Annotation of the Georgian Literary Corpus | oral | In order to solve theoretical and applied tasks of Georgian language it is very important to draw out deeply annotated text corpora. While syntactically annotated corpora are now available for English, Czech, Russian and other languages, for Georgian they are rare. The environment, developed by our research group, offers several NLP applications, including a module of morphologic, syntactic and semantic level, a Universal Networking Language interface and a natural language interface to access SQL type databases. In this article, we research the automatic syntactic parser of Georgian Language. It includes syntactic level as well as morphologic level of Georgian language model. The basis of the linguistic model of Georgian text syntax annotation is the dependency grammar. | http://www.nplg.gov.ge/ec/ka/bibl/search.html?cmd=search&pft=biblio&qs=700%3A1%3A%E1%83%90%E1%83%9B%E1%83%98%E1%83%A0%E1%83%94%E1%83%96%E1%83%90%E1%83%A8%E1%83%95%E1%83%98%E1%83%9A%E1%83%98+%E1%83%90. |
International Scientific Conference on "Humanities in the Information Society- II" | Batumi, Georgia | 2014 | 24-26 ოქტომბერი | Batumi Shota Rustaveli State University, Faculty of Humanities | The syntactic structure of a georgian sentence | oral | (VP) phrases. To maintain the unity of the tree structure, the notion of a zero node (S-sentence) is introduced, which in the case of an infinitive verb is the parent of only the VP-verb phrase, while in the other case it is the parent of NP and VP. All members of the sentence (both main and non-main) are described. Each of them necessarily indicates its syntactic role: this is the role of the syntactic parent and the syntactic heir. The role of the parent can be both a noun phrase and a verb phrase. These phrases can freely participate in the role of heir as well. It is also shown in which syntactic construction this or that member of the sentence participates, which, in turn, has assigned all possible roles with appropriate grammatical features. The knowledge accumulated as a result of morphological analysis, which provides comprehensive syntactic information, plays an important role in the syntactic annotation structure of the Georgian sentence. For example, a noun in ergative case can only be a subject, and so on. A syntactic annotation system with such a structure allows a complete description of a Georgian sentence through grammatical features. | http://www.nplg.gov.ge/ec/ka/bibl/search.html?cmd=search&pft=biblio&qs=700%3A1%3A%E1%83%90%E1%83%9B%E1%83%98%E1%83%A0%E1%83%94%E1%83%96%E1%83%90%E1%83%A8%E1%83%95%E1%83%98%E1%83%9A%E1%83%98+%E1%83%90. |
Batumi II International Symposium in Lexicography | Batumi, Georgia | 2012 | 18-20 მაისი | Batumi Shota Rustaveli State University, Faculty of Education and Sciences; TSU Arnold Chikobava Institute of Linguistics; Ivane Javakhishvili Tbilisi State University Lexicographic Center; Shota Rustaveli National Science Foundation | Georgian language explanatory-combinatorial dictionary generator. | oral | Language modeling is one of the most important directions in modern linguistics. Linguistic models are characterized, on the one hand, by the division of the linguistic system into several levels (morphological, syntactic, semantic), and, on the other hand, by the establishment of direct connections between these levels. It is possible to rely on the "explanatory-combinatorial dictionary" for the agreed functioning and effective operation of different levels of the Georgian language model. In the computer explanatory-combinatorial dictionary of the Georgian language, all the information around the lexeme is divided into zones: first - the word commander, second - the definition of the word, third - the morphological model, fourth - the semantic-syntactic model of the word; the fifth zone is devoted to the list of lexical functions of speech. With the support of Shota Rustaveli National Science Foundation, a generator of the Georgian Explanatory-Combinatorial Dictionary was developed. To create it, we use the modern approach to the syntax and semantics of the Georgian language: "layered" syntactic structures; I. Melchuk - Theory of lexical parameters; J. Apresian - method of synonymous sequences. From the point of view of Kartvelology, this is an extension, strengthening and updating of the Georgian language description methodology in accordance with the new international standards; for the above modern theories - a new test of their cross-linguistic suitability; In terms of computer linguistics, this work provides a strong basis for creating a complete functional model of the Georgian language. | https://bsu.edu.ge/text_files/ge_file_2290_1.pdf |
Ninth International Tbilisi Symposium on Language, Logic and Computation | Kutaisi, Georgia, | 2011 | 26-30 ოქტომბერი | Institute for Logic, Language and Computation (ILLC) of the University of Amsterdam in conjunction with the Centre for Language, Logic and Speech at the Tbilisi State University, the Georgian Academy of Sciences and the Akaki Tsereteli State University. | Georgian “Ancestors” of the logical implication | oral | The lion share of mental (philosophical, logical) concepts has its origin in the natural language - “body of thinking”. The most important of those concepts should have their origin in all languages, though with some possible additional nuances which make them more understandable and near for usual users of this language. Particularly, here will be given a short review of Georgian conjunctions with semantics corresponding to the logical implication | https://archive.illc.uva.nl/Tbilisi/Tbilisi2011/Programme/Abstracts_General_Programme/index.html |
International Conference "Georgian Language and Modern Technologies" | Tbilisi, Georgia | 2011 | 7-8 ივლისი | Arnold Chikobava Institute of Linguistics | Some Questions of the Formation of the Plural in the Georgian Morphological Processor | oral | The article discusses the peculiarities of declination of some georgian language nouns in plural. Linguistic events are subject to certain regularities, but besides the general rules there are exceptions. The focus is on the forming issues of some noun plural forms. According to the already established rule, the nouns of a certain group do not use plural forms. They are called uncountable nouns. Such are the nouns of substances, abstract, collective, but it is not uncommon to use such nouns in the plural during different semantic loads. Some adjectives are turned into nouns in plural forms (reds, greens, rich, poor, etc.). In the article for ilustration a lot of word combination – collocation are presented which have already been well-established in the Georgian language, such as „ქართული ღვინოები“ - Georgian wines, „მარილების დაგროვება“- salt accumulation, „მინარალური წყლები“-mineral waters - (nouns of substances); "ფიქრები”-Thoughts, „მოტივები“-Motives, „არჩევნები“-Elections - (abstract nouns); „გუნდები“-Teams, „კრებები“-Congregations - (collective nouns): and others. | http://www.ice.ge/symposium/symp2011_2/konferencia-2011.pdf |
THE INTERNATIONAL CONFERENCE "THE GEORGIAN LANGUAGE AND MODERN TECHNOLOGIES `2009" | Tbilisi, Georgia | 2009 | 20-21 ოქტომბერი | Arnold Chikobava Institute of Linguistics | Georgian Computer Prompter | oral | Georgian computer prompter is a software which can assist the disabled to write in Georgian on the computer. As is known this problem cannot yet be solved by any current software. This system suggests the correct forms of the word and makes it easy to use the keyboard with the minimum of effort.The development of the Georgian morphological processor is provided by means of knowledge collection MESLM Multilanguage expert system. MESLM can be regarded as an instrument which imitates the human mind and creates morphological rules for any language processor. Development of the Georgian language processor by means of the MESLM system software includes three components. These are: verb, noun and other parts of speech. By this time the Georgian text corpora has been filled with about one million words. The software will assist the users to select grammatically correct words while keying in Georgian texts by means of specifying the sequence of words. | https://www.ice.ge/new/pages/news/konferencia.pdf |