ქართული ენის ლინგვისტური მოდელი (სახელური სიტყვაფორმების სინთეზი)
Main Article Content
ანოტაცია
In this paper we present a unique linguistic model designed for the morphological synthesis and analysis of the Georgian language. The morphological structure of the Georgian language is fully covered by this model. Based on it, we have created a software with a special tool, namely, a processor enabling the generation and analysis of word forms. We would like to emphasise the fact that no particular grammatical theory is used in our linguistic model. Rather, the language data is provided in a different format and in a structured way, taking into account the theories currently in use. Our model as presented in this paper is a collection of morphological equations (often around 3,000 units) required to generate every word form from a single stem. Currently, more than 266 million word-forms can be synthesised utilising the Georgian language's morphological processor. While not all of these forms can be found in electronic texts or extensive corpora, they are all viable. These words have been referred to as “potential forms”. The possible forms are crucial for studying a variety of topics pertaining to natural language processing. Additionally, they aid in the resolution of specific artificial intelligence challenges. Beyond, we intend to pinpoint the exact frequencies of the data produced by the processor and specify the domains in which they are utilised.