scholarly journals The development of stemming algorithm for the Uzbek language

Author(s):  
Ilkhom Izatovich Bakaev

The automatic processing of unstructured texts in natural languages is one of the relevant problems of computer analysis and text synthesis. Within this problem, the author singles out a task of text normalization, which usually suggests such processes as tokenization, stemming, and lemmatization. The existing stemming algorithms for the most part are oriented towards the synthetic languages with inflectional morphemes. The Uzbek language represents an example of agglutinative language, characterized by polysemanticity of affixal and auxiliary morphemes. Although the Uzbek language largely differs from, for example, English language, it is successfully processed by stemming algorithms. There are virtually no examples of effective implementation of stemming algorithms for the Uzbek language; therefore, this questions is the subject of scientific interest and defines the goal of this work. In the course of this research, the author solved the task of bringing the given texts in the Uzbek language to normal form, which on the preliminary stage were tokenized and cleared of stop words. To author developed the method of normalization of texts in the Uzbek language based on the stemming algorithm. The development of stemming algorithm employed hybrid approach with application of algorithmic method, lexicon of linguistic rules and database of the normal word forms of the Uzbek language. The precision of the proposed algorithm depends on the precision of tokenization algorithm. At the same time, the article did not explore the question of finding the roots of paired words separated by spaces, as this task is solved at the stage of tokenization. The algorithm can be integrated into various automated systems for machine translation, information extraction, data retrieval, etc.

Author(s):  
Sevil M. Radjabova ◽  

Article deals with the changes of the meanings of the post nominal adjectives in the process of transformation in the modern English language. On the basis of the linguoculturological approach and the method of linguistic analysis, the characteristic features of the change in the meanings of post-nominative adjectives in the English language have been revealed. In the English language the adjectives can perform the function of predicative. For the semantics of the adjectives which have the predicative function, these adjectives are characterized by inner qualitative diversity. Mainly, qualitative adjectives refer to classical predication and denote the feature of the object directly. Such adjectives have more features of predication. The predicative sign of the adjective, the presence of a connotation of subjective assessment determines its semantics and use. There are differences between the constructions used in the predicate function in phrases that perform the function of the subject and in the altered form of word phrases related attributively and predicatively. The predicative relation is the immunest form of syntactic connection and in predicative connectives the structural restriction in comparison with attributive constructions is extremely limited. Adjectives as predicative words do not have denotation and reference, they have no denoter, there is a signification. Basing on their indicative characteristics, it is possible to present all the possible semantic features. For the English language it is characteristic the use of the attribute before the defined word. The development from a special sign of thought to a general concept is characteristic for the whole structure of the English language; it is even possible to observe it in word formation. In most cases, taking into account the use of the adjective in the function of an attribute, the terms like postpositional (post nominal) and prepositional (prenominal) adjectives are used. The reasons for the change in the position of adjectives should be sought not always in the nature of the adjective, but in the degree to which it determines its referent. The semes that make up the meaning of the word are at different levels and are more or less stable. In adjectives, the nuclear seme, or subseme, is always found next to the differential seme. In other words, the adjective cannot be combined with nouns in all semantic groups. When an adjective is combined with a noun, a background is formed that allows or prevents the actualization of a particular seme. This causes the activation of a specific seme associated with the semantics of the given name in combination with the given name. Such semes are reflected in the join semantics. The opposite can also be said. The adjective itself chooses the noun for word formation. After all, the same adjective behaves differently in relation to transformation into different attributive complexes. In our opinion, adjectives act as an important restrictive informative element at the content level.


Author(s):  
Oksana Danylovych

The article deals with the study of semantic combinability of adjectives with nouns in the English belle-lettres discourse. This aim covers distinguishing and analysis of the factors that influence on combinability. Linguistic, extralinguistic factors, the belle-lettres discourse are considered. The object of the investigation is semantic combinability of adjectives with nouns in the texts of belle-lettres discourse. The subject of the investigation is statistically meaningful ties on the semantic level. ʺ In our study the statistical method was used such as χ² – which shows presence or absence of a tie. In such a way standard elements of contextual sets of lexical-semantic groups (LSG) of adjectives were found. The coefficient K indicates the force (intensity) of ties. Due to it the ties are divided into strong, mean and weak ones. Statistically meaningful ties of LSG of adjectives with LSG of nouns were analyzed. Strong ties were considered and factors that influence on the force of a tie were studied. Distinguishing of standard ties for every LSG of adjectives gave a possibility to determine peculiarities of the given ties for the belle-lettres discourse. Predominance of the language factor influence over the belle-lettres discourse factor causes a strong tie though it is not considered characteristic or has a specific feature which is appropriate to the belle-lettres discourse though such signs are distinguished in the belle-lettres discourse as LSG of adjectives ʺAge, timeʺ with LSG of nouns ʺPiece of time, day, period, seasonʺ, ʺGeographical positionʺ with Geographical objects, administrative unitsʺ, ʺSpace value as to the length, distance, durationʺ with ʺPiece of time, day, period, seasonʺ, ʺComposition, material of object ʺ with ʺThings, mechanismsʺ. The force of a tie is influenced by both language factors and needs and demands of the the belle-lettres discourse. The following ties were distinguished which are characteristic for the belle-lettres discourse: LSG of adjectives ʺColour and brightnessʺ with LSG of nouns ʺParts of the bodyʺ, ʺAppearance of a manʺ with ʺParts of the bodyʺ, signs of clothes as to colour and material, ʺTemperatureʺ with ʺNatural phenomenaʺ. Specificity of combinability of adjectives with nouns on semantic level is caused not only by linguistic and extralinguistic factors but also the influence of the belle-lettres discourse.


2012 ◽  
Vol 58 (4) ◽  
pp. 351-356
Author(s):  
Mincho B. Hadjiski ◽  
Lyubka A. Doukovska ◽  
Stefan L. Kojnov

Abstract Present paper considers nonlinear trend analysis for diagnostics and predictive maintenance. The subject is a device from Maritsa East 2 thermal power plant a mill fan. The choice of the given power plant is not occasional. This is the largest thermal power plant on the Balkan Peninsula. Mill fans are main part of the fuel preparation in the coal fired power plants. The possibility to predict eventual damages or wear out without switching off the device is significant for providing faultless and reliable work avoiding the losses caused by planned maintenance. This paper addresses the needs of the Maritsa East 2 Complex aiming to improve the ecological parameters of the electro energy production process.


2018 ◽  
Author(s):  
Tsair-Wei Chien ◽  
Hsien-Yi Wang ◽  
Yang Shao ◽  
Willy Chou

BACKGROUND Researchers often spend a great deal of time and effort retrieving related journals for their studies and submissions. Authors often designate one article and then retrieve other articles that are related to the given one using PubMed’s service for finding cited-by or similar articles. However, to date, none present the association between cited-by and similar journals related to a given journal. Authors need one effective and efficient way to find related journals on the topic of mobile health research. OBJECTIVE This study aims (1) to show the related journals for a given journal by both cited-by and similarity criteria; (2) to present the association between cited-by and similarity journals related to a given journal; (3) to inspect the patterns of network density indices among clusters classified by social network analysis (SNA); (4) to investigate the feature of Kendall's coefficient(W) of concordance. METHODS We obtained 676 abstracts since 2013 from Medline based on the keywords of ("JMIR mHealth and uHealth"[Journal]) on June 30, 2018, and plotted the clusters of related journals on Google Maps by using MS Excel modules. The features of network density indices were examined. The Kendall coefficient (W) was used to assess the concordance of clusters across indices. RESULTS This study found that (1) the journals related to JMIR mHealth and uHealth are easily presented on dashboards; (2) a mild association(=0.14) exists between cited-by and similar journals related to JMIR mHealth and uHealth; (3) the median Impact Factor were 3.37 and 2.183 based on the representatives of top ten clusters grouped by the cited-by and similar journals, respectively; (4) all Kendall’s coefficients(i.e., 0.82, 0.89, 0.92, and 0.75) for the four sets of density centrality have a statistically significant concordance (p < 0.05). CONCLUSIONS SNA provides deep insight into the relationships of related journals to a given journal. The results of this research can provide readers with a knowledge and concept diagram to use with future submissions to a given journal in the subject category of Mobile Health Research. CLINICALTRIAL Not available


Author(s):  
Imelda Aisah Sarip ◽  
Kamid Kamid ◽  
Bambang Hariyadi

The aim of this research is to describe creative thinking process of linguistic type student in biology problem solving. This research is conducted to linguistic intelligence type of subject at SMPN 6 Kota Jambi. SL the subject was selected based on the aim of the research. Data collection is conducted by interview and a modified think aloud method. Data is analyzed based on creative thinking process purposed by Polya.The result of this research shows that SL could find and arrange the given problems and collect data correctly and appropriately. The problem solving steps is done systematically to the end of problem solving process. The last steps problem solving, SL does checking while doing scratching to make sure that the written answers meet her need.


2000 ◽  
Vol 27 (2) ◽  
pp. 177-198 ◽  
Author(s):  
Garry D. Carnegie ◽  
Brad N. Potter

While accounting researchers have explored international publishing patterns in the accounting literature generally, little is known about recent contributions to the specialist international accounting history journals. Specifically, this study surveys publishing patterns in the three specialist, internationally refereed, accounting history journals in the English language during the period 1996 to 1999. The survey covers 149 contributions in total and provides empirical evidence on the location of their authors, the subject country or region in each investigation, and the time span of each study. It also classifies the literature examined based on the literature classification framework provided by Carnegie and Napier [1996].


2012 ◽  
Vol 2 (2) ◽  
pp. 56-70
Author(s):  
Petr Kopečný

This paper concentrates on the area of special educational support provided to individuals living in homes for people with disabilities in the Czech Republic and presents partial research results illustrating the state of the provision of speech therapy to users of social services facilities falling under the jurisdiction of the Ministry of Labour and Social Affairs. The subject of the research is an analysis of support for the development of the communication skills of pupils living in social services facilities. The partial results of the research outline the approaches employed by the managerial staff of the given facilities in implementing special educational procedures, describe forms of speech therapy provision in homes for people with disabilities, and compare the attitudes of teachers and social services staff to the development of communication with the importance attributed to it by speech therapists and demonstrated by the case studies performed.


2018 ◽  
Vol 170 ◽  
pp. 02006
Author(s):  
Elena Khutieva ◽  
Alexander Maizel ◽  
Marina Vlasova

The exploitation of Arctic resources is becoming now one of the most important directions of Russia’s strategic development. The coordination center for this project is St. Petersburg. The article assesses the potential of this region which forms an essential prerequisite for the effective implementation of the given work from the standpoint of the state and prospects of industrial clusters formed in its territory. The subjects of the cluster environment of St. Petersburg relevant programs of state support are divided into three categories: 2 innovative territorial clusters, 3 territorial clusters, 9 territorial clusters administered by the Center for cluster development. Specific recommendations for them are proposed on the basis of analysis of their strengths and weaknesses, as well as assessment of opportunities and threats to their development.


Author(s):  
Alejandra Hernando-Garijo ◽  
David Hortigüela-Alcalá ◽  
Pedro Antonio Sánchez-Miguel ◽  
Sixto González-Víllora

The implementation of pedagogical models (PMs) in the subject of Physical Education (PE) is presented as a pedagogical approach that is based on the educational context as a means to overcome the serious limitations that arise from traditional approaches. The effective implementation of this approach has demonstrated benefits in terms of student motivation, student involvement and improved learning. Thus, its application and international relevance, the variability of content covered, the possibility of replicability in a variety of contexts and the fact that it favors a reflective framework and common action by teachers are some of the reasons that justify its use. In this sense, the need for teacher training, as well as the intention to generate more scientific evidence based on its application in the classroom, are some of the key aspects to be taken into account for its implementation and consequent consolidation in the educational field.


Author(s):  
Santosh Kumar Mishra ◽  
Rijul Dhir ◽  
Sriparna Saha ◽  
Pushpak Bhattacharyya

Image captioning is the process of generating a textual description of an image that aims to describe the salient parts of the given image. It is an important problem, as it involves computer vision and natural language processing, where computer vision is used for understanding images, and natural language processing is used for language modeling. A lot of works have been done for image captioning for the English language. In this article, we have developed a model for image captioning in the Hindi language. Hindi is the official language of India, and it is the fourth most spoken language in the world, spoken in India and South Asia. To the best of our knowledge, this is the first attempt to generate image captions in the Hindi language. A dataset is manually created by translating well known MSCOCO dataset from English to Hindi. Finally, different types of attention-based architectures are developed for image captioning in the Hindi language. These attention mechanisms are new for the Hindi language, as those have never been used for the Hindi language. The obtained results of the proposed model are compared with several baselines in terms of BLEU scores, and the results show that our model performs better than others. Manual evaluation of the obtained captions in terms of adequacy and fluency also reveals the effectiveness of our proposed approach. Availability of resources : The codes of the article are available at https://github.com/santosh1821cs03/Image_Captioning_Hindi_Language ; The dataset will be made available: http://www.iitp.ac.in/∼ai-nlp-ml/resources.html .


Sign in / Sign up

Export Citation Format

Share Document