scholarly journals Constructive and object-oriented modeling text for detection of text borrowings

2019 ◽  
Vol 4 (123) ◽  
pp. 34-47
Author(s):  
Olena Serhiivna Kuropiatnykov

The scientific community is encouraged to use such models and data structures as arrays of LERP-RSA (the longest expected duplicate array of reduced suffix templates), tag classifier-a model based on Stanford NER's three-class, structures based on DN-sequences, graph representations, etc. The following algorithms are used: GreedyString-Tiling, ARPAD, shingle, statistical methods, genetic algorithms, and others. It should also be noted that much attention is paid to morphological analysis and lemmatization, pre-processing of texts. Models and algorithms only partly have program realization.The purpose of this work is to develop a text model to identify borrowings and bring it to program implementation. The task is to develop the object-oriented model and program implementation of a graph text model, with the application of the problem of detection of borrowing. As well as obtaining timeframes for program implementation work for further evaluation of the possibility of its use in the academic environment.The main idea of the graph model is to present the text as a weighted oriented graph. The vertex weight is a character or sequence of characters. Edge weight is the set of numbers of paths into which the edge enters. To formalize the model will use the apparatus of constructive-synthesizing modeling. To create graphs, a constructor and its components are defined: carrier, signature, multiple statements of information support for design. Transformations are made for the constructor: specialization, interpretation and concretization.On the basis of this model, the object-oriented model is constructed. it includes three classes: vertex, graph and work .The object of class Work presents the text as a set of objects of class Graph. The correspondences between the components of the presented models are established.The object-oriented model is implemented by software. Data are given about the execution time of graph construction and texts comparison.At this stage, software implementation of the model has shown acceptable time performance. Further research in this direction is promising. Directions for improving the model and program are proposed.

Electronics ◽  
2020 ◽  
Vol 9 (9) ◽  
pp. 1520
Author(s):  
Tiancheng Tang ◽  
Tianyi Yuan ◽  
Xinhuai Tang ◽  
Delai Chen

Supervised neural network models have achieved outstanding performance in the document summarization task in recent years. However, it is hard to get enough labeled training data with a high quality for these models to generate different types of summaries in reality. In this work, we mainly focus on improving the performance of the popular unsupervised Textrank algorithm that requires no labeled training data for extractive summarization. We first modify the original edge weight of Textrank to take the relative position of sentences into account, and then combine the output of the improved Textrank with K-means clustering to improve the diversity of generated summaries. To further improve the performance of our model, we innovatively incorporate external knowledge from open-source knowledge graphs into our model by entity linking. We use the knowledge graph sentence embedding and the tf-idf embedding as the input of our improved Textrank, and get the final score for each sentence by linear combination. Evaluations on the New York Times data set show the effectiveness of our knowledge-enhanced approach. The proposed model outperforms other popular unsupervised models significantly.


2019 ◽  
Vol 14 (1) ◽  
pp. 157-163
Author(s):  
Альбина Шагиева ◽  
Al'bina Shagieva ◽  
Андрей Макаров ◽  
Andrey Makarov ◽  
Наталья Карпова ◽  
...  

The anti-crisis management of rural municipalities is an organized special management, based on a set of specific management decisions taken by the governing bodies of institutions and organizations, state and local authorities in the context of resource constraints and increased risk. Monitoring the results of the crisis management of rural municipalities is aimed at tracking its condition and financial performance. The purpose of the crisis management of rural municipalities is to restore their livelihoods and the withdrawal from the crisis of enterprises, institutions, organizations that provide services to the rural population. The nature of changes in the crisis management process was assessed taking into account: the full coverage of the areas of project and event management, the pace of change, the degree of achievement of goals, and the growth of economic indicators. The study developed the basic provisions for improving the information system of crisis management of rural municipalities. The information support system includes: a list of indicators for assessing the course of crisis management, requirements for a data bank. The structure of the information system and the network model of the possibility of creating an agricultural cluster based on rural municipalities are proposed. The proposed information system serves and interconnects, in the mode of interaction, local authorities, scientific and educational unit, functional departmental systems: tax, statistical, environmental protection, rural infrastructure and others. It makes it possible to create a cluster of rural municipalities in a less costly and more coordinated way. The main idea of evaluating and adjusting crisis management is to comprehensively examine the interrelationships of the parameters of the tariff, tax, investment and subsidy policies, taking into account the transformations in the management system of rural municipalities.


Complexity ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Miao Teng

In this paper, we conduct an in-depth study of Japanese keyword extraction from news reports, train external computer document word sets from text preprocessing into word vectors using the Ship-gram model in the deep learning tool Word2Vec, and calculate the cosine distance between word vectors. In this paper, the sliding window in TextRank is designed to connect internal document information to improve the in-text semantic coherence. The main idea is to use not only the statistical and structural features of words but also the semantic features of words extracted through word-embedding techniques, i.e., multifeature fusion, to obtain the importance weights of words themselves and the attraction weights between words and then iteratively calculate the final weight of each word through the graph model algorithm to determine the extracted keywords. To verify the performance of the algorithm, extensive simulation experimental studies were conducted on three different types of datasets. The experimental results show that the proposed keyword extraction algorithm can improve the performance by a maximum of 6.45% and 20.36% compared with the existing word frequency statistics and graph model methods, respectively; MF-Rank can achieve a maximum performance improvement of 1.76% compared with PW-TF.


2016 ◽  
Vol 2 ◽  
pp. e48 ◽  
Author(s):  
Ivar Farup

An object-oriented computational framework for the transformation of colour data and colour metric tensors is presented. The main idea of the design is to represent the transforms between spaces as compositions of objects from a class hierarchy providing the methods for both the transforms themselves and the corresponding Jacobian matrices. In this way, new colour spaces can be implemented on the fly by transforming from any existing colour space, and colour data in various formats as well as colour metric tensors and colour difference data can easily be transformed between the colour spaces. This reduces what normally requires several days of coding to a few lines of code without introducing a significant computational overhead. The framework is implemented in the Python programming language.


2007 ◽  
Vol 175 (4) ◽  
pp. 101-114 ◽  
Author(s):  
Ana Paula Lüdtke Ferreira ◽  
Luciana Foss ◽  
Leila Ribeiro

2012 ◽  
Vol 134 (2) ◽  
Author(s):  
Bergen Helms ◽  
Kristina Shea

Computational design synthesis aims to iteratively and automatically generate solution spaces of standard and novel design alternatives to support the innovation process. New approaches are required to generate alternative solutions at the function and behavior level as well as to ease the computational modeling of design knowledge. This paper introduces the approach of object-oriented graph grammars for the computational synthesis of product models based on a Function–Behavior–Structure (FBS) representation. The approach combines the advantages of a generic and systematic design method with a highly computable graph representation and object-oriented concepts. Through this combination, advances in terms of extendibility, efficiency, and flexible formalization of declarative and procedural engineering knowledge are achieved. Validation of the method is given through the synthesis of hybrid powertrains. The generation of hybrid powertrain solution spaces is shown, especially focusing on the impact of an evolving vocabulary, or building blocks, for synthesis. Future work includes integrating search methods in the synthesis process along with quantitative evaluation using simulation methods.


Author(s):  
Clemens Münzer ◽  
Kristina Shea ◽  
Bergen Helms

Ever since computers have been used to support human designers, a variety of representations have been used to encapsulate engineering knowledge. Computational design synthesis approaches utilize this knowledge to generate design candidates for a specified task. However, new approaches are required to enable systematic solution space exploration. This paper presents an approach that combines a graph-based, object-oriented knowledge representation with first-order logic and Boolean satisfiability. This combination is used as the foundation for a generic, automated approach for requirement-driven computational design synthesis. Available design building blocks and a design task defined through a set of requirements are modeled in a graph-based environment and then automatically transferred into a Boolean satisfiability problem and solved, considering a given solution size. The solution is then automatically transferred back to the graph-based domain. The method is validated through the synthesis of automotive powertrains. The contribution of the paper is a new method that is both able to determine that an engineering task is solvable or not given a set of design building blocks and able to systematically explore the solution space.


Sign in / Sign up

Export Citation Format

Share Document