scholarly journals Cluster Analysis and Visualization for the Legend of the Condor Heroes Based on Social Network

2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Chao Fan ◽  
Zhihui Yang ◽  
Yuyi Yuan

The Legend of the Condor Heroes (LCH) is one of the fifteen well-known Wuxia novels penned by Jin Yong. It portrays a number of characters in the background of the Southern Song Dynasty. In this research, we attempt to analyze the relationship of characters in LCH based on social network, including network feature analysis, cluster analysis, and data visualization. Moreover, the approach can be extended to other literary works because our research provides a general framework for analyzing character relationships. We first perform lexical analysis on the corpus to extract character names and then utilize co-word analysis to build a social network of character relationships. We reckon characters as nodes and count the cooccurrences of characters as weights of links. By applying the social network analysis of created network, we can obtain network features of LCH. Furthermore, a hierarchical clustering algorithm is implemented to study the structure of LCH network. Both the dendrogram and Venn diagram are used for data visualization. An improved approach of visualizing the clustering results is advanced in order to display the group and hierarchical structure better. The final experimental results demonstrate that the proposed method shows a good effect.


2014 ◽  
Vol 543-547 ◽  
pp. 2270-2273
Author(s):  
Ha Dong Meng ◽  
Wen Bo Liu ◽  
Yu Chen Song

In order to improve the application effectiveness of cluster analysis in unsupervised image classification, a clustering algorithm based on density and adaptive density-reachable is designed and implemented. Compared with the classifying results of K-means and hierarchical method, it is evident that the algorithm has good effect on unsupervised image classification.



2015 ◽  
pp. 125-138 ◽  
Author(s):  
I. V. Goncharenko

In this article we proposed a new method of non-hierarchical cluster analysis using k-nearest-neighbor graph and discussed it with respect to vegetation classification. The method of k-nearest neighbor (k-NN) classification was originally developed in 1951 (Fix, Hodges, 1951). Later a term “k-NN graph” and a few algorithms of k-NN clustering appeared (Cover, Hart, 1967; Brito et al., 1997). In biology k-NN is used in analysis of protein structures and genome sequences. Most of k-NN clustering algorithms build «excessive» graph firstly, so called hypergraph, and then truncate it to subgraphs, just partitioning and coarsening hypergraph. We developed other strategy, the “upward” clustering in forming (assembling consequentially) one cluster after the other. Until today graph-based cluster analysis has not been considered concerning classification of vegetation datasets.



2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Yihang Zhou ◽  
Kai Wang ◽  
Jian Sun ◽  
Yong Cui ◽  
Dongbo Hu

AbstractNanhai I is a highly valuable shipwreck of the Southern Song Dynasty for studying various topics, including the shipbuilding techniques. The sealing materials are of significant importance to ensure the ship’s reliability during the voyage across the ocean and they were rarely analyzed. Therefore, the sealing materials of this ship were analyzed by several analytical approaches. The sealing materials included two types, i.e., gap filler with jute fibers and surface coating without any oakum. The main components of both types of putty are calcite with minor Tung oil. The weight ratio of Ca(OH)2/Tung oil range from 4.3:1 to 7.9:1 for surface coating samples and the weight ratio of Ca(OH)2/organics is 3.1:1 for the gap filler sample. Additionally, we first find that the surface coating has a layered structure, where outer layers contain more Tung oil than inner layers. The innermost layer of the surface coating sample might be altered by organic acids from wood deterioration, causing its loose structure and grey color. The composite layers with different formula might be a result of balancing the costs and performances of the putty.



Author(s):  
N. P. Szabó ◽  
B. A. Braun ◽  
M. M. G. Abdelrahman ◽  
M. Dobróka

AbstractThe identification of lithology, fluid types, and total organic carbon content are of great priority in the exploration of unconventional hydrocarbons. As a new alternative, a further developed K-means type clustering method is suggested for the evaluation of shale gas formations. The traditional approach of cluster analysis is mainly based on the use of the Euclidean distance for grouping the objects of multivariate observations into different clusters. The high sensitivity of the L2 norm applied to non-Gaussian distributed measurement noises is well-known, which can be reduced by selecting a more suitable norm as distance metrics. To suppress the harmful effect of non-systematic errors and outlying data, the Most Frequent Value method as a robust statistical estimator is combined with the K-means clustering algorithm. The Cauchy-Steiner weights calculated by the Most Frequent Value procedure is applied to measure the weighted distance between the objects, which improves the performance of cluster analysis compared to the Euclidean norm. At the same time, the centroids are also calculated as a weighted average (using the Most Frequent Value method), instead of applying arithmetic mean. The suggested statistical method is tested using synthetic datasets as well as observed wireline logs, mud-logging data and core samples collected from the Barnett Shale Formation, USA. The synthetic experiment using extremely noisy well logs demonstrates that the newly developed robust clustering procedure is able to separate the geological-lithological units in hydrocarbon formations and provide additional information to standard well log analysis. It is also shown that the Cauchy-Steiner weighted cluster analysis is affected less by outliers, which allows a more efficient processing of poor-quality wireline logs and an improved evaluation of shale gas reservoirs.



Genetics ◽  
2001 ◽  
Vol 159 (2) ◽  
pp. 699-713
Author(s):  
Noah A Rosenberg ◽  
Terry Burke ◽  
Kari Elo ◽  
Marcus W Feldman ◽  
Paul J Freidlin ◽  
...  

Abstract We tested the utility of genetic cluster analysis in ascertaining population structure of a large data set for which population structure was previously known. Each of 600 individuals representing 20 distinct chicken breeds was genotyped for 27 microsatellite loci, and individual multilocus genotypes were used to infer genetic clusters. Individuals from each breed were inferred to belong mostly to the same cluster. The clustering success rate, measuring the fraction of individuals that were properly inferred to belong to their correct breeds, was consistently ~98%. When markers of highest expected heterozygosity were used, genotypes that included at least 8–10 highly variable markers from among the 27 markers genotyped also achieved >95% clustering success. When 12–15 highly variable markers and only 15–20 of the 30 individuals per breed were used, clustering success was at least 90%. We suggest that in species for which population structure is of interest, databases of multilocus genotypes at highly variable markers should be compiled. These genotypes could then be used as training samples for genetic cluster analysis and to facilitate assignments of individuals of unknown origin to populations. The clustering algorithm has potential applications in defining the within-species genetic units that are useful in problems of conservation.





Author(s):  
Б.Г. Вульфович

Задачей данной статьи является рассмотрение лингвопрагматических особенностей комментариев пользователей социальной сети «Твиттер» на выход Великобритании из ЕС. Анализ данных комментариев с лингвопрагматической точки зрения представляет интерес, так как показывает наиболее актуальную картину отношения пользователей социальных сетей к произошедшему событию. Приоритетными методами анализа лингвопрагматического потенциала Интернет-комментариев для нас являются: описательный метод, метод прагматического анализа, т.е. рассмотрение языкового материала в его непосредственном контексте в функциональном аспекте, метод частичной выборки, метод контекстологического описания. Контекстуальный метод был использован с целью установления особенностей комментариев в среде социальной сети «Твиттер»; описательный метод - для выявления непосредственного отношения пользователей социальных сетей к выходу Великобритании из ЕС; частичной выборки - для отбора наиболее эффективных и целостных комментариев с позиции прагматики и их реализации в данном контексте. Проведённое исследование позволило установить, что большинство людей удовлетворено результатами выхода Великобритании из ЕС и положительно отзывается об этом событии. Об этом свидетельствует как большое количество экспрессивов, использованных в интернет-комментариях в отношении данного события, так и активное употребление в них оценочной лексики. Результаты проведённого исследования могут быть применены в теоретических работах по описанию характеристик речевых актов, в курсе теоретической грамматики, стилистики, прагмалингвистики. The purpose of this article is to review the linguo-pragmatic features of Brexit represented in the comments in Twitter. Their analysis from a linguistic-pragmatic point of view may be of interest, since it shows the most relevant picture of the relationship of social network users for the event. The priority methods for analyzing the linguo-pragmatic potential of Internet comments for us are: a descriptive method, a pragmatic analysis method, i.e. consideration of linguistic material in its immediate context in the functional aspect, partial sampling method, contextual description method. The contextual method was used to establish the characteristics of comments on the Twitter social network; descriptive method was used to identify the direct relationship of social network users to the UK exit from the EU; partial sampling was used to select the most effective and holistic comments from the position of pragmatics and their implementation in this context. The study found that most people are satisfied with the results of the UK exit from the EU and respond positively to this event. The results of the study can be applied in theoretical works on the description of the characteristics of speech acts, in the course of theoretical grammar, stylistics, pragmalinguistics.



Sign in / Sign up

Export Citation Format

Share Document