scholarly journals Time Variant Multi Perspective Hierarchical Clustering Algorithm for Predicting Student Interest in Sports Mining

2019 ◽  
Vol 8 (4) ◽  
pp. 7313-7317

Predicting performance of students in sports is analyzed and studied. There are many techniques identified for the prediction of sports interest and they are not producing expected value. Towards performance development, a novel time variant multi perspective hierarchical clustering approach towards user interest prediction. The proposed time variant model reads the sports log and groups them according to the time domain. The entire log has been split into different of clusters as like time window. Then using window log, the method splits the logs according to different sports. For each time window, the method identifies the list of actions or sports played or tagged or chat with other users. Using the class of log, the method identifies the category of sports log and for each category of sports, the method compute the sports strike strength (SSS). Based on the value of SSS, the method identifies the user interest. Similarly, the interest of the student at each time window has been identified and used to generate the knowledge. The proposed method improves the performance of sports interest prediction on students with less false ratio.

2021 ◽  
pp. 1-48
Author(s):  
Zhong Hong ◽  
Kunhong Li ◽  
Mingjun Su ◽  
Guangmin Hu

Traditional constant time window-based waveform classification method is a robust tool for seismic facies analysis. However, when the interval thickness is seismically variable, the fixed time window is not able to contain the complete geologic information of interest. Therefore, the constant time window-based waveform classification method is inapplicable to conduct seismic facies analysis. To expand the application scope of seismic waveform classification in the strata with varying thickness, we propose a novel scheme for unsupervised seismic facies analysis of variable window length. The input of top and bottom horizons can guarantee the comprehensive geologic information of target interval. Throughout the whole workflow, we utilize DTW (Dynamic Time Warping) distance to measure the similarities between seismic waveforms of different lengths. Firstly, we improve the traditional spectral clustering algorithm by replacing the Euclidean distance with DTW-distance. Therefore, it can be applicable in the interval of variable thickness. Secondly, to solve the problem of large computation when applying the improved spectral clustering approach, we propose the method of seismic data thinning based on the technology of superpixel. Lastly, we combine these two algorithms and perform the integrated workflow of improved spectral clustering. The experiments on synthetic data show that the proposed workflow outperforms the traditional fixed time window-based clustering algorithm in recognizing the boundaries of different lithologies and lithologic associations with varying thickness. The practical application shows great promise for reservoir characterization of interval with varying thickness. The plane map of waveform classification provides convincing reference to delineate reservoir distribution of data set.


2016 ◽  
Vol 120 (1226) ◽  
pp. 547-571
Author(s):  
M. Edmunds ◽  
B. Evans ◽  
I. Masters ◽  
R. S. Laramee

ABSTRACTThis application paper describes a novel, cluster-based, semi-automatic, stream surface placement strategy for structured and unstructured computational fluid dynamics (CFD) data, tailored towards a specific application: The BLOODHOUND jet and rocket propelled land speed record vehicle. An existing automatic stream surface placement algorithm(8), is extensively modified to cater for large unstructured CFD simulation data. The existing algorithm uses hierarchical clustering of velocity and distance vectors to find potential stream surface seeding locations. This work replaces the hierarchical clustering algorithm, designed to work with small regular grids, with a K-means clustering approach suitable for large unstructured grids. Modifications are made to the seeding curve construction algorithm, improving the smoothness and distribution of the discretised curve in complex cases. A new distance function is described which allows the user to target particular characteristics of simulation data. The proposed algorithm reduces the required memory footprint and computational requirement compared to previous work(8). The performance and effectiveness of the proposed algorithm is demonstrated, and CFD domain expert evaluation is provided describing the value of this approach.


Author(s):  
Mohana Priya K ◽  
Pooja Ragavi S ◽  
Krishna Priya G

Clustering is the process of grouping objects into subsets that have meaning in the context of a particular problem. It does not rely on predefined classes. It is referred to as an unsupervised learning method because no information is provided about the "right answer" for any of the objects. Many clustering algorithms have been proposed and are used based on different applications. Sentence clustering is one of best clustering technique. Hierarchical Clustering Algorithm is applied for multiple levels for accuracy. For tagging purpose POS tagger, porter stemmer is used. WordNet dictionary is utilized for determining the similarity by invoking the Jiang Conrath and Cosine similarity measure. Grouping is performed with respect to the highest similarity measure value with a mean threshold. This paper incorporates many parameters for finding similarity between words. In order to identify the disambiguated words, the sense identification is performed for the adjectives and comparison is performed. semcor and machine learning datasets are employed. On comparing with previous results for WSD, our work has improvised a lot which gives a percentage of 91.2%


Author(s):  
Ana Belén Ramos-Guajardo

AbstractA new clustering method for random intervals that are measured in the same units over the same group of individuals is provided. It takes into account the similarity degree between the expected values of the random intervals that can be analyzed by means of a two-sample similarity bootstrap test. Thus, the expectations of each pair of random intervals are compared through that test and a p-value matrix is finally obtained. The suggested clustering algorithm considers such a matrix where each p-value can be seen at the same time as a kind of similarity between the random intervals. The algorithm is iterative and includes an objective stopping criterion that leads to statistically similar clusters that are different from each other. Some simulations to show the empirical performance of the proposal are developed and the approach is applied to two real-life situations.


Sensors ◽  
2021 ◽  
Vol 21 (4) ◽  
pp. 1021
Author(s):  
Zhanserik Nurlan ◽  
Tamara Zhukabayeva ◽  
Mohamed Othman

Wireless sensor networks (WSN) are networks of thousands of nodes installed in a defined physical environment to sense and monitor its state condition. The viability of such a network is directly dependent and limited by the power of batteries supplying the nodes of these networks, which represents a disadvantage of such a network. To improve and extend the life of WSNs, scientists around the world regularly develop various routing protocols that minimize and optimize the energy consumption of sensor network nodes. This article, introduces a new heterogeneous-aware routing protocol well known as Extended Z-SEP Routing Protocol with Hierarchical Clustering Approach for Wireless Heterogeneous Sensor Network or EZ-SEP, where the connection of nodes to a base station (BS) is done via a hybrid method, i.e., a certain amount of nodes communicate with the base station directly, while the remaining ones form a cluster to transfer data. Parameters of the field are unknown, and the field is partitioned into zones depending on the node energy. We reviewed the Z-SEP protocol concerning the election of the cluster head (CH) and its communication with BS and presented a novel extended mechanism for the selection of the CH based on remaining residual energy. In addition, EZ-SEP is weighted up using various estimation schemes such as base station repositioning, altering the field density, and variable nodes energy for comparison with the previous parent algorithm. EZ-SEP was executed and compared to routing protocols such as Z-SEP, SEP, and LEACH. The proposed algorithm performed using the MATLAB R2016b simulator. Simulation results show that our proposed extended version performs better than Z-SEP in the stability period due to an increase in the number of active nodes by 48%, in efficiency of network by the high packet delivery coefficient by 16% and optimizes the average power consumption compared to by 34.


Mathematics ◽  
2021 ◽  
Vol 9 (4) ◽  
pp. 370
Author(s):  
Shuangsheng Wu ◽  
Jie Lin ◽  
Zhenyu Zhang ◽  
Yushu Yang

The fuzzy clustering algorithm has become a research hotspot in many fields because of its better clustering effect and data expression ability. However, little research focuses on the clustering of hesitant fuzzy linguistic term sets (HFLTSs). To fill in the research gaps, we extend the data type of clustering to hesitant fuzzy linguistic information. A kind of hesitant fuzzy linguistic agglomerative hierarchical clustering algorithm is proposed. Furthermore, we propose a hesitant fuzzy linguistic Boole matrix clustering algorithm and compare the two clustering algorithms. The proposed clustering algorithms are applied in the field of judicial execution, which provides decision support for the executive judge to determine the focus of the investigation and the control. A clustering example verifies the clustering algorithm’s effectiveness in the context of hesitant fuzzy linguistic decision information.


Author(s):  
R. R. Gharieb ◽  
G. Gendy ◽  
H. Selim

In this paper, the standard hard C-means (HCM) clustering approach to image segmentation is modified by incorporating weighted membership Kullback–Leibler (KL) divergence and local data information into the HCM objective function. The membership KL divergence, used for fuzzification, measures the proximity between each cluster membership function of a pixel and the locally-smoothed value of the membership in the pixel vicinity. The fuzzification weight is a function of the pixel to cluster-centers distances. The used pixel to a cluster-center distance is composed of the original pixel data distance plus a fraction of the distance generated from the locally-smoothed pixel data. It is shown that the obtained membership function of a pixel is proportional to the locally-smoothed membership function of this pixel multiplied by an exponentially distributed function of the minus pixel distance relative to the minimum distance provided by the nearest cluster-center to the pixel. Therefore, since incorporating the locally-smoothed membership and data information in addition to the relative distance, which is more tolerant to additive noise than the absolute distance, the proposed algorithm has a threefold noise-handling process. The presented algorithm, named local data and membership KL divergence based fuzzy C-means (LDMKLFCM), is tested by synthetic and real-world noisy images and its results are compared with those of several FCM-based clustering algorithms.


Sign in / Sign up

Export Citation Format

Share Document