Clustering web search results using Wikipedia resource

Overlap information usually exits in the high-dimensional data. Misclassified points may be more when affinity propagation clustering is applied to these data. Concerning this problem, a new method combining principal components analysis and affinity propagation clustering is proposed. In this method, dimensionality of the original data is reduced on the premise of reserving most information of the variables. Then, affinity propagation clustering is implemented in the low-dimensional space. Thus, because the redundant information is deleted, the classification is accurate. Experiment is done by using this new method, the results of the experiment explain that this method is effective.

Download Full-text

Application of Affinity Propagation Clustering Algorithm in Fault Diagnosis of Metro Vehicle Auxiliary Inverter

Lecture Notes in Electrical Engineering - Proceedings of the 2013 International Conference on Electrical and Information Technologies for Rail Transportation (EITRT2013)-Volume II ◽

10.1007/978-3-642-53751-6_1 ◽

2014 ◽

pp. 3-9

Author(s):

Junwei Gao ◽

Zengtao Ma ◽

Yong Qin ◽

Limin Jia ◽

Dechen Yao

Keyword(s):

Fault Diagnosis ◽

Clustering Algorithm ◽

Affinity Propagation ◽

Affinity Propagation Clustering

Download Full-text

A Roadmap to Integrate Document Clustering in Information Retrieval

Information Retrieval Methods for Multidisciplinary Applications ◽

10.4018/978-1-4666-3898-3.ch003 ◽

2013 ◽

pp. 31-45

Author(s):

R. Subhashini ◽

V.Jawahar Senthil Kumar

Keyword(s):

Information Retrieval ◽

Search Engines ◽

World Wide ◽

Clustering Algorithm ◽

Web Search ◽

Full Potential ◽

Digital Information ◽

Search Results ◽

The World ◽

The Web

The World Wide Web is a large distributed digital information space. The ability to search and retrieve information from the Web efficiently and effectively is an enabling technology for realizing its full potential. Information Retrieval (IR) plays an important role in search engines. Today’s most advanced engines use the keyword-based (“bag of words”) paradigm, which has inherent disadvantages. Organizing web search results into clusters facilitates the user’s quick browsing of search results. Traditional clustering techniques are inadequate because they do not generate clusters with highly readable names. This paper proposes an approach for web search results in clustering based on a phrase based clustering algorithm. It is an alternative to a single ordered result of search engines. This approach presents a list of clusters to the user. Experimental results verify the method’s feasibility and effectiveness.

Download Full-text

Research and experiment on Affinity Propagation clustering algorithm

2011 Second International Conference on Mechanic Automation and Control Engineering ◽

10.1109/mace.2011.5988401 ◽

2011 ◽

Author(s):

Huan Zhang ◽

Kun Song

Keyword(s):

Clustering Algorithm ◽

Affinity Propagation ◽

Affinity Propagation Clustering

Download Full-text

An Improved Affinity Propagation Clustering Algorithm Based on Entropy Weight Method and Principal Component Analysis

International Journal of Database Theory and Application ◽

10.14257/ijdta.2016.9.6.23 ◽

2016 ◽

Vol 9 (6) ◽

pp. 227-238 ◽

Cited By ~ 1

Author(s):

Wang Limin ◽

Zhang Li ◽

Han Xuming ◽

Ji Qiang ◽

Mu Guangyu ◽

...

Keyword(s):

Principal Component Analysis ◽

Clustering Algorithm ◽

Principal Component ◽

Component Analysis ◽

Affinity Propagation ◽

Entropy Weight ◽

Affinity Propagation Clustering ◽

Entropy Weight Method ◽

Weight Method

Download Full-text

Dynamic equivalent modeling of two-staged photovoltaic power station clusters based on dynamic affinity propagation clustering algorithm

International Journal of Electrical Power & Energy Systems ◽

10.1016/j.ijepes.2017.08.038 ◽

2018 ◽

Vol 95 ◽

pp. 463-475 ◽

Cited By ~ 17

Author(s):

Peixin Li ◽

Wei Gu ◽

Liufang Wang ◽

Bin Xu ◽

Ming Wu ◽

...

Keyword(s):

Clustering Algorithm ◽

Affinity Propagation ◽

Power Station ◽

Photovoltaic Power ◽

Affinity Propagation Clustering ◽

Dynamic Equivalent

Download Full-text

Feature Selection of Sudden Failure Based on Affinity Propagation Clustering

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.586.241 ◽

2012 ◽

Vol 586 ◽

pp. 241-246

Author(s):

Li Min Li ◽

Zhong Sheng Wang

Keyword(s):

Feature Selection ◽

Clustering Algorithm ◽

Feature Space ◽

Failure Diagnosis ◽

Affinity Propagation ◽

Svm Classifier ◽

Incorrect Answer ◽

Sudden Failure ◽

Affinity Propagation Clustering ◽

Selection Of

When diagnosing sudden mechanical failure, in order to make the result of classification more accurate, in this article we describe an affinity propagation clustering algorithm for feature selection of sudden machinery failure diagnosis. General methods of feature selection select features by reducing dimension of the features, at the same time changing the data in the feature space, which would result in incorrect answer to the diagnosis. While affinity propagation method is based on measuring similarity between features whereby redundancy therein is removed, and selecting the exemplar subset of features, while doesn't change the data in the feature space. After testing on clustering and taking the result of PCA and affinity propagation clustering as input of a same SVM classifier, we get the conclusion that the latter has lower error than the former.

Download Full-text

A Quantitative Site-Specific Classification Approach Based on Affinity Propagation Clustering

10.21203/rs.3.rs-44214/v1 ◽

2020 ◽

Author(s):

Sayed Moustafa ◽

Farhan Khan ◽

Mohamed Metwaly ◽

Eslam A.Elawadi ◽

Nassir Al-Arifi

Keyword(s):

Urban Areas ◽

Clustering Algorithm ◽

Site Response ◽

Seismic Hazard Assessment ◽

Spectral Ratio ◽

Site Effect ◽

Affinity Propagation ◽

Site Classification ◽

Site Specific ◽

Affinity Propagation Clustering

Abstract Investigations made to evaluate the site effect characteristics and develop a reliable site classification scheme have received the paramount importance for the planning of urban areas and for a reliable site-specific seismic hazard assessment. This paper presents a new approach for site classification based on affinity propagation (AP) along with a selected set of representative horizontal to vertical spectral ratio (HVSR) curves inside King Saud University (KSU) campus. Measurements of the ambient vibrations were performed to cover the entire campus area by about 307 stations with 20 minutes recording length and sample rate of 128 Hz for each station to satisfy the criteria for reliable and unambiguous HVSR results. Predominant period values were used for identifying of site response and subsequent site classification. Empirical equations from the literature relating frequency of HVSR peak to average shear wave velocity in the upper 30m, commonly used as a proxy for site classification, were found to be unreliable, making site classification difficult. To overcome this problem, Affinity propagation clustering algorithm is used. The obtained results illustrated that microtremors spectral ratios can be remarkably robust tool in determining site effects. The survey results concluded to the preliminary seismic site classification map for the mapped area, which would be useful for future safe design of structures. Finally, the results presented in this study are encouraging prolongation of this type of study in other parts of Saudi Arabia using the microtremors data and site response functions.

Download Full-text

A Roadmap to Integrate Document Clustering in Information Retrieval

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2011010103 ◽

2011 ◽

Vol 1 (1) ◽

pp. 31-44 ◽

Cited By ~ 1

Author(s):

R. Subhashini ◽

V.Jawahar Senthil Kumar

Keyword(s):

Information Retrieval ◽

Search Engines ◽

Clustering Algorithm ◽

Web Search ◽

Full Potential ◽

Digital Information ◽

Enabling Technology ◽

Clustering Techniques ◽

Search Results ◽

The World

The World Wide Web is a large distributed digital information space. The ability to search and retrieve information from the Web efficiently and effectively is an enabling technology for realizing its full potential. Information Retrieval (IR) plays an important role in search engines. Today’s most advanced engines use the keyword-based (“bag of words”) paradigm, which has inherent disadvantages. Organizing web search results into clusters facilitates the user’s quick browsing of search results. Traditional clustering techniques are inadequate because they do not generate clusters with highly readable names. This paper proposes an approach for web search results in clustering based on a phrase based clustering algorithm. It is an alternative to a single ordered result of search engines. This approach presents a list of clusters to the user. Experimental results verify the method s feasibility and effectiveness.

Download Full-text