Peringkasan Tweet Berdasarkan Trending Topic Twitter Dengan Pembobotan TF-IDF dan Single Linkage AngglomerativeHierarchical Clustering

Annisa Annisa; Yuda Munarko; Yufis Azhar

doi:10.22219/kinetik.v1i1.7

Peringkasan Tweet Berdasarkan Trending Topic Twitter Dengan Pembobotan TF-IDF dan Single Linkage AngglomerativeHierarchical Clustering

Kinetik Game Technology Information System Computer Network Computing Electronics and Control ◽

10.22219/kinetik.v1i1.7 ◽

2016 ◽

Vol 1 (1) ◽

Author(s):

Annisa Annisa ◽

Yuda Munarko ◽

Yufis Azhar

Keyword(s):

Hierarchical Clustering ◽

Main Idea ◽

Single Linkage ◽

Human Expert ◽

Efficient Single-Linkage hierarchical clustering based on partitioning

2016 11th International Conference on Computer Engineering & Systems (ICCES) ◽

10.1109/icces.2016.7821986 ◽

2016 ◽

Author(s):

Mohamed A. Mahfouz

Keyword(s):

Hierarchical Clustering ◽

Single Linkage

Download Full-text

An automatic approach to exclude interlopers from asteroid families

Monthly Notices of the Royal Astronomical Society ◽

10.1093/mnras/stx1273 ◽

2017 ◽

Vol 470 (1) ◽

pp. 576-591 ◽

Cited By ~ 12

Author(s):

Viktor Radović ◽

Bojan Novaković ◽

Valerio Carruba ◽

Dušan Marčeta

Keyword(s):

Hierarchical Clustering ◽

Family Members ◽

Main Idea ◽

Clustering Method ◽

New Approach ◽

Valuable Source ◽

Additional Step ◽

Chaining Effect ◽

Source Of Information ◽

Asteroid Families

Abstract Asteroid families are a valuable source of information to many asteroid-related researches, assuming a reliable list of their members could be obtained. However, as the number of known asteroids increases fast it becomes more and more difficult to obtain a robust list of members of an asteroid family. Here, we are proposing a new approach to deal with the problem, based on the well-known hierarchical clustering method. An additional step in the whole procedure is introduced in order to reduce a so-called chaining effect. The main idea is to prevent chaining through an already identified interloper. We show that in this way a number of potential interlopers among family members is significantly reduced. Moreover, we developed an automatic online-based portal to apply this procedure, i.e. to generate a list of family members as well as a list of potential interlopers. The Asteroid Families Portal is freely available to all interested researchers.

Download Full-text

Scalable single linkage hierarchical clustering for big data

2013 IEEE Eighth International Conference on Intelligent Sensors, Sensor Networks and Information Processing ◽

10.1109/issnip.2013.6529823 ◽

2013 ◽

Cited By ~ 16

Author(s):

T. C. Havens ◽

J. C. Bezdek ◽

M. Palaniswami

Keyword(s):

Big Data ◽

Hierarchical Clustering ◽

Single Linkage

Download Full-text

Evaluation of the Gower coefficient modifications in hierarchical clustering

Advances in Methodology and Statistics ◽

10.51936/eqvy9516 ◽

2017 ◽

Vol 14 (1) ◽

Author(s):

Zdeněk Šulc ◽

Martin Matějka ◽

Jiří Procházka ◽

Hana Řezanková

Keyword(s):

Hierarchical Clustering ◽

Mixed Type ◽

Similarity Measures ◽

Rand Index ◽

Clustering Methods ◽

Single Linkage ◽

Linkage Methods ◽

Hierarchical Clustering Methods ◽

Nominal Variables

This paper thoroughly examines three recently introduced modifications of the Gower coefficient, which were determined for data with mixed-type variables in hierarchical clustering. On the contrary to the original Gower coefficient, which only recognizes if two categories match or not in the case of nominal variables, the examined modifications offer three different approaches to measuring the similarity between categories. The examined dissimilarity measures are compared and evaluated regarding the quality of their clusters measured by three internal indices (Dunn, silhouette, McClain) and regarding their classification abilities measured by the Rand index. The comparison is performed on 810 generated datasets. In the analysis, the performance of the similarity measures is evaluated by different data characteristics (the number of variables, the number of categories, the distance of clusters, etc.) and by different hierarchical clustering methods (average, complete, McQuitty and single linkage methods). As a result, two modifications are recommended for the use in practice.

Download Full-text

Statistical properties of the single linkage hierarchical clustering estimator

Journal of Statistical Planning and Inference ◽

10.1016/j.jspi.2016.12.002 ◽

2017 ◽

Vol 185 ◽

pp. 15-28 ◽

Cited By ~ 4

Author(s):

Dekang Zhu ◽

Dan P. Guralnik ◽

Xuezhi Wang ◽

Xiang Li ◽

Bill Moran

Keyword(s):

Hierarchical Clustering ◽

Statistical Properties ◽

Single Linkage

Download Full-text

Analisa Hasil Pengelompokan Wilayah Kejadian Non-Kebakaran Menggunakan Agglomerative Hierachical Clustering di Semarang

Jurnal Tekno Kompak ◽

10.33365/jtk.v15i2.1166 ◽

2021 ◽

Vol 15 (2) ◽

pp. 63

Author(s):

Desy Exasanti ◽

Arief Jananto

Keyword(s):

Hierarchical Clustering ◽

Manhattan Distance ◽

Agglomerative Hierarchical Clustering ◽

Single Linkage ◽

Bottom Up ◽

Environment Analysis ◽

Complete Linkage ◽

Average Linkage

Abstrak−Klasterisasi merupakan metode pengelompokan dari data yang sudah diketahui label kelasnya untuk menemukan klaster baru dari hasil observasi. Dalam klasterisasi banyak metode yaitu metode terpusat, hirarki, kepadatan dan berbasis kisi, namun dalam penelitian yang dilakukan ini dipilih metode berbasis hirarki. Metode hirarki ini bekerja melakukan pengelompokan objek dengan membentuk hirarki klaster namun bukan berarti selalu digambarkan dengan hirarki dalam organsasi. Dipilihnya Agglomerative Hierarchical Clustering dimana merupakan jenis dari bawah ke atas atau biasa disebut (bottom-up) dalam metode ini objek yang akan diuji dianggap sebagai objek tunggal sebagai klaster dan lalu dilakukan iterasi untuk menemukan klaster-klaster yang lebih besar. Data yang akan digunakan adalah data non-kebakaran pada Dinas Pemadam Kebakaran Kota Semarang ynng mana akan dilakukan pengelompokan wilayah penanganan non-kebakaran. Dinas Pemadam Kebakaran melakukan penanganan bukan hanya kebakaran saja namun ada banyak hal yang sebenarnya dapat ditangani oleh petugas pemadam kebakaran, kejadian non-kebakaran ada beberapa seperti evakuasi reptil, evakuasi kucing, penyelamatan korban kecelakaan dan lain sebagainya. Dari data non-kebakaran dari 16 kecamatan di Kota Semarang pada tahun 2019 akan dilakukan uji menggunakan tiga algoritma yaitu Single Lingkage, Average Linkage dan Complete Linkage . Adapun dari algoritma Single Linkage dilakukan prosedur pemusatan dari jarak terkecil antar objek data, algoritma Average Linkage dilakukan prosedur dari jarak rata-rata objek data, sedangkan jika algoritma Complete Linkage dilakukan prosedur pemusatan dari jarak yang terbesar. Implementasi dan visualiasi dari data uji coba yang dilakukan di penilitian ini menggunakan tools WEKA 3.8.4, Wakaito Environment Analysis for Knowledge atau yang biasa dikenal dengan WEKA ini merupakan software yang menggunakan bahasa pemrograman java. Dari dataset 380 data diambil sampel 100 data untuk diuji mengunakan WEKA menggunakan metode perhtungan jarak Manhattan Distance dengan 3 cluster. Hasil dari data uji coba dapat divisualisasikan dengan visualisasi dendogram pada fitur visualize tree dan jika dilakukan visualisasi dalam bentuk grafik dapat dilakukan menggunakan fitur visualize clusters assignment.

Download Full-text

Nearest Prototype and Nearest Neighbor Clustering with Twofold Memberships Based on Inductive Property

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2013.p0504 ◽

2013 ◽

Vol 17 (4) ◽

pp. 504-510

Author(s):

Satoshi Takumi ◽

◽

Sadaaki Miyamoto

Keyword(s):

Hierarchical Clustering ◽

Nearest Neighbor ◽

Classification Rules ◽

Agglomerative Hierarchical Clustering ◽

Single Linkage ◽

Natural Classification ◽

Nearest Neighbor Classification ◽

Voronoi Regions ◽

Inductive Property ◽

Neighbor Classification

The aim of this paper is to study methods of twofold membership clustering using the nearest prototype and nearest neighbor. The former uses theK-means, whereas the latter extends the single linkage in agglomerative hierarchical clustering. The concept of inductive clustering is moreover used for the both methods, which means that natural classification rules are derived as the results of clustering, a typical example of which is the Voronoi regions inK-means clustering. When the rule of nearest prototype allocation inK-means is replaced by nearest neighbor classification, we have inductive clustering related to the single linkage in agglomerative hierarchical clustering. The former method usesK-means or fuzzyc-means with noise clusters, whereby twofold memberships are derived; the latter method also derives two memberships in a different manner. Theoretical properties of the both methods are studied. Illustrative examples show implications and significances of this concept.

Download Full-text