Self-organizing map clustering technique for ANN-based spatiotemporal modeling of groundwater quality parameters

2015 ◽  
Vol 18 (2) ◽  
pp. 288-309 ◽  
Author(s):  
Vahid Nourani ◽  
Mohammad Taghi Alami ◽  
Farnaz Daneshvar Vousoughi

The present study integrates co-kriging as spatial estimator and self-organizing map (SOM) as clustering technique to identify spatially homogeneous clusters of groundwater quality data and to choose the most effective input data for feed-forward neural network (FFNN) model to simulate electrical conductivity (EC) and total dissolved solids (TDS) of groundwater. The methodology is presented in three stages. In the first stage, a geostatistics approach of co-kriging is used to estimate groundwater quality parameters at locations where the groundwater levels are measured. In stage two, a SOM clustering technique is used to identify spatially homogeneous clusters of groundwater quality data. The dominant input data, selected by spatial clustering and mutual information are then imposed into the FFNN model for one-step-ahead predictions of groundwater quality parameters at stage three. The performance of the newly proposed model is compared to a conventional linear forecasting method of multiple linear regression (MLR). The results suggest that the proposed model decreases dimensionality of the input layer and consequently the complexity of the FFNN model with acceptable efficiency in spatiotemporal simulation of groundwater quality parameters. The application of FFNN for modeling EC and TDS parameters increases the accuracy of predictions respectively up to 84.5% and 17% on average with regard to the MLR model.

Author(s):  
Melody Y. Kiang ◽  
Dorothy M. Fisher ◽  
Michael Y. Hu ◽  
Robert T. Chi

This chapter presents an extended Self-Organizing Map (SOM) network and demonstrates how it can be used to forecast market segment membership. The Kohonen’s SOM network is an unsupervised learning neural network that maps n-dimensional input data to a lower dimensional (usually one- or two-dimensional) output map while maintaining the original topological relations. We apply an extended version of SOM networks that further groups the nodes on the output map into a user-specified number of clusters to a residential market data set from AT&T. Specifically, the extended SOM is used to group survey respondents using their attitudes towards modes of communication. We then compare the extended SOM network solutions with a two-step procedure that uses the factor scores from factor analysis as inputs to K-means cluster analysis. Results using AT&T data indicate that the extended SOM network performs better than the two-step procedure.


Author(s):  
Ambarwati Ambarwati ◽  
Edi Winarko

AbstrakBerita merupakan sumber informasi yang dinantikan oleh manusia setiap harinya. Manusia membaca berita dengan kategori yang diinginkan. Jika komputer mampu mengelompokkan berita secara otomatis maka tentunya manusia akan lebih mudah membaca berita sesuai dengan kategori yang diinginkan. Pengelompokan berita yang berupa artikel secara otomatis sangatlah menarik karena mengorganisir artikel berita secara manual membutuhkan waktu dan biaya yang tidak sedikit.Tujuan penelitian ini adalah membuat sistem aplikasi untuk pengelompokkan artikel berita dengan menggunakan algoritma Self Organizing Map. Artikel berita digunakan sebagai input data. Kemudian sistem melakukan pemrosesan data untuk dikelompokkan. Proses yang dilakukan sistem meliputi preprocessing, feature extraction, clustering dan visualize.Sistem yang dikembangkan mampu menampilkan hasil clustering dengan algoritma Self Organizing Map dan memberikan visualisasi dengan smoothed data histograms berupa island map dari artikel berita. Selain itu sistem dapat menampilkan koleksi dokumen dari lima kategori berita yang ada pada tiap tahunnya dan banyaknya kata (histogram kata) yang sering muncul pada tiap arikel berita. Pengujian dari sistem ini dengan memasukan artikel berita, kemudian sistem memprosesnya dan mampu memberikan hasil cluster dari artikel berita yang dimasukan. Kata kunci—Pengelompokkan berita Indonesia, pengelompokkan berdasar histogram kata, pengelompokan berita menggunakan SOM  Abstract News is awaited information resources by humans every day. Human reading the news with the desired category. If the computer able to news clustering with automatically, humans of course will be easier to read the news according to the desired category. News clustering in the form of news articles with automatically very interesting because it organizes news articles manually takes time and costs not a little bit.The purpose of this research is to create a system application for grouping news articles by using the Self Organizing Map algorithm. News article be used as input into the system. News articles used as input data. Then the system performs data processing until to be clustered. Processes performed by the system covers: preprocessing, feature extraction, clustering and visualize.The system developed is able to display the results clustering of the Self Organizing Map algorithm and gives visualization of the Smoothed Data Histograms in the form of island map from news articles. Additionally the system can display a word histogram and news articles from five categories news in each year. Testing of this system by entering the news articles, then the system performs data processing and gives results of a cluster from news articles that input. Keywords—Indonesia news clustering, clustering based on words histograms, news clustering using SOM


Author(s):  
Fedja Hadzic ◽  
Tharam Dillon ◽  
Henry Tan ◽  
Ling. Feng ◽  
Elizabeth Chang

Association rule mining is one of the most popular pattern discovery methods used in data mining. Frequent pattern extraction is an essential step in association rule mining. Most of the proposed algorithms for extracting frequent patterns are based on the downward closure lemma concept utilizing the support and confidence framework. In this chapter we investigate an alternative method for mining frequent patterns in a transactional database. Self-Organizing Map (SOM) is an unsupervised neural network that effectively creates spatially organized internal representations of the features and abstractions detected in the input space. It is one of the most popular clustering techniques, and it reveals existing similarities in the input space by performing a topology-preserving mapping. These promising properties indicate that such a clustering technique can be used to detect frequent patterns in a top-down manner as opposed to the traditional approach that employs a bottom-up lattice search. Issues that are frequently raised when using clustering technique for the purpose of finding association rules are: (i) the completeness of association rule set, (ii) the support level for the rules generated, and (iii) the confidence level for the rules generated. We present some case studies analyzing the relationships between the SOM approach and the traditional association rule framework, and propose a way to constrain the clustering technique so that the traditional support constraint can be approximated. Throughout our experiments, we have demonstrated how a clustering approach can be used for discovering frequent patterns.


2007 ◽  
Vol 11 (4) ◽  
pp. 1309-1321 ◽  
Author(s):  
L. Peeters ◽  
F. Bação ◽  
V. Lobo ◽  
A. Dassargues

Abstract. The use of unsupervised artificial neural network techniques like the self-organizing map (SOM) algorithm has proven to be a useful tool in exploratory data analysis and clustering of multivariate data sets. In this study a variant of the SOM-algorithm is proposed, the GEO3DSOM, capable of explicitly incorporating three-dimensional spatial knowledge into the algorithm. The performance of the GEO3DSOM is compared to the performance of the standard SOM in analyzing an artificial data set and a hydrochemical data set. The hydrochemical data set consists of 131 groundwater samples collected in two detritic, phreatic, Cenozoic aquifers in Central Belgium. Both techniques succeed very well in providing more insight in the groundwater quality data set, visualizing the relationships between variables, highlighting the main differences between groups of samples and pointing out anomalous wells and well screens. The GEO3DSOM however has the advantage to provide an increased resolution while still maintaining a good generalization of the data set.


Sign in / Sign up

Export Citation Format

Share Document