scholarly journals Endmember Learning with K-Means through SCD Model in Hyperspectral Scene Reconstructions

2019 ◽  
Vol 5 (11) ◽  
pp. 85 ◽  
Author(s):  
Ayan Chatterjee ◽  
Peter W. T. Yuen

This paper proposes a simple yet effective method for improving the efficiency of sparse coding dictionary learning (DL) with an implication of enhancing the ultimate usefulness of compressive sensing (CS) technology for practical applications, such as in hyperspectral imaging (HSI) scene reconstruction. CS is the technique which allows sparse signals to be decomposed into a sparse representation “a” of a dictionary D u . The goodness of the learnt dictionary has direct impacts on the quality of the end results, e.g., in the HSI scene reconstructions. This paper proposes the construction of a concise and comprehensive dictionary by using the cluster centres of the input dataset, and then a greedy approach is adopted to learn all elements within this dictionary. The proposed method consists of an unsupervised clustering algorithm (K-Means), and it is then coupled with an advanced sparse coding dictionary (SCD) method such as the basis pursuit algorithm (orthogonal matching pursuit, OMP) for the dictionary learning. The effectiveness of the proposed K-Means Sparse Coding Dictionary (KMSCD) is illustrated through the reconstructions of several publicly available HSI scenes. The results have shown that the proposed KMSCD achieves ~40% greater accuracy, 5 times faster convergence and is twice as robust as that of the classic Spare Coding Dictionary (C-SCD) method that adopts random sampling of data for the dictionary learning. Over the five data sets that have been employed in this study, it is seen that the proposed KMSCD is capable of reconstructing these scenes with mean accuracies of approximately 20–500% better than all competing algorithms adopted in this work. Furthermore, the reconstruction efficiency of trace materials in the scene has been assessed: it is shown that the KMSCD is capable of recovering ~12% better than that of the C-SCD. These results suggest that the proposed DL using a simple clustering method for the construction of the dictionary has been shown to enhance the scene reconstruction substantially. When the proposed KMSCD is incorporated with the Fast non-negative orthogonal matching pursuit (FNNOMP) to constrain the maximum number of materials to coexist in a pixel to four, experiments have shown that it achieves approximately ten times better than that constrained by using the widely employed TMM algorithm. This may suggest that the proposed DL method using KMSCD and together with the FNNOMP will be more suitable to be the material allocation module of HSI scene simulators like the CameoSim package.

2021 ◽  
Vol 73 (1) ◽  
Author(s):  
Xian Zhang ◽  
Jin Li ◽  
Diquan Li ◽  
Yong Li ◽  
Bei Liu ◽  
...  

AbstractMagnetotelluric (MT) data processing can increase the reliability of measured data. Traditional MT data denoising methods are usually applied to entire MT time-series, which results in the loss of useful MT signals and a decrease of imaging accuracy of electromagnetic inversion. However, targeted MT noise separation can retain part of the signal unaffected by strong noise and enhance the quality of MT responses. Thus, we propose a novel method for MT noise separation that uses the refined composite multiscale dispersion entropy (RCMDE) and the orthogonal matching pursuit (OMP) algorithm. First, the RCMDE is extracted from each segment of the MT data. Then, the RCMDEs for each segment are input to the fuzzy c-mean (FCM) clustering algorithm for automatic identification of the MT signal and noise. Next, the OMP method is utilized to remove the identified noise segments independently. Finally, the reconstructed signal consists of the denoised signal segments and the identified useful signal segments. We conducted simulation experiments and algorithm evaluations on electromagnetic transfer function (EMTF) data, simulated data and measured sites. The results indicate that the RCMDE can improve the stability of multiscale dispersion entropy (MDE) and multiscale entropy (ME) by analyzing the characteristics of the signal samples library, effectively distinguishing MT signals and noise. Compared with the existing technique of denoising entire time series, the proposed method uses the RCMDE as characteristic parameter and uses the OMP algorithm for noise separation, simplifies the multi-feature fusion, and improves the accuracy of signal-noise identification. Moreover, the denoising efficiency is accelerated, and the MT response in the low-frequency band is greatly improved.


Author(s):  
Т.В. Речкалов ◽  
М.Л. Цымблер

Алгоритм PAM (Partitioning Around Medoids) представляет собой разделительный алгоритм кластеризации, в котором в качестве центров кластеров выбираются только кластеризуемые объекты (медоиды). Кластеризация на основе техники медоидов применяется в широком спектре приложений: сегментирование медицинских и спутниковых изображений, анализ ДНК-микрочипов и текстов и др. На сегодня имеются параллельные реализации PAM для систем GPU и FPGA, но отсутствуют таковые для многоядерных ускорителей архитектуры Intel Many Integrated Core (MIC). В настоящей статье предлагается новый параллельный алгоритм кластеризации PhiPAM для ускорителей Intel MIC. Вычисления распараллеливаются с помощью технологии OpenMP. Алгоритм предполагает использование специализированной компоновки данных в памяти и техники тайлинга, позволяющих эффективно векторизовать вычисления на системах Intel MIC. Эксперименты, проведенные на реальных наборах данных, показали хорошую масштабируемость алгоритма. The PAM (Partitioning Around Medoids) is a partitioning clustering algorithm where each cluster is represented by an object from the input dataset (called a medoid). The medoid-based clustering is used in a wide range of applications: the segmentation of medical and satellite images, the analysis of DNA microarrays and texts, etc. Currently, there are parallel implementations of PAM for GPU and FPGA systems, but not for Intel Many Integrated Core (MIC) accelerators. In this paper, we propose a novel parallel PhiPAM clustering algorithm for Intel MIC systems. Computations are parallelized by the OpenMP technology. The algorithm exploits a sophisticated memory data layout and loop tiling technique, which allows one to efficiently vectorize computations with Intel MIC. Experiments performed on real data sets show a good scalability of the algorithm.


Electronics ◽  
2019 ◽  
Vol 8 (2) ◽  
pp. 165 ◽  
Author(s):  
Liquan Zhao ◽  
Yunfeng Hu ◽  
Yulong Liu

The stochastic gradient matching pursuit algorithm requires the sparsity of the signal as prior information. However, this prior information is unknown in practical applications, which restricts the practical applications of the algorithm to some extent. An improved method was proposed to overcome this problem. First, a pre-evaluation strategy was used to evaluate the sparsity of the signal and the estimated sparsity was used as the initial sparsity. Second, if the number of columns of the candidate atomic matrix was smaller than that of the rows, the least square solution of the signal was calculated, otherwise, the least square solution of the signal was set as zero. Finally, if the current residual was greater than the previous residual, the estimated sparsity was adjusted by the fixed step-size and stage index, otherwise we did not need to adjust the estimated sparsity. The simulation results showed that the proposed method was better than other methods in terms of the aspect of reconstruction percentage in the larger sparsity environment.


Electronics ◽  
2020 ◽  
Vol 9 (9) ◽  
pp. 1507
Author(s):  
Hufei Zhu ◽  
Wen Chen ◽  
Yanpeng Wu

Based on the efficient inverse Cholesky factorization, we propose an implementation of OMP (called as version 0, i.e., v0) and its four memory-saving versions (i.e., the proposed v1, v2, v3 and v4). In the simulations, the proposed five versions and the existing OMP implementations have nearly the same numerical errors. Among all the OMP implementations, the proposed v0 needs the least computational complexity, and is the fastest in the simulations for almost all problem sizes. As a tradeoff between computational complexities/time and memory requirements, the proposed v1 seems to be better than all the existing ones when only considering the efficient OMP implementations storing G (i.e., the Gram matrix of the dictionary), the proposed v2 and v3 seem to be better than the only existing one when only considering the efficient implementations not storing G, and the proposed v4 seems to be better than the naive implementation that has the (known) minimum memory requirements. Moreover, all the proposed five versions only include parallelizable matrix-vector products in each iteration, and do not need any back-substitutions that are necessary in some existing efficient implementations (e.g., those utilizing the Cholesky factorization).


Author(s):  
G. Sandhya ◽  
Amalapurapu Srinag ◽  
Ganesh Babu Pantangi ◽  
Joel Abhishek Kanaparthi

The main aim of brain Magnetic Resonance Image (MRI) segmentation is to extractthe significant objects like tumors for better diagnosis and proper treatment. As the brain tumors are distinct in the sense of shapes, location, and intensity it is hard to define a general algorithm for the tumor segmentation. Accurate extraction of tumors from the brain MRIs is a challenging task due to the complex anatomical structure of brain tissues in addition to the existence of intensity inhomogeneity, partial volume effects, and noise. In this paper, a method of Sparse coding based on non-linear features is proposed for the tumor segmentation from MR images of the brain. Initially, first and second-order statistical eigenvectors of 3 × 3 size are extracted from the MRIs then the process of Sparse coding is applied to them. The kernel dictionary learning algorithm is employed to obtain the non-linear features from these processed vectors to build two individual adaptive dictionaries for healthy and pathological tissues. This work proposes dictionary learning based kernel clustering algorithm to code the pixels, and then the target pixelsare classified by utilizing the method of linear discrimination. The proposed technique is applied to several tumor MRIs, taken from the BRATS database. This technique overcomes the problem of linear inseparability produced by the high level intensity similarity between the normal and abnormal tissues of the given brain image. All the performance parameters are high for the proposed technique. Comparison of the results with some other existing methods such as Fuzzy – C- Means (FCM), Hybrid k-Mean Graph Cut (HKMGC) and Neutrosophic Set – Expert Maximum Fuzzy Sure Entropy (NS-EMFSE) demonstrates that the proposed sparse coding method is effective in segmenting the brain tumor regions.


Electronics ◽  
2019 ◽  
Vol 8 (1) ◽  
pp. 86 ◽  
Author(s):  
Xiaorui Song ◽  
Lingda Wu ◽  
Hongxing Hao ◽  
Wanpeng Xu

Processing and applications of hyperspectral images (HSI) are limited by the noise component. This paper establishes an HSI denoising algorithm by applying dictionary learning and sparse coding theory, which is extended into the spectral domain. First, the HSI noise model under additive noise assumption was studied. Considering the spectral information of HSI data, a novel dictionary learning method based on an online method is proposed to train the spectral dictionary for denoising. With the spatial–contextual information in the noisy HSI exploited as a priori knowledge, the total variation regularizer is introduced to perform the sparse coding. Finally, sparse reconstruction is implemented to produce the denoised HSI. The performance of the proposed approach is better than the existing algorithms. The experiments illustrate that the denoising result obtained by the proposed algorithm is at least 1 dB better than that of the comparison algorithms. The intrinsic details of both spatial and spectral structures can be preserved after significant denoising.


Complexity ◽  
2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Dan Zhang ◽  
Yingcang Ma ◽  
Hu Zhao ◽  
Xiaofei Yang

Clustering algorithm is one of the important research topics in the field of machine learning. Neutrosophic clustering is the generalization of fuzzy clustering and has been applied to many fields. This paper presents a new neutrosophic clustering algorithm with the help of regularization. Firstly, the regularization term is introduced into the FC-PFS algorithm to generate sparsity, which can reduce the complexity of the algorithm on large data sets. Secondly, we propose a method to simplify the process of determining regularization parameters. Finally, experiments show that the clustering results of this algorithm on artificial data sets and real data sets are mostly better than other clustering algorithms. Our clustering algorithm is effective in most cases.


Author(s):  
MICHEL BRUYNOOGHE

The clustering of large data sets is of great interest in fields such as pattern recognition, numerical taxonomy, image or speech processing. The traditional Ascendant Hierarchical Algorithm (AHC) cannot be run for sets of more than a few thousand elements. The reducible neighborhoods clustering algorithm, which is presented in this paper, has overtaken the limits of the traditional hierarchical clustering algorithm by generating an exact hierarchy on a large data set. The theoretical justification of this algorithm is the so-called Bruynooghe reducibility principle, that lays down the condition under which the exact hierarchy may be constructed locally, by carrying out aggregations in restricted regions of the representation space. As for the Day and Edelsbrunner algorithm, the maximum theoretical time complexity of the reducible neighborhoods clustering algorithm is O(n2 log n), regardless of the chosen clustering strategy. But the reducible neighborhoods clustering algorithm uses the original data table and its practical performances are by far better than Day and Edelsbrunner’s algorithm, thus allowing the hierarchical clustering of large data sets, i.e. composed of more than 10 000 objects.


Sign in / Sign up

Export Citation Format

Share Document