Identifying vital genes of breast cancer through synergy network by part mutual information

2020 ◽  
Vol 31 (06) ◽  
pp. 2050088
Author(s):  
Xiaobo Yang ◽  
Binghui Guo ◽  
Zhilong Mi ◽  
Ziqiao Yin ◽  
Jiahui Li ◽  
...  

Breast cancer is a common malignant tumor of which pathogenic genes are widely studied. Since gene pairs are considered as biomarkers to identify cancer patients, in this paper, we use information theory to study the collaboration features of gene pairs. The measure of synergy based on mutual information (MI) is introduced to determine whether genes collaborate with each other in breast cancer. Part mutual information (PMI) is introduced to further select collaborative genes and construct a synergy network, which overcomes the shortage of MI. Furthermore, a dual network of synergy network is constructed and structural indices are calculated to identify vital genes. By decision tree and support vector machine, synergy is considered as a suitable index and dual network with PMI improves the accuracy of cancer identification. This method can be extended to identify other biological phenomenon and find collaborative genes as biomarkers.

Electronics ◽  
2021 ◽  
Vol 10 (12) ◽  
pp. 1496
Author(s):  
Hao Liang ◽  
Yiman Zhu ◽  
Dongyang Zhang ◽  
Le Chang ◽  
Yuming Lu ◽  
...  

In analog circuit, the component parameters have tolerances and the fault component parameters present a wide distribution, which brings obstacle to classification diagnosis. To tackle this problem, this article proposes a soft fault diagnosis method combining the improved barnacles mating optimizer(BMO) algorithm with the support vector machine (SVM) classifier, which can achieve the minimum redundancy and maximum relevance for feature dimension reduction with fuzzy mutual information. To be concrete, first, the improved barnacles mating optimizer algorithm is used to optimize the parameters for learning and classification. We adopt six test functions that are on three data sets from the University of California, Irvine (UCI) machine learning repository to test the performance of SVM classifier with five different optimization algorithms. The results show that the SVM classifier combined with the improved barnacles mating optimizer algorithm is characterized with high accuracy in classification. Second, fuzzy mutual information, enhanced minimum redundancy, and maximum relevance principle are applied to reduce the dimension of the feature vector. Finally, a circuit experiment is carried out to verify that the proposed method can achieve fault classification effectively when the fault parameters are both fixed and distributed. The accuracy of the proposed fault diagnosis method is 92.9% when the fault parameters are distributed, which is 1.8% higher than other classifiers on average. When the fault parameters are fixed, the accuracy rate is 99.07%, which is 0.7% higher than other classifiers on average.


2010 ◽  
Vol 36 (3) ◽  
pp. 1503-1510 ◽  
Author(s):  
U. Rajendra Acharya ◽  
E. Y. K. Ng ◽  
Jen-Hong Tan ◽  
S. Vinitha Sree

2011 ◽  
Vol 36 (4) ◽  
pp. 2505-2519 ◽  
Author(s):  
Hui-Ling Chen ◽  
Bo Yang ◽  
Gang Wang ◽  
Su-Jing Wang ◽  
Jie Liu ◽  
...  

Author(s):  
Gang Liu ◽  
Chunlei Yang ◽  
Sen Liu ◽  
Chunbao Xiao ◽  
Bin Song

A feature selection method based on mutual information and support vector machine (SVM) is proposed in order to eliminate redundant feature and improve classification accuracy. First, local correlation between features and overall correlation is calculated by mutual information. The correlation reflects the information inclusion relationship between features, so the features are evaluated and redundant features are eliminated with analyzing the correlation. Subsequently, the concept of mean impact value (MIV) is defined and the influence degree of input variables on output variables for SVM network based on MIV is calculated. The importance weights of the features described with MIV are sorted by descending order. Finally, the SVM classifier is used to implement feature selection according to the classification accuracy of feature combination which takes MIV order of feature as a reference. The simulation experiments are carried out with three standard data sets of UCI, and the results show that this method can not only effectively reduce the feature dimension and high classification accuracy, but also ensure good robustness.


Author(s):  
Yifeng Dou ◽  
Wentao Meng

As one of the most vulnerable cancers of women, the incidence rate of breast cancer in China is increasing at an annual rate of 3%, and the incidence is younger. Therefore, it is necessary to conduct research on the risk of breast cancer, including the cause of disease and the prediction of breast cancer risk based on historical data. Data based statistical learning is an important branch of modern computational intelligence technology. Using machine learning method to predict and judge unknown data provides a new idea for breast cancer diagnosis. In this paper, an improved optimization algorithm (GSP_SVM) is proposed by combining genetic algorithm, particle swarm optimization and simulated annealing with support vector machine algorithm. The results show that the classification accuracy, MCC, AUC and other indicators have reached a very high level. By comparing with other optimization algorithms, it can be seen that this method can provide effective support for decision-making of breast cancer auxiliary diagnosis, thus significantly improving the diagnosis efficiency of medical institutions. Finally, this paper also preliminarily explores the effect of applying this algorithm in detecting and classifying breast cancer in different periods, and discusses the application of this algorithm to multiple classifications by comparing it with other algorithms.


Sign in / Sign up

Export Citation Format

Share Document