Intelligent Integrated Community Model for Ethnic Minority Areas Using Data Analysis and Machine Learning

The intelligent community has become a technological trend with the continuous development of China’s Internet economy and the continuous improvement in community cooperative governance. The comprehensive service system in ethnic areas is also one of the reasons for the intelligent community. It plays an important role in community cooperative governance and comprehensive service system construction in ethnic minority areas. This paper briefly analyzes the important opportunities provided by the “wisdom community” for community cooperative governance and comprehensive service system construction in ethnic minority areas. For this purpose, an integrated model for the construction of a “culture sharing cloud system,” “public cultural service microplatform,” and “interactive end of public cultural service” is proposed. The mode of wisdom community’s public culture is utilized in the proposed integrated model. Our proposed model puts forward some countermeasures to accelerate the construction of infrastructure, innovate the digital service system of public culture, and optimize the internal organizational structure of the government. The existing datasets are analyzed to design the proposed model. Machine learning techniques are utilized to design the proposed intelligent community system. The proposed model overcomes the existing communication bottleneck for the public cultural service in ethnic minority areas and accelerates the realization of community coordination governance and the optimization of the comprehensive service system. It also improves the modernization level of minority areas. Finally, the effectiveness of the proposed model is highlighted.

Download Full-text

AN EFFICIENT MACHINE LEARNING MODEL FOR PREDICTION OF ACUTE MYOCARDIAL INFARCTION

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666200325104317 ◽

2020 ◽

Vol 13 ◽

Author(s):

Dhilsath Fathima.M ◽

S. Justin Samuel ◽

R. Hari Haran

Keyword(s):

Machine Learning ◽

Myocardial Infarction ◽

Acute Myocardial Infarction ◽

Logistic Regression ◽

Decision Tree ◽

Learning Model ◽

Training Dataset ◽

Data Set ◽

Machine Learning Model ◽

Proposed Model

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.

Download Full-text

A novel multi-stage ensemble model with multiple K-means-based selective undersampling: An application in credit scoring

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201954 ◽

2021 ◽

Vol 40 (5) ◽

pp. 9471-9484

Author(s):

Yilun Jin ◽

Yanan Liu ◽

Wenyu Zhang ◽

Shuai Zhang ◽

Yu Lou

Keyword(s):

Machine Learning ◽

Predictive Accuracy ◽

Credit Scoring ◽

Imbalanced Data ◽

Ensemble Model ◽

Selective Sampling ◽

Machine Learning Methods ◽

Multi Stage ◽

Proposed Model ◽

New Feature

With the advancement of machine learning, credit scoring can be performed better. As one of the widely recognized machine learning methods, ensemble learning has demonstrated significant improvements in the predictive accuracy over individual machine learning models for credit scoring. This study proposes a novel multi-stage ensemble model with multiple K-means-based selective undersampling for credit scoring. First, a new multiple K-means-based undersampling method is proposed to deal with the imbalanced data. Then, a new selective sampling mechanism is proposed to select the better-performing base classifiers adaptively. Finally, a new feature-enhanced stacking method is proposed to construct an effective ensemble model by composing the shortlisted base classifiers. In the experiments, four datasets with four evaluation indicators are used to evaluate the performance of the proposed model, and the experimental results prove the superiority of the proposed model over other benchmark models.

Download Full-text

Prediction of Pest Insect Appearance Using Sensors and Machine Learning

Sensors ◽

10.3390/s21144846 ◽

2021 ◽

Vol 21 (14) ◽

pp. 4846

Author(s):

Dušan Marković ◽

Dejan Vujičić ◽

Snežana Tanasković ◽

Borislav Đorđević ◽

Siniša Ranđić ◽

...

Keyword(s):

Machine Learning ◽

Relative Humidity ◽

Weather Conditions ◽

Daily Basis ◽

Machine Learning Algorithms ◽

Lower Percentage ◽

Timely Manner ◽

Proposed Model ◽

Set Up ◽

Accuracy Of Prediction

The appearance of pest insects can lead to a loss in yield if farmers do not respond in a timely manner to suppress their spread. Occurrences and numbers of insects can be monitored through insect traps, which include their permanent touring and checking of their condition. Another more efficient way is to set up sensor devices with a camera at the traps that will photograph the traps and forward the images to the Internet, where the pest insect’s appearance will be predicted by image analysis. Weather conditions, temperature and relative humidity are the parameters that affect the appearance of some pests, such as Helicoverpa armigera. This paper presents a model of machine learning that can predict the appearance of insects during a season on a daily basis, taking into account the air temperature and relative humidity. Several machine learning algorithms for classification were applied and their accuracy for the prediction of insect occurrence was presented (up to 76.5%). Since the data used for testing were given in chronological order according to the days when the measurement was performed, the existing model was expanded to take into account the periods of three and five days. The extended method showed better accuracy of prediction and a lower percentage of false detections. In the case of a period of five days, the accuracy of the affected detections was 86.3%, while the percentage of false detections was 11%. The proposed model of machine learning can help farmers to detect the occurrence of pests and save the time and resources needed to check the fields.

Download Full-text

BioSignal modelling for prediction of cardiac diseases using intra group selection method

Intelligent Decision Technologies ◽

10.3233/idt-200058 ◽

2021 ◽

Vol 15 (1) ◽

pp. 151-160

Author(s):

Hemant P. Kasturiwale ◽

Sujata N. Kale

Keyword(s):

Machine Learning ◽

Nervous System ◽

Random Forest ◽

Normal Sinus Rhythm ◽

Heart Defects ◽

Support Vector ◽

Autonomous Nervous System ◽

Cardiac Diseases ◽

Vast Number ◽

Proposed Model

The Autonomous Nervous System (ANS) controls the nervous system and Heart Rate Variability (HRV) can be used as a diagnostic tool to diagnose heart defects. HRV can be classified into linear and nonlinear HRV indices which are used mostly to measure the efficiency of the model. For prediction of cardiac diseases, the selection and extraction features of machine learning model are effective. The available model used till date is based on HRV indices to predict the cardiac diseases accurately. The model could hardly throw light on specifics of indices, selection process and stability of the model. The proposed model is developed considering all facet electrocardiogram amplitude (ECG), frequency components, sampling frequency, extraction methods and acquisition techniques. The machine learning based model and its performance shall be tested using the standard BioSignal method, both on the data available and on the data obtained by the author. This is unique model developed by considering the vast number of mixtures sets and more than four complex cardiac classes. The statistical analysis is performed on a variety of databases such as MIT/BIH Normal Sinus Rhythm (NSR), MIT/BIH Arrhythmia (AR) and MIT/BIH Atrial Fibrillation (AF) and Peripheral Pule Analyser using feature compatibility techniques. The classifiers are trained for prediction with approximately 40000 sets of parameters. The proposed model reaches an average accuracy of 97.87 percent and is sensitive and précised. The best features are chosen from the different HRV features that will be used for classification. The present model was checked under all possible subject scenarios, such as the raw database and the non-ECG signal. In this sense, robustness is defined not only by the specificity parameter, but also by other measuring output parameters. Support Vector Machine (SVM), K-nearest Neighbour (KNN), Ensemble Adaboost (EAB) with Random Forest (RF) are tested in a 5% higher precision band and a lower band configuration. The Random Forest has produced better results, and its robustness has been established.

Download Full-text

Prediction of Dead Oil Viscosity: Machine Learning vs. Classical Correlations

Energies ◽

10.3390/en14040930 ◽

2021 ◽

Vol 14 (4) ◽

pp. 930

Author(s):

Fahimeh Hadavimoghaddam ◽

Mehdi Ostadhassan ◽

Ehsan Heidaryan ◽

Mohammad Ali Sadri ◽

Inna Chapanova ◽

...

Keyword(s):

Machine Learning ◽

Viscosity Data ◽

Oil Viscosity ◽

Pvt Data ◽

Proposed Model ◽

Wide Range ◽

Engineering Problems ◽

Functional Forms ◽

Insight Into

Dead oil viscosity is a critical parameter to solve numerous reservoir engineering problems and one of the most unreliable properties to predict with classical black oil correlations. Determination of dead oil viscosity by experiments is expensive and time-consuming, which means developing an accurate and quick prediction model is required. This paper implements six machine learning models: random forest (RF), lightgbm, XGBoost, multilayer perceptron (MLP) neural network, stochastic real-valued (SRV) and SuperLearner to predict dead oil viscosity. More than 2000 pressure–volume–temperature (PVT) data were used for developing and testing these models. A huge range of viscosity data were used, from light intermediate to heavy oil. In this study, we give insight into the performance of different functional forms that have been used in the literature to formulate dead oil viscosity. The results show that the functional form f(γAPI,T), has the best performance, and additional correlating parameters might be unnecessary. Furthermore, SuperLearner outperformed other machine learning (ML) algorithms as well as common correlations that are based on the metric analysis. The SuperLearner model can potentially replace the empirical models for viscosity predictions on a wide range of viscosities (any oil type). Ultimately, the proposed model is capable of simulating the true physical trend of the dead oil viscosity with variations of oil API gravity, temperature and shear rate.

Download Full-text

A Machine Learning-based Iterative Design Approach to Automate User Satisfaction Degree Prediction in Smart Product-Service System

Computers & Industrial Engineering ◽

10.1016/j.cie.2022.107939 ◽

2022 ◽

pp. 107939

Author(s):

Jingchen Cong ◽

Pai Zheng ◽

Yuan Bian ◽

Chun-Hsien Chen ◽

Jianmin Li ◽

...

Keyword(s):

Machine Learning ◽

User Satisfaction ◽

Service System ◽

Design Approach ◽

Product Service System ◽

Iterative Design ◽

Satisfaction Degree ◽

Product Service ◽

Smart Product

Download Full-text

Modified Decision Tree Technique for Ransomware Detection at Runtime through API Calls

Scientific Programming ◽

10.1155/2020/8845833 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Faizan Ullah ◽

Qaisar Javaid ◽

Abdu Salam ◽

Masood Ahmad ◽

Nadeem Sarwar ◽

...

Keyword(s):

Machine Learning ◽

Random Forest ◽

Decision Tree ◽

Feature Vector ◽

Machine Learning Algorithms ◽

The Novel ◽

Proposed Model ◽

Testing Accuracy ◽

Financial Losses

Ransomware (RW) is a distinctive variety of malware that encrypts the files or locks the user’s system by keeping and taking their files hostage, which leads to huge financial losses to users. In this article, we propose a new model that extracts the novel features from the RW dataset and performs classification of the RW and benign files. The proposed model can detect a large number of RW from various families at runtime and scan the network, registry activities, and file system throughout the execution. API-call series was reutilized to represent the behavior-based features of RW. The technique extracts fourteen-feature vector at runtime and analyzes it by applying online machine learning algorithms to predict the RW. To validate the effectiveness and scalability, we test 78550 recent malign and benign RW and compare with the random forest and AdaBoost, and the testing accuracy is extended at 99.56%.

Download Full-text

Soft Computing Methodology to Optimize the Integrated Dynamic Models of Cellular Manufacturing Systems in a Robust Environment

Mathematical Problems in Engineering ◽

10.1155/2021/3040391 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Amir-Mohammad Golmohammadi ◽

Hasan Rasay ◽

Zaynab Akhoundpour Amiri ◽

Maryam Solgi ◽

Negar Balajeh

Keyword(s):

Machine Learning ◽

Programming Model ◽

Facility Layout ◽

Cell Formation ◽

Layout Design ◽

Metaheuristic Algorithms ◽

Sensitivity Analyses ◽

Mixed Integer ◽

Facility Layout Design ◽

Proposed Model

Machine learning, neural networks, and metaheuristic algorithms are relatively new subjects, closely related to each other: learning is somehow an intrinsic part of all of them. On the other hand, cell formation (CF) and facility layout design are the two fundamental steps in the CMS implementation. To get a successful CMS design, addressing the interrelated decisions simultaneously is important. In this article, a new nonlinear mixed-integer programming model is presented which comprehensively considers solving the integrated dynamic cell formation and inter/intracell layouts in continuous space. In the proposed model, cells are configured in flexible shapes during the planning horizon considering cell capacity in each period. This study considers the exact information about facility layout design and material handling cost. The proposed model is an NP-hard mixed-integer nonlinear programming model. To optimize the proposed problem, first, three metaheuristic algorithms, that is, Genetic Algorithm (GA), Keshtel Algorithm (KA), and Red Deer Algorithm (RDA), are employed. Then, to further improve the quality of the solutions, using machine learning approaches and combining the results of the aforementioned algorithms, a new metaheuristic algorithm is proposed. Numerical examples, sensitivity analyses, and comparisons of the performances of the algorithms are conducted.

Download Full-text

A Novel Approach for Forest Fragmentation Susceptibility Mapping and Assessment: A Case Study from the Indian Himalayan Region

Remote Sensing ◽

10.3390/rs13204090 ◽

2021 ◽

Vol 13 (20) ◽

pp. 4090

Author(s):

Amit Kumar Batar ◽

Hideaki Shibata ◽

Teiji Watanabe

Keyword(s):

Forest Fragmentation ◽

Integrated Model ◽

Himalayan Region ◽

Geographical Information ◽

Susceptibility Map ◽

Forest Landscape ◽

Indian Himalayan Region ◽

Proposed Model ◽

Causative Factors ◽

The Future

An estimation of where forest fragmentation is likely to occur is critically important for improving the integrity of the forest landscape. We prepare a forest fragmentation susceptibility map for the first time by developing an integrated model and identify its causative factors in the forest landscape. Our proposed model is based upon the synergistic use of the earth observation data, forest fragmentation approach, patch forests, causative factors, and the weight-of-evidence (WOE) method in a Geographical Information System (GIS) platform. We evaluate the applicability of the proposed model in the Indian Himalayan region, a region of rich biodiversity and environmental significance in the Indian subcontinent. To obtain a forest fragmentation susceptibility map, we used patch forests as past evidence of completely degraded forests. Subsequently, we used these patch forests in the WOE method to assign the standardized weight value to each class of causative factors tested by the Variance Inflation Factor (VIF) method. Finally, we prepare a forest fragmentation susceptibility map and classify it into five levels: very low, low, medium, high, and very high and test its validity using 30% randomly selected patch forests. Our study reveals that around 40% of the study area is highly susceptible to forest fragmentation. This study identifies that forest fragmentation is more likely to occur if proximity to built-up areas, roads, agricultural lands, and streams is low, whereas it is less likely to occur in higher altitude zones (more than 2000 m a.s.l.). Additionally, forest fragmentation will likely occur in areas mainly facing south, east, southwest, and southeast directions and on very gentle and gentle slopes (less than 25 degrees). This study identifies Himalayan moist temperate and pine forests as being likely to be most affected by forest fragmentation in the future. The results suggest that the study area would experience more forest fragmentation in the future, meaning loss of forest landscape integrity and rich biodiversity in the Indian Himalayan region. Our integrated model achieved a prediction accuracy of 88.7%, indicating good accuracy of the model. This study will be helpful to minimize forest fragmentation and improve the integrity of the forest landscape by implementing forest restoration and reforestation schemes.

Download Full-text

A Soft Voting Ensemble-Based Model for the Early Prediction of Idiopathic Pulmonary Fibrosis (IPF) Disease Severity in Lungs Disease Patients

Life ◽

10.3390/life11101092 ◽

2021 ◽

Vol 11 (10) ◽

pp. 1092

Author(s):

Sikandar Ali ◽

Ali Hussain ◽

Satyabrata Aich ◽

Moo Suk Park ◽

Man Pyo Chung ◽

...

Keyword(s):

Machine Learning ◽

Idiopathic Pulmonary Fibrosis ◽

Pulmonary Fibrosis ◽

Lung Disease ◽

Lung Diseases ◽

Early Stage ◽

Confusion Matrix ◽

Machine Learning Techniques ◽

Training Dataset ◽

Proposed Model

Idiopathic pulmonary fibrosis, which is one of the lung diseases, is quite rare but fatal in nature. The disease is progressive, and detection of severity takes a long time as well as being quite tedious. With the advent of intelligent machine learning techniques, and also the effectiveness of these techniques, it was possible to detect many lung diseases. So, in this paper, we have proposed a model that could be able to detect the severity of IPF at the early stage so that fatal situations can be controlled. For the development of this model, we used the IPF dataset of the Korean interstitial lung disease cohort data. First, we preprocessed the data while applying different preprocessing techniques and selected 26 highly relevant features from a total of 502 features for 2424 subjects. Second, we split the data into 80% training and 20% testing sets and applied oversampling on the training dataset. Third, we trained three state-of-the-art machine learning models and combined the results to develop a new soft voting ensemble-based model for the prediction of severity of IPF disease in patients with this chronic lung disease. Hyperparameter tuning was also performed to get the optimal performance of the model. Fourth, the performance of the proposed model was evaluated by calculating the accuracy, AUC, confusion matrix, precision, recall, and F1-score. Lastly, our proposed soft voting ensemble-based model achieved the accuracy of 0.7100, precision 0.6400, recall 0.7100, and F1-scores 0.6600. This proposed model will help the doctors, IPF patients, and physicians to diagnose the severity of the IPF disease in its early stages and assist them to take proactive measures to overcome this disease by enabling the doctors to take necessary decisions pertaining to the treatment of IPF disease.

Download Full-text