MM-UrbanFAC: Urban Functional Area Classification Model Based on Multimodal Machine Learning

Author(s):  
Xiujuan Xu ◽  
Yulin Bai ◽  
Yu Liu ◽  
Xiaowei Zhao ◽  
Yuzhi Sun
2021 ◽  
Author(s):  
Lu Ma ◽  
Qi Zhou ◽  
Huming Yin ◽  
Xiaojie Ang ◽  
Yu Li ◽  
...  

Abstract Background: To extract the texture features of Apparent Diffusion Coefficient (ADC) images in Mp-MRI and build a machine learning model based on radiomics texture analysis to determine its ability to distinguish benign from prostate cancer (PCa) lesions using PI-RADS 4/5 score.Materials and methods: First, use ImageJ software to obtain texture feature parameters based on ADC images; use R language to standardize texture feature parameters, and use Lasso regression to reduce the dimensionality of multiple feature parameters; then, use the feature parameters after dimensionality reduction to construct image-based groups. Learn R-Logistic, R-SVM, R-AdaBoost to identify the machine learning classification model of prostate benign and malignant nodules. Secondly, the clinical indicators of the patients were statistically analyzed, and the three clinical indicators with the largest AUC values were selected to establish a classification model based on clinical indicators of benign and malignant prostate nodules. Finally, compare the performance of the model based on radiomics texture features and clinical indicators to identify benign and malignant prostate nodules in PI-RADS 4/5.Results: The experimental results show that the AUC of the R-Logistic model test set is 0.838, which is higher than the R-SVM and R-AdaBoost classification models. At this time, the corresponding R-Logistic classification model formula is: Y_radiomics=9.396-7.464*median ADC-0.584 *kurtosis+0.627*skewness+0.576*MRI lesions volume; analysis of clinical indicators shows that the 3 indicators with the highest discrimination efficiency are PSA, Fib, LDL-C, and the corresponding C-Logistic classification model formula is: Y_clinical =-2.608 +0.324*PSA-3.045*Fib+4.147*LDL-C, the AUC value of the model training set is 0.860, which is smaller than the training set R-Logistic classification model AUC value of 0.936.Conclusion: The machine learning classifier model is established based on the texture features of radiomics. It has a good classification performance in identifying benign and malignant nodules of the prostate in PI-RADS 4/5. This has certain potential and clinical value for patients with prostate cancer to adopt different treatment methods and prognosis.


2021 ◽  
Vol 12 ◽  
Author(s):  
Rahila Sardar ◽  
Arun Sharma ◽  
Dinesh Gupta

With the availability of COVID-19-related clinical data, healthcare researchers can now explore the potential of computational technologies such as artificial intelligence (AI) and machine learning (ML) to discover biomarkers for accurate detection, early diagnosis, and prognosis for the management of COVID-19. However, the identification of biomarkers associated with survival and deaths remains a major challenge for early prognosis. In the present study, we have evaluated and developed AI-based prediction algorithms for predicting a COVID-19 patient’s survival or death based on a publicly available dataset consisting of clinical parameters and protein profile data of hospital-admitted COVID-19 patients. The best classification model based on clinical parameters achieved a maximum accuracy of 89.47% for predicting survival or death of COVID-19 patients, with a sensitivity and specificity of 85.71 and 92.45%, respectively. The classification model based on normalized protein expression values of 45 proteins achieved a maximum accuracy of 89.01% for predicting the survival or death, with a sensitivity and specificity of 92.68 and 86%, respectively. Interestingly, we identified 9 clinical and 45 protein-based putative biomarkers associated with the survival/death of COVID-19 patients. Based on our findings, few clinical features and proteins correlate significantly with the literature and reaffirm their role in the COVID-19 disease progression at the molecular level. The machine learning–based models developed in the present study have the potential to predict the survival chances of COVID-19 positive patients in the early stages of the disease or at the time of hospitalization. However, this has to be verified on a larger cohort of patients before it can be put to actual clinical practice. We have also developed a webserver CovidPrognosis, where clinical information can be uploaded to predict the survival chances of a COVID-19 patient. The webserver is available at http://14.139.62.220/covidprognosis/.


Author(s):  
Stephanie Vázquez-González ◽  
María Somodevilla-García ◽  
Rosalva Loreto López ◽  
Helena Gómez-Adorno

The aim of this article is to contextualize and describe the gathering and annotation of a conventual Hispanic and Novo Hispanic texts corpus for emotions identification. Such corpus will be the dataset for an emotions identification model based on machine learning ∖ deep learning techniques. Furthermore, this document describes several exploratory experiments carried out on the corpus. Within these experiments, it is described how the corpus is also used to obtain a lexicon mapped to polarities and emotions, and how some of the documents are hand-labeled by experts for the evaluation of the Machine Learning ∖ Deep learning -based emotion classification model. Finally, the future uses and experiments with said corpus are described.


Energies ◽  
2021 ◽  
Vol 14 (7) ◽  
pp. 1809
Author(s):  
Mohammed El Amine Senoussaoui ◽  
Mostefa Brahami ◽  
Issouf Fofana

Machine learning is widely used as a panacea in many engineering applications including the condition assessment of power transformers. Most statistics attribute the main cause of transformer failure to insulation degradation. Thus, a new, simple, and effective machine-learning approach was proposed to monitor the condition of transformer oils based on some aging indicators. The proposed approach was used to compare the performance of two machine-learning classifiers: J48 decision tree and random forest. The service-aged transformer oils were classified into four groups: the oils that can be maintained in service, the oils that should be reconditioned or filtered, the oils that should be reclaimed, and the oils that must be discarded. From the two algorithms, random forest exhibited a better performance and high accuracy with only a small amount of data. Good performance was achieved through not only the application of the proposed algorithm but also the approach of data preprocessing. Before feeding the classification model, the available data were transformed using the simple k-means method. Subsequently, the obtained data were filtered through correlation-based feature selection (CFsSubset). The resulting features were again retransformed by conducting the principal component analysis and were passed through the CFsSubset filter. The transformation and filtration of the data improved the classification performance of the adopted algorithms, especially random forest. Another advantage of the proposed method is the decrease in the number of the datasets required for the condition assessment of transformer oils, which is valuable for transformer condition monitoring.


Algorithms ◽  
2021 ◽  
Vol 14 (6) ◽  
pp. 187
Author(s):  
Aaron Barbosa ◽  
Elijah Pelofske ◽  
Georg Hahn ◽  
Hristo N. Djidjev

Quantum annealers, such as the device built by D-Wave Systems, Inc., offer a way to compute solutions of NP-hard problems that can be expressed in Ising or quadratic unconstrained binary optimization (QUBO) form. Although such solutions are typically of very high quality, problem instances are usually not solved to optimality due to imperfections of the current generations quantum annealers. In this contribution, we aim to understand some of the factors contributing to the hardness of a problem instance, and to use machine learning models to predict the accuracy of the D-Wave 2000Q annealer for solving specific problems. We focus on the maximum clique problem, a classic NP-hard problem with important applications in network analysis, bioinformatics, and computational chemistry. By training a machine learning classification model on basic problem characteristics such as the number of edges in the graph, or annealing parameters, such as the D-Wave’s chain strength, we are able to rank certain features in the order of their contribution to the solution hardness, and present a simple decision tree which allows to predict whether a problem will be solvable to optimality with the D-Wave 2000Q. We extend these results by training a machine learning regression model that predicts the clique size found by D-Wave.


Sign in / Sign up

Export Citation Format

Share Document