scholarly journals Decision making via semi-supervised machine learning techniques

2016 ◽  
Author(s):  
Ευτύχιος Πρωτοπαπαδάκης

Ο όρος μάθηση με μερική επίβλεψη αναφέρεται σε ένα ευρύ πεδίο τεχνικών μηχανικής μάθησης, οι οποίες χρησιμοποιούν τα μη τιτλοφορημένα δεδομένα για να εξάγουν επιπλέον ωφέλιμη πληροφορία. Η μερική επίβλεψη αντιμετωπίζει προβλήματα που σχετίζονται με την επεξεργασία και την αξιοποίηση μεγάλου όγκου δεδομένων και τα όποια κόστη σχετίζονται με αυτά (π.χ. χρόνος επεξεργασίας, ανθρώπινα λάθη). Απώτερος σκοπός είναι η ασφαλή εξαγωγή συμπερασμάτων, κανόνων ή προτάσεων. Τα μοντέλα λήψης απόφασης που χρησιμοποιούν τεχνικές μερικής μάθησης έχουν ποικίλα πλεονεκτήματα. Σε πρώτη φάση, χρειάζονται μικρό πλήθος τιτλοφορημένων δεδομένων για την αρχικοποίηση τους. Στη συνέχεια, τα νέα δεδομένα που θα εμφανιστούν αξιοποιούνται και τροποποιούν κατάλληλα το μοντέλο. Ως εκ τούτου, έχουμε ένα συνεχώς εξελισσόμενο μοντέλο λήψης αποφάσεων, με την ελάχιστη δυνατή προσπάθεια.Τεχνικές που προσαρμόζονται εύκολα και οικονομικά είναι οι κατεξοχήν κατάλληλες για τον έλεγχο συστημάτων, στα οποία παρατηρούνται συχνές αλλαγές στον τρόπο λειτουργίας. Ενδεικτικά πεδία εφαρμογής εφαρμογής ευέλικτων συστημάτων υποστήριξης λήψης αποφάσεων με μερική μάθηση είναι: η επίβλεψη γραμμών παραγωγής, η επιτήρηση θαλάσσιων συνόρων, η φροντίδα ηλικιωμένων, η εκτίμηση χρηματοπιστωτικού κινδύνου, ο έλεγχος για δομικές ατέλειες και η διαφύλαξη της πολιτιστικής κληρονομιάς.

2021 ◽  
Author(s):  
Carlos Eduardo Nass ◽  
Agustín Alejandro Ortíz Díaz ◽  
Fabiano Baldo

The growing popularity of audio and video streaming, industry 4.0 and IoT (Internet of Things) technologies contribute to the fast augment of the generation of various types of data. Therefore, to analyze these data for decision-making, supervised machine learning techniques need to be fast while keeping a suitable predicting performance even in many real-life scenarios where labeled data are expensive and hard to be gotten. To overcome this problem, this work proposes an adaptation to the Very Fast C4.5 (VFC4.5) algorithm implementing on it a semi-supervised impurity metric presented in the literature. The results pointed out that this adaptation can slightly increase the accuracy of the VFC4.5 when the datasets have the presence of a very few amount of labeled instances, but it increases the training time, especially when the number of labeled instances in the datasets increase.


2020 ◽  
Vol 28 (2) ◽  
pp. 253-265 ◽  
Author(s):  
Gabriela Bitencourt-Ferreira ◽  
Amauri Duarte da Silva ◽  
Walter Filgueira de Azevedo

Background: The elucidation of the structure of cyclin-dependent kinase 2 (CDK2) made it possible to develop targeted scoring functions for virtual screening aimed to identify new inhibitors for this enzyme. CDK2 is a protein target for the development of drugs intended to modulate cellcycle progression and control. Such drugs have potential anticancer activities. Objective: Our goal here is to review recent applications of machine learning methods to predict ligand- binding affinity for protein targets. To assess the predictive performance of classical scoring functions and targeted scoring functions, we focused our analysis on CDK2 structures. Methods: We have experimental structural data for hundreds of binary complexes of CDK2 with different ligands, many of them with inhibition constant information. We investigate here computational methods to calculate the binding affinity of CDK2 through classical scoring functions and machine- learning models. Results: Analysis of the predictive performance of classical scoring functions available in docking programs such as Molegro Virtual Docker, AutoDock4, and Autodock Vina indicated that these methods failed to predict binding affinity with significant correlation with experimental data. Targeted scoring functions developed through supervised machine learning techniques showed a significant correlation with experimental data. Conclusion: Here, we described the application of supervised machine learning techniques to generate a scoring function to predict binding affinity. Machine learning models showed superior predictive performance when compared with classical scoring functions. Analysis of the computational models obtained through machine learning could capture essential structural features responsible for binding affinity against CDK2.


Author(s):  
Augusto Cerqua ◽  
Roberta Di Stefano ◽  
Marco Letta ◽  
Sara Miccoli

AbstractEstimates of the real death toll of the COVID-19 pandemic have proven to be problematic in many countries, Italy being no exception. Mortality estimates at the local level are even more uncertain as they require stringent conditions, such as granularity and accuracy of the data at hand, which are rarely met. The “official” approach adopted by public institutions to estimate the “excess mortality” during the pandemic draws on a comparison between observed all-cause mortality data for 2020 and averages of mortality figures in the past years for the same period. In this paper, we apply the recently developed machine learning control method to build a more realistic counterfactual scenario of mortality in the absence of COVID-19. We demonstrate that supervised machine learning techniques outperform the official method by substantially improving the prediction accuracy of the local mortality in “ordinary” years, especially in small- and medium-sized municipalities. We then apply the best-performing algorithms to derive estimates of local excess mortality for the period between February and September 2020. Such estimates allow us to provide insights about the demographic evolution of the first wave of the pandemic throughout the country. To help improve diagnostic and monitoring efforts, our dataset is freely available to the research community.


Author(s):  
Linwei Hu ◽  
Jie Chen ◽  
Joel Vaughan ◽  
Soroush Aramideh ◽  
Hanyu Yang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document