scholarly journals Application of Machine Learning in medical data analysis illustrated with an example of association rules

2021 ◽  
Vol 192 ◽  
pp. 3134-3143
Author(s):  
Beata Butryn ◽  
Iwona Chomiak-Orsa ◽  
Krzysztof Hauke ◽  
Maciej Pondel ◽  
Agnieszka Siennicka
2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Sibing Sun

Data science has expanded at an exponential growth with the advancement of big data technology. The data analysis techniques need to delve deeper to find valuable information (Sarac 2017). The notion of edge computing is broadly acknowledged. Edge-enabled solutions provide computing, analysis, storage, and control nearer to the edge of the network, which support the efficient processing and decision-making. Machine learning has also attained significant attention in this context due to its flexibility and its ability to provide a variety of supervised, unsupervised, and semisupervised techniques. This research presents a specific model to evaluate the potential correlation of piano teaching using machine learning. The data analysis is performed at the edges of network for efficient results (Tan et al. 2017). The association rule mining technique of machine learning is utilized with the integration of improved T-test method. The improved T-test is performed for the measurement of association rules and proposed a new measure and influence degree of association rules. It is evident from the results that the use of the degree of influence as a measure of association rules to find the potential relevance of multimedia-assistant piano teaching evaluation data is extremely feasible. It overcomes shortcomings of existing measurement standards and reduces the generation of redundant rules. The existing literature highlights the concepts of evaluation of potential correlation and evaluates the advantages. However, there is a lack of an effective strategy for piano teaching. The proposed model performs efficient calculation and storage. The feasibility and effectiveness of the proposed framework are verified using the analysis of the actual dataset. The verification results show that it is feasible and valuable to find the potential relevance of multimedia-assisted piano teaching evaluation.


2015 ◽  
Vol 764-765 ◽  
pp. 910-914
Author(s):  
Jeong Dong Kim ◽  
Suk Hyung Hwang ◽  
Doo Kwon Baik

Recently, Formal Concept Analysis (FCA) have been widely used for various purposes in many different domains such as data mining, machine learning, knowledge management and so on. In this paper, we introduce FCA as the basis for a practical and well founded methodological approach for data analysis which identifies conceptual structures among data sets. As well as, we propose a FCA-based data analysis for discovering association rules by using polarity from social contents. Additionally, we show the experiments that demonstrate how our data analysis approaches can be applied for knowledge discovery by using association rules.


2020 ◽  
Vol 13 (5) ◽  
pp. 1020-1030
Author(s):  
Pradeep S. ◽  
Jagadish S. Kallimani

Background: With the advent of data analysis and machine learning, there is a growing impetus of analyzing and generating models on historic data. The data comes in numerous forms and shapes with an abundance of challenges. The most sorted form of data for analysis is the numerical data. With the plethora of algorithms and tools it is quite manageable to deal with such data. Another form of data is of categorical nature, which is subdivided into, ordinal (order wise) and nominal (number wise). This data can be broadly classified as Sequential and Non-Sequential. Sequential data analysis is easier to preprocess using algorithms. Objective: The challenge of applying machine learning algorithms on categorical data of nonsequential nature is dealt in this paper. Methods: Upon implementing several data analysis algorithms on such data, we end up getting a biased result, which makes it impossible to generate a reliable predictive model. In this paper, we will address this problem by walking through a handful of techniques which during our research helped us in dealing with a large categorical data of non-sequential nature. In subsequent sections, we will discuss the possible implementable solutions and shortfalls of these techniques. Results: The methods are applied to sample datasets available in public domain and the results with respect to accuracy of classification are satisfactory. Conclusion: The best pre-processing technique we observed in our research is one hot encoding, which facilitates breaking down the categorical features into binary and feeding it into an Algorithm to predict the outcome. The example that we took is not abstract but it is a real – time production services dataset, which had many complex variations of categorical features. Our Future work includes creating a robust model on such data and deploying it into industry standard applications.


Sign in / Sign up

Export Citation Format

Share Document