Data Mining and Machine Learning

Instant medical care and drug suggestion service using data mining and machine learning based intelligent self-diagnosis medical system

International Journal of Advanced Life Sciences ◽

10.26627/ijals/2017/10.03.0022 ◽

2017 ◽

Vol 10 (03) ◽

pp. 318-325

Author(s):

sudha M

Keyword(s):

Machine Learning ◽

Data Mining ◽

Medical Care ◽

Medical System ◽

Using Data

Machine Learning and Data Mining Activity Results when using Projectiles in Different Sports

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/103932020 ◽

2020 ◽

Vol 9 (3) ◽

pp. 3157-3160

Author(s):

Burov Alexey Gennadievich

Keyword(s):

Machine Learning ◽

Data Mining ◽

Mining Activity

Classification of Operational and Financial Variables Affecting the Bullwhip Effect in Indian Sectors: A Machine Learning Approach

Recent Patents on Computer Science ◽

10.2174/2213275911666181012121059 ◽

2019 ◽

Vol 12 (3) ◽

pp. 171-179 ◽

Cited By ~ 6

Author(s):

Sachin Gupta ◽

Anurag Saxena

Keyword(s):

Machine Learning ◽

Data Mining ◽

Supply Chain ◽

Supply Chain Management ◽

Product Life Cycle ◽

Consumer Preference ◽

Bullwhip Effect ◽

Machine Learning Techniques ◽

Chain Management ◽

Financial Variables

Background: The increased variability in production or procurement with respect to less increase of variability in demand or sales is considered as bullwhip effect. Bullwhip effect is considered as an encumbrance in optimization of supply chain as it causes inadequacy in the supply chain. Various operations and supply chain management consultants, managers and researchers are doing a rigorous study to find the causes behind the dynamic nature of the supply chain management and have listed shorter product life cycle, change in technology, change in consumer preference and era of globalization, to name a few. Most of the literature that explored bullwhip effect is found to be based on simulations and mathematical models. Exploring bullwhip effect using machine learning is the novel approach of the present study. Methods: Present study explores the operational and financial variables affecting the bullwhip effect on the basis of secondary data. Data mining and machine learning techniques are used to explore the variables affecting bullwhip effect in Indian sectors. Rapid Miner tool has been used for data mining and 10-fold cross validation has been performed. Weka Alternating Decision Tree (w-ADT) has been built for decision makers to mitigate bullwhip effect after the classification. Results: Out of the 19 selected variables affecting bullwhip effect 7 variables have been selected which have highest accuracy level with minimum deviation. Conclusion: Classification technique using machine learning provides an effective tool and techniques to explore bullwhip effect in supply chain management.

Data mining techniques with machine learning algorithm to predict patients of heart disease

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/1088/1/012035 ◽

2021 ◽

Vol 1088 (1) ◽

pp. 012035

Author(s):

Mulyawan ◽

Agus Bahtiar ◽

Githera Dwilestari ◽

Fadhil Muhammad Basysyar ◽

Nana Suarna

Keyword(s):

Machine Learning ◽

Data Mining ◽

Heart Disease ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Data Mining Techniques

Data Mining-based Financial Statement Fraud Detection: Systematic Literature Review and Meta-analysis to Estimate Data Sample Mapping of Fraudulent Companies Against Non-fraudulent Companies

Global Business Review ◽

10.1177/0972150920984857 ◽

2021 ◽

pp. 097215092098485

Author(s):

Sonika Gupta ◽

Sushil Kumar Mehta

Keyword(s):

Machine Learning ◽

Data Mining ◽

Literature Review ◽

Systematic Literature Review ◽

Classification Accuracy ◽

Meta Analysis ◽

Financial Statement ◽

Research Articles ◽

Financial Statement Fraud ◽

Data Mining Techniques

Data mining techniques have proven quite effective not only in detecting financial statement frauds but also in discovering other financial crimes, such as credit card frauds, loan and security frauds, corporate frauds, bank and insurance frauds, etc. Classification of data mining techniques, in recent years, has been accepted as one of the most credible methodologies for the detection of symptoms of financial statement frauds through scanning the published financial statements of companies. The retrieved literature that has used data mining classification techniques can be broadly categorized on the basis of the type of technique applied, as statistical techniques and machine learning techniques. The biggest challenge in executing the classification process using data mining techniques lies in collecting the data sample of fraudulent companies and mapping the sample of fraudulent companies against non-fraudulent companies. In this article, a systematic literature review (SLR) of studies from the area of financial statement fraud detection has been conducted. The review has considered research articles published between 1995 and 2020. Further, a meta-analysis has been performed to establish the effect of data sample mapping of fraudulent companies against non-fraudulent companies on the classification methods through comparing the overall classification accuracy reported in the literature. The retrieved literature indicates that a fraudulent sample can either be equally paired with non-fraudulent sample (1:1 data mapping) or be unequally mapped using 1:many ratio to increase the sample size proportionally. Based on the meta-analysis of the research articles, it can be concluded that machine learning approaches, in comparison to statistical approaches, can achieve better classification accuracy, particularly when the availability of sample data is low. High classification accuracy can be obtained with even a 1:1 mapping data set using machine learning classification approaches.

Algorithms and software for data mining and machine learning: a critical comparative view from a systematic review of the literature

The Journal of Supercomputing ◽

10.1007/s11227-021-03708-5 ◽

2021 ◽

Author(s):

Gilda Taranto-Vera ◽

Purificación Galindo-Villardón ◽

Javier Merchán-Sánchez-Jara ◽

Julio Salazar-Pozo ◽

Alex Moreno-Salazar ◽

...

Keyword(s):

Machine Learning ◽

Systematic Review ◽

Data Mining ◽

Review Of The Literature

Machine Learning and Data Mining for Emerging Trend in Cyber Dynamics

10.1007/978-3-030-66288-2 ◽

2021 ◽

Keyword(s):

Machine Learning ◽

Data Mining

Effective Prediction of Heart Disease Using Data Mining and Machine Learning: A Review

2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS) ◽

10.1109/icais50930.2021.9395963 ◽

2021 ◽

Author(s):

Simran Verma ◽

Abhishek Gupta

Keyword(s):

Machine Learning ◽

Data Mining ◽

Heart Disease ◽

Using Data

Practice of Machine Learning Algorithm in Data Mining Field

2020 International Conference on Advance in Ambient Computing and Intelligence (ICAACI) ◽

10.1109/icaaci50733.2020.00016 ◽

2020 ◽

Author(s):

Yongxu Li

Keyword(s):

Machine Learning ◽

Data Mining ◽

Learning Algorithm ◽

Machine Learning Algorithm

Prediction of population behavior of Listeria monocytogenes in food using machine learning and a microbial growth and survival database

Scientific Reports ◽

10.1038/s41598-021-90164-z ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Satoko Hiura ◽

Shige Koseki ◽

Kento Koyama

Keyword(s):

Machine Learning ◽

Data Mining ◽

Listeria Monocytogenes ◽

Water Activity ◽

Bacterial Population ◽

Gradient Boosting ◽

Initial Cell ◽

Data Mining Approach ◽

Cell Counts ◽

Extreme Gradient Boosting

AbstractIn predictive microbiology, statistical models are employed to predict bacterial population behavior in food using environmental factors such as temperature, pH, and water activity. As the amount and complexity of data increase, handling all data with high-dimensional variables becomes a difficult task. We propose a data mining approach to predict bacterial behavior using a database of microbial responses to food environments. Listeria monocytogenes, which is one of pathogens, population growth and inactivation data under 1,007 environmental conditions, including five food categories (beef, culture medium, pork, seafood, and vegetables) and temperatures ranging from 0 to 25 °C, were obtained from the ComBase database (www.combase.cc). We used eXtreme gradient boosting tree, a machine learning algorithm, to predict bacterial population behavior from eight explanatory variables: ‘time’, ‘temperature’, ‘pH’, ‘water activity’, ‘initial cell counts’, ‘whether the viable count is initial cell number’, and two types of categories regarding food. The root mean square error of the observed and predicted values was approximately 1.0 log CFU regardless of food category, and this suggests the possibility of predicting viable bacterial counts in various foods. The data mining approach examined here will enable the prediction of bacterial population behavior in food by identifying hidden patterns within a large amount of data.