A TOPSIS Data Mining Demonstration and Application to Credit Scoring

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch112 ◽

2008 ◽

pp. 1877-1887

Author(s):

Desheng Wu ◽

David L. Olson

Keyword(s):

Machine Learning ◽

Data Mining ◽

Credit Scoring ◽

Multiple Criteria Decision Analysis ◽

Data Sets ◽

Ideal Solution ◽

Explanatory Variables ◽

Large Sets ◽

Order Preference ◽

Learning Data

The technique for order preference by similarity to ideal solution (TOPSIS) is a technique that can consider any number of measures, seeking to identify solutions close to an ideal and far from a nadir solution. TOPSIS has traditionally been applied in multiple criteria decision analysis. In this paper we propose an approach to develop a TOPSIS classifier. We demonstrate its use in credit scoring, providing a way to deal with large sets of data using machine learning. Data sets often contain many potential explanatory variables, some preferably minimized, some preferably maximized. Results are favorable by a comparison with traditional data mining techniques of decision trees. Proposed models are validated using Mont Carlo simulation.

Download Full-text

Improved minimum-minimum roughness algorithm for clustering categorical data

International Journal of ADVANCED AND APPLIED SCIENCES ◽

10.21833/ijaas.2021.10.006 ◽

2021 ◽

Vol 8 (10) ◽

pp. 43-50

Author(s):

Truong et al. ◽

Keyword(s):

Machine Learning ◽

Data Mining ◽

Hierarchical Clustering ◽

Categorical Data ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Experimental Results ◽

Data Sets ◽

Top Down ◽

Hierarchical Clustering Algorithm

Clustering is a fundamental technique in data mining and machine learning. Recently, many researchers are interested in the problem of clustering categorical data and several new approaches have been proposed. One of the successful and pioneering clustering algorithms is the Minimum-Minimum Roughness algorithm (MMR) which is a top-down hierarchical clustering algorithm and can handle the uncertainty in clustering categorical data. However, MMR tends to choose the category with less value leaf node with more objects, leading to undesirable clustering results. To overcome such shortcomings, this paper proposes an improved version of the MMR algorithm for clustering categorical data, called IMMR (Improved Minimum-Minimum Roughness). Experimental results on actual data sets taken from UCI show that the IMMR algorithm outperforms MMR in clustering categorical data.

Download Full-text

Mitigating Gender Bias in Machine Learning Data Sets

Communications in Computer and Information Science - Bias and Social Aspects in Search and Recommendation ◽

10.1007/978-3-030-52485-2_2 ◽

2020 ◽

pp. 12-26

Author(s):

Susan Leavy ◽

Gerardine Meaney ◽

Karen Wade ◽

Derek Greene

Keyword(s):

Machine Learning ◽

Gender Bias ◽

Data Sets ◽

Learning Data

Download Full-text

Big Data Models and the Public Sector

Web Services ◽

10.4018/978-1-5225-7501-6.ch007 ◽

2019 ◽

pp. 105-126

Author(s):

N. Nawin Sona

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Public Sector ◽

Predictive Analytics ◽

Data Types ◽

The Public ◽

Emerging Trends ◽

Wide Range ◽

Learning Data

This chapter aims to give an overview of the wide range of Big Data approaches and technologies today. The data features of Volume, Velocity, and Variety are examined against new database technologies. It explores the complexity of data types, methodologies of storage, access and computation, current and emerging trends of data analysis, and methods of extracting value from data. It aims to address the need for clarity regarding the future of RDBMS and the newer systems. And it highlights the methods in which Actionable Insights can be built into public sector domains, such as Machine Learning, Data Mining, Predictive Analytics and others.

Download Full-text

Bayesian Modelling for Machine Learning

Encyclopedia of Information Science and Technology, First Edition ◽

10.4018/978-1-59140-553-5.ch044 ◽

2005 ◽

pp. 236-242

Author(s):

Paul Rippon ◽

Kerrie Mengersen

Keyword(s):

Machine Learning ◽

Process Model ◽

Bayesian Learning ◽

Data Sets ◽

Complex Data ◽

Sources Of Information ◽

Multiple Sources ◽

Complex Data Sets ◽

Complex Phenomena ◽

Learning Data

Learning algorithms are central to pattern recognition, artificial intelligence, machine learning, data mining, and statistical learning. The term often implies analysis of large and complex data sets with minimal human intervention. Bayesian learning has been variously described as a method of updating opinion based on new experience, updating parameters of a process model based on data, modelling and analysis of complex phenomena using multiple sources of information, posterior probabilistic expectation, and so on. In all of these guises, it has exploded in popularity over recent years.

Download Full-text

Statistical and Machine-Learning Data Mining

10.1201/b11508 ◽

2011 ◽

Cited By ~ 12

Author(s):

Bruce Ratner ◽

Stephen Day ◽

Christopher Davies

Keyword(s):

Machine Learning ◽

Data Mining ◽

Learning Data

Download Full-text

A Review on Various Algorithms used in Machine Learning

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit1952248 ◽

2019 ◽

pp. 915-920

Author(s):

Divya Chaudhary ◽

Er. Richa Vasuja

Keyword(s):

Machine Learning ◽

Data Mining ◽

New Technologies ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Learning System ◽

Training Set ◽

Learning Data ◽

Do So

In today's scenario all of data is being generated by everyone of us . so it becomes vital for us to handle this data. To do so new technologies are being developed such as machine learning, data mining etc. This paper gives the study related to machine learning(ML).Precise approximations are repetitively being produced by Machine Learning algorithms. Machine learning system effectively “learns” how to guess from training set of completed jobs. The main purpose of the review is to give a jagged estimate or overview about the mostly used algorithms in machine learning.

Download Full-text

Sentiment Analysis of College Reviews using Machine Learning & Data Mining

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2019.7041 ◽

2019 ◽

Vol 7 (7) ◽

pp. 277-284

Author(s):

Mr. Ghanshyam Gupta

Keyword(s):

Machine Learning ◽

Data Mining ◽

Sentiment Analysis ◽

Learning Data

Download Full-text

Application of Machine Learning for Prediction of Lung Cancer using Omics Data

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f3625.049620 ◽

2020 ◽

Vol 9 (6) ◽

pp. 230-236

Keyword(s):

Machine Learning ◽

Lung Cancer ◽

Data Mining ◽

Survival Analysis ◽

Early Stage ◽

Image Data ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Data Sets ◽

Omics Data

Cancer is one of the deadly diseases across many countries. However, cancer can be cured, if detected at an early stage. Researchers are working on healthcare for early detection and prevention of cancer. Medical data has reached its utmost potential by providing researchers with huge data sets collected from all over the globe. In the present scenario, Machine Learning has been widely used in the area of cancer diagnosis and prognosis. Survival analysis may help in the prediction of the early onset of disease, relapse, re-occurrence of diseases and biomarker identification. Applications of machine learning and data mining methods in medical field are currently the most widespread in cancer detection and survival analysis. In this survey, different ways to detect and predict lung cancer using latest Machine learning algorithms combined with data mining has been analyzed. Comparative study of various machine learning techniques and technologies has been done over different types of data such as clinical data, omics data, image data etc.

Download Full-text

Data Mining, Machine Learning, Deep Learning, Chemometrics. Definitions, common points and Trends (Spoiler Alert: VALIDATE your models!)

Brazilian Journal of Analytical Chemistry ◽

10.30744/brjac.2179-3425.ar-38-2021 ◽

2021 ◽

Vol 8 (32) ◽

pp. 22-38

Author(s):

José Manuel Amigo

Keyword(s):

Analytical Chemistry ◽

Artificial Intelligence ◽

Machine Learning ◽

Data Mining ◽

Deep Learning ◽

Daily Life ◽

Big Data Analysis ◽

Mining Machine ◽

Learning Data ◽

Made In

Concepts like Machine Learning, Data Mining or Artificial Intelligence have become part of our daily life. This is mostly due to the incredible advances made in computation (hardware and software), the increasing capabilities of generating and storing all types of data and, especially, the benefits (societal and economical) that generate the analysis of such data. Simultaneously, Chemometrics has played an important role since the late 1970s, analyzing data within natural science (and especially in Analytical Chemistry). Even with the strong parallelisms between all of the abovementioned terms and being popular with most of us, it is still difficult to clearly define or differentiate the meaning of Machine Learning, Data Mining, Artificial Intelligence, Deep Learning and Chemometrics. This manuscript brings some light to the definitions of Machine Learning, Data Mining, Artificial Intelligence and Big Data Analysis, defines their application ranges and seeks an application space within the field of analytical chemistry (a.k.a. Chemometrics). The manuscript is full of personal, sometimes probably subjective, opinions and statements. Therefore, all opinions here are open for constructive discussion with the only purpose of Learning (like the Machines do nowadays).

Download Full-text

Forensics: Assessing model goodness: A machine learning view

Encyclopedia with Semantic Computing and Robotic Intelligence ◽

10.1142/s2529737618500156 ◽

2018 ◽

Vol 02 (02) ◽

pp. 1850015 ◽

Cited By ~ 1

Author(s):

Joseph R. Barr ◽

Joseph Cavanaugh

Keyword(s):

Machine Learning ◽

Data Mining ◽

Statistical Model ◽

Statistical Learning ◽

Statistical Models ◽

The Subject ◽

The Many ◽

The One ◽

Compare And Contrast ◽

Learning Data

It is not unusual that efforts to validate a statistical model exceed those used to build the model. Multiple techniques are used to validate, compare and contrast among competing statistical models: Some are concerned with a model’s ability to predict new data while others are concerned with model descriptiveness of the data. Without claiming to provide a comprehensive view of the landscape, in this paper we will touch on both aspects of model validation. There is much more to the subject and the reader is referred to any of the many classical statistical texts including the revised two volumes of Bickel and Docksum (2016), the one by Hastie, Tibshirani, and Friedman [The Elements of Statistical Learning: Data Mining, Inference, and Predication, 2nd edn. (Springer, 2009)], and several others listed in the bibliography.

Download Full-text