A TOPSIS Data Mining Demonstration and Application to Credit Scoring

2008 ◽  
pp. 1877-1887
Author(s):  
Desheng Wu ◽  
David L. Olson

The technique for order preference by similarity to ideal solution (TOPSIS) is a technique that can consider any number of measures, seeking to identify solutions close to an ideal and far from a nadir solution. TOPSIS has traditionally been applied in multiple criteria decision analysis. In this paper we propose an approach to develop a TOPSIS classifier. We demonstrate its use in credit scoring, providing a way to deal with large sets of data using machine learning. Data sets often contain many potential explanatory variables, some preferably minimized, some preferably maximized. Results are favorable by a comparison with traditional data mining techniques of decision trees. Proposed models are validated using Mont Carlo simulation.

2021 ◽  
Vol 8 (10) ◽  
pp. 43-50
Author(s):  
Truong et al. ◽  

Clustering is a fundamental technique in data mining and machine learning. Recently, many researchers are interested in the problem of clustering categorical data and several new approaches have been proposed. One of the successful and pioneering clustering algorithms is the Minimum-Minimum Roughness algorithm (MMR) which is a top-down hierarchical clustering algorithm and can handle the uncertainty in clustering categorical data. However, MMR tends to choose the category with less value leaf node with more objects, leading to undesirable clustering results. To overcome such shortcomings, this paper proposes an improved version of the MMR algorithm for clustering categorical data, called IMMR (Improved Minimum-Minimum Roughness). Experimental results on actual data sets taken from UCI show that the IMMR algorithm outperforms MMR in clustering categorical data.


Web Services ◽  
2019 ◽  
pp. 105-126
Author(s):  
N. Nawin Sona

This chapter aims to give an overview of the wide range of Big Data approaches and technologies today. The data features of Volume, Velocity, and Variety are examined against new database technologies. It explores the complexity of data types, methodologies of storage, access and computation, current and emerging trends of data analysis, and methods of extracting value from data. It aims to address the need for clarity regarding the future of RDBMS and the newer systems. And it highlights the methods in which Actionable Insights can be built into public sector domains, such as Machine Learning, Data Mining, Predictive Analytics and others.


Author(s):  
Paul Rippon ◽  
Kerrie Mengersen

Learning algorithms are central to pattern recognition, artificial intelligence, machine learning, data mining, and statistical learning. The term often implies analysis of large and complex data sets with minimal human intervention. Bayesian learning has been variously described as a method of updating opinion based on new experience, updating parameters of a process model based on data, modelling and analysis of complex phenomena using multiple sources of information, posterior probabilistic expectation, and so on. In all of these guises, it has exploded in popularity over recent years.


2011 ◽  
Author(s):  
Bruce Ratner ◽  
Stephen Day ◽  
Christopher Davies

Author(s):  
Divya Chaudhary ◽  
Er. Richa Vasuja

In today's scenario all of data is being generated by everyone of us . so it becomes vital for us to handle this data. To do so new technologies are being developed such as machine learning, data mining etc. This paper gives the study related to machine learning(ML).Precise approximations are repetitively being produced by Machine Learning algorithms. Machine learning system effectively “learns” how to guess from training set of completed jobs. The main purpose of the review is to give a jagged estimate or overview about the mostly used algorithms in machine learning.


Cancer is one of the deadly diseases across many countries. However, cancer can be cured, if detected at an early stage. Researchers are working on healthcare for early detection and prevention of cancer. Medical data has reached its utmost potential by providing researchers with huge data sets collected from all over the globe. In the present scenario, Machine Learning has been widely used in the area of cancer diagnosis and prognosis. Survival analysis may help in the prediction of the early onset of disease, relapse, re-occurrence of diseases and biomarker identification. Applications of machine learning and data mining methods in medical field are currently the most widespread in cancer detection and survival analysis. In this survey, different ways to detect and predict lung cancer using latest Machine learning algorithms combined with data mining has been analyzed. Comparative study of various machine learning techniques and technologies has been done over different types of data such as clinical data, omics data, image data etc.


2021 ◽  
Vol 8 (32) ◽  
pp. 22-38
Author(s):  
José Manuel Amigo

Concepts like Machine Learning, Data Mining or Artificial Intelligence have become part of our daily life. This is mostly due to the incredible advances made in computation (hardware and software), the increasing capabilities of generating and storing all types of data and, especially, the benefits (societal and economical) that generate the analysis of such data. Simultaneously, Chemometrics has played an important role since the late 1970s, analyzing data within natural science (and especially in Analytical Chemistry). Even with the strong parallelisms between all of the abovementioned terms and being popular with most of us, it is still difficult to clearly define or differentiate the meaning of Machine Learning, Data Mining, Artificial Intelligence, Deep Learning and Chemometrics. This manuscript brings some light to the definitions of Machine Learning, Data Mining, Artificial Intelligence and Big Data Analysis, defines their application ranges and seeks an application space within the field of analytical chemistry (a.k.a. Chemometrics). The manuscript is full of personal, sometimes probably subjective, opinions and statements. Therefore, all opinions here are open for constructive discussion with the only purpose of Learning (like the Machines do nowadays).


2018 ◽  
Vol 02 (02) ◽  
pp. 1850015 ◽  
Author(s):  
Joseph R. Barr ◽  
Joseph Cavanaugh

It is not unusual that efforts to validate a statistical model exceed those used to build the model. Multiple techniques are used to validate, compare and contrast among competing statistical models: Some are concerned with a model’s ability to predict new data while others are concerned with model descriptiveness of the data. Without claiming to provide a comprehensive view of the landscape, in this paper we will touch on both aspects of model validation. There is much more to the subject and the reader is referred to any of the many classical statistical texts including the revised two volumes of Bickel and Docksum (2016), the one by Hastie, Tibshirani, and Friedman [The Elements of Statistical Learning: Data Mining, Inference, and Predication, 2nd edn. (Springer, 2009)], and several others listed in the bibliography.


Sign in / Sign up

Export Citation Format

Share Document