Violation of Homogeneity: A Methodologic Issue in the Use of Data Mining Tools

The increasing use of data mining tools in both the public and private sectors raises concerns regarding the potentially sensitive nature of much of the data being mined. The utility to be gained from widespread data mining seems to come into direct conflict with an individual’s need and right to privacy. Privacy preserving data mining solutions achieve the somewhat paradoxical property of enabling a data mining algorithm to use data without ever actually “seeing” it. Thus, the benefits of data mining can be enjoyed, without compromising the privacy of concerned individuals.

Download Full-text

On the Use of Data Mining Tools for Data Preparation in Classification Problems

2012 IEEE/ACIS 11th International Conference on Computer and Information Science ◽

10.1109/icis.2012.79 ◽

2012 ◽

Author(s):

P. M. Goncalves ◽

R. S. M. Barros ◽

D. C. L. Vieira

Keyword(s):

Data Mining ◽

Data Preparation ◽

Classification Problems ◽

Use Of Data ◽

Mining Tools

Download Full-text

Use of Data Mining Tools in the Fields of Tea Cultivation and Tea Industry of Assam

International Journal of Computer Applications ◽

10.5120/3813-5266 ◽

2011 ◽

Vol 31 (4) ◽

pp. 27-41

Author(s):

Sadiq Hussain ◽

Nayeemuddin Ahmed

Keyword(s):

Data Mining ◽

Use Of Data ◽

Tea Industry ◽

Mining Tools

Download Full-text

Secure Multiparty Computation for Privacy Preserving Data Mining

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch189 ◽

2011 ◽

pp. 1005-1009 ◽

Cited By ~ 26

Author(s):

Yehida Lindell

Keyword(s):

Data Mining ◽

Privacy Preserving ◽

Data Mining Algorithm ◽

Multiparty Computation ◽

Privacy Preserving Data Mining ◽

Public And Private ◽

The Public ◽

Use Of Data ◽

To Come ◽

Mining Tools

The increasing use of data-mining tools in both the public and private sectors raises concerns regarding the potentially sensitive nature of much of the data being mined. The utility to be gained from widespread data mining seems to come into direct conflict with an individual’s need and right to privacy. Privacy-preserving data-mining solutions achieve the somewhat paradoxical property of enabling a data-mining algorithm to use data without ever actually seeing it. Thus, the benefits of data mining can be enjoyed without compromising the privacy of concerned individuals.

Download Full-text

Machine Learning and data mining tools applied for databases of low number of records

Advanced Engineering Research ◽

10.23947/2687-1653-2021-21-4-346-363 ◽

2022 ◽

Vol 21 (4) ◽

pp. 346-363

Author(s):

Hubert Anysz

Keyword(s):

Machine Learning ◽

Data Mining ◽

Computational Methods ◽

Large Datasets ◽

Learning Tools ◽

Data Preparation ◽

Preparation Methods ◽

Use Of Data ◽

Small Set ◽

Mining Tools

The use of data mining and machine learning tools is becoming increasingly common. Their usefulness is mainly noticeable in the case of large datasets, when information to be found or new relationships are extracted from information noise. The development of these tools means that datasets with much fewer records are being explored, usually associated with specific phenomena. This specificity most often causes the impossibility of increasing the number of cases, and that can facilitate the search for dependences in the phenomena under study. The paper discusses the features of applying the selected tools to a small set of data. Attempts have been made to present methods of data preparation, methods for calculating the performance of tools, taking into account the specifics of databases with a small number of records. The techniques selected by the author are proposed, which helped to break the deadlock in calculations, i.e., to get results much worse than expected. The need to apply methods to improve the accuracy of forecasts and the accuracy of classification was caused by a small amount of analysed data. This paper is not a review of popular methods of machine learning and data mining; nevertheless, the collected and presented material will help the reader to shorten the path to obtaining satisfactory results when using the described computational methods

Download Full-text

Classification Techniques and Data Mining Tools Used in Medical Bioinformatics

Big Data Governance and Perspectives in Knowledge Management - Advances in Knowledge Acquisition, Transfer, and Management ◽

10.4018/978-1-5225-7077-6.ch005 ◽

2019 ◽

pp. 105-126

Author(s):

Satish Kumar David ◽

Amr T. M. Saeb ◽

Mohamed Rafiullah ◽

Khalid Rubeaan

Keyword(s):

Data Mining ◽

Genomic Analysis ◽

Resistance Pattern ◽

Medical Applications ◽

Classification Analysis ◽

Classification Techniques ◽

Data Mining Techniques ◽

Use Of Data ◽

Drug Resistance Pattern ◽

Mining Tools

Increasing volumes of data with the increased availability information mandates the use of data mining techniques in order to gather useful information from the datasets. In this chapter, data mining techniques are described with a special emphasis on classification techniques as one important supervised learning technique. Bioinformatics tools in the field for medical applications especially in medical microbiology are discussed. This chapter presents WEKA software as a tool of choice to perform classification analysis for different kinds of available data. Uses of WEKA data mining tools for biological applications such as genomic analysis and for medical applications such as diabetes are discussed. Data mining offers novel tools for medical applications for infectious diseases; it can help in identifying the pathogen and analyzing the drug resistance pattern. For non-communicable diseases such as diabetes, it provides excellent data analysis options for analyzing large volumes of data from many clinical studies.

Download Full-text

USE OF DATA MINING TOOLS WHEN PLANNING OF NON-STATIONARY WATER FLOODING

Oilfield Engineering ◽

10.30713/0207-2351-2020-1(613)-13-19 ◽

2020 ◽

pp. 13-19

Author(s):

R.T. Alimkhanov ◽

◽

V.V. Rozhkova ◽

R.F. Mazitov ◽

◽

...

Keyword(s):

Data Mining ◽

Water Flooding ◽

Use Of Data ◽

Mining Tools

Download Full-text

Data Mining in Institutional Economics Tasks

EPJ Web of Conferences ◽

10.1051/epjconf/201817303013 ◽

2018 ◽

Vol 173 ◽

pp. 03013

Author(s):

Igor Kirilyuk ◽

Anna Kuznetsova ◽

Oleg Senko

Keyword(s):

Data Mining ◽

Institutional Economics ◽

Single Variable ◽

Institutional Type ◽

Explanatory Variables ◽

Use Of Data ◽

Different Types ◽

Mining Tools

The paper discusses problems associated with the use of data mining tools to study discrepancies between countries with different types of institutional matrices by variety of potential explanatory variables: climate, economic or infrastructure indicators. An approach is presented which is based on the search of statistically valid regularities describing the dependence of the institutional type on a single variable or a pair of variables. Examples of regularities are given.

Download Full-text

Use of data mining tools rang a sending compete instruments transport services SME sector enterprises

Zeszyty Naukowe Uniwersytetu Szczecińskiego Studia Informatica ◽

10.18276/si.2015.38-03 ◽

2015 ◽

Vol 38 ◽

pp. 37-47

Author(s):

Krzysztof Grochowski ◽

Daniel Zwierzchowski

Keyword(s):

Data Mining ◽

Use Of Data ◽

Transport Services ◽

Mining Tools

Download Full-text