Recent developments in data mining applications and techniques

Social networks have an outstanding marketing value and developing data mining methods for viral marketing is a hot topic in the research community. However, most social networks remain impossible to be fully analyzed and understood due to prohibiting sizes and the incapability of traditional machine learning and data mining approaches to deal with the new dimension in the learning process related to the large-scale environment where the data are produced. On one hand, the birth and evolution of such networks has posed outstanding challenges for the learning and mining community, and on the other has opened the possibility for very powerful business applications. However, little understanding exists regarding these business applications and the potential of social network mining to boost marketing. This paper presents a review of the most important state-of-the-art approaches in the machine learning and data mining community regarding analysis of social networks and their business applications. The authors review the problems related to social networks and describe the recent developments in the area discussing important achievements in the analysis of social networks and outlining future work. The focus of the review in not only on the technical aspects of the learning and mining approaches applied to social networks but also on the business potentials of such methods.

Download Full-text

Locally Adaptive Techniques for Pattern Classification

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch182 ◽

2011 ◽

pp. 1170-1175

Author(s):

Carlotta Domeniconi ◽

Dimitrios Gunopulos

Keyword(s):

Data Mining ◽

Electronic Commerce ◽

Pattern Classification ◽

Credit Card ◽

Curse Of Dimensionality ◽

Ease Of Use ◽

Practical Significance ◽

High Dimensional ◽

Finite Samples ◽

Recent Developments

Pattern classification is a very general concept with numerous applications ranging from science, engineering, target marketing, medical diagnosis and electronic commerce to weather forecast based on satellite imagery. A typical application of pattern classification is mass mailing for marketing. For example, credit card companies often mail solicitations to consumers. Naturally, they would like to target those consumers who are most likely to respond. Often, demographic information is available for those who have responded previously to such solicitations, and this information may be used in order to target the most likely respondents. Another application is electronic commerce of the new economy. E-commerce provides a rich environment to advance the state-of-the-art in classification because it demands effective means for text classification in order to make rapid product and market recommendations. Recent developments in data mining have posed new challenges to pattern classification. Data mining is a knowledge discovery process whose aim is to discover unknown relationships and/or patterns from a large set of data, from which it is possible to predict future outcomes. As such, pattern classification becomes one of the key steps in an attempt to uncover the hidden knowledge within the data. The primary goal is usually predictive accuracy, with secondary goals being speed, ease of use, and interpretability of the resulting predictive model. While pattern classification has shown promise in many areas of practical significance, it faces difficult challenges posed by real world problems, of which the most pronounced is Bellman’s curse of dimensionality: it states the fact that the sample size required to perform accurate prediction on problems with high dimensionality is beyond feasibility. This is because in high dimensional spaces data become extremely sparse and are apart from each other. As a result, severe bias that affects any estimation process can be introduced in a high dimensional feature space with finite samples. Learning tasks with data represented as a collection of a very large number of features abound. For example, microarrays contain an overwhelming number of genes relative to the number of samples. The Internet is a vast repository of disparate information growing at an exponential rate. Efficient and effective document retrieval and classification systems are required to turn the ocean of bits around us into useful information, and eventually into knowledge. This is a challenging task, since a word level representation of documents easily leads 30000 or more dimensions. This chapter discusses classification techniques to mitigate the curse of dimensionality and reduce bias, by estimating feature relevance and selecting features accordingly. This issue has both theoretical and practical relevance, since many applications can benefit from improvement in prediction performance.

Download Full-text

FROM DATA MINING TO BEHAVIOR MINING

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622006002271 ◽

2006 ◽

Vol 05 (04) ◽

pp. 703-711 ◽

Cited By ~ 21

Author(s):

ZHENGXIN CHEN

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Knowledge Economy ◽

Graph Mining ◽

Complex Data ◽

Fundamental Change ◽

Process Data ◽

Research And Practice ◽

Behavior Mining ◽

Recent Developments

Knowledge economy requires data mining be more goal-oriented so that more tangible results can be produced. This requirement implies that the semantics of the data should be incorporated into the mining process. Data mining is ready to deal with this challenge because recent developments in data mining have shown an increasing interest on mining of complex data (as exemplified by graph mining, text mining, etc.). By incorporating the relationships of the data along with the data itself (rather than focusing on the data alone), complex data injects semantics into the mining process, thus enhancing the potential of making better contribution to knowledge economy. Since the relationships between the data reveal certain behavioral aspects underlying the plain data, this shift of mining from simple data to complex data signals a fundamental change to a new stage in the research and practice of knowledge discovery, which can be termed as behavior mining. Behavior mining also has the potential of unifying some other recent activities in data mining. We discuss important aspects on behavior mining, and discuss its implications for the future of data mining.

Download Full-text

Governance in European Hospitals

Advances in Electronic Government, Digital Divide, and Regional Development - Social and Political Implications of Data Mining ◽

10.4018/978-1-60566-230-5.ch006 ◽

2010 ◽

pp. 88-105

Author(s):

Kristof Eeckloo ◽

Luc Delesie ◽

Arthur Vleugels

Keyword(s):

Data Mining ◽

Research Agenda ◽

Practical Knowledge ◽

Real Data ◽

Checks And Balances ◽

Hospital Governance ◽

Policy Domain ◽

Recent Developments ◽

State Of Affairs ◽

The Uk

Hospital governance refers to the complex of checks and balances that determine how decisions are made within the top structures of hospitals. In this chapter, authors introduce hospital governance as a policy domain in which data mining methods have a large potential to provide insight and practical knowledge. The chapter starts by exploring the essentials of the concept, by analysing the root notion of governance and comparing it with applications in other sectors. Recent developments and examples from the UK, France and The Netherlands are outlined. Based on an evaluation of the current state of affairs, a research agenda is developed. The chapter concludes with an introduction to the European Hospital Governance Project, which follows the outlines of the described research agenda. Methods of data mining and information visualisation that are used in this project are explained by means of a real data example.

Download Full-text

Recent Developments in Damage Identification of Structures Using Data Mining

Latin American Journal of Solids and Structures ◽

10.1590/1679-78254378 ◽

2017 ◽

Vol 14 (13) ◽

pp. 2373-2401 ◽

Cited By ~ 20

Author(s):

Meisam Gordan ◽

Hashim Abdul Razak ◽

Zubaidah Ismail ◽

Khaled Ghaedi

Keyword(s):

Data Mining ◽

Damage Identification ◽

Recent Developments ◽

Using Data

Download Full-text

Exception Rules in Data Mining

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch021 ◽

2008 ◽

pp. 336-342

Author(s):

Olena Daly ◽

David Taniar

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Data Mining ◽

Study Data ◽

Recent Developments

Data mining is a process of discovering new, unexpected, valuable patterns from existing databases (Frawley, Piatetsky-Shapiro, & Matheus, 1991). Though data mining is the evolution of a field with a long history, the term itself was only introduced relatively recently, in the 1990s. Data mining is best described as the union of historical and recent developments in statistics, artificial intelligence, and machine learning. These techniques are then used together to study data and find previously hidden trends or patterns within.

Download Full-text

Recent developments in data mining and soft computing for JET with a view on ITER

Fusion Engineering and Design ◽

10.1016/j.fusengdes.2008.12.060 ◽

2009 ◽

Vol 84 (7-11) ◽

pp. 1372-1375

Author(s):

A. Murari ◽

J. Vega ◽

G. Vagliasindi ◽

J.A. Alonso ◽

D. Alves ◽

...

Keyword(s):

Data Mining ◽

Soft Computing ◽

Recent Developments

Download Full-text

Assessing the Applicability of Three Approaches to Design-Oriented Research

Journal of Contemporary Educational Research ◽

10.26689/jcer.v5i8.2411 ◽

2021 ◽

Vol 5 (8) ◽

pp. 36-40

Author(s):

Jing Zhou ◽

Christopher Kueh ◽

Yi Lin

Keyword(s):

Data Mining ◽

Data Collection ◽

Reflective Practice ◽

Design Process ◽

Quantitative Data ◽

Research Field ◽

Efficient Design ◽

Qualitative Interviewing ◽

Recent Developments ◽

Oriented Research

The three main approaches in inquisitive research design are qualitative, quantitative, and mixed methods [1]. However, recent developments in the research field have resulted in multiple other approaches, borrowing ideas from a broad range of fields. One such approach is the practice-led approach. This approach involves an efficient design process, novel qualitative interviewing methods, together with data mining procedures from quantitative data collection [2]. This paper assesses the practice-led approach used in user experience (UX) design, together with three approaches: co-design, service design, and reflective practice.

Download Full-text

Recent Developments in Privacy-preserving Mining of Clinical Data

ACM/IMS Transactions on Data Science ◽

10.1145/3447774 ◽

2021 ◽

Vol 2 (4) ◽

pp. 1-32

Author(s):

Chance Desmet ◽

Diane J. Cook

Keyword(s):

Data Mining ◽

Clinical Data ◽

Personal Information ◽

Personal Data ◽

Privacy Preserving ◽

Clinical Applications ◽

Future Research ◽

Privacy Preserving Data Mining ◽

Recent Developments

With the dramatic improvements in both the capability to collect personal data and the capability to analyze large amounts of data, increasingly sophisticated and personal insights are being drawn. These insights are valuable for clinical applications but also open up possibilities for identification and abuse of personal information. In this article, we survey recent research on classical methods of privacy-preserving data mining. Looking at dominant techniques and recent innovations to them, we examine the applicability of these methods to the privacy-preserving analysis of clinical data. We also discuss promising directions for future research in this area.

Download Full-text

Recent Developments in Boolean Matrix Factorization

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/685 ◽

2020 ◽

Author(s):

Pauli Miettinen ◽

Stefan Neumann

Keyword(s):

Machine Learning ◽

Data Mining ◽

Matrix Factorization ◽

Formal Concept Analysis ◽

Boolean Matrix ◽

Low Rank ◽

Formal Concept ◽

Binary Matrix ◽

Open Questions ◽

Recent Developments

The goal of Boolean Matrix Factorization (BMF) is to approximate a given binary matrix as the product of two low-rank binary factor matrices, where the product of the factor matrices is computed under the Boolean algebra. While the problem is computationally hard, it is also attractive because the binary nature of the factor matrices makes them highly interpretable. In the last decade, BMF has received a considerable amount of attention in the data mining and formal concept analysis communities and, more recently, the machine learning and the theory communities also started studying BMF. In this survey, we give a concise summary of the efforts of all of these communities and raise some open questions which in our opinion require further investigation.

Download Full-text