Recent developments in data mining applications and techniques

Author(s):  
Ameera M. Almasoud ◽  
Hend S. Al-Khalifa ◽  
Abdulmalik Al-Salman
2013 ◽  
Vol 9 (1) ◽  
pp. 36-53
Author(s):  
Evis Trandafili ◽  
Marenglen Biba

Social networks have an outstanding marketing value and developing data mining methods for viral marketing is a hot topic in the research community. However, most social networks remain impossible to be fully analyzed and understood due to prohibiting sizes and the incapability of traditional machine learning and data mining approaches to deal with the new dimension in the learning process related to the large-scale environment where the data are produced. On one hand, the birth and evolution of such networks has posed outstanding challenges for the learning and mining community, and on the other has opened the possibility for very powerful business applications. However, little understanding exists regarding these business applications and the potential of social network mining to boost marketing. This paper presents a review of the most important state-of-the-art approaches in the machine learning and data mining community regarding analysis of social networks and their business applications. The authors review the problems related to social networks and describe the recent developments in the area discussing important achievements in the analysis of social networks and outlining future work. The focus of the review in not only on the technical aspects of the learning and mining approaches applied to social networks but also on the business potentials of such methods.


Author(s):  
Carlotta Domeniconi ◽  
Dimitrios Gunopulos

Pattern classification is a very general concept with numerous applications ranging from science, engineering, target marketing, medical diagnosis and electronic commerce to weather forecast based on satellite imagery. A typical application of pattern classification is mass mailing for marketing. For example, credit card companies often mail solicitations to consumers. Naturally, they would like to target those consumers who are most likely to respond. Often, demographic information is available for those who have responded previously to such solicitations, and this information may be used in order to target the most likely respondents. Another application is electronic commerce of the new economy. E-commerce provides a rich environment to advance the state-of-the-art in classification because it demands effective means for text classification in order to make rapid product and market recommendations. Recent developments in data mining have posed new challenges to pattern classification. Data mining is a knowledge discovery process whose aim is to discover unknown relationships and/or patterns from a large set of data, from which it is possible to predict future outcomes. As such, pattern classification becomes one of the key steps in an attempt to uncover the hidden knowledge within the data. The primary goal is usually predictive accuracy, with secondary goals being speed, ease of use, and interpretability of the resulting predictive model. While pattern classification has shown promise in many areas of practical significance, it faces difficult challenges posed by real world problems, of which the most pronounced is Bellman’s curse of dimensionality: it states the fact that the sample size required to perform accurate prediction on problems with high dimensionality is beyond feasibility. This is because in high dimensional spaces data become extremely sparse and are apart from each other. As a result, severe bias that affects any estimation process can be introduced in a high dimensional feature space with finite samples. Learning tasks with data represented as a collection of a very large number of features abound. For example, microarrays contain an overwhelming number of genes relative to the number of samples. The Internet is a vast repository of disparate information growing at an exponential rate. Efficient and effective document retrieval and classification systems are required to turn the ocean of bits around us into useful information, and eventually into knowledge. This is a challenging task, since a word level representation of documents easily leads 30000 or more dimensions. This chapter discusses classification techniques to mitigate the curse of dimensionality and reduce bias, by estimating feature relevance and selecting features accordingly. This issue has both theoretical and practical relevance, since many applications can benefit from improvement in prediction performance.


Author(s):  
ZHENGXIN CHEN

Knowledge economy requires data mining be more goal-oriented so that more tangible results can be produced. This requirement implies that the semantics of the data should be incorporated into the mining process. Data mining is ready to deal with this challenge because recent developments in data mining have shown an increasing interest on mining of complex data (as exemplified by graph mining, text mining, etc.). By incorporating the relationships of the data along with the data itself (rather than focusing on the data alone), complex data injects semantics into the mining process, thus enhancing the potential of making better contribution to knowledge economy. Since the relationships between the data reveal certain behavioral aspects underlying the plain data, this shift of mining from simple data to complex data signals a fundamental change to a new stage in the research and practice of knowledge discovery, which can be termed as behavior mining. Behavior mining also has the potential of unifying some other recent activities in data mining. We discuss important aspects on behavior mining, and discuss its implications for the future of data mining.


Author(s):  
Kristof Eeckloo ◽  
Luc Delesie ◽  
Arthur Vleugels

Hospital governance refers to the complex of checks and balances that determine how decisions are made within the top structures of hospitals. In this chapter, authors introduce hospital governance as a policy domain in which data mining methods have a large potential to provide insight and practical knowledge. The chapter starts by exploring the essentials of the concept, by analysing the root notion of governance and comparing it with applications in other sectors. Recent developments and examples from the UK, France and The Netherlands are outlined. Based on an evaluation of the current state of affairs, a research agenda is developed. The chapter concludes with an introduction to the European Hospital Governance Project, which follows the outlines of the described research agenda. Methods of data mining and information visualisation that are used in this project are explained by means of a real data example.


2017 ◽  
Vol 14 (13) ◽  
pp. 2373-2401 ◽  
Author(s):  
Meisam Gordan ◽  
Hashim Abdul Razak ◽  
Zubaidah Ismail ◽  
Khaled Ghaedi

2008 ◽  
pp. 336-342
Author(s):  
Olena Daly ◽  
David Taniar

Data mining is a process of discovering new, unexpected, valuable patterns from existing databases (Frawley, Piatetsky-Shapiro, & Matheus, 1991). Though data mining is the evolution of a field with a long history, the term itself was only introduced relatively recently, in the 1990s. Data mining is best described as the union of historical and recent developments in statistics, artificial intelligence, and machine learning. These techniques are then used together to study data and find previously hidden trends or patterns within.


2009 ◽  
Vol 84 (7-11) ◽  
pp. 1372-1375
Author(s):  
A. Murari ◽  
J. Vega ◽  
G. Vagliasindi ◽  
J.A. Alonso ◽  
D. Alves ◽  
...  

2021 ◽  
Vol 5 (8) ◽  
pp. 36-40
Author(s):  
Jing Zhou ◽  
Christopher Kueh ◽  
Yi Lin

The three main approaches in inquisitive research design are qualitative, quantitative, and mixed methods [1]. However, recent developments in the research field have resulted in multiple other approaches, borrowing ideas from a broad range of fields. One such approach is the practice-led approach. This approach involves an efficient design process, novel qualitative interviewing methods, together with data mining procedures from quantitative data collection [2]. This paper assesses the practice-led approach used in user experience (UX) design, together with three approaches: co-design, service design, and reflective practice.


2021 ◽  
Vol 2 (4) ◽  
pp. 1-32
Author(s):  
Chance Desmet ◽  
Diane J. Cook

With the dramatic improvements in both the capability to collect personal data and the capability to analyze large amounts of data, increasingly sophisticated and personal insights are being drawn. These insights are valuable for clinical applications but also open up possibilities for identification and abuse of personal information. In this article, we survey recent research on classical methods of privacy-preserving data mining. Looking at dominant techniques and recent innovations to them, we examine the applicability of these methods to the privacy-preserving analysis of clinical data. We also discuss promising directions for future research in this area.


Author(s):  
Pauli Miettinen ◽  
Stefan Neumann

The goal of Boolean Matrix Factorization (BMF) is to approximate a given binary matrix as the product of two low-rank binary factor matrices, where the product of the factor matrices is computed under the Boolean algebra. While the problem is computationally hard, it is also attractive because the binary nature of the factor matrices makes them highly interpretable. In the last decade, BMF has received a considerable amount of attention in the data mining and formal concept analysis communities and, more recently, the machine learning and the theory communities also started studying BMF. In this survey, we give a concise summary of the efforts of all of these communities and raise some open questions which in our opinion require further investigation.


Sign in / Sign up

Export Citation Format

Share Document