Graph-Based Data Mining

Author(s):  
Wenyuan Li ◽  
Wee-Keong Ng ◽  
Kok-Leong Ong

With the most expressive representation that is able to characterize the complex data, graph mining is an emerging and promising domain in data mining. Meanwhile, the graph has been well studied in a long history with many theoretical results from various foundational fields, such as mathematics, physics, and artificial intelligence. In this chapter, we systematically reviewed theories and techniques newly studied and proposed in these areas. Moreover, we focused on those approaches that are potentially valuable to graph-based data mining. These approaches provide the different perspectives and motivations for this new domain. To illustrate how the method from the other area contributes to graph-based data mining, we did a case study on a classic graph problem that can be widely applied in many application areas. Our results showed that the methods from foundational areas may contribute to graph-based data mining.

2013 ◽  
Vol 443 ◽  
pp. 402-406 ◽  
Author(s):  
Shang Gao ◽  
Mei Mei Li

With the rapid development of the number of mobile phone users has accumulated a large number of graph data, graph data mining has gradually become a hot area of research. Traditional data such as clustering, classification, frequent pattern mining gradually extended to the field of graph data mining research. Introduced at this stage graph data mining technology research progress, summarizes the characteristics of the graphical data mining, practical significance, the main problem, and scenarios to discuss and forecast chart data, especially research on uncertain graph data become trends and hot spots.


Author(s):  
ZHENGXIN CHEN

Knowledge economy requires data mining be more goal-oriented so that more tangible results can be produced. This requirement implies that the semantics of the data should be incorporated into the mining process. Data mining is ready to deal with this challenge because recent developments in data mining have shown an increasing interest on mining of complex data (as exemplified by graph mining, text mining, etc.). By incorporating the relationships of the data along with the data itself (rather than focusing on the data alone), complex data injects semantics into the mining process, thus enhancing the potential of making better contribution to knowledge economy. Since the relationships between the data reveal certain behavioral aspects underlying the plain data, this shift of mining from simple data to complex data signals a fundamental change to a new stage in the research and practice of knowledge discovery, which can be termed as behavior mining. Behavior mining also has the potential of unifying some other recent activities in data mining. We discuss important aspects on behavior mining, and discuss its implications for the future of data mining.


2021 ◽  
Author(s):  
Bongs Lainjo

Abstract Background: Information technology has continued to shape contemporary thematic trends. Advances in communication have impacted almost all themes ranging from education, engineering, healthcare, and many other aspects of our daily lives. Method: This paper attempts to review the different dynamics of the thematic IoT platforms. A select number of themes are extensively analyzed with emphasis on data mining (DM), personalized healthcare (PHC), and thematic trends of a select number of subjectively identified IoT-related publications over three years. In this paper, the number of IoT-related-publications is used as a proxy representing the number of apps. DM remains the trailblazer, serving as a theme with crosscutting qualities that drive artificial intelligence (AI), machine learning (ML), and data transformation. A case study in PHC illustrates the importance, complexity, productivity optimization, and nuances contributing to a successful IoT platform. Among the initial 99 IoT themes, 18 are extensively analyzed using the number of IoT publications to demonstrate a combination of different thematic dynamics, including subtleties that influence escalating IoT publication themes. Results: Based on findings amongst the 99 themes, the annual median IoT-related publications for all the themes over the four years were increasingly 5510, 8930, 11700, and 14800 for 2016, 2017, 2018, and 2019 respectively; indicating an upbeat prognosis of IoT dynamics. Conclusion: The vulnerabilities that come with the successful implementation of IoT systems are highlighted including the successes currently achieved by institutions promoting the benefits of IoT-related systems like the case study. Security continues to be an issue of significant importance.


2021 ◽  
Vol 6 (3) ◽  

Information technology has continued to shape contemporary thematic trends. Advances in communication have impacted almost all themes ranging from education, engineering, healthcare, and many other aspects of our daily lives. This paper attempts to review the different dynamics of the thematic IoT platforms. A select number of themes are extensively analyzed with emphasis on data mining (DM), personalized healthcare (PHC), and thematic trends of a select number of subjectively identified IoT-related publications over three years. In this paper, the number of IoT-related-publications is used as a proxy representing the number of apps. DM remains the trailblazer, serving as a theme with crosscutting qualities that drive artificial intelligence (AI), machine learning (ML), and data transformation. A case study in PHC illustrates the importance, complexity, productivity optimization, and nuances contributing to a successful IoT platform. Among the initial 99 IoT themes, 18 are extensively analyzed using the number of IoT publications to demonstrate a combination of different thematic dynamics, including subtleties that influence escalating IoT publication themes. Based on findings amongst the 99 themes, the annual median IoT-related publications for all the themes over the four years were increasingly 5510, 8930, 11700, and 14800 for 2016, 2017, 2018, and 2019 respectively; indicating an upbeat prognosis of IoT dynamics. And finally, the vulnerabilities that come with the successful implementation of IoT systems are highlighted as part of the research. Security continues to be an issue of significant importance.


Author(s):  
Rui Sarmento ◽  
Luís Trigo ◽  
Liliana Fonseca

Managers, investors, financial institutions and government agencies have a major concern on forecasting enterprise bankruptcy. It enables the sustainability assessment of critical suppliers and clients, as well as competitors and the business environment. Throughout the 20th and the 21st century, advances in statistics and computer science fields enabled the development of different trends in financial distress assessment that co-exist today. However, recent Data Mining (DM) techniques are regarded as being the most precise. IT expertise requirements in the constantly evolving DM field may have been a major obstacle to the adoption of these techniques by decision makers. Furthermore, DM software tools that are now widespread offer a broad spectrum of Artificial Intelligence algorithms and the most difficult task may be the decision of selecting the appropriate algorithm. Hence, the adoption of a good workflow method for data processing and analysis is critical for having fast and reliable results. This work presents an overview of the available bankruptcy techniques and provides a comprehensive case study exploring the latest Data Mining techniques.


2018 ◽  
Vol 1 (2) ◽  
pp. 60-72
Author(s):  
Mansour Safran

This aims to review and analyze the Jordanian experiment in the developmental regional planning field within the decentralized managerial methods, which is considered one of the primary basic provisions for applying and success of this kind of planning. The study shoed that Jordan has passed important steps in the way for implanting the decentralized administration, but these steps are still not enough to established the effective and active regional planning. The study reveled that there are many problems facing the decentralized regional planning in Jordan, despite of the clear goals that this planning is trying to achieve. These problems have resulted from the existing relationship between the decentralized administration process’ dimensions from one side, and between its levels which ranged from weak to medium decentralization from the other side, In spite of the official trends aiming at applying more of the decentralized administrative policies, still high portion of these procedures are theoretical, did not yet find a way to reality. Because any progress or success at the level of applying the decentralized administrative policies doubtless means greater effectiveness and influence on the development regional planning in life of the residents in the kingdom’s different regions. So, it is important to go a head in applying more steps and decentralized administrative procedures, gradually and continuously to guarantee the control over any negative effects that might result from Appling this kind of systems.   © 2018 JASET, International Scholars and Researchers Association


2018 ◽  
Vol 5 (1) ◽  
pp. 47-55
Author(s):  
Florensia Unggul Damayanti

Data mining help industries create intelligent decision on complex problems. Data mining algorithm can be applied to the data in order to forecasting, identity pattern, make rules and recommendations, analyze the sequence in complex data sets and retrieve fresh insights. Yet, increasing of technology and various techniques among data mining availability data give opportunity to industries to explore and gain valuable information from their data and use the information to support business decision making. This paper implement classification data mining in order to retrieve knowledge in customer databases to support marketing department while planning strategy for predict plan premium. The dataset decompose into conceptual analytic to identify characteristic data that can be used as input parameter of data mining model. Business decision and application is characterized by processing step, processing characteristic and processing outcome (Seng, J.L., Chen T.C. 2010). This paper set up experimental of data mining based on J48 and Random Forest classifiers and put a light on performance evaluation between J48 and random forest in the context of dataset in insurance industries. The experiment result are about classification accuracy and efficiency of J48 and Random Forest , also find out the most attribute that can be used to predict plan premium in context of strategic planning to support business strategy.


Sign in / Sign up

Export Citation Format

Share Document