Data Mining Using Qualitative Information on the Web

2008 ◽  
pp. 1416-1434
Author(s):  
Taeho Hong ◽  
Woojong Suh

Data mining has drawn much attention in generating the useful information from Web data. Data mining techniques have typically considered quantitative information rather than qualitative, though the qualitative information can often be used to improve the quality of a result. This chapter provides a hybrid data mining application, KBNMiner (Knowledge-Based News Miner), to predict interest rates on the basis of qualitative information on the Web as well as quantitative information stored in a database. The KBNMiner is developed through the integration of cognitive maps and neural networks. To validate the effectiveness of the KBNMiner, an experiment with Web news information is conducted and its results are discussed.

2005 ◽  
pp. 332-352
Author(s):  
Taeho Hong ◽  
Woojong Suh

Data mining has drawn much attention in generating the useful information from Web data. Data mining techniques have typically considered quantitative information rather than qualitative, though the qualitative information can often be used to improve the quality of a result. This chapter provides a hybrid data mining application, KBNMiner (Knowledge-Based News Miner), to predict interest rates on the basis of qualitative information on the Web as well as quantitative information stored in a database. The KBNMiner is developed through the integration of cognitive maps and neural networks. To validate the effectiveness of the KBNMiner, an experiment with Web news information is conducted and its results are discussed.


Author(s):  
Arun Thotapalli Sundararaman

Study of data quality for data mining application has always been a complex topic; in the recent years, this topic has gained further complexity with the advent of big data as the source for data mining and business intelligence (BI) applications. In a big data environment, data is consumed in various states and various forms serving as input for data mining, and this is the main source of added complexity. These new complexities and challenges arise from the underlying dimensions of big data (volume, variety, velocity, and value) together with the ability to consume data at various stages of transition from raw data to standardized datasets. These have created a need for expanding the traditional data quality (DQ) factors into BDQ (big data quality) factors besides the need for new BDQ assessment and measurement frameworks for data mining and BI applications. However, very limited advancement has been made in research and industry in the topic of BDQ and their relevance and criticality for data mining and BI applications. Data quality in data mining refers to the quality of the patterns or results of the models built using mining algorithms. DQ for data mining in business intelligence applications should be aligned with the objectives of the BI application. Objective measures, training/modeling approaches, and subjective measures are three major approaches that exist to measure DQ for data mining. However, there is no agreement yet on definitions or measurements or interpretations of DQ for data mining. Defining the factors of DQ for data mining and their measurement for a BI system has been one of the major challenges for researchers as well as practitioners. This chapter provides an overview of existing research in the area of BDQ definitions and measurement for data mining for BI, analyzes the gaps therein, and provides a direction for future research and practice in this area.


Author(s):  
Lan-zhong Wang

The purpose of this study is to develop a distance personalized teaching platform. The web data mining is used for the construction of the system and by analyzing the character of web data mining (WDM) and the essence of personalization teaching and instruction, based on WDM, The system contains knowledge base, individual database, WDM and web server four modules. The web data mining is used for the construction of the system and by analyzing the character of web data mining (WDM) and the essence of personalization teaching and instruction. Simulation results show that model has important enlightenment and pushing effect for promoting the individual service and improving teaching quality of modern distance education.


2018 ◽  
Vol 7 (2.7) ◽  
pp. 980 ◽  
Author(s):  
V Sai Virajitha ◽  
Dr JKR Sastry ◽  
Dr V Chandra Prakash ◽  
P Srija ◽  
M Varun

WEB sites are playing very vital role in information dissemination. Most of the businesses are using their WEB sites to promote market and conduct business. The quality of the WEB sites has indirect relationship with quantum of business conduct by the industrial establishments. Quality of a web site is based on number of characteristics; computation of the same in quantitative terms is a complex process. Structure of a WEB site plays a vital role in hosting the content in most comprehensive manner.In this paper the subjecting of the WEB to data mining and determining the structures contained in the WEB site is presented. The structures are evaluated to find the quality of the same individually and also combined considering all the structures that are mined. A method is presented in this paper using which the quality of a web site is computed considering the structure of the WEB site alone. 


Author(s):  
Vinod Podichetty ◽  
Robert Biscup

The Internet offers an unprecedented opportunity for healthcare information to be disseminated instantaneously. Quality of information, both scientific and nonscientific, and the development of tools to disseminate information securely via the Internet are the two most important issues related to achieving effective and wider exchange of health information. For the first time ever, information can be exchanged simultaneously and interactively all around the world, with the potential of being equally available to healthcare professionals as well as to patients. The big difference between yesterday's knowledge-based patient care and that of tomorrow, is a fundamental premise that patients will explore the web world with a desire to learn more about their condition, including its treatment and prognosis. This has evolved into the concept of e-health (Electronic Health). Evaluation and examination of the information being conveyed via the Internet is important and necessary in order for the Internet to be an effective tool in healthcare.


2020 ◽  
Vol 7 (2) ◽  
pp. 187
Author(s):  
Farid Ridho ◽  
Fachruddin Mansyur

<p><em>BPS is a data provider body in Indonesia. In publishing, BPS uses a variety of media, one of which is the BPS website. To get data through the BPS website, users can visit the website then download the data they need. The services obtained by data users on the BPS website depend on the quality of the website. The better the quality, the better the service experience gained by data users. The method that can be used to improve the quality of a website is the web usage mining method. Web usage mining is the application of data mining techniques on web repositories to study usage patterns. The purpose of this study is to determine the pattern of data publication requests on the BPS website which can later be used as a reference to improve the quality of BPS website services. Based on the results of the study, it was found that data users tend to access the same data with different years simultaneously. For results by grouping data by title without year, obtained quite diverse rules.</em></p><p><em><strong>Keywords</strong></em><em>: </em><em>web usage mining, association rule, apriori</em></p><p><em>BPS merupakan badan penyedia data di Indonesia. Dalam mempublikasikan datanya, BPS menggunakan berbagai media, salah satunya adalah website BPS. Untuk mendapatkan data melalui website BPS, pengguna dapat mengunjungi website kemudian mengunduh data yang mereka butuhkan. Layanan yang didapatkan oleh pengguna data pada website BPS tergantung dari kualitas website tersebut. Semakin baik kualitasnya, semakin baik pula pengalaman pelayanan yang didapatkan oleh pengguna data. Metode yang dapat digunakan untuk meningkatkan kualitas suatu website adalah metode web usage mining. Web usage mining merupakan penerapan tekhnik data mining pada web repositori untuk mempelajari pola penggunaan</em><em>. </em><em>Tujuan dari penelitian ini adalah untuk mengetahui pola permintaan publikasi data pada website BPS yang nantinya dapat digunakan sebagai acuan untuk meningkatkan kualitas layanan website BPS. Berdasarkan hasil penelitian, didapatkan bahwa pengguna data cenderung mengakses data yang sama dengan tahun yang berbeda secara bersamaan. Untuk hasil dengan mengelompokan data berdasarkan judul tanpa tahun, diperoleh rules yang cukup beragam.</em></p><p><em><strong>Kata kunci</strong></em><em>: </em><em>web usage mining, association rule, apriori</em></p>


2012 ◽  
Vol 532-533 ◽  
pp. 919-923 ◽  
Author(s):  
Feng Zhang ◽  
Li Liu

To improve the data mining efficiency, analyzed existing algorithm for data mining.However,it has some uncertain knoledge are a major concern in data mining, it is great difficulty for data mining in web knoledge,which contains more uncertainty than an affirmatory one dees. In this paper, with web mining method based on the cloud computing analysis. One is the main issues related to the web knowledge problem are detaled, the other is the commonly used methods of handling web knowledge problems in data mining are reviewed, with a diseussion about a number of their known strength and weakness. This can be used to improve the quality of information service on web and can assist the web master to optimize site architec and increase visiting efficiency. The results of experiment show that it is better than that of the existing methods proposed in the literature.


Author(s):  
Gabriela Alejandra García-Templado ◽  
Remah Y. Gharib

 Architectural design theories developed during the last decades of the 20th century - including Environmental Psychology and Pattern Theories - aimed to improve the quality of the built environment while centred on the experience of users. However, their approaches of analytical methodologies are not usually applied to understand and comprehend historic buildings from a wider architectural perspective. This study aims to deepen the analysis of historic buildings by advancing their depictions using concepts and ideas mainly established in pattern theories and contemporary best practices, in order to facilitate how modern designers may learn from the significant buildings of the past. To achieve this, a knowledge-based descriptive framework has been developed; this tool serves to enrich the architectural description of a building by including both qualitative and quantitative details, abstract and as-built characteristics, and spatial patterns which are inherent to architectural designs. Four historical palatial complexes erected in the Iberian Peninsula during the Islamic rule in al-Andalus have been selected to demonstrate the practical application and validity of this tool. The information collected through such a descriptive tool adds a layer of quantitative information that enriches the depiction of the historic buildings studied. An organized display of the resulting data provides for comparative analysis and also serves as a way to develop contemporary architectural proposals which reflect distinctive features of significant historical buildings.


Sign in / Sign up

Export Citation Format

Share Document