Implementation of Web Data Mining Technology Based on Python

Abstract With the arrival of the era of big data, people have gradually realized the importance of data. Data is not just a resource, it is an asset. This paper mainly studies the realization of Web data mining technology based on Python. This paper analyzes the overall architecture design of distributed web crawler system, and then analyzes in detail the principles of crawler’s URL function module, crawler’s web crawl function module, crawler’s web page parsing function module, crawler’s data storage function module and so on. Each function module of the crawler system was tested on the experimental computer, and the data information was summarized for comparative analysis. The main significance of this paper lies in the design and implementation of a distributed web crawler system, which, to a certain extent, solves the problems of slow speed, low efficiency and poor scalability of traditional single computer web crawler, and improves the speed and efficiency of web crawler in grasping information and web page data.

Download Full-text

Web Data Mining Technology and Network Information Security

2018 1st International Conference on Engineering, Communication and Computer Sciences (ICECCS 2018) ◽

10.23977/iceccs.2018.022 ◽

2018 ◽

Keyword(s):

Data Mining ◽

Information Security ◽

Web Data ◽

Web Data Mining ◽

Mining Technology ◽

Network Information

Download Full-text

Application analysis of computer web data mining technology in E-commerce

10.1145/3501409.3501626 ◽

2021 ◽

Author(s):

Huiting Ju ◽

Hui Wang

Keyword(s):

Data Mining ◽

Web Data ◽

Web Data Mining ◽

Mining Technology ◽

Application Analysis

Download Full-text

Personalized Services Research Based on Web Data Mining Technology

2009 Second International Symposium on Computational Intelligence and Design ◽

10.1109/iscid.2009.192 ◽

2009 ◽

Cited By ~ 5

Author(s):

Xiaorong Cheng ◽

Hong Liu

Keyword(s):

Data Mining ◽

Web Data ◽

Services Research ◽

Web Data Mining ◽

Mining Technology ◽

Personalized Services

Download Full-text

Web Data Mining Technology on Cloud Computing

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.543-547.3490 ◽

2014 ◽

Vol 543-547 ◽

pp. 3490-3493

Author(s):

Yan Zhang

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Efficient Method ◽

Rapid Development ◽

Present Situation ◽

Technology Research ◽

Computing Technology ◽

Web Data ◽

Web Data Mining ◽

Mining Technology

With the rapid development of cloud computing technology, the traditional centralized data mining technology becomes inappropriate for the growing huge amounts of data. Cloud computings Web data mining technology comes into use because it is a reliable and efficient method. This article introduces the meaning, characteristics, and the present situation of cloud computing, analyzes the advantage of Web data mining technology on the basis of the use of cloud computing technology, makes investigations and summaries of the present situation, challenges and problems of the current cloud computing Web data mining technology research, and puts forward the corresponding methods to solve these problems.

Download Full-text

Research of Intelligent Intrusion Detection System Based on Web Data Mining Technology

2011 Fourth International Conference on Business Intelligence and Financial Engineering ◽

10.1109/bife.2011.102 ◽

2011 ◽

Cited By ~ 1

Author(s):

Wenguang Chai ◽

Chunhui Tan ◽

Yuting Duan

Keyword(s):

Data Mining ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Web Data ◽

Web Data Mining ◽

Mining Technology

Download Full-text

Research and Application in the Web Data Mining Based on the XML

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.644-650.2124 ◽

2014 ◽

Vol 644-650 ◽

pp. 2124-2127

Author(s):

Fen Liu

Keyword(s):

Data Mining ◽

Information Transmission ◽

Rapid Development ◽

Data Sources ◽

The Internet ◽

Web Data ◽

Web Data Mining ◽

Mining Technology ◽

Web Documents ◽

The Web

With the rapid development of Internet, the Internet has become the important resources of information transmission and share. The characteristics of Web data are semi-structured, heterogeneous and mass, making traditional data mining technology indirectly applied to Web data sources. Web data mining refers to extracting a potential, useful model from the Web documents or Web activities. Because of the structural and expansibility of XML, research on XML combined with Web data mining has also became popular.

Download Full-text

Analyzing Security and Performance Issue in Web Data Mining Technology

International Journal of Computer Applications ◽

10.5120/14809-3027 ◽

2014 ◽

Vol 85 (1) ◽

pp. 45-49

Author(s):

Md NadeemAhmed ◽

Mohd Hussain

Keyword(s):

Data Mining ◽

Web Data ◽

Web Data Mining ◽

Mining Technology ◽

And Performance

Download Full-text

Web Data Mining Technology and Instrument Research

Lecture Notes in Electrical Engineering - Proceedings of the 2nd International Conference on Green Communications and Networks 2012 (GCN 2012): Volume 2 ◽

10.1007/978-3-642-35567-7_29 ◽

2013 ◽

pp. 231-237

Author(s):

Yunli Lei

Keyword(s):

Data Mining ◽

Web Data ◽

Web Data Mining ◽

Mining Technology

Download Full-text

Web Data Mining Technology and Network Information Security Precautions

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/750/1/012198 ◽

2020 ◽

Vol 750 ◽

pp. 012198

Author(s):

Jun-zhong He

Keyword(s):

Data Mining ◽

Information Security ◽

Web Data ◽

Web Data Mining ◽

Mining Technology ◽

Network Information

Download Full-text

Research on the Application of Web Mining Technique Based on XML for Unstructured Web Data Using LINQ

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.403-408.1062 ◽

2011 ◽

Vol 403-408 ◽

pp. 1062-1067 ◽

Cited By ~ 1

Author(s):

Payalpreet Kaur ◽

Raghu Garg ◽

Ravinder Singh ◽

Mandeep Singh

Keyword(s):

Data Mining ◽

Data Storage ◽

Data Exchange ◽

Web Mining ◽

Data Extraction ◽

Unstructured Data ◽

Web Data ◽

Web Data Mining ◽

Xml Document ◽

Mining Model

Web data mining is a field that has gained popularity in the recent time with the advancement in web mining technologies. Web data mining is the extraction of data on web. The term Web Data Mining is a technique used to crawl through various web resources to collect required information, which enables an individual or a company to promote business, understanding marketing dynamics, new promotions floating on the Internet, etc. The data on web is unstructured, irregular and lacks a fixed unified pattern as it is presented in HTML format that represents data in the presentation format and is unable to handle semi-structured or unstructured data . These difficulties lead to the emergence of XML based web data mining. XML was created so that richly structured documents could be used over the web.XML provides a standard for the data exchange and data storage .This paper presents a web data mining model based on XML. In this model first of all unstructured data is transformed to XML and then XML document is stored in database in the form of the string tree, then specific records are searched using a LINQ query. If record does not exist in the database then check the updates of specific website and repeat the same steps. At last data selected by LINQ Query is displayed on web browser. The feature that helped to increase the speed of data extraction and that also reduces the time of extraction is the presence of database that stores the data that have been extracted earlier by a user and can be used by other users by passing a LINQ query .In this model there is no need to create an extra separate XSL file because this model stores xml document in the database in the form of the string tree. This model is implemented using C# with XML.

Download Full-text