Analysis of the Generator and Consistency of General Web Page Layout Structure Using Matching Algorithm Based on Set Difference

Abstract Topic precise crawler is a special purpose web crawler, which downloads appropriate web pages analogous to a particular topic by measuring cosine similarity or semantic similarity score. The cosine based similarity measure displays inaccurate relevance score, if topic term does not directly occur in the web page. The semantic-based similarity measure provides the precise relevance score, even if the synonyms of the given topic occur in the web page. The unavailability of the topic in the ontology produces inaccurate relevance score by the semantic focused crawlers. This paper overcomes these glitches with a hybrid string-matching algorithm by combining the semantic similarity-based measure with the probabilistic similarity-based measure. The experimental results revealed that this algorithm increased the efficiency of the focused web crawlers and achieved better Harvest Rate (HR), Precision (P) and Irrelevance Ratio (IR) than the existing web focused crawlers achieve.

Download Full-text

A Web Page Layout Self-Identification Algorithm for Web Page Display Optimization

Journal of Convergence Information Technology ◽

10.4156/jcit.vol8.issue4.77 ◽

2013 ◽

Vol 8 (4) ◽

pp. 673-681

Author(s):

Guoming Sang ◽

Zhi Liu ◽

Jun Shi

Keyword(s):

Identification Algorithm ◽

Web Page ◽

Page Layout

Download Full-text

Adaptive Web page layout for mobile devices

2014 International Conference on Computing, Management and Telecommunications (ComManTel) ◽

10.1109/commantel.2014.6825615 ◽

2014 ◽

Keyword(s):

Mobile Devices ◽

Web Page ◽

Page Layout

Download Full-text

Automated reasoning for web page layout

ACM SIGPLAN Notices ◽

10.1145/3022671.2984010 ◽

2016 ◽

Vol 51 (10) ◽

pp. 181-194 ◽

Cited By ~ 3

Author(s):

Pavel Panchekha ◽

Emina Torlak

Keyword(s):

Automated Reasoning ◽

Web Page ◽

Page Layout

Download Full-text

An Extension of the Web-Page Layout Optimization Method for Multimodal Browsing Sizes

2010 13th International Conference on Network-Based Information Systems ◽

10.1109/nbis.2010.18 ◽

2010 ◽

Cited By ~ 3

Author(s):

Nobuo Funabiki ◽

Junki Shimizu ◽

Megumi Isogai ◽

Toru Nakanishi

Keyword(s):

Optimization Method ◽

Layout Optimization ◽

Web Page ◽

Page Layout ◽

The Web

Download Full-text

A User-Centered Log-Based Information Retrieval System Using Web Log Mining

Advances in Data Mining and Database Management - Advancing Cloud Database Systems and Capacity Planning With Dynamic Applications ◽

10.4018/978-1-5225-2013-9.ch014 ◽

2017 ◽

pp. 343-362

Author(s):

Sathiyamoorthi V

Keyword(s):

Data Mining ◽

Web Mining ◽

Production Control ◽

Research Area ◽

Web Page ◽

Web Based ◽

Average Speed ◽

Layout Structure ◽

Knowledge Discovery In Database ◽

Recent Trends

It is generally observed throughout the world that in the last two decades, while the average speed of computers has almost doubled in a span of around eighteen months, the average speed of the network has doubled merely in a span of just eight months! In order to improve the performance, more and more researchers are focusing their research in the field of computers and its related technologies. Data Mining is also known as knowledge discovery in database (KDD) is one such research area. The discovered knowledge can be applied in various application areas such as marketing, fraud detection, customer retention and production control and marketing to improve their business. It discovers implicit, previously unknown and potentially useful information out of datasets. Recent trends in data mining include web mining where it discovers knowledge from web based information to improve the page layout, structure and its content thereby it reduces the user latency in accessing the web page and website performance.

Download Full-text