Automatic Detection for JavaScript Obfuscation Attacks in Web Pages through String Pattern Analysis

Future Generation Information Technology - Lecture Notes in Computer Science ◽

10.1007/978-3-642-10509-8_19 ◽

2009 ◽

pp. 160-172 ◽

Author(s):

YoungHan Choi ◽

TaeGhyoon Kim ◽

SeokJin Choi ◽

CheolWon Lee

Keyword(s):

Pattern Analysis ◽

Automatic Detection ◽

Web Pages ◽

Download Full-text

Model for efficient delivery of dynamic web pages with automatic detection of shared fragments

2013 22nd Wireless and Optical Communication Conference ◽

10.1109/wocc.2013.6676414 ◽

2013 ◽

Author(s):

Lingli Zhang

Keyword(s):

Automatic Detection ◽

Web Pages ◽

Efficient Delivery ◽

Download Full-text

A Framework for Automated Scraping of Structured Data Records From the Deep Web Using Semantic Labeling

International Journal of Information Retrieval Research ◽

10.4018/ijirr.290830 ◽

2022 ◽

Vol 12 (1) ◽

pp. 1-18

Author(s):

Umamageswari Kumaresan ◽

Kalpana Ramanujam

Keyword(s):

Web Sites ◽

Syntactic Structure ◽

Structured Data ◽

Web Pages ◽

Semantic Labeling ◽

Repeated Pattern ◽

Computationally Intensive ◽

To Come ◽

String Pattern ◽

Informative Content

The intent of this research is to come up with an automated web scraping system which is capable of extracting structured data records embedded in semi-structured web pages. Most of the automated extraction techniques in the literature captures repeated pattern among a set of similarly structured web pages, thereby deducing the template used for the generation of those web pages and then data records extraction is done. All of these techniques exploit computationally intensive operations such as string pattern matching or DOM tree matching and then perform manual labeling of extracted data records. The technique discussed in this paper departs from the state-of-the-art approaches by determining informative sections in the web page through repetition of informative content rather than syntactic structure. From the experiments, it is clear that the system has identified data rich region with 100% precision for web sites belonging to different domains. The experiments conducted on the real world web sites prove the effectiveness and versatility of the proposed approach.

Download Full-text

Detecção Automática de Incompatibilidades Cross-Browser utilizando Redes Neurais Artificiais

Journal on Advances in Theoretical and Applied Informatics ◽

10.26729/jadi.v2i2.2109 ◽

2016 ◽

Vol 2 (2) ◽

pp. 55

Author(s):

Fagner Christian Paes ◽

Willian Massami Watanabe

Keyword(s):

Web Application ◽

Web Applications ◽

Automatic Detection ◽

False Positives ◽

Web Pages ◽

Inspection Process ◽

Cascading Style Sheets ◽

Mozilla Firefox ◽

Web Developers ◽

Cross-Browser Incompatibilities (XBIs) represent inconsistencies in Web Application when introduced in different browsers. The growing number of implementation of browsers (Internet Explorer, Microsoft Edge, Mozilla Firefox, Google Chrome) and the constant evolution of the specifications of Web technologies provided differences in the way that the browsers behave and render the web pages. The web applications must behave consistently among browsers. Therefore, the web developers should overcome the differences that happen during the rendering in different environments by detecting and avoiding XBIs during the development process. Many web developers depend on manual inspection of web pages in several environments to detect the XBIs, independently of the cost and time that the manual tests represent to the process of development. The tools for the automatic detection of the XBIs accelerate the inspection process in the web pages, but the current tools have little precision, and their evaluations report a large percentage of false positives. This search aims to evaluate the use of Artificial Neural Networks for reducing the numbers of false positives in the automatic detection of the XBIs through the CSS (Cascading Style Sheets) and the relative comparison of the element in the web page.

Download Full-text

Automatic detection of fragments in dynamically generated web pages

Proceedings of the 13th conference on World Wide Web - WWW '04 ◽

10.1145/988672.988732 ◽

2004 ◽

Author(s):

Lakshmish Ramaswamy ◽

Arun Iyengar ◽

Ling Liu ◽

Fred Douglis

Keyword(s):

Automatic Detection ◽

Download Full-text

Combining Information Fusion with String Pattern Analysis: A New Method for Predicting Future Purchase Behavior

Information Fusion in Data Mining - Studies in Fuzziness and Soft Computing ◽

10.1007/978-3-540-36519-8_10 ◽

2003 ◽

pp. 161-187 ◽

Author(s):

Yukinobu Hamuro ◽

Naoki Katoh ◽

Ip H. Edward ◽

Stephane L. Cheung ◽

Katsutoshi Yada

Keyword(s):

Information Fusion ◽

Pattern Analysis ◽

Purchase Behavior ◽

String Pattern ◽

Combining Information

Download Full-text

Automatic Detection of Visibility Faults by Layout Changes in HTML5 Web Pages

2018 IEEE 11th International Conference on Software Testing, Verification and Validation (ICST) ◽

10.1109/icst.2018.00027 ◽

2018 ◽

Author(s):

Yeonhee Ryou ◽

Sukyoung Ryu

Keyword(s):

Automatic Detection ◽

Download Full-text

Automatic Detection of Illegitimate Websites with Mutual Clustering

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i3.pp995-1001 ◽

2016 ◽

Vol 6 (3) ◽

pp. 995

Author(s):

Kanaka Durga ◽

V Rama Krishna

Keyword(s):

Search Engine ◽

Search Engines ◽

Automatic Detection ◽

Threshold Value ◽

Research Community ◽

Web Pages ◽

Web Content ◽

Web Contents ◽

Similar Content ◽

In the websites the contents will be are similarity when we compared with other search engines. So to check the similar content in the websites and its web contents we created a overhead to the search engine which will severely effect its performance & quality. So to detect the silmilar or same content or web documenattion some techniques are implemented by web crawling research community. So it is one of major factor for the search engines to provide some applicatory data to users in the first page itself. So to avoid such issues we proposed a methodlogy called Automatic Detection of illegitimate websites with Mutual Clustering (ADIWMC) paper we are presenting a peculiar and efficacious path for the detection of similarities in the web pages in web clustering. Detection of same and similar web pages and web content will be done by storing the crawled web pages into depository. Initially the adwords will be extracted from the crawled pages and similarity checking will be done between the two pages based in the usage of adwords. So a threshold value is set for this, if the similarity checking percentage is greater than the threshold then similarity content is reduced and improves the depositary and improves the search engine quality. In the sections of existing analysis and the proposed analysis we are clearly exploring how it works.

Download Full-text

Automatic Detection of Illegitimate Websites with Mutual Clustering

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i3.9878 ◽

2016 ◽

Vol 6 (3) ◽

pp. 995

Author(s):

Kanaka Durga ◽

V Rama Krishna

Keyword(s):

Search Engine ◽

Search Engines ◽

Automatic Detection ◽

Threshold Value ◽

Research Community ◽

Web Pages ◽

Web Content ◽

Web Contents ◽

Similar Content ◽

In the websites the contents will be are similarity when we compared with other search engines. So to check the similar content in the websites and its web contents we created a overhead to the search engine which will severely effect its performance & quality. So to detect the silmilar or same content or web documenattion some techniques are implemented by web crawling research community. So it is one of major factor for the search engines to provide some applicatory data to users in the first page itself. So to avoid such issues we proposed a methodlogy called Automatic Detection of illegitimate websites with Mutual Clustering (ADIWMC) paper we are presenting a peculiar and efficacious path for the detection of similarities in the web pages in web clustering. Detection of same and similar web pages and web content will be done by storing the crawled web pages into depository. Initially the adwords will be extracted from the crawled pages and similarity checking will be done between the two pages based in the usage of adwords. So a threshold value is set for this, if the similarity checking percentage is greater than the threshold then similarity content is reduced and improves the depositary and improves the search engine quality. In the sections of existing analysis and the proposed analysis we are clearly exploring how it works.

Download Full-text

Automatic Detection of Shared Fragments in Large Collections of Web Pages and its Applications

Journal of Algorithms & Computational Technology ◽

10.1260/174830107781389003 ◽

2007 ◽

Vol 1 (2) ◽

pp. 215-250 ◽

Author(s):

Zhimin Gu ◽

Junchang Ma

Keyword(s):

Automatic Detection ◽

Download Full-text

Automatic Detection of Potential Layout Faults Following Changes to Responsive Web Pages (N)

2015 30th IEEE/ACM International Conference on Automated Software Engineering (ASE) ◽

10.1109/ase.2015.31 ◽

2015 ◽

Author(s):

Thomas A. Walsh ◽

Phil McMinn ◽

Gregory M. Kapfhammer

Keyword(s):

Automatic Detection ◽

Download Full-text