scholarly journals An Improved Focused Crawler: Using Web Page Classification and Link Priority Evaluation

2016 ◽  
Vol 2016 ◽  
pp. 1-10 ◽  
Author(s):  
Houqing Lu ◽  
Donghui Zhan ◽  
Lei Zhou ◽  
Dengchao He

A focused crawler is topic-specific and aims selectively to collect web pages that are relevant to a given topic from the Internet. However, the performance of the current focused crawling can easily suffer the impact of the environments of web pages and multiple topic web pages. In the crawling process, a highly relevant region may be ignored owing to the low overall relevance of that page, and anchor text or link-context may misguide crawlers. In order to solve these problems, this paper proposes a new focused crawler. First, we build a web page classifier based on improved term weighting approach (ITFIDF), in order to gain highly relevant web pages. In addition, this paper introduces an evaluation approach of the link, link priority evaluation (LPE), which combines web page content block partition algorithm and the strategy of joint feature evaluation (JFE), to better judge the relevance between URLs on the web page and the given topic. The experimental results demonstrate that the classifier using ITFIDF outperforms TFIDF, and our focused crawler is superior to other focused crawlers based on breadth-first, best-first, anchor text only, link-context only, and content block partition in terms of harvest rate and target recall. In conclusion, our methods are significant and effective for focused crawler.

The Given dynamic web pages have many security issues, and it can be very difficult to handle.The goal of card and finger print system gives more security to dynamic web pages. In this paper, We clearly discuss and workout the real time example with the help of Internet. Specifically, by introducing this technology the cyber crime also be comes underthecontrol.Fingerprintverificationisanimportantbiometric techniqueforpersonalidentification.Inthispaper,wedescribethede signandimplementationofaprototypeautomaticidentityauthentic ationsystemwhichusesfingerprintstoauthenicatetheidentityofani ndividual.


Author(s):  
R. Rathipriya

The primary objective of this chapter is to propose Biclustering Optimization Techniques (BOT) to identify the optimal web pages from web usage data. Bio-inspired optimization techniques like Firefly algorithm and its variant are used as optimization tool to generate optimal usage profile from the given web usage dataset. Finally, empirical study is conducted on the benchmark clickstream datasets like MSNBC, MSWEB and CTI and their results are analyzed to know the performance of the proposed biclustering optimization techniques with respect to optimization techniques available in the literature.


Author(s):  
ALI SELAMAT ◽  
ZHI SAM LEE ◽  
MOHD AIZAINI MAAROF ◽  
SITI MARIYAM SHAMSUDDIN

In this paper, an improved web page classification method (IWPCM) using neural networks to identify the illicit contents of web pages is proposed. The proposed IWPCM approach is based on the improvement of feature selection of the web pages using class based feature vectors (CPBF). The CPBF feature selection approach has been calculated by considering the important term's weight for illicit web documents and reduce the dependency of the less important term's weight for normal web documents. The IWPCM approach has been examined using the modified term-weighting scheme by comparing it with several traditional term-weighting schemes for non-illicit and illicit web contents available from the web. The precision, recall, and F1 measures have been used to evaluate the effectiveness of the proposed IWPCM approach. The experimental results have shown that the proposed improved term-weighting scheme has been able to identify the non-illicit and illicit web contents available from the experimental datasets.


2011 ◽  
Vol 4 (2) ◽  
Author(s):  
Kai Kaspar ◽  
Frank Ollermann ◽  
Kai-Christoph Hamborg

This article focuses on the impact of observation time and web page structure on viewing behavior. 63 subjects observed similarly structured pages of a popular commercial internet shop. Eye movements were recorded and analyzed regarding several saccade parameters, the individual fixation distribution by means of a progressive entropy approach, and the within- as well as between-subject congruency of fixation distributions. Our results show that viewing behavior significantly changed while subjects observed individual web pages. In contrast, we only found little evidence for a change in eye movements across web pages and hence for an attention-related schema building. In this context, we also provide an example of the impact of web page elements’ position on fixation probability.


Author(s):  
Alizaman D. Gamon ◽  
Mariam Saidona Tagoranao

This study discusses the penetration of Islam in the Philippines, particularly the third wave of its expansion, which was brought by Sufi missionaries. It reinstates the historical relevance of Sufi ideas and approaches due to its contemporary relevance to the concept of social co-existence. The rational, intellectual and philosophical dimension of Islam is manifested in the cultural and traditional life of Muslim communities. The study also analyzes the impact of Muslim struggle for the development of Islamic institutions in the context of the secular state. The ongoing, unsettled debate between Islamic and government approaches to peace and development in Mindanao and Sulu continues unabated. Over the years, reforms were introduced, but in their midst, evidence of government biases and prejudices with regards to Islamic institutions have surfaced. Muslim leaders and intellectuals responded in the context of historical rights and freedom, but those views were often questioned as they are presumed to be incompatible with the national agenda for national unity. It was very recently that this incompatibility was readdressed giving support to having lasting peace and justice in Mindanao. The study argues that there have been substantial state-sponsored reforms which may contribute to the gradual advancement of Muslim communities. Though the path for the passage of Muslim concerns within the given condition is fragile and open to challenges, the study recognizes the prominence of inter-civilizational dialogue, from which the universal values of humanity will be embraced by both Muslim and non-Muslim policy makers. In addition, Muslim and non-Muslim communities in the Philippines need to embrace the universal principle of humanity and coexistence due to its relevance to the political stability and economic growth in the country.  Keywords: Muslims in the Philippines, Islamic institutions, Islamization, Muslim intellectuals, Reform. Abstrak Kajian ini mengkaji tentang kemasukan Islam, terutamanya gelombang ketiga perkembangannya, yang dibawa oleh para pendakwah sufi. Kajian itu mengembalikan semula sejarah penting tentang idea-idea dan pendekatan Sufi yang boleh digunapakai pada masa kini untuk mewujudkan keharmonian sosial di kalangan rakyat pelbagai agama. Pemahaman tentang Islam mempunyai pengaruh yang jelas terhadap kebudayaan dan tradisi Islam. Kajian ini juga menganalisis kesan perjuangan Muslim untuk pembangunan institusi Islam dalam konteks sebuah negara sekular. Perbahasan yang berterusan yang tidak menemukan penyelesaian antara pendekatan Islam dan pendekatan kerajaan untuk perdamaian serta pembangunan di Mindanao dan Sulu terus berlanjutan. Walaupun  bertahun-tahun pembaharuan telah dilakukan, namun terdapat bukti penolakan dan prasangka buruk kerajaan terhadap institusi Islam. Para pemimpin dan intelektual Muslim bertindak berdasarkan pada fakta sejarah dan hak kebebasan bersuara, namun pandangan mereka sering dipertikaikan kerana mereka dianggap tidak seiring dengan agenda dan perpaduan nasional. Baru-baru ini ketidakserasian ini mulai disuarakan semula untuk mendapat sokongan terhadap keamanan dan keadilan yang berterusan di Mindanao. Kajian ini mendapati bahawa terdapat pembaharuan yang dilakukan oleh pihak kerajaan yang boleh menyumbang ke arah  kemajuan masyarakat Islam secara beransur-ansur. Walaupun pendekatan bagi memenuhi hasrat orang Islam masih dalam keadaan yang rapuh dan penuh cabaran, namun kajian ini mengusulkan peripentingnya dialog antara peradaban dimana nilai-nilai universal manusia akan diperoleh dan dipegang oleh kedua-kedua pihak pembuat dasar iaitu  Islam dan bukan Islam. Di samping itu, umat Islam dan bukan Islam di Filipina perlu mengkaji dan mencontohi model keharmonian sosial Malaysia dan Singapura kerana kaitannya dengan kestabilan politik dan pertumbuhan ekonomi. Kata Kunci: Muslim di Filipina, institusi Islam, Islamisasi, intelektual Islam, Pembaharuan.


2018 ◽  
Vol 23 (1) ◽  
pp. 60-71
Author(s):  
Wigiyanti Masodah

Offering credit is the main activity of a Bank. There are some considerations when a bank offers credit, that includes Interest Rates, Inflation, and NPL. This study aims to find out the impact of Variable Interest Rates, Inflation variables and NPL variables on credit disbursed. The object in this study is state-owned banks. The method of analysis in this study uses multiple linear regression models. The results of the study have shown that Interest Rates and NPL gave some negative impacts on the given credit. Meanwhile, Inflation variable does not have a significant effect on credit given. Keywords: Interest Rate, Inflation, NPL, offered Credit.


2020 ◽  
Vol 14 ◽  
Author(s):  
Shefali Singhal ◽  
Poonam Tanwar

Abstract:: Now-a-days when everything is going digitalized, internet and web plays a vital role in everyone’s life. When one has to ask something or has any online task to perform, one has to use internet to access relevant web-pages throughout. These web-pages are mainly designed for large screen terminals. But due to mobility, handy and economic reasons most of the persons are using small screen terminals (SST) like mobile phone, palmtop, pagers, tablet computers and many more. Reading a web page which is actually designed for large screen terminal on a small screen is time consuming and cumbersome task because there are many irrelevant content parts which are to be scrolled or there are advertisements, etc. Here main concern is e-business users. To overcome such issues the source code of a web page is organized in tree data-structure. In this paper we are arranging each and every main heading as a root node and all the content of this heading as a child node of the logical structure. Using this structure, we regenerate a web-page automatically according to SST size. Background:: DOM and VIPS algorithms are the main background techniques which are supporting the current research. Objective:: To restructure a web page in a more user friendly and content presenting format. Method Backtracking:: Method Backtracking: Results:: web page heading queue generation. Conclusion:: Concept of logical structure supports every SST.


Energies ◽  
2021 ◽  
Vol 14 (4) ◽  
pp. 878 ◽  
Author(s):  
Oliwia Pietrzak ◽  
Krystian Pietrzak

This paper focuses on effects of implementing zero-emission buses in public transport fleets in urban areas in the context of electromobility assumptions. It fills the literature gap in the area of research on the impact of the energy mix of a given country on the issues raised in this article. The main purpose of this paper is to identify and analyse economic effects of implementing zero-emission buses in public transport in cities. The research area was the city of Szczecin, Poland. The research study was completed using the following research methods: literature review, document analysis (legal acts and internal documents), case study, ratio analysis, and comparative analysis of selected variants (investment variant and base variant). The conducted research study has shown that economic benefits resulting from implementing zero-emission buses in an urban transport fleet are limited by the current energy mix structure of the given country. An unfavourable energy mix may lead to increased emissions of SO2 and CO2 resulting from operation of this kind of vehicle. Therefore, achieving full effects in the field of electromobility in the given country depends on taking concurrent actions in order to diversify the power generation sources, and in particular on increasing the share of Renewable Energy Sources (RES).


2003 ◽  
Vol 22 (2) ◽  
pp. 87-93
Author(s):  
James Otto ◽  
Mohammad Najdawi ◽  
William Wagner

With the extensive growth of the Internet and electronic commerce, the issue of how users behave when confronted with long download times is important. This paper investigates Web switching behavior. The paper describes experiments where users were subjected to artificially delayed Web page download times to study the impact of Web site wait times on switching behavior. Two hypotheses were tested. First, that longer wait times will result in increased switching behavior. The implication being that users become frustrated with long waiting times and choose to go elsewhere. Second, that users who switch will benefit, in terms of decreased download times, from their decision to switch.


Author(s):  
Ana Guerberof Arenas ◽  
Joss Moorkens ◽  
Sharon O’Brien

AbstractThis paper presents results of the effect of different translation modalities on users when working with the Microsoft Word user interface. An experimental study was set up with 84 Japanese, German, Spanish, and English native speakers working with Microsoft Word in three modalities: the published translated version, a machine translated (MT) version (with unedited MT strings incorporated into the MS Word interface) and the published English version. An eye-tracker measured the cognitive load and usability according to the ISO/TR 16982 guidelines: i.e., effectiveness, efficiency, and satisfaction followed by retrospective think-aloud protocol. The results show that the users’ effectiveness (number of tasks completed) does not significantly differ due to the translation modality. However, their efficiency (time for task completion) and self-reported satisfaction are significantly higher when working with the released product as opposed to the unedited MT version, especially when participants are less experienced. The eye-tracking results show that users experience a higher cognitive load when working with MT and with the human-translated versions as opposed to the English original. The results suggest that language and translation modality play a significant role in the usability of software products whether users complete the given tasks or not and even if they are unaware that MT was used to translate the interface.


Sign in / Sign up

Export Citation Format

Share Document