scholarly journals User profiling for the web

2006 ◽  
Vol 3 (2) ◽  
pp. 1-29 ◽  
Author(s):  
Miha Grcar ◽  
Dunja Mladenic ◽  
Marko Grobelnik

This paper addresses a problem of personalized information delivery related to the Web, that is based on user profiling. Different approaches to user profiling have been developed. When the user profiling is used for personalization in the context of Web, we can talk about Web personalization. There are three main groups of approaches: content-based filtering collaborative filtering and Web usage mining. We provide an overview of them including recent research results in the area with especial emphases on user profiling in the perspective of Semantic Web applications.

Author(s):  
H. Inbarani ◽  
K. Thangavel

The technology behind personalization or Web page recommendation has undergone tremendous changes, and several Web-based personalization systems have been proposed in recent years. The main goal of Web personalization is to dynamically recommend Web pages based on online behavior of users. Although personalization can be accomplished in numerous ways, most Web personalization techniques fall into four major categories: decision rule-based filtering, content-based filtering, and collaborative filtering and Web usage mining. Decision rule-based filtering reviews users to obtain user demographics or static profiles, and then lets Web sites manually specify rules based on them. It delivers the appropriate content to a particular user based on the rules. However, it is not particularly useful because it depends on users knowing in advance the content that interests them. Content-based filtering relies on items being similar to what a user has liked previously. Collaborative filtering, also called social or group filtering, is the most successful personalization technology to date. Most successful recommender systems on the Web typically use explicit user ratings of products or preferences to sort user profile information into peer groups. It then tells users what products they might want to buy by combining their personal preferences with those of like-minded individuals. However, collaborative filtering has limited use for a new product that no one has seen or rated, and content-based filtering to obtain user profiles might miss novel or surprising information. Additionally, traditional Web personalization techniques, including collaborative or content-based filtering, have other problems, such as reliance on subject user ratings and static profiles or the inability to capture richer semantic relationships among Web objects. To overcome these shortcomings, the new Web personalization tool, nonintrusive personalization, attempts to increasingly incorporate Web usage mining techniques. Web usage mining can help improve the scalability, accuracy, and flexibility of recommender systems. Thus, Web usage mining can reduce the need for obtaining subjective user ratings or registration-based personal preferences. This chapter provides a survey of Web usage mining approaches.


Author(s):  
Anu Sharma ◽  
Aarti Singh

Intelligent semantic approaches (i.e., semantic web and software agents) are very useful technologies for adding meaning to the web. Adaptive web is a new era of web targeting to provide customized and personalized view of contents and services to its users. Integration of these two technologies can further add to reasoning and intelligence in recommendation process. This chapter explores the existing work done in the area of applying intelligent approaches to web personalization and highlighting ample scope for application of intelligent agents in this domain for solving many existing issues like personalized content management, user profile learning, modelling, and adaptive interactions with users.


Author(s):  
Leila Zemmouchi-Ghomari

Data play a central role in the effectiveness and efficiency of web applications, such as the Semantic Web. However, data are distributed across a very large number of online sources, due to which a significant effort is needed to integrate this data for its proper utilization. A promising solution to this issue is the linked data initiative, which is based on four principles related to publishing web data and facilitating interlinked and structured online data rather than the existing web of documents. The basic ideas, techniques, and applications of the linked data initiative are surveyed in this paper. The authors discuss some Linked Data open issues and potential tracks to address these pending questions.


2011 ◽  
pp. 78-88
Author(s):  
Alexander Mikroyannidis ◽  
Babis Theodoulidis

The rate of growth in the amount of information available in the World Wide Web has not been followed by similar advances in the way this information is organized and exploited. Web adaptation seeks to address this issue by transforming the topology of a Web site to help users in their browsing tasks. In this sense, Web usage mining techniques have been employed for years to study how the Web is used in order to make Web sites more user-friendly. The Semantic Web is an ambitious initiative aiming to transform the Web to a well-organized source of information. In particular, apart from the unstructured information of today’s Web, the Semantic Web will contain machine-processable metadata organized in ontologies. This will enhance the way we search the Web and can even allow for automatic reasoning on Web data with the use of software agents. Semantic Web adaptation brings traditional Web adaptation techniques into the new era of the Semantic Web. The idea is to enable the Semantic Web to be constantly aligned to the users’ preferences. In order to achieve this, Web usage mining and text mining methodologies are employed for the semi-automatic construction and evolution of Web ontologies. This usage-driven evolution of Web ontologies, in parallel with Web topologies evolution, can bring the Semantic Web closer to the users’ expectations.


Author(s):  
Sunny Sharma ◽  
Manisha Malhotra

Web usage mining is the use of data mining techniques to analyze user behavior in order to better serve the needs of the user. This process of personalization uses a set of techniques and methods for discovering the linking structure of information on the web. The goal of web personalization is to improve the user experience by mining the meaningful information and presented the retrieved information in a way the user intends. The arrival of big data instigated novel issues to the personalization community. This chapter provides an overview of personalization, big data, and identifies challenges related to web personalization with respect to big data. It also presents some approaches and models to fill the gap between big data and web personalization. Further, this research brings additional opportunities to web personalization from the perspective of big data.


Author(s):  
Christopher Walton

In the introductory chapter of this book, we discussed the means by which knowledge can be made available on the Web. That is, the representation of the knowledge in a form by which it can be automatically processed by a computer. To recap, we identified two essential steps that were deemed necessary to achieve this task: 1. We discussed the need to agree on a suitable structure for the knowledge that we wish to represent. This is achieved through the construction of a semantic network, which defines the main concepts of the knowledge, and the relationships between these concepts. We presented an example network that contained the main concepts to differentiate between kinds of cameras. Our network is a conceptualization, or an abstract view of a small part of the world. A conceptualization is defined formally in an ontology, which is in essence a vocabulary for knowledge representation. 2. We discussed the construction of a knowledge base, which is a store of knowledge about a domain in machine-processable form; essentially a database of knowledge. A knowledge base is constructed through the classification of a body of information according to an ontology. The result will be a store of facts and rules that describe the domain. Our example described the classification of different camera features to form a knowledge base. The knowledge base is expressed formally in the language of the ontology over which it is defined. In this chapter we elaborate on these two steps to show how we can define ontologies and knowledge bases specifically for the Web. This will enable us to construct Semantic Web applications that make use of this knowledge. The chapter is devoted to a detailed explanation of the syntax and pragmatics of the RDF, RDFS, and OWL Semantic Web standards. The resource description framework (RDF) is an established standard for knowledge representation on the Web. Taken together with the associated RDF Schema (RDFS) standard, we have a language for representing simple ontologies and knowledge bases on the Web.


Author(s):  
Marcello Pecoraro

This chapter aims at providing an overview about the use of statistical methods supporting the Web Usage Mining. Within the first part is described the framework of the Web Usage Mining as a branch of the Web Mining committed to the study of how to use a Website. Then, the data (object of the analysis) are detailed together with the problems linked to the pre-processing. Once clarified, the data origin and their treatment for a correct development of a Web Usage analysis,the focus shifts on the statistical techniques that can be applied to the analysis background, with reference to binary segmentation methods. Those latter allow the discrimination through a response variable that determines the affiliation of the users to a group by considering some characteristics detected on the same users.


2010 ◽  
pp. 751-758
Author(s):  
P. Markellou

Over the last decade, we have witnessed an explosive growth in the information available on the Web. Today, Web browsers provide easy access to myriad sources of text and multimedia data. Search engines index more than a billion pages and finding the desired information is not an easy task. This profusion of resources has prompted the need for developing automatic mining techniques on Web, thereby giving rise to the term “Web mining” (Pal, Talwar, & Mitra, 2002). Web mining is the application of data mining techniques on the Web for discovering useful patterns and can be divided into three basic categories: Web content mining, Web structure mining, and Web usage mining. Web content mining includes techniques for assisting users in locating Web documents (i.e., pages) that meet certain criteria, while Web structure mining relates to discovering information based on the Web site structure data (the data depicting the Web site map). Web usage mining focuses on analyzing Web access logs and other sources of information regarding user interactions within the Web site in order to capture, understand and model their behavioral patterns and profiles and thereby improve their experience with the Web site. As citizens requirements and needs change continuously, traditional information searching, and fulfillment of various tasks result to the loss of valuable time spent in identifying the responsible actor (public authority) and waiting in queues. At the same time, the percentage of users who acquaint with the Internet has been remarkably increased (Internet World Stats, 2005). These two facts motivate many governmental organizations to proceed with the provision of e-services via their Web sites. The ease and speed with which business transactions can be carried out over the Web has been a key driving force in the rapid growth and popularity of e-government, e-commerce, and e-business applications. In this framework, the Web is emerging as the appropriate environment for business transactions and user-organization interactions. However, since it is a large collection of semi-structured and structured information sources, Web users often suffer from information overload. Personalization is considered as a popular solution in order to alleviate this problem and to customize the Web environment to users (Eirinaki & Vazirgiannis, 2003). Web personalization can be described, as any action that makes the Web experience of a user personalized to his or her needs and wishes. Principal elements of Web personalization include modeling of Web objects (pages) and subjects (users), categorization of objects and subjects, matching between and across objects and/or subjects, and determination of the set of actions to be recommended for personalization. In the remainder of this article, we present the way an e-government application can deploy Web mining techniques in order to support intelligent and personalized interactions with citizens. Specifically, we describe the tasks that typically comprise this process, illustrate the future trends, and discuss the open issues in the field.


2008 ◽  
Vol 23 (2) ◽  
pp. 181-212 ◽  
Author(s):  
LYNDON J. B. NIXON ◽  
ELENA SIMPERL ◽  
RETO KRUMMENACHER ◽  
FRANCISCO MARTIN-RECUERDA

AbstractSemantic technologies promise to solve many challenging problems of the present Web applications. As they achieve a feasible level of maturity, they become increasingly accepted in various business settings at enterprise level. By contrast, their usability in open environments such as the Web—with respect to issues such as scalability, dynamism and openness—still requires additional investigation. In particular, Semantic Web services have inherited the Web service communication model, which is primarily based on synchronous message exchange technology such as remote procedure call (RPC), thus being incompatible with the REST (REpresentational State Transfer) architectural model of the Web. Recent advances in the field of middleware propose ‘semantic tuplespace computing’ as an instrument for coping with this situation. Arguing that truly Web-compliant Web service communication should be based, analogously to the conventional Web, on shared access to persistently published data instead of message passing, space-based middleware introduces a coordination infrastructure by means of which services can exchange information in a time- and reference-decoupled manner. In this article, we introduce the most important approaches in this newly emerging field. Our objective is to analyze and compare the solutions proposed so far, thus giving an account of the current state-of-the-art, and identifying new directions of research and development.


Author(s):  
Shaun D'Souza

The web contains vast repositories of unstructured text. We investigate the opportunity for building a knowledge graph from these text sources. We generate a set of triples which can be used in knowledge gathering and integration. We define the architecture of a language compiler for processing subject-predicate-object triples using the OpenNLP parser. We implement a depth-first search traversal on the POS tagged syntactic tree appending predicate and object information. A parser enables higher precision and higher recall extractions of syntactic relationships across conjunction boundaries. We are able to extract 2-2.5 times the correct extractions of ReVerb. The extractions are used in a variety of semantic web applications and question answering. We verify extraction of 50,000 triples on the ClueWeb dataset.


Sign in / Sign up

Export Citation Format

Share Document