Construction of Domain Ontologies

Author(s):  
Jongwoo Kim ◽  
Veda C. Storey

As the World Wide Web evolves into the Semantic Web, domain ontologies, which represent the concepts of an application domain and their associated relationships, have become increasingly important as surrogates for capturing and representing the semantics of real world applications. Much ontology development remains manual and is both difficult and time-consuming. This research presents a methodology for semi-automatically generating domain ontologies from extracted information on the World Wide Web. The methodology is implemented in a prototype that integrates existing ontology and web organization tools. The prototype is used to develop ontologies for different application domains, and an empirical analysis carried out to demonstrate the feasibility of the research.

Author(s):  
Jongwoo Kim ◽  
Veda C. Storey

As the World Wide Web evolves into the Semantic Web, domain ontologies, which represent the concepts of an application domain and their associated relationships, have become increasingly important as surrogates for capturing and representing the semantics of real world applications. Much ontology development remains manual and is both difficult and time-consuming. This research presents a methodology for semi-automatically generating domain ontologies from extracted information on the World Wide Web. The methodology is implemented in a prototype that integrates existing ontology and web organization tools. The prototype is used to develop ontologies for different application domains, and an empirical analysis carried out to demonstrate the feasibility of the research.


2012 ◽  
Vol 1 (2) ◽  
pp. 63-70
Author(s):  
Zhaoyan Jin ◽  
Quanyuan Wu

The PageRank vector of a network is very important, for it can reflect the importance of a Web page in the World Wide Web, or of a people in a social network. However, with the growth of the World Wide Web and social networks, it needs more and more time to compute the PageRank vector of a network. In many real-world applications, the degree and PageRank distributions of these complex networks conform to the Power-Law distribution. This paper utilizes the degree distribution of a network to initialize its PageRank vector, and presents a Power-Law degree distribution accelerating algorithm of PageRank computation. Experiments on four real-world datasets show that the proposed algorithm converges more quickly than the original PageRank algorithm.DOI: 10.18495/comengapp.12.063070


Author(s):  
Mahesh S. Raisinghani ◽  
Tapas R. Sahoo

In less than a decade, the World Wide Web has become popular because of the depth of information it provides and the simplicity of its usage by simple clicks through related and interlinked pages. However, the amount of information and the numerous formats in which it is presented are simply overwhelming, and it is not uncommon to get overloaded with irrelevant or unrelated information. For example, a simple search task of finding books written by an author named David Flower would fetch hundreds of pages that merely contain the words David and/or Flower. The Web contains information on millions of Web pages interwoven by the use of hyperlinks and presented in rich HTML (hypertext markup language) formats, such as images, graphics, audio, and video. This rich presentation capability makes the Web highly readable for humans, but adds no meaning to the information when read by computers. The Semantic Web, which is considered to be the next evolution of the current Web, would qualify information with well-defined meaning. This added meaning to data, called metadata, would enable computers and people to work in cooperation (Hendler, Berners-Lee, & Miller, 2002). In addition to having hyperlinked pages containing media objects, the Semantic Web will also contain resources pointing to real-world objects such as people, places, organizations, and events. These objects will be linked based on their real-world relationships. Another goal of the Semantic Web is to develop enabling standards and technologies designed to help machines understand more information on the Web so that they can support richer discovery, data integration, navigation, and automation of tasks (Berners-Lee, Hendler, & Lassila, 2001). The current Web has the potential of becoming the largest database system, but it suffers from its foundation as a presentation media. This article addresses issues involved in effectively storing and managing data on the Web and focuses on various research activities in this direction. The Semantic Web is a vision that will extend the current Web to give well-defined meaning to information, enabling computers and people to work in better cooperation. A collaborative effort between the World Wide Web Consortium (W3C) and a large number of researchers and industrial partners is defining standards and technologies required for building the Semantic Web. This effort will enable data to be understood by machines and will be used for effective discovery, automation, integration, and reuse across applications.


Author(s):  
Georg Neubauer

The main subject of the work is the visualization of typed links in Linked Data. The academic subjects relevant to the paper in general are the Semantic Web, the Web of Data and information visualization. The Semantic Web, invented by Tim Berners-Lee in 2001, was announced as an extension to the World Wide Web (Web 2.0). The actual area of investigation concerns the connectivity of information on the World Wide Web. To be able to explore such interconnections, visualizations are critical requirements as well as a major part of processing data in themselves. In the context of the Semantic Web, representation of information interrelations can be achieved using graphs. The aim of the article is to primarily describe the arrangement of Linked Data visualization concepts by establishing their principles in a theoretical approach. Putting design restrictions into context leads to practical guidelines. By describing the creation of two alternative visualizations of a commonly used web application representing Linked Data as network visualization, their compatibility was tested. The application-oriented part treats the design phase, its results, and future requirements of the project that can be derived from this test.


Author(s):  
Rafael Cunha Cardoso ◽  
Fernando da Fonseca de Souza ◽  
Ana Carolina Salgado

Currently, systems dedicated to information retrieval/extraction perform an important role on fetching relevant and qualified information from the World Wide Web (WWW). The Semantic Web can be described as the Web’s future once it introduces a set of new concepts and tools. For instance, ontology is used to insert knowledge into contents of the current WWW to give meaning to such contents. This allows software agents to better understand the Web’s content meaning so that such agents can execute more complex and useful tasks to users. This work introduces an architecture that uses some Semantic Web concepts allied to Regular Expressions (REGEX) in order to develop a system that retrieves/extracts specific domain information from the Web. A prototype, based on such architecture, was developed to find information about offers announced on supermarkets Web sites.


Author(s):  
Salvador Miranda Lima ◽  
José Moreira

The emergence of the World Wide Web made available massive amounts of data. This data, created and disseminated from many different sources, is prepared and linked in a way that is well-suited for display purposes, but automation, integration, interoperability or context-oriented search can hardly be implemented. Hence, the Semantic Web aims at promoting global information integration and semantic interoperability, through the use of metadata, ontologies and inference mechanisms. This chapter presents a Semantic Model for Tourism (SeMoT), designed for building Semantic Web enabled applications for the planning and management of touristic itineraries, taking into account the new requirements of more demanding and culturally evolved tourists. It includes an introduction to relevant tourism concepts, an overview of current trends in Web Semantics research and a presentation of the architecture, main features and a selection of representative ontologies that compose the SeMoT.


Author(s):  
Kevin Curran ◽  
Gary Gumbleton

Tim Berners-Lee, director of the World Wide Web Consortium (W3C), states that, “The Semantic Web is not a separate Web but an extension of the current one, in which information is given well-defined meaning, better enabling computers and people to work in cooperation” (Berners-Lee, 2001). The Semantic Web will bring structure to the meaningful content of Web pages, creating an environment where software agents, roaming from page to page, can readily carry out sophisticated tasks for users. The Semantic Web (SW) is a vision of the Web where information is more efficiently linked up in such a way that machines can more easily process it. It is generating interest not just because Tim Berners-Lee is advocating it, but because it aims to solve the problem of information being hidden away in HTML documents, which are easy for humans to get information out of but are difficult for machines to do so. We will discuss the Semantic Web here.


2013 ◽  
Vol 60 (1) ◽  
pp. 42-53 ◽  
Author(s):  
Alexandru Napoleon Sireteanu

Abstract In the beginning World Wide Web was syntactic and the content itself was only readable by humans. The modern web combines existing web technologies with knowledge representation formalisms. In this sense, the Semantic Web proposes the mark-up of content on the web using formal ontology that structure essential data for the purpose of comprehensive machine understanding. On the syntactical level, standardization is an important topic. Many standards which can be used to integrate different information sources have evolved. Beside the classical database interfaces like ODBC, web-oriented standard languages like HTML, XML, RDF and OWL increase in importance. As the World Wide Web offers the greatest potential for sharing information, we will base our paper on these evolving standards.


Author(s):  
Rui G. Pereira ◽  
Mario M. Freire

The World Wide Web (WWW, Web, or W3) is known as the largest accessible repository of human knowledge. It contains around 3 billion documents, which may be accessed by more than 500 million worldwide users. In only 13 years since its appearance in 1991, the Web suffered such a huge growth that it is safe to say there is no phenomenon in history that can compare to it. It reached such importance that it became an indispensable partner in the lives of people (Daconta, Obrst & Smith, 2003).


Sign in / Sign up

Export Citation Format

Share Document