scholarly journals Dataset or Not? A Study on the Veracity of Semantic Markup for Dataset Pages

2021 ◽  
pp. 338-356
Author(s):  
Tarfah Alrashed ◽  
Dimitris Paparas ◽  
Omar Benjelloun ◽  
Ying Sheng ◽  
Natasha Noy

AbstractSemantic markup, such as , allows providers on the Web to describe content using a shared controlled vocabulary. This markup is invaluable in enabling a broad range of applications, from vertical search engines, to rich snippets in search results, to actions on emails, to many others. In this paper, we focus on semantic markup for datasets, specifically in the context of developing a vertical search engine for datasets on the Web, Google’s Dataset Search. Dataset Search relies on to identify pages that describe datasets. While was the core enabling technology for this vertical search, we also discovered that we need to address the following problem: pages from 61% of internet hosts that provide markup do not actually describe datasets. We analyze the veracity of dataset markup for Dataset Search’s Web-scale corpus and categorize pages where this markup is not reliable. We then propose a way to drastically increase the quality of the dataset metadata corpus by developing a deep neural-network classifier that identifies whether or not a page with markup is a dataset page. Our classifier achieves 96.7% recall at the 95% precision point. This level of precision enables Dataset Search to circumvent the noise in semantic markup and to use the metadata to provide high quality results to users.

2018 ◽  
Vol 10 (4) ◽  
pp. 1
Author(s):  
Mileidy Alvarez-Melgarejo ◽  
Martha L. Torres-Barreto

The bibliometric method has proven to be a powerful tool for the analysis of scientific publications, in such a way that allows rating the quality of the knowledge generating process, as well as its impact on firm´s environment. This article presents a comparison between two powerful bibliographic databases in terms of their coverage and the usefulness of their content. The comparison starts with a subject associated to the relationship between resources and capabilities. The outcomes show that the search results differ between both databases. The Web Of Science (WOS), has a greater coverage than SCOPUS has.  It also has a greater impact in terms of most cited authors and publications. The search results in the WOS yield articles from 2001, while Scopus yields articles from 1976, however, some of the latter are inconsistent with the topic being searched. The analysis points to a lack of studies regarding resources as foundations of firm´s capabilities; as a result, new research on this field is suggested.


2013 ◽  
Vol 411-414 ◽  
pp. 106-109 ◽  
Author(s):  
Ya Heng Ren

Vertical Search Engine provides a professional search compared with the traditional search engine. All of the data searched by vertical search engine is relative with some one theme, which is decided by users. Usually Vector Space Model is used for judging the relativity between data in the web and the decided theme. But when elements of the theme appear repeatedly, their order is not considered by Vector Space Model. Adding a new element, the Evolved Vector Space Model is provided. The experiments show that the new model has fixed the problem and have a better performance in judging relativity.


Author(s):  
H. Arafat Ali ◽  
Ali I. El Desouky ◽  
Ahmed I. Saleh

Search engines are the most important search tools for finding useful and recent information on the Web today. They rely on crawlers that continually crawl the Web for new pages. Meanwhile, focused crawlers have become an attractive area for research in recent years. They suggest a better solution for general-purpose search engine limitations and lead to a new generation of search engines called vertical-search engines. Searching the Web vertically is to divide the Web into smaller regions; each region is related to a specific domain. In addition, one crawler is allowed to search in each domain. The innovation of this article is adding intelligence and adaptation ability to focused crawlers. Such added features will certainly guide the crawler perfectly to retrieve more relevant pages while crawling the Web. The proposed crawler has the ability to estimate the rank of the page before visiting it and adapts itself to any changes in its domain using.


2011 ◽  
Vol 3 (4) ◽  
pp. 62-70 ◽  
Author(s):  
Stephen O’Neill ◽  
Kevin Curran

Search engine optimization (SEO) is the process of improving the visibility, volume and quality of traffic to website or a web page in search engines via the natural search results. SEO can also target other areas of a search, including image search and local search. SEO is one of many different strategies used for marketing a website but SEO has been proven the most effective. An Internet marketing campaign may drive organic search results to websites or web pages but can be involved with paid advertising on search engines. All search engines have a unique way of ranking the importance of a website. Some search engines focus on the content while others review Meta tags to identify who and what a web site’s business is. Most engines use a combination of Meta tags, content, link popularity, click popularity and longevity to determine a sites ranking. To make it even more complicated, they change their ranking policies frequently. This paper provides an overview of search engine optimisation strategies and pitfalls.


Author(s):  
Stephen O’Neill ◽  
Kevin Curran

Search engine optimization (SEO) is the process of improving the visibility, volume and quality of traffic to website or a web page in search engines via the natural search results. SEO can also target other areas of a search, including image search and local search. SEO is one of many different strategies used for marketing a website but SEO has been proven the most effective. An Internet marketing campaign may drive organic search results to websites or web pages but can be involved with paid advertising on search engines. All search engines have a unique way of ranking the importance of a website. Some search engines focus on the content while others review Meta tags to identify who and what a web site’s business is. Most engines use a combination of Meta tags, content, link popularity, click popularity and longevity to determine a sites ranking. To make it even more complicated, they change their ranking policies frequently. This paper provides an overview of search engine optimisation strategies and pitfalls.


2016 ◽  
Vol 7 (1) ◽  
pp. 16-33 ◽  
Author(s):  
Himani Singal ◽  
Shruti Kohli

Trusting any information on web is psychosomatic and subliminal by nature. The decision is left on the requestor to assess, judge and corroborate the contents contained in the websites before perceiving it. This is of acute concern when websites deal with sensitive issues like health. There is no standard mechanism that embodies or characterizes how to make these ‘trust' decisions. Although all the web users make these decisions on a frequent basis, there is no method to comply with the rationale to take such decisions. This paper is an attempt to provide a solution to the problem of ‘how much the content, typically provided by any health related website should be trusted?' A probing has been done to study the users' behavior on these websites. This cram makes use of real-time analytical data collected from similarweb.com for hundred health related websites to analyze web users' behavior. The goalmouth is to develop a novel technique to re-rank search results using TRUST as a deciding factor so that more trustworthy web links appears higher in the results list. The aim is to determine and discern the users' attitudinal factors that can be captured in practice without user interaction and also capitalize on the quality of the trust estimates.


Author(s):  
Mark Oprenko

The definition of the multimorbidity concept reveals insufficient specificity of the comorbidity and multimorbidity definitions and, as a result, confusion in the use of these terms. Most authors are unanimous that the “core” of multimorbidity is presence of more than one disease in a patient. These coexisting diseases can be pathogenetically interconnected and non-interconnected. Regardless, the degree of multimorbidity always affects prognosis and quality of life.


Edupedia ◽  
2020 ◽  
Vol 5 (1) ◽  
pp. 45-53
Author(s):  
Ilzam Dhaifi

The world has been surprised by the emergence of a COVID 19 pandemic, was born in China, and widespread to various countries in the world. In Indonesia, the government issued several policies to break the COVID 19 pandemic chain, which also triggered some pro-cons in the midst of society. One of the policies government takes is the closure of learning access directly at school and moving the learning process from physical class to a virtual classroom or known as online learning. In the economic sector also affects the parents’ financial ability to provide sufficient funds to support the implementation of distance learning applied by the government. The implications of the distance education policy are of course the quality of learning, including the subjects of Islamic religious education, which is essentially aimed at planting knowledge, skills, and religious consciousness to form the character of the students. Online education must certainly be precise, in order to provide equal education services to all students, prepare teachers to master the technology, and seek the core learning of Islamic religious education can still be done well.


Author(s):  
Juan Alfredo Lino-Gamiño ◽  
Carlos Méndez-González ◽  
Eduardo José Salazar-Araujo ◽  
Pablo Adrián Magaña-Sánchez

In the value chain it is important to keep in mind the core business of the company, since it depends largely on the competitiveness of the company and its overall performance, bearing in mind that all business indicators depend on it. In this work we will study the washing process within the company WASH CONTAINERS SA DE CV, to improve the washing processes and in this way reduce times and movements in the process leading the company to reduce costs considerably within the operations company daily, having a more competitive operation and with greater profit margin in its business process. Goals: It Improve the logistics of the movement of containers for washing and with it the core business of the company. Methodology: The action research will be applied applying Business Process Management for the improvement of processes in situ, it will be developed in a certain period of time and with that it will establish an improvement projection. Contribution: The improvement of the times for the disposal of the containers and their subsequent use, allows a better competitiveness and with it the income of the company, on the other hand, the transport companies improve in performance in quantity, quality of disposition and with it their income.


2019 ◽  
Vol 54 (6) ◽  
Author(s):  
Sawsan Ali Hamid ◽  
Rana Alauldeen Abdalrahman ◽  
Inam Abdullah Lafta ◽  
Israa Al Barazanchi

Recently, web services have presented a new and evolving model for constructing the distributed system. The meteoric growth of the Web over the last few years proves the efficacy of using simple protocols over the Internet as the basis for a large number of web services and applications. Web service is a modern technology of web, which can be defined as software applications with a programmatic interface based on Internet protocol. Web services became common in the applications of the web by the help of Universal, Description, Discovery and Integration; Web Service Description Language and Simple Object Access Protocol. The architecture of web services refers to a collection of conceptual components in which common sets of standard can be defined among interoperating components. Nevertheless, the existing Web service's architecture is not impervious to some challenges, such as security problems, and the quality of services. Against this backdrop, the present study will provide an overview of these issues. Therefore, it aims to propose web services architecture model to support distributed system in terms of application and issues.


Sign in / Sign up

Export Citation Format

Share Document