Vertical Search Engines on Forestry

2010 ◽  
Vol 143-144 ◽  
pp. 1270-1274 ◽  
Author(s):  
Fan Zhang ◽  
Xiu Lan Feng ◽  
Jin Sheng Yuan

With the rapid growth of forestry information,the amount of information on forestry is increasing rapidly. Comprehensive search engine is powerful,but its speed and accuracy of industry search is limited, owing to a host of information. According to the definition of vertical search engines,Heritrix Web Crawler and full text search framework of Lucene,this paper is mainly concerned the information of capture,indexing and search strategies in order to achieve an ideal forestry vertical search engine design. Experiments compared with comprehensive have proved the effectiveness of the proposed method.

2018 ◽  
Vol 176 ◽  
pp. 03014
Author(s):  
Yaru Cao ◽  
Ning Ma ◽  
Fucheng Wan ◽  
Xiangzhen He

Based on the research of vertical search engine and cross-language information retrieval, a crosslanguage vertical search engine design for e-commerce platform is proposed. It aims to solve the problem that it is difficult for Internet users to quickly, efficiently, and comprehensively search for valuable products, especially ethnic minority netizens. Cross-language in this article mainly refers to the conversion of Chinese, English, and Tibetan. Using dictionary-based query translation method to translate query words to achieve cross-language function. Improved Heritrix designed a web crawler information collection method. Using HtmlParser to achieve structured information extraction, and using Lucene to build an index and achieve retrieval.


2014 ◽  
Vol 651-653 ◽  
pp. 1580-1585
Author(s):  
Yong Cai ◽  
Mei Li ◽  
Hao Hu ◽  
Yu Hua Ni ◽  
Nu Hua Cheng

The Traditional Chinese Medicine (TCM) industry is a traditional industry of China. A unique, comprehensive and systematic theory and method of diagnosis and treatment has been formed over thousands of years. A large amount of TCM literature and database resources need to be shared via the Internet for queries by professionals and A unique TCM-industry search engine has been developed to comply with such requirements. A new method for generating query recommendations according to the enquiry is proposed based on features of the TCM industry, on the authors’ experience and on learning from the enactment and development of the vertical search engine of the TCM industry. Different from methods used by other search engines such as Google, Yahoo or Baidu, the new method combines the features of the TCM-industry search engine and the Chinese word segmentation of the TCM industry. It uses algorithms to calculate the correlation between query recommendations and stores these in relationship database. Practice indicates that such a method generates accurate and industry-specific query recommendations promptly.It therefore has innovation and promotional value.


Author(s):  
H. Arafat Ali ◽  
Ali I. El Desouky ◽  
Ahmed I. Saleh

Search engines are the most important search tools for finding useful and recent information on the Web today. They rely on crawlers that continually crawl the Web for new pages. Meanwhile, focused crawlers have become an attractive area for research in recent years. They suggest a better solution for general-purpose search engine limitations and lead to a new generation of search engines called vertical-search engines. Searching the Web vertically is to divide the Web into smaller regions; each region is related to a specific domain. In addition, one crawler is allowed to search in each domain. The innovation of this article is adding intelligence and adaptation ability to focused crawlers. Such added features will certainly guide the crawler perfectly to retrieve more relevant pages while crawling the Web. The proposed crawler has the ability to estimate the rank of the page before visiting it and adapts itself to any changes in its domain using.


The Dark Web ◽  
2018 ◽  
pp. 319-333
Author(s):  
Sudhakar Ranjan ◽  
Komal Kumar Bhatia

Now days with the advent of internet technologies and ecommerce the need for smart search engine for human life is rising. The traditional search engines are not intelligent as well as smart and thus lead to the rise in searching costs. In this paper, architecture of a vertical search engine based on the domain specific hidden web crawler is proposed. To make a least cost vertical search engine improvement in the following techniques like: searching, indexing, ranking, transaction and query interface are suggested. The domain term analyzer filters the useless information to the maximum extent and finally provides the users with high precision information. Through the experimental result it is shown that the system works on accelerating the access, computation, storage, communication time, increased efficiency and work professionally.


2017 ◽  
Vol 7 (2) ◽  
pp. 19-33
Author(s):  
Sudhakar Ranjan ◽  
Komal Kumar Bhatia

Now days with the advent of internet technologies and ecommerce the need for smart search engine for human life is rising. The traditional search engines are not intelligent as well as smart and thus lead to the rise in searching costs. In this paper, architecture of a vertical search engine based on the domain specific hidden web crawler is proposed. To make a least cost vertical search engine improvement in the following techniques like: searching, indexing, ranking, transaction and query interface are suggested. The domain term analyzer filters the useless information to the maximum extent and finally provides the users with high precision information. Through the experimental result it is shown that the system works on accelerating the access, computation, storage, communication time, increased efficiency and work professionally.


2020 ◽  
Vol 19 (10) ◽  
pp. 1602-1618 ◽  
Author(s):  
Thibault Robin ◽  
Julien Mariethoz ◽  
Frédérique Lisacek

A key point in achieving accurate intact glycopeptide identification is the definition of the glycan composition file that is used to match experimental with theoretical masses by a glycoproteomics search engine. At present, these files are mainly built from searching the literature and/or querying data sources focused on posttranslational modifications. Most glycoproteomics search engines include a default composition file that is readily used when processing MS data. We introduce here a glycan composition visualizing and comparative tool associated with the GlyConnect database and called GlyConnect Compozitor. It offers a web interface through which the database can be queried to bring out contextual information relative to a set of glycan compositions. The tool takes advantage of compositions being related to one another through shared monosaccharide counts and outputs interactive graphs summarizing information searched in the database. These results provide a guide for selecting or deselecting compositions in a file in order to reflect the context of a study as closely as possible. They also confirm the consistency of a set of compositions based on the content of the GlyConnect database. As part of the tool collection of the Glycomics@ExPASy initiative, Compozitor is hosted at https://glyconnect.expasy.org/compozitor/ where it can be run as a web application. It is also directly accessible from the GlyConnect database.


Author(s):  
Richard Berendsen ◽  
Bogomil Kovachev ◽  
Edgar Meij ◽  
Maarten de Rijke ◽  
Wouter Weerkamp

Sign in / Sign up

Export Citation Format

Share Document