Irish on the World Wide Web

This paper reports on the process of searching with Irish words on the Irish language version of the Google Internet search engine. Five words from ‘typical’ and ‘non-typical’ domains for Irish are used, and the results are analysed in terms of the “authenticity” of the search process and results, the language usage in the sites found through the search process, and the domains represented by the results. The study identifies a number of problems encountered when searching for results in a ‘small’ language. It also indicates that the ‘official’ sector and other sectors closely related to language policy and planning are the main providers of monolingual Irish texts on the Internet, with a variety of mixed Irish and English approaches favoured by other providers.

Download Full-text

Deconstructing Google Dataset Search

10.31229/osf.io/9vjqa ◽

2019 ◽

Author(s):

Adrienne Canino

Keyword(s):

World Wide Web ◽

Search Engine ◽

World Wide ◽

Research Data ◽

The Internet ◽

Research Information ◽

The World ◽

Pros And Cons ◽

Information Landscape ◽

Web Developers

This essay examines the beta tool from Google, Google Dataset Search. The Google Dataset Search, announced in September 2018, is a search engine specific to finding research data published on the internet. The structure and methods of the search engine are examined, as well as the methods Google recommends to web developers to make it an effective tool across the World Wide Web. The column concludes with a discussion of the pros and cons of this tool in the research information landscape.

Download Full-text

Searching Bioinformatics Information Strategies for Effective Use of Search Engine

Biomedical Engineering ◽

10.4018/978-1-5225-3158-6.ch033 ◽

2018 ◽

pp. 742-748

Author(s):

Viveka Vardhan Jumpala

Keyword(s):

World Wide Web ◽

Search Engine ◽

Search Engines ◽

World Wide ◽

The Internet ◽

Information Strategies ◽

The World ◽

Search For Information ◽

Effective Use ◽

The Web

The Internet, which is an information super high way, has practically compressed the world into a cyber colony through various networks and other Internets. The development of the Internet and the emergence of the World Wide Web (WWW) as common vehicle for communication and instantaneous access to search engines and databases. Search Engine is designed to facilitate search for information on the WWW. Search Engines are essentially the tools that help in finding required information on the web quickly in an organized manner. Different search engines do the same job in different ways thus giving different results for the same query. Search Strategies are the new trend on the Web.

Download Full-text

Searching Bioinformatics Information Strategies for Effective Use of Search Engine

Library and Information Services for Bioinformatics Education and Research - Advances in Library and Information Science ◽

10.4018/978-1-5225-1871-6.ch009 ◽

2017 ◽

pp. 169-176

Author(s):

Viveka Vardhan Jumpala

Keyword(s):

World Wide Web ◽

Search Engine ◽

Search Engines ◽

World Wide ◽

The Internet ◽

Information Strategies ◽

The World ◽

Search For Information ◽

Effective Use ◽

The Web

Download Full-text

Internet Search Engines

Encyclopedia of E-Commerce, E-Government, and Mobile Commerce ◽

10.4018/978-1-59140-799-7.ch108 ◽

2011 ◽

pp. 672-677

Author(s):

Vijay Kasi ◽

Radhika Jain

Keyword(s):

Search Engine ◽

Web Sites ◽

Search Engines ◽

World Wide ◽

Relevant Information ◽

The Internet ◽

Web Pages ◽

Web Page ◽

The World ◽

The Web

In the context of the Internet, a search engine can be defined as a software program designed to help one access information, documents, and other content on the World Wide Web. The adoption and growth of the Internet in the last decade has been unprecedented. The World Wide Web has always been applauded for its simplicity and ease of use. This is evident looking at the extent of the knowledge one requires to build a Web page. The flexible nature of the Internet has enabled the rapid growth and adoption of it, making it hard to search for relevant information on the Web. The number of Web pages has been increasing at an astronomical pace, from around 2 million registered domains in 1995 to 233 million registered domains in 2004 (Consortium, 2004). The Internet, considered a distributed database of information, has the CRUD (create, retrieve, update, and delete) rule applied to it. While the Internet has been effective at creating, updating, and deleting content, it has considerably lacked in enabling the retrieval of relevant information. After all, there is no point in having a Web page that has little or no visibility on the Web. Since the 1990s when the first search program was released, we have come a long way in terms of searching for information. Although we are currently witnessing a tremendous growth in search engine technology, the growth of the Internet has overtaken it, leading to a state in which the existing search engine technology is falling short. When we apply the metrics of relevance, rigor, efficiency, and effectiveness to the search domain, it becomes very clear that we have progressed on the rigor and efficiency metrics by utilizing abundant computing power to produce faster searches with a lot of information. Rigor and efficiency are evident in the large number of indexed pages by the leading search engines (Barroso, Dean, & Holzle, 2003). However, more research needs to be done to address the relevance and effectiveness metrics. Users typically type in two to three keywords when searching, only to end up with a search result having thousands of Web pages! This has made it increasingly hard to effectively find any useful, relevant information. Search engines face a number of challenges today requiring them to perform rigorous searches with relevant results efficiently so that they are effective. These challenges include the following (“Search Engines,” 2004). 1. The Web is growing at a much faster rate than any present search engine technology can index. 2. Web pages are updated frequently, forcing search engines to revisit them periodically. 3. Dynamically generated Web sites may be slow or difficult to index, or may result in excessive results from a single Web site. 4. Many dynamically generated Web sites are not able to be indexed by search engines. 5. The commercial interests of a search engine can interfere with the order of relevant results the search engine shows. 6. Content that is behind a firewall or that is password protected is not accessible to search engines (such as those found in several digital libraries).1 7. Some Web sites have started using tricks such as spamdexing and cloaking to manipulate search engines to display them as the top results for a set of keywords. This can make the search results polluted, with more relevant links being pushed down in the result list. This is a result of the popularity of Web searches and the business potential search engines can generate today. 8. Search engines index all the content of the Web without any bounds on the sensitivity of information. This has raised a few security and privacy flags. With the above background and challenges in mind, we lay out the article as follows. In the next section, we begin with a discussion of search engine evolution. To facilitate the examination and discussion of the search engine development’s progress, we break down this discussion into the three generations of search engines. Figure 1 depicts this evolution pictorially and highlights the need for better search engine technologies. Next, we present a brief discussion on the contemporary state of search engine technology and various types of content searches available today. With this background, the next section documents various concerns about existing search engines setting the stage for better search engine technology. These concerns include information overload, relevance, representation, and categorization. Finally, we briefly address the research efforts under way to alleviate these concerns and then present our conclusion.

Download Full-text

Developing a Framework for Assessing Information Quality on the World Wide Web

10.28945/2854 ◽

2005 ◽

Cited By ~ 9

Author(s):

Shirlee-ann Knight ◽

Janice Burn

Keyword(s):

Information Retrieval ◽

World Wide Web ◽

Search Engine ◽

Rapid Growth ◽

Information Exchange ◽

Information Quality ◽

World Wide ◽

The Internet ◽

The World

The rapid growth of the Internet as an environment for information exchange and the lack of enforceable standards regarding the information it contains has lead to numerous information qual ity problems. A major issue is the inability of Search Engine technology to wade through the vast expanse of questionable content and return "quality" results to a user's query. This paper attempts to address some of the issues involved in determining what quality is, as it pertains to information retrieval on the Internet. The IQIP model is presented as an approach to managing the choice and implementation of quality related algorithms of an Internet crawling Search Engine.

Download Full-text

Wittgenstein and web facets

NASKO ◽

10.7152/nasko.v3i1.12788 ◽

2011 ◽

Vol 3 (1) ◽

pp. 33

Author(s):

Elizabeth Milonas

Keyword(s):

Search Engine ◽

Philosophy Of Language ◽

World Wide ◽

Web Search ◽

Language Usage ◽

Search Result ◽

Web Search Engine ◽

Daunting Task ◽

The World ◽

The Web

The World Wide Web has grown exponentially in the last few years. The popularity of Web search engines has also grown in a similar manner. The task of a Web search engine is to provide the Web searcher with accurate and targeted information from the plethora of information available on the Web. This is a daunting task that requires the careful usage of language to ensure accuracy. As a result, the importance of the usage and meaning of language in the Web domain has become the focus of recent research. In this paper, the author will explore Wittgenstein’s later philosophy of language as it applies to the language used in the search result pages of a Web search engine in an effort to broaden the understanding of language usage within this domain.

Download Full-text

Library Internet Resources: Conducting Performing Arts Research Online

Theatre Survey ◽

10.1017/s004055740000329x ◽

1999 ◽

Vol 40 (1) ◽

pp. 97-104

Author(s):

Susan Brady

Keyword(s):

World Wide Web ◽

Communication Technology ◽

Performing Arts ◽

World Wide ◽

The Internet ◽

Research Libraries ◽

The Past ◽

Internet Resources ◽

The World ◽

Electronic Access

Over the past decade academic and research libraries throughout the world have taken advantage of the enormous developments in communication technology to improve services to their users. Through the Internet and the World Wide Web researchers now have convenient electronic access to library catalogs, indexes, subject bibliographies, descriptions of manuscript and archival collections, and other resources. This brief overview illustrates how libraries are facilitating performing arts research in new ways.

Download Full-text

Public Access Technologies in Public Libraries: Effects and Implications

Information Technology and Libraries ◽

10.6017/ital.v28i2.3176 ◽

2009 ◽

Vol 28 (2) ◽

pp. 81 ◽

Cited By ~ 26

Author(s):

John Carlo Bertot

Keyword(s):

World Wide Web ◽

World Wide ◽

Public Libraries ◽

Public Access ◽

The Internet ◽

Access Technology ◽

Initial Development ◽

Early Adopters ◽

The World ◽

Computer Services

<span>Public libraries were early adopters of Internet-based technologies and have provided public access to the Internet and computers since the early 1990s. The landscape of public-access Internet and computing was substantially different in the 1990s as the World Wide Web was only in its initial development. At that time, public libraries essentially experimented with publicaccess Internet and computer services, largely absorbing this service into existing service and resource provision without substantial consideration of the management, facilities, staffing, and other implications of public-access technology (PAT) services and resources. This article explores the implications for public libraries of the provision of PAT and seeks to look further to review issues and practices associated with PAT provision resources. While much research focuses on the amount of public access that </span><span>public libraries provide, little offers a view of the effect of public access on libraries. This article provides insights into some of the costs, issues, and challenges associated with public access and concludes with recommendations that require continued exploration.</span>

Download Full-text

The Internet for Scientists and the World Wide Web for Scientists and Engineers: A Complete Reference for Navigating, Researching and Publishing Online

Physics Today ◽

10.1063/1.882414 ◽

1998 ◽

Vol 51 (10) ◽

pp. 82-83

Author(s):

Kevin O'Donnell ◽

Larry Winger ◽

Brian J. Thomas ◽

Steven Bachrach

Keyword(s):

World Wide Web ◽

World Wide ◽

The Internet ◽

The World ◽

Scientists And Engineers

Download Full-text

A Simulation of the Structure of the World-Wide Web

Sociological Research Online ◽

10.5153/sro.684 ◽

2002 ◽

Vol 7 (1) ◽

pp. 9-25 ◽

Cited By ~ 2

Author(s):

Moses Boudourides ◽

Gerasimos Antypas

Keyword(s):

World Wide Web ◽

Power Law ◽

Web Sites ◽

World Wide ◽

The Internet ◽

Web Pages ◽

Small Worlds ◽

Web Page ◽

Simple Simulation ◽

The World

In this paper we are presenting a simple simulation of the Internet World-Wide Web, where one observes the appearance of web pages belonging to different web sites, covering a number of different thematic topics and possessing links to other web pages. The goal of our simulation is to reproduce the form of the observed World-Wide Web and of its growth, using a small number of simple assumptions. In our simulation, existing web pages may generate new ones as follows: First, each web page is equipped with a topic concerning its contents. Second, links between web pages are established according to common topics. Next, new web pages may be randomly generated and subsequently they might be equipped with a topic and be assigned to web sites. By repeated iterations of these rules, our simulation appears to exhibit the observed structure of the World-Wide Web and, in particular, a power law type of growth. In order to visualise the network of web pages, we have followed N. Gilbert's (1997) methodology of scientometric simulation, assuming that web pages can be represented by points in the plane. Furthermore, the simulated graph is found to possess the property of small worlds, as it is the case with a large number of other complex networks.

Download Full-text