The World Wide Web as Complex Data Set: Expanding the Digital Humanities into the Twentieth Century and Beyond through Internet Research

2016 ◽  
Vol 10 (1) ◽  
pp. 95-109 ◽  
Author(s):  
Michael L. Black

While intellectual property protections effectively frame digital humanities text mining as a field primarily for the study of the nineteenth century, the Internet offers an intriguing object of study for humanists working in later periods. As a complex data source, the World Wide Web presents its own methodological challenges for digital humanists, but lessons learned from projects studying large nineteenth century corpora offer helpful starting points. Complicating matters further, legal and ethical questions surrounding web scraping, or the practice of large scale data retrieval over the Internet, will require humanists to frame their research to distinguish it from commercial and malicious activities. This essay reviews relevant research in the digital humanities and new media studies in order to show how web scraping might contribute to humanities research questions. In addition to recommendations for addressing the complex concerns surrounding web scraping this essay also provides a basic overview of the process and some recommendations for resources.

1970 ◽  
Vol 41 (116) ◽  
pp. 153-171
Author(s):  
Helle Porsdam ◽  
Mia Rendix

DIGITAL HUMANITIES: THE FUTURE OF THE HUMANITIES | The article deals with a widespread global phenomen within the Humanities, i.e. Digital Humanities. In the wake of the spreadof the Internet with opportunities for sharing knowledge, open access, and the World Wide Web as a democratic agency, the Humanities faculties are in the process of adapting themselves to exploiting digital challenges and potentials. Much has been written on digitalization’s ideological and tech nological aspects and consequences, but researchers from the Humanities first began over the past five to seven years to devise ways in which new Digital Humanities should be formed. Special centers for Digital Humanities already exist at leading American and British universities, and several specific manifests on the digitalization of the Humanities have been written. The authors of this article wish to discuss and problematize the Digital Humanities in both an international and national context, and the consequences of Digital Humanities that we see for the actual role of research and the potential for acknowledgement.


1999 ◽  
Vol 40 (1) ◽  
pp. 97-104
Author(s):  
Susan Brady

Over the past decade academic and research libraries throughout the world have taken advantage of the enormous developments in communication technology to improve services to their users. Through the Internet and the World Wide Web researchers now have convenient electronic access to library catalogs, indexes, subject bibliographies, descriptions of manuscript and archival collections, and other resources. This brief overview illustrates how libraries are facilitating performing arts research in new ways.


2009 ◽  
Vol 28 (2) ◽  
pp. 81 ◽  
Author(s):  
John Carlo Bertot

<span>Public libraries were early adopters of Internet-based technologies and have provided public access to the Internet and computers since the early 1990s. The landscape of public-access Internet and computing was substantially different in the 1990s as the World Wide Web was only in its initial development. At that time, public libraries essentially experimented with publicaccess Internet and computer services, largely absorbing this service into existing service and resource provision without substantial consideration of the management, facilities, staffing, and other implications of public-access technology (PAT) services and resources. This article explores the implications for public libraries of the provision of PAT and seeks to look further to review issues and practices associated with PAT provision resources. While much research focuses on the amount of public access that </span><span>public libraries provide, little offers a view of the effect of public access on libraries. This article provides insights into some of the costs, issues, and challenges associated with public access and concludes with recommendations that require continued exploration.</span>


2002 ◽  
Vol 7 (1) ◽  
pp. 9-25 ◽  
Author(s):  
Moses Boudourides ◽  
Gerasimos Antypas

In this paper we are presenting a simple simulation of the Internet World-Wide Web, where one observes the appearance of web pages belonging to different web sites, covering a number of different thematic topics and possessing links to other web pages. The goal of our simulation is to reproduce the form of the observed World-Wide Web and of its growth, using a small number of simple assumptions. In our simulation, existing web pages may generate new ones as follows: First, each web page is equipped with a topic concerning its contents. Second, links between web pages are established according to common topics. Next, new web pages may be randomly generated and subsequently they might be equipped with a topic and be assigned to web sites. By repeated iterations of these rules, our simulation appears to exhibit the observed structure of the World-Wide Web and, in particular, a power law type of growth. In order to visualise the network of web pages, we have followed N. Gilbert's (1997) methodology of scientometric simulation, assuming that web pages can be represented by points in the plane. Furthermore, the simulated graph is found to possess the property of small worlds, as it is the case with a large number of other complex networks.


1998 ◽  
Vol 3 (2) ◽  
pp. 12-20 ◽  
Author(s):  
R. S. Slack

This paper argues that the World Wide Web provides a unique opportunity for sociological explication. It contends that sociological uses of the Internet for publication purposes have not as yet taken full advantage of the technology available, producing web facsimiles of printed pages. It highlights the potential for undertaking inquiries which employ the multimedia aspects of WWW technology and extends some of the insights from ethnomethodology and conversation analysis regarding retrievable data.


1997 ◽  
pp. 13-26 ◽  
Author(s):  
David Johnson ◽  
Myke Gluck

This article looks at the access to geographic information through a review of information science theory and its application to the WWW. The two most common retrieval systems are information and data retrieval. A retrieval system has seven elements: retrieval models, indexing, match and retrieval, relevance, order, query languages and query specification. The goal of information retrieval is to match the user's needs to the information that is in the system. Retrieval of geographic information is a combination of both information and data retrieval. Aids to effective retrieval of geographic information are: query languages that employ icons and natural language, automatic indexing of geographic information, and standardization of geographic information. One area that has seen an explosion of geographic information retrieval systems (GIR's) is the World Wide Web (WWW). The final section of this article discusses how seven WWW GIR's solve the the problem of matching the user's information needs to the information in the system.


2021 ◽  
Vol 58 (1) ◽  
pp. 1853-1858
Author(s):  
Lesko Natalia Vladimirovna Et al.

The features of the Internet as a leading institution of information law in the XXI century have been studied in the article. It has been determined that a characteristic feature of the Internet is that geographical boundaries do not play any role here. The Internet space is an electronic information space of communications for which there are no borders. That is why it is difficult to ensure effective legal regulation of the Internet, as there is no systematic legislation regulating the relevant types of relations on the World Wide Web, besides, there are objective features of the Internet functioning. It has been stated that an important point of solving the problems of using the Internet is the adoption of the Laws: "On the protection of freedom on the Internet", "On e-democracy", "On distance learning on the Internet". It has been noted that in modern society, the Internet has made it possible to influence greatly the life of every person. As a result of globalization processes, the World Wide Web performs the function of forming a person's world-view. Unfortunately, standards that do not conform to the ideas of humanism are often promoted on the Internet. New forms of communication on the Internet have led to the separation of the culture function of this means of mass communication, as a result of which a specific information culture is being formed. Thus, an important factor in building a global information society is the formation of the individual new information culture on the Internet network.


BMJ ◽  
1995 ◽  
Vol 311 (7019) ◽  
pp. 1552-1556 ◽  
Author(s):  
M. Pallen

2020 ◽  
Vol 2 ◽  
pp. 80-85
Author(s):  
Natavan Kamilova ◽  

In this article we made an attempt to resolve a problem, which impends over our society. Getting involved in the Internet, using social network language, humanity, distorting their language, forget about great traditions of each nation, about originality, harmony, euphony of a language. In this case, we are talking about rich and expressive Russian language.


Sign in / Sign up

Export Citation Format

Share Document