scholarly journals A Scalable Approach to Harvest Modern Weblogs

2015 ◽  
Vol 24 (02) ◽  
pp. 1540005
Author(s):  
Vangelis Banos ◽  
Olivier Blanvillain ◽  
Nikos Kasioumis ◽  
Yannis Manolopoulos

Blogs are one of the most prominent means of communication on the web. Their content, interconnections and influence constitute a unique socio-technical artefact of our times which needs to be preserved. The BlogForever project has established best practices and developed an innovative system to harvest, preserve, manage and reuse blog content. This paper presents the latest developments of the blog crawler which is a key component of the BlogForever platform. More precisely, our work concentrates on techniques to automatically extract content such as articles, authors, dates and comments from blog posts. To achieve this goal, we introduce a simple yet robust and scalable algorithm to generate extraction rules based on string matching using the blog's web feed in conjunction with blog hypertext. Furthermore, we present a system architecture which is characterised by efficiency, modularity, scalability and interoperability with third-party systems. Finally, we conduct thorough evaluations of the performance and accuracy of our system.

2017 ◽  
pp. 067-082
Author(s):  
G.Yu. Proskudina ◽  
◽  
K.A. Kudim ◽  

The paper discusses developing of Wikidata project, the query web service and the query language. The workflow of the web service, query language and result output forms are demonstrated with plenty of examples. Wikidata usage technology by third-party systems is developed.In this concern ExternalData extension which is part of MediaWiki software is considered. Additionally the instruction for installation and configuration of the extension is presented. During test period ExternalData extension was improved. procedure for automatic list generation in wiki page is developed.


2008 ◽  
Vol 18 (1) ◽  
pp. 9-20 ◽  
Author(s):  
Mark Kander ◽  
Steve White

Abstract This article explains the development and use of ICD-9-CM diagnosis codes, CPT procedure codes, and HCPCS supply/device codes. Examples of appropriate coding combinations, and Coding rules adopted by most third party payers are given. Additionally, references for complete code lists on the Web and a list of voice-related CPT code edits are included. The reader is given adequate information to report an evaluation or treatment session with accurate diagnosis, procedure, and supply/device codes. Speech-language pathologists can accurately code services when given adequate resources and rules and are encouraged to insert relevant codes in the medical record rather than depend on billing personnel to accurately provide this information. Consultation is available from the Division 3 Reimbursement Committee members and from [email protected] .


Author(s):  
Joanne G. Carman

This article explores the accountability relationship between the state auditor’s office and non-profit organisations by examining the audit reports prepared by the North Carolina State Auditor’s Office for non-profit organisations from 2009 to 2018. The data collected for this study show that the extent to which the state auditor conducts audits of non-profit organisations is fairly limited. Yet, when it does audit them, it is doing so to police their behaviours, monitor their expenditures and ensure that they are being good stewards with the resources they have been given. The findings from this study have important implications, in that they suggest that other accountability mechanisms continue to be important, including: training and education for board members about their legal and fiduciary responsibilities; the importance of adhering to best practices and standards; and the important role that third-party watchdog organisations and accreditors can play in ensuring non-profit accountability.


2020 ◽  
Author(s):  
João Campos ◽  
André Teixeira ◽  
Thiago Ferreira ◽  
Fábio Cozman ◽  
Adriana Pagano

We introduce robot journalists that cover two pressing topics in Brazilian society: COVID-19 spread and Legal Amazon deforestation. Our approach is able to automatically analyze structured domain data, select relevant content, generate news texts and publish them on the Web. We provide a thorough description of our system architecture, report on the results of automatic evaluation, discuss some of the advantages of robot-journalism in society, and point out further steps in our work. Corpus and code are publicly available.


Author(s):  
Satish C. Sharma ◽  
Harshila Bagoria

Cloud computing is a new breed of service offered over the Internet, which has completely changed the way one can use the power of computers irrespective of geographic location. It has brought in new avenues for organizations and businesses to offer services using hardware or software or platform of third party sources, thus saving on cost and maintenance. It can transform the way systems are built and services delivered, providing libraries with an opportunity to extend their impact. Cloud computing has become a major topic of discussion and debate for any business or organization which relies on technology. Anyone connected to the Internet is probably using some type of cloud computing on a regular basis. Whether they are using Google’s Gmail, organizing photos on Flickr, or searching the Web with Bing, they are engaged in cloud computing. In this chapter, an attempt has been made to give an overview of this technology, its connection with libraries, the models in which libraries can deploy this technology for providing services and augment the productivity of library staff and case studies.


2010 ◽  
pp. 2298-2309
Author(s):  
Justin Meza ◽  
Qin Zhu

Knowledge is the fact or knowing something from experience or via association. Knowledge organization is the systematic management and organization of knowledge (Hodge, 2000). With the advent of Web 2.0, Mashups have become a hot new thing on the Web. A mashup is a Web site or a Web application that combines content from more than one source and delivers it in an integrated way (Fichter, 2006). In this article, we will first explore the concept of mashups and look at the components of a mashup. We will provide an overview of various mashups on the Internet. We will look at literature about knowledge and the knowledge organization. Then, we will elaborate on our experiment of a mashup in an enterprise environment. We will describe how we mixed the content from two sets of sources and created a new source: a novel way of organizing and displaying HP Labs Technical Reports. The findings from our project will be included and some best practices for creating enterprise mashups will be given. The future of enterprise mashups will be discussed as well.


Sign in / Sign up

Export Citation Format

Share Document