A Scalable Approach to Harvest Modern Weblogs

Blogs are one of the most prominent means of communication on the web. Their content, interconnections and influence constitute a unique socio-technical artefact of our times which needs to be preserved. The BlogForever project has established best practices and developed an innovative system to harvest, preserve, manage and reuse blog content. This paper presents the latest developments of the blog crawler which is a key component of the BlogForever platform. More precisely, our work concentrates on techniques to automatically extract content such as articles, authors, dates and comments from blog posts. To achieve this goal, we introduce a simple yet robust and scalable algorithm to generate extraction rules based on string matching using the blog's web feed in conjunction with blog hypertext. Furthermore, we present a system architecture which is characterised by efficiency, modularity, scalability and interoperability with third-party systems. Finally, we conduct thorough evaluations of the performance and accuracy of our system.

Download Full-text

About technologies of use of external data on creating and editing of encyclopedic texts

PROBLEMS IN PROGRAMMING ◽

10.15407/pp2017.01.067 ◽

2017 ◽

pp. 067-082

Author(s):

G.Yu. Proskudina ◽

◽

K.A. Kudim ◽

Keyword(s):

Web Service ◽

Query Language ◽

Party Systems ◽

Test Period ◽

Third Party ◽

External Data ◽

The Web

The paper discusses developing of Wikidata project, the query web service and the query language. The workflow of the web service, query language and result output forms are demonstrated with plenty of examples. Wikidata usage technology by third-party systems is developed.In this concern ExternalData extension which is part of MediaWiki software is considered. Additionally the instruction for installation and configuration of the extension is presented. During test period ExternalData extension was improved. procedure for automatic list generation in wiki page is developed.

Download Full-text

Coding for Evaluation and Treatment

Perspectives on Voice and Voice Disorders ◽

10.1044/vvd18.1.9 ◽

2008 ◽

Vol 18 (1) ◽

pp. 9-20 ◽

Cited By ~ 1

Author(s):

Mark Kander ◽

Steve White

Keyword(s):

Medical Record ◽

Treatment Session ◽

Accurate Diagnosis ◽

Third Party ◽

Speech Language Pathologists ◽

Adequate Information ◽

Diagnosis Codes ◽

Coding Rules ◽

Procedure Codes ◽

The Web

Abstract This article explains the development and use of ICD-9-CM diagnosis codes, CPT procedure codes, and HCPCS supply/device codes. Examples of appropriate coding combinations, and Coding rules adopted by most third party payers are given. Additionally, references for complete code lists on the Web and a list of voice-related CPT code edits are included. The reader is given adequate information to report an evaluation or treatment session with accurate diagnosis, procedure, and supply/device codes. Speech-language pathologists can accurately code services when given adequate resources and rules and are encouraged to insert relevant codes in the medical record rather than depend on billing personnel to accurately provide this information. Consultation is available from the Division 3 Reimbursement Committee members and from [email protected] .

Download Full-text

Exploring the accountability relationship between non-profit organisations and the state auditor

Voluntary Sector Review ◽

10.1332/204080521x16321509753051 ◽

2021 ◽

Author(s):

Joanne G. Carman

Keyword(s):

North Carolina ◽

Best Practices ◽

The State ◽

Third Party ◽

Board Members ◽

Training And Education ◽

Non Profit ◽

The North ◽

Audit Reports

This article explores the accountability relationship between the state auditor’s office and non-profit organisations by examining the audit reports prepared by the North Carolina State Auditor’s Office for non-profit organisations from 2009 to 2018. The data collected for this study show that the extent to which the state auditor conducts audits of non-profit organisations is fairly limited. Yet, when it does audit them, it is doing so to police their behaviours, monitor their expenditures and ensure that they are being good stewards with the resources they have been given. The findings from this study have important implications, in that they suggest that other accountability mechanisms continue to be important, including: training and education for board members about their legal and fiduciary responsibilities; the importance of adhering to best practices and standards; and the important role that third-party watchdog organisations and accreditors can play in ensuring non-profit accountability.

Download Full-text

Design of the Web-based monitoring system architecture for geomagnetically induced current

2010 International Conference on Educational and Network Technology ◽

10.1109/icent.2010.5532181 ◽

2010 ◽

Author(s):

Jian Wang ◽

Ying Wang ◽

Lian-guang Liu

Keyword(s):

Monitoring System ◽

System Architecture ◽

Induced Current ◽

Web Based ◽

Geomagnetically Induced Current ◽

The Web

Download Full-text

Towards Fully Automated News Reporting in Brazilian Portuguese

10.5753/eniac.2020.12158 ◽

2020 ◽

Author(s):

João Campos ◽

André Teixeira ◽

Thiago Ferreira ◽

Fábio Cozman ◽

Adriana Pagano

Keyword(s):

System Architecture ◽

Brazilian Portuguese ◽

Automatic Evaluation ◽

News Reporting ◽

Brazilian Society ◽

The Web

We introduce robot journalists that cover two pressing topics in Brazilian society: COVID-19 spread and Legal Amazon deforestation. Our approach is able to automatically analyze structured domain data, select relevant content, generate news texts and publish them on the Web. We provide a thorough description of our system architecture, report on the results of automatic evaluation, discuss some of the advantages of robot-journalism in society, and point out further steps in our work. Corpus and code are publicly available.

Download Full-text

From the Internet of Things to the Web of Things: Resource-oriented Architecture and Best Practices

Architecting the Internet of Things ◽

10.1007/978-3-642-19157-2_5 ◽

2011 ◽

pp. 97-129 ◽

Cited By ~ 199

Author(s):

Dominique Guinard ◽

Vlad Trifa ◽

Friedemann Mattern ◽

Erik Wilde

Keyword(s):

Internet Of Things ◽

Best Practices ◽

The Internet ◽

Web Of Things ◽

The Internet Of Things ◽

The Web

Download Full-text

A Model-Based Approach for Integrating Third Party Systems with Web Applications

Lecture Notes in Computer Science - Web Engineering ◽

10.1007/11531371_57 ◽

2005 ◽

pp. 441-452 ◽

Cited By ~ 4

Author(s):

Nathalie Moreno ◽

Antonio Vallecillo

Keyword(s):

Web Applications ◽

Party Systems ◽

Third Party ◽

Model Based

Download Full-text

A Model-based Control System Architecture for the Web-Distributed Simulation and Operation of Assembly Lines

Applied Simulation and Modelling ◽

10.2316/p.2011.715-086 ◽

2011 ◽

Cited By ~ 1

Author(s):

Jürgen Rossmann ◽

Christian Schlette ◽

Michael Schluse ◽

Martin Hoppen

Keyword(s):

Control System ◽

System Architecture ◽

Distributed Simulation ◽

Assembly Lines ◽

Model Based Control ◽

Model Based ◽

Control System Architecture ◽

The Web

Download Full-text

Libraries and Cloud Computing Models

Advances in Library and Information Science - Cloud Computing and Virtualization Technologies in Libraries ◽

10.4018/978-1-4666-4631-5.ch008 ◽

2014 ◽

pp. 124-149

Author(s):

Satish C. Sharma ◽

Harshila Bagoria

Keyword(s):

Cloud Computing ◽

Case Studies ◽

Geographic Location ◽

Third Party ◽

The Internet ◽

Major Topic ◽

Or Organization ◽

The Web ◽

Computing Models ◽

The Way

Cloud computing is a new breed of service offered over the Internet, which has completely changed the way one can use the power of computers irrespective of geographic location. It has brought in new avenues for organizations and businesses to offer services using hardware or software or platform of third party sources, thus saving on cost and maintenance. It can transform the way systems are built and services delivered, providing libraries with an opportunity to extend their impact. Cloud computing has become a major topic of discussion and debate for any business or organization which relies on technology. Anyone connected to the Internet is probably using some type of cloud computing on a regular basis. Whether they are using Google’s Gmail, organizing photos on Flickr, or searching the Web with Bing, they are engaged in cloud computing. In this chapter, an attempt has been made to give an overview of this technology, its connection with libraries, the models in which libraries can deploy this technology for providing services and augment the productivity of library staff and case studies.

Download Full-text

Mix, Match, Rediscovery

Information Resources Management ◽

10.4018/978-1-61520-965-1.ch803 ◽

2010 ◽

pp. 2298-2309

Author(s):

Justin Meza ◽

Qin Zhu

Keyword(s):

Best Practices ◽

Web 2.0 ◽

Web Site ◽

Web Application ◽

Knowledge Organization ◽

The Internet ◽

The Future ◽

Enterprise Mashups ◽

The Web

Knowledge is the fact or knowing something from experience or via association. Knowledge organization is the systematic management and organization of knowledge (Hodge, 2000). With the advent of Web 2.0, Mashups have become a hot new thing on the Web. A mashup is a Web site or a Web application that combines content from more than one source and delivers it in an integrated way (Fichter, 2006). In this article, we will first explore the concept of mashups and look at the components of a mashup. We will provide an overview of various mashups on the Internet. We will look at literature about knowledge and the knowledge organization. Then, we will elaborate on our experiment of a mashup in an enterprise environment. We will describe how we mixed the content from two sets of sources and created a new source: a novel way of organizing and displaying HP Labs Technical Reports. The findings from our project will be included and some best practices for creating enterprise mashups will be given. The future of enterprise mashups will be discussed as well.

Download Full-text