web data Latest Research Papers

Spatiotemporal RDF Data Query Based on Subgraph Matching

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10120832 ◽

2021 ◽

Vol 10 (12) ◽

pp. 832

Author(s):

Xiangfu Meng ◽

Lin Zhu ◽

Qing Li ◽

Xiaoyan Zhang

Keyword(s):

Web Data ◽

Subgraph Matching ◽

Data Query ◽

Query Process ◽

Query Result ◽

Temporal Features ◽

Speed Up ◽

Rdf Data ◽

Description Framework ◽

Query Algorithm

Resource Description Framework (RDF), as a standard metadata description framework proposed by the World Wide Web Consortium (W3C), is suitable for modeling and querying Web data. With the growing importance of RDF data in Web data management, there is an increasing need for modeling and querying RDF data. Previous approaches mainly focus on querying RDF. However, a large amount of RDF data have spatial and temporal features. Therefore, it is important to study spatiotemporal RDF data query approaches. In this paper, firstly, we formally define spatiotemporal RDF data, and construct a spatiotemporal RDF model st-RDF that is used to represent and manipulate spatiotemporal RDF data. Secondly, we present a spatiotemporal RDF query algorithm stQuery based on subgraph matching. This algorithm can quickly determine whether the query result is empty for queries whose temporal or spatial range exceeds a specific range by adopting a preliminary query filtering mechanism in the query process. Thirdly, we propose a sorting strategy that calculates the matching order of query nodes to speed up the subgraph matching. Finally, we conduct experiments in terms of effect and query efficiency. The experimental results show the performance advantages of our approach.

Intelligent and adaptive web data extraction system using convolutional and long short-term memory deep learning networks

Big Data Mining and Analytics ◽

10.26599/bdma.2021.9020012 ◽

2021 ◽

Vol 4 (4) ◽

pp. 279-297

Author(s):

Sudhir Kumar Patnaik ◽

C. Narendra Babu ◽

Mukul Bhave

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Data Extraction ◽

Extraction System ◽

Learning Networks ◽

Web Data ◽

Short Term ◽

Term Memory ◽

Web Data Extraction ◽

Long Short Term Memory

Automatic Symptom Extraction from Unstructured Web Data for Designing Healthcare Systems

10.1007/978-981-16-1342-5_46 ◽

2021 ◽

pp. 599-608

Author(s):

Priyanka C. Nair ◽

Deepa Gupta ◽

B. Indira Devi

Keyword(s):

Healthcare Systems ◽

Web Data

SISTEM INFORMASI GEOGRAFIS COFFEE SHOP DI KOTA SAMARINDA BERBASIS WEB

Buletin Poltanesa ◽

10.51967/tanesa.v22i2.881 ◽

2021 ◽

Vol 22 (2) ◽

Author(s):

Rofikhotul Khoeriyah ◽

Nia Kurniadin

Keyword(s):

Web Data ◽

Coffee Shop

Coffee Shop merupakan tempat yang banyak diminati oleh masyarakat Kota Samarinda. Terdapat beberapa perbedaan antara Coffee Shop dengan kedai kopi atau warung kopi, antara lain dari segi konsep, desain interior, sarana dan prasarana, menu dan segmen pasar. Akan tetapi masyarakat dihadapkan dengan permasalahan dalam mengetahui lokasi serta informasi yang ada pada Coffee Shop. Dengan demikian diperlukan sarana informasi yang dapat diakses oleh umum, salah satu cara dengan pembuatan peta informasi berbasis Web yaitu WebGIS. Tujuan dari kegiatan penelitian ini yaitu untuk memberikan informasi lokasi dan informasi lainnya tentang Coffee Shop yang ada di Samarinda, serta penyajiannya dalam bentuk peta informasi berbasis Web. Data yang dikumpulkan berupa nilai titik koordinat dari hasil pengamatan di lapangan, serta beberapa informasi mengenai Coffee Shop dari media sosial masing-masing Coffee Shop, yang kemudian diolah menggunakan perangkat lunak Quantum GIS menjadi peta informasi berbasis Web. Hasil penelitian menunjukkan bahwa terdapat 49 Coffee Shop yang tersebar di Kota Samarinda dan data tersebut disajikan dalam bentuk WebGIS yang disertai informasi yang ada pada masing-masing Coffee Shop tersebut.

Exploiting Heterogenous Web Data – A Systematic Approach on the Example of Nintendo Switch Games

10.1145/3487664.3487674 ◽

2021 ◽

Author(s):

Sandra Boric ◽

Christine Strauss

Keyword(s):

Systematic Approach ◽

Web Data

Assessing internet and web services based webdom and virtual web-data-centric geographical study

GeoJournal ◽

10.1007/s10708-021-10549-5 ◽

2021 ◽

Author(s):

Abhay Sankar Sahu

Keyword(s):

Web Services ◽

Web Data ◽

Geographical Study

Trends in web data extraction using machine learning

Web Intelligence ◽

10.3233/web-210465 ◽

2021 ◽

pp. 1-22

Author(s):

Sudhir Kumar Patnaik ◽

C. Narendra Babu

Keyword(s):

Machine Learning ◽

Error Detection ◽

Data Extraction ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Future Research ◽

Self Healing ◽

Learning Approaches ◽

Web Data ◽

Web Data Extraction

Web data extraction has seen significant development in the last decade since its inception in the early nineties. It has evolved from a simple manual way of extracting data from web page and documents to automated extraction to an intelligent extraction using machine learning algorithms, tools and techniques. Data extraction is one of the key components of end-to-end life cycle in web data extraction process that includes navigation, extraction, data enrichment and visualization. This paper presents the journey of web data extraction over the last many years highlighting evolution of tools, techniques, frameworks and algorithms for building intelligent web data extraction systems. The paper also throws light into challenges, opportunities for future research and emerging trends over the years in web data extraction with specific focus on machine learning techniques. Both traditional and machine learning approaches to manual and automated web data extraction are experimented and results published with few use cases demonstrating the challenges in web data extraction in the event of changes in the website layout. This paper introduces novel ideas such as self-healing capability in web data extraction and proactive error detection in the event of changes in website layout as an area of future research. This unique perspective will help readers to get deeper insights in to the present and future of web data extraction.

Web data mining1

10.4324/9781003025245-5 ◽

2021 ◽

pp. 46-70

Author(s):

Stefan Bosse ◽

Lena Dahlhaus ◽

Uwe Engel

Keyword(s):

Web Data

Application programming interfaces and web data for social research

10.4324/9781003025245-4 ◽

2021 ◽

pp. 33-45

Author(s):

Dominic Nyhuis

Keyword(s):

Social Research ◽

Web Data ◽

Application Programming Interfaces ◽

Application Programming ◽

Programming Interfaces

Implementation of Web Data Mining Technology Based on Python

Journal of Physics Conference Series ◽

10.1088/1742-6596/2066/1/012033 ◽

2021 ◽

Vol 2066 (1) ◽

pp. 012033

Author(s):

Guilian Feng

Keyword(s):

Data Mining ◽

Data Storage ◽

Web Page ◽

Web Data ◽

Web Crawler ◽

Slow Speed ◽

Web Data Mining ◽

Mining Technology ◽

Low Efficiency ◽

Function Module

Abstract With the arrival of the era of big data, people have gradually realized the importance of data. Data is not just a resource, it is an asset. This paper mainly studies the realization of Web data mining technology based on Python. This paper analyzes the overall architecture design of distributed web crawler system, and then analyzes in detail the principles of crawler’s URL function module, crawler’s web crawl function module, crawler’s web page parsing function module, crawler’s data storage function module and so on. Each function module of the crawler system was tested on the experimental computer, and the data information was summarized for comparative analysis. The main significance of this paper lies in the design and implementation of a distributed web crawler system, which, to a certain extent, solves the problems of slow speed, low efficiency and poor scalability of traditional single computer web crawler, and improves the speed and efficiency of web crawler in grasping information and web page data.

web data
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Spatiotemporal RDF Data Query Based on Subgraph Matching

Intelligent and adaptive web data extraction system using convolutional and long short-term memory deep learning networks

Automatic Symptom Extraction from Unstructured Web Data for Designing Healthcare Systems

SISTEM INFORMASI GEOGRAFIS COFFEE SHOP DI KOTA SAMARINDA BERBASIS WEB

Exploiting Heterogenous Web Data – A Systematic Approach on the Example of Nintendo Switch Games

Assessing internet and web services based webdom and virtual web-data-centric geographical study

Trends in web data extraction using machine learning

Web data mining1

Application programming interfaces and web data for social research

Implementation of Web Data Mining Technology Based on Python

Export Citation Format

web dataRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Spatiotemporal RDF Data Query Based on Subgraph Matching

Intelligent and adaptive web data extraction system using convolutional and long short-term memory deep learning networks

Automatic Symptom Extraction from Unstructured Web Data for Designing Healthcare Systems

SISTEM INFORMASI GEOGRAFIS COFFEE SHOP DI KOTA SAMARINDA BERBASIS WEB

Exploiting Heterogenous Web Data – A Systematic Approach on the Example of Nintendo Switch Games

Assessing internet and web services based webdom and virtual web-data-centric geographical study

Trends in web data extraction using machine learning

Web data mining1

Application programming interfaces and web data for social research

Implementation of Web Data Mining Technology Based on Python

web data
Recently Published Documents