PPHA-Popularity Prediction Based High Data Availability for Multimedia Data Center

Kuo-Chi Fang; Husnu S. Narman; Ibrahim Hussein Mwinyi; Wook-Sung Yoo

doi:10.4018/ijitn.2019010102

PPHA-Popularity Prediction Based High Data Availability for Multimedia Data Center

International Journal of Interdisciplinary Telecommunications and Networking ◽

10.4018/ijitn.2019010102 ◽

2019 ◽

Vol 11 (1) ◽

pp. 17-29

Author(s):

Kuo-Chi Fang ◽

Husnu S. Narman ◽

Ibrahim Hussein Mwinyi ◽

Wook-Sung Yoo

Keyword(s):

Data Center ◽

Service Providers ◽

Data Access ◽

Current Data ◽

Multimedia Data ◽

Data Availability ◽

Full Data ◽

Data Center Management ◽

Popularity Prediction ◽

Data Objects

Due to the growth of internet-connected devices and extensive data analysis applications in recent years, cloud computing systems are largely utilized. Because of high utilization of cloud storage systems, the demand for data center management has been increased. There are several crucial requirements of data center management, such as increase data availability, enhance durability, and decrease latency. In previous works, a replication technique is mostly used to answer those needs according to consistency requirements. However, most of the works consider full data, popular data, and geo-distance-based replications by considering storage and replication cost. Moreover, the previous data popularity based-techniques rely on the historical and current data access frequencies for replication. In this article, the authors approach this problem from a distinct aspect while developing replication techniques for a multimedia data center management system which can dynamically adapt servers of a data center by considering popularity prediction in each data access location. Therefore, they first label data objects from one to ten to track access frequencies of data objects. Then, they use those data access frequencies from each location to predict the future access frequencies of data objects to determine the replication levels and locations to replicate the data objects, and store the related data objects to close storage servers. To show the efficiency of the proposed methods, the authors conduct an extensive simulation by using real data. The results show that the proposed method has an advantage over the previous works in terms of data availability and increases the data availability up to 50%. The proposed method and related analysis can assist multimedia service providers to enhance their service qualities.

Download Full-text

BC Data ScoutTM: A New Tool To Investigate Datasets For Health Research

International Journal for Population Data Science ◽

10.23889/ijpds.v3i4.948 ◽

2018 ◽

Vol 3 (4) ◽

Author(s):

Sarah Kesselring ◽

Jessica Galo ◽

Kelly Sanderson ◽

Shirley Wong ◽

Maureen Neuman ◽

...

Keyword(s):

User Satisfaction ◽

Population Data ◽

Data Access ◽

Data Availability ◽

Administrative Databases ◽

Hospital Data ◽

Full Data ◽

High Quality Research ◽

Support Unit ◽

Access Request

IntroductionBC’s Ministry of Health (MOH) maintains many administrative databases with rich information and analytical potential. Researchers are keen to use these data for both discovery and applied research. Historically, limited views of data availability and populations therein have supported study feasibility. Therefore, we developed BC Data ScoutTM, a cohort browser. Objectives and ApproachWe developed a cohort browser service to provide information to researchers planning a study using MOH data. The objective was to create a tool that is simple to use, provides quick results and is free to users to encourage its use. A better understanding of the data available can improve study quality and expand the user-base by giving researchers access to information not previously available during the planning stages. The tool will be evaluated by examining the number of requests received and a user satisfaction survey. Plans are in place to expand into additional data sources and extend query sophistication. ResultsThe BC Data ScoutTM online tool provides cohort information in the form of highly aggregated, approximate results to researchers planning a study. It was developed by the MOH, the BC SUPPORT Unit and Population Data BC (PopData) and was launched in February 2018. The service is delivered by PopData. BC Data ScoutTM offers province-wide information for query, is accessible to a wide group of eligible researchers, and has data availability from the year 2000 onwards. Four types of MOH data are available for query: hospital data; physician data; pharmaceutical data; and demographics. In addition to determining study feasibility, the aggregate reports also help to further refine a full data access request and provide enough information to complete and strengthen a funding application. Conclusion/ImplicationsBC Data ScoutTM will be beneficial for researchers planning to request data. This preliminary information may increase the chances of meaningful research studies to obtain funding, and the production of relevant, high-quality research results. BC will be among the first jurisdictions across Canada to offer this type of feasibility service.

Download Full-text

An Adaptive Data Placement Architecture in Multicloud Environments

Scientific Programming ◽

10.1155/2020/1704258 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Pengwei Wang ◽

Caihui Zhao ◽

Yi Wei ◽

Dong Wang ◽

Zhaohui Zhang

Keyword(s):

Service Providers ◽

Cloud Service ◽

Data Access ◽

Data Retrieval ◽

Data Placement ◽

Data Availability ◽

Maintenance Cost ◽

Time Varying ◽

Access Pattern ◽

Data Lifetime

Cloud service providers (CSPs) can offer infinite storage space with cheaper maintenance cost compared to the traditional storage mode. Users tend to store their data in geographical and diverse CSPs so as to avoid vendor lock-in. Static data placement has been widely studied in recent works. However, the data access pattern is often time-varying and users may pay more cost if static placement is adopted during the data lifetime. Therefore, it is a pending problem and challenge of how to dynamically store users’ data under time-varying data access pattern. To this end, we propose ADPA, an adaptive data placement architecture that can adjust the data placement scheme based on the time-varying data access pattern and subject for minimizing the total cost and maximizing the data availability. The proposed architecture includes two main components: data retrieval frequency prediction module based on LSTM and data placement optimization module based on Q-learning. The performance of ADPA is evaluated through several experimental scenarios using NASA-HTTP workload and cloud providers information.

Download Full-text

Systematic review of the status of veterinary epidemiological research in two species regarding the FAIR guiding principles

BMC Veterinary Research ◽

10.1186/s12917-021-02971-1 ◽

2021 ◽

Vol 17 (1) ◽

Author(s):

Anne Meyer ◽

Céline Faverjon ◽

Miel Hostens ◽

Arjan Stegeman ◽

Angus Cameron

Keyword(s):

Systematic Review ◽

Molecular Epidemiology ◽

Data Access ◽

Epidemiological Studies ◽

Data Availability ◽

Disease Patterns ◽

The Status ◽

Fair Principles ◽

Data Objects ◽

Veterinary Epidemiology

Abstract Background The FAIR (Findable, Accessible, Interoperable, Reusable) principles were proposed in 2016 to set a path towards reusability of research datasets. In this systematic review, we assessed the FAIRness of datasets associated with peer-reviewed articles in veterinary epidemiology research published since 2017, specifically looking at salmonids and dairy cattle. We considered the differences in practices between molecular epidemiology, the branch of epidemiology using genetic sequences of pathogens and hosts to describe disease patterns, and non-molecular epidemiology. Results A total of 152 articles were included in the assessment. Consistent with previous assessments conducted in other disciplines, our results showed that most datasets used in non-molecular epidemiological studies were not available (i.e., neither findable nor accessible). Data availability was much higher for molecular epidemiology papers, in line with a strong repository base available to scientists in this discipline. The available data objects generally scored favourably for Findable, Accessible and Reusable indicators, but Interoperability was more problematic. Conclusions None of the datasets assessed in this study met all the requirements set by the FAIR principles. Interoperability, in particular, requires specific skills in data management which may not yet be broadly available in the epidemiology community. In the discussion, we present recommendations on how veterinary research could move towards greater reusability according to FAIR principles. Overall, although many initiatives to improve data access have been started in the research community, their impact on the availability of datasets underlying published articles remains unclear to date.

Download Full-text

Hydrogen-Based Energy Storage Systems for Large-Scale Data Center Applications

Sustainability ◽

10.3390/su132212654 ◽

2021 ◽

Vol 13 (22) ◽

pp. 12654

Author(s):

Asha-Dee N. Celestine ◽

Martin Sulic ◽

Marika Wieliczko ◽

Ned T. Stetson

Keyword(s):

Data Center ◽

Large Scale ◽

Data Centers ◽

Service Providers ◽

Data Access ◽

Large Scale Data ◽

Current State ◽

Global Demand ◽

Primary Power ◽

Hydrogen Technologies

Global demand for data and data access has spurred the rapid growth of the data center industry. To meet demands, data centers must provide uninterrupted service even during the loss of primary power. Service providers seeking ways to eliminate their carbon footprint are increasingly looking to clean and sustainable energy solutions, such as hydrogen technologies, as alternatives to traditional backup generators. In this viewpoint, a survey of the current state of data centers and hydrogen-based technologies is provided along with a discussion of the hydrogen storage and infrastructure requirements needed for large-scale backup power applications at data centers.

Download Full-text

Improved Proofs Of Retrievability And Replication For Data Availability In Cloud Storage

The Computer Journal ◽

10.1093/comjnl/bxz151 ◽

2020 ◽

Vol 63 (8) ◽

pp. 1216-1230 ◽

Cited By ~ 2

Author(s):

Wei Guo ◽

Sujuan Qin ◽

Jun Lu ◽

Fei Gao ◽

Zhengping Jin ◽

...

Keyword(s):

Service Providers ◽

Cloud Service ◽

Data Availability ◽

Forgery Attack ◽

Common Strategy ◽

Comparable Performance ◽

Replication Scheme ◽

High Level ◽

Cloud Users ◽

Proofs Of Retrievability

Abstract For a high level of data availability and reliability, a common strategy for cloud service providers is to rely on replication, i.e. storing several replicas onto different servers. To provide cloud users with a strong guarantee that all replicas required by them are actually stored, many multi-replica integrity auditing schemes were proposed. However, most existing solutions are not resource economical since users need to create and upload replicas of their files by themselves. A multi-replica solution called Mirror is presented to overcome the problems, but we find that it is vulnerable to storage saving attack, by which a dishonest provider can considerably save storage costs compared to the costs of storing all the replicas honestly—while still can pass any challenge successfully. In addition, we also find that Mirror is easily subject to substitution attack and forgery attack, which pose new security risks for cloud users. To address the problems, we propose some simple yet effective countermeasures and an improved proofs of retrievability and replication scheme, which can resist the aforesaid attacks and maintain the advantages of Mirror, such as economical bandwidth and efficient verification. Experimental results show that our scheme exhibits comparable performance with Mirror while achieving high security.

Download Full-text

Energy efficient data center management under availability constraints

2015 Annual IEEE Systems Conference (SysCon) Proceedings ◽

10.1109/syscon.2015.7116780 ◽

2015 ◽

Cited By ~ 6

Author(s):

Dinkar Sitaram ◽

H.L. Phalachandra ◽

Gautham S ◽

Swathi H V ◽

Sagar TP

Keyword(s):

Data Center ◽

Energy Efficient ◽

Availability Constraints ◽

Data Center Management ◽

Efficient Data

Download Full-text

Sounding Heavy Precipitating Vertical Cloud Structures with Polarimetric Radio Occultations aboard PAZ

10.5194/egusphere-egu21-9405 ◽

2021 ◽

Author(s):

Ramon Padullés ◽

Estel Cardellach ◽

F. Joseph Turk ◽

Chi O. Ao ◽

Kuo Nung Wang ◽

...

Keyword(s):

Radio Occultation ◽

Deep Convection ◽

Heavy Precipitation ◽

Current Data ◽

Data Availability ◽

The Status ◽

Potential Applications ◽

Initial Hypothesis ◽

Forecast Models ◽

Electromagnetic Signals

The Radio Occultation and Heavy Precipitation (ROHP) experiment aboard the Spanish PAZ satellite was activated in May 2018 with the objective to demonstrate the Polarimetric Radio Occultation (PRO) concept for rain detection. This technique enhances standard RO by measuring GNSS signals at two orthogonal linear polarizations (H and V). Owing to hydrometeor asymmetry, electromagnetic signals propagating through regions of heavy precipitation would experience a differential phase delay expected to be measurable by the ROHP experiment. After 2+ years of operations, the initial hypothesis has been verified and the main scientific goals have been achieved. Soon after the activation of the experiment it became clear that PRO observables were sensitive to heavy precipitation, showing positive signatures correlated with the presence and intensity of precipitation. After a thorough on-orbit calibration, it has been demonstrated that the PAZ polarimetric observable can be used as a proxy for heavy precipitation. Furthermore, PRO measurements were shown to be sensitive to the horizontally oriented frozen hydrometeors present throughout the vertical cloud extent, providing valuable information on the vertical structure of precipitating clouds. In addition, PRO can retrieve standard thermodynamic RO products such as temperature, pressure, and water vapor. These products, provided with high vertical resolution, globally distributed and seamlessly over ocean and over land, make PRO observations a unique dataset, with potential applications ranging from the study of deep convection processes to the evaluation and diagnosis of NWP forecast models. In this presentation we will report on the status of the experiment and current data availability. We will also show the results of the sensitivity studies to heavy precipitation and frozen particles, performed using collocated observations between PAZ and GPM-DPR, GPM-GMI, and other radiometers from the GPM constellation, as well as a-priory information from the Cloudsat radar. Finally, we will address potential level-2 products we can expect from PAZ observations.

Download Full-text

The Hierarchical Distributed Agent Based Approach to a Modern Data Center Management

ITM Web of Conferences ◽

10.1051/itmconf/20171004003 ◽

2017 ◽

Vol 10 ◽

pp. 04003

Author(s):

Andrey Gavrilov ◽

Yury Leokhin

Keyword(s):

Data Center ◽

Agent Based ◽

Data Center Management

Download Full-text

Implementasi Virtual Interface Menggunakan Metode EOIP Tunnel Pada Jaringan WAN PT. Indo Matra Lestari

Jurnal Teknik Komputer ◽

10.31294/jtk.v6i1.6830 ◽

2020 ◽

Vol 6 (1) ◽

pp. 103-110

Author(s):

Sidik Sidik ◽

Ade Sudaryana ◽

Rame Santoso

Keyword(s):

Data Center ◽

Data Access ◽

Local Network ◽

Virtual Interface ◽

Head Office ◽

Secure Network ◽

Pass Through ◽

Point To Point ◽

Access To Data ◽

Branch Office

Computer networks have become an important point in companies that have many branch offices to coordinate the transfer of data. PT Indo Matra Lestari's connection uses a VPN system using the PPTP method. Data Center is used as a VPN server, the client is the Head Office and Citereup Branch Offices. Between the Head Office and the Citereup Branch Office there is no direct connection so access to data made between the Head Office and the Citereup Branch Office is slow, because the data must pass through the Data Center before reaching its destination. Moreover, the data accessed is private to the company and only accessed on the local network. The solution used to create a direct and secure network path between the Head Office and Branch Offices is to use the EoIP Tunnel on the proxy router. Tunneling method in EoIP can make network bridging between proxy devices, EoIP Tunnel will change to Virtual Interface on the proxy router so that it is as if the proxy router is connected locally. Tunnel ID on the EoIP Tunnel functions as a tunneling path security. The application of the EoIP Tunnel makes the point to point connection point between Mikrotik devices faster in data access because the data access is directed to the destination. In order for this EoIP Tunnel connection to run optimally and well, a network management is needed in managing internet bandwidth usage

Download Full-text

Resource-Aware Cooperative Caching on Mobile Ad-hoc Peer to Peer Networks

10.32920/ryerson.14655603.v1 ◽

2021 ◽

Author(s):

Hoda R.K. Nejad

Keyword(s):

Ad Hoc Networks ◽

Overlay Networks ◽

Ad Hoc ◽

Data Access ◽

Peer To Peer ◽

Data Availability ◽

Cooperative Caching ◽

P2p Systems ◽

Hoc Networks ◽

Resource Aware

With the emergence of wireless devices, service delivery for ad-hoc networks has started to attract a lot of attention recently. Ad-hoc networks provide an attractive solution for networking in the situations where network infrastructure or service subscription is not available. We believe that overlay networks, particularly peer-to-peer (P2P) systems, is a good abstraction for application design and deployment over ad-hoc networks. The principal benefit of this approach is that application states are only maintained by the nodes involved in the application execution and all other nodes only perform networking related functions. On the other hand, data access applications in Ad-hoc networks suffer from restricted resources. In this thesis, we explore how to use Cooperative Caching to improve data access efficiency in Ad-hoc network. We propose a Resource-Aware Cooperative Caching P2P system (RACC) for data access applications in Ad-hoc networks. The objective is to improve data availability by considering energy of each node, demand and supply of network. We evaluated and compared the performance of RACC with Simple Cache, CachePath and CacheData schemes. Our simulation results show that RACC improves the lay of query as well as energy usage of the network as compared to Simple Cache, CachePath and CacheData.

Download Full-text