PPHA-Popularity Prediction Based High Data Availability for Multimedia Data Center

Author(s):  
Kuo-Chi Fang ◽  
Husnu S. Narman ◽  
Ibrahim Hussein Mwinyi ◽  
Wook-Sung Yoo

Due to the growth of internet-connected devices and extensive data analysis applications in recent years, cloud computing systems are largely utilized. Because of high utilization of cloud storage systems, the demand for data center management has been increased. There are several crucial requirements of data center management, such as increase data availability, enhance durability, and decrease latency. In previous works, a replication technique is mostly used to answer those needs according to consistency requirements. However, most of the works consider full data, popular data, and geo-distance-based replications by considering storage and replication cost. Moreover, the previous data popularity based-techniques rely on the historical and current data access frequencies for replication. In this article, the authors approach this problem from a distinct aspect while developing replication techniques for a multimedia data center management system which can dynamically adapt servers of a data center by considering popularity prediction in each data access location. Therefore, they first label data objects from one to ten to track access frequencies of data objects. Then, they use those data access frequencies from each location to predict the future access frequencies of data objects to determine the replication levels and locations to replicate the data objects, and store the related data objects to close storage servers. To show the efficiency of the proposed methods, the authors conduct an extensive simulation by using real data. The results show that the proposed method has an advantage over the previous works in terms of data availability and increases the data availability up to 50%. The proposed method and related analysis can assist multimedia service providers to enhance their service qualities.

Author(s):  
Sarah Kesselring ◽  
Jessica Galo ◽  
Kelly Sanderson ◽  
Shirley Wong ◽  
Maureen Neuman ◽  
...  

IntroductionBC’s Ministry of Health (MOH) maintains many administrative databases with rich information and analytical potential. Researchers are keen to use these data for both discovery and applied research. Historically, limited views of data availability and populations therein have supported study feasibility. Therefore, we developed BC Data ScoutTM, a cohort browser. Objectives and ApproachWe developed a cohort browser service to provide information to researchers planning a study using MOH data. The objective was to create a tool that is simple to use, provides quick results and is free to users to encourage its use. A better understanding of the data available can improve study quality and expand the user-base by giving researchers access to information not previously available during the planning stages. The tool will be evaluated by examining the number of requests received and a user satisfaction survey. Plans are in place to expand into additional data sources and extend query sophistication. ResultsThe BC Data ScoutTM online tool provides cohort information in the form of highly aggregated, approximate results to researchers planning a study. It was developed by the MOH, the BC SUPPORT Unit and Population Data BC (PopData) and was launched in February 2018. The service is delivered by PopData. BC Data ScoutTM offers province-wide information for query, is accessible to a wide group of eligible researchers, and has data availability from the year 2000 onwards. Four types of MOH data are available for query: hospital data; physician data; pharmaceutical data; and demographics. In addition to determining study feasibility, the aggregate reports also help to further refine a full data access request and provide enough information to complete and strengthen a funding application. Conclusion/ImplicationsBC Data ScoutTM will be beneficial for researchers planning to request data. This preliminary information may increase the chances of meaningful research studies to obtain funding, and the production of relevant, high-quality research results. BC will be among the first jurisdictions across Canada to offer this type of feasibility service.


2020 ◽  
Vol 2020 ◽  
pp. 1-12
Author(s):  
Pengwei Wang ◽  
Caihui Zhao ◽  
Yi Wei ◽  
Dong Wang ◽  
Zhaohui Zhang

Cloud service providers (CSPs) can offer infinite storage space with cheaper maintenance cost compared to the traditional storage mode. Users tend to store their data in geographical and diverse CSPs so as to avoid vendor lock-in. Static data placement has been widely studied in recent works. However, the data access pattern is often time-varying and users may pay more cost if static placement is adopted during the data lifetime. Therefore, it is a pending problem and challenge of how to dynamically store users’ data under time-varying data access pattern. To this end, we propose ADPA, an adaptive data placement architecture that can adjust the data placement scheme based on the time-varying data access pattern and subject for minimizing the total cost and maximizing the data availability. The proposed architecture includes two main components: data retrieval frequency prediction module based on LSTM and data placement optimization module based on Q-learning. The performance of ADPA is evaluated through several experimental scenarios using NASA-HTTP workload and cloud providers information.


2021 ◽  
Vol 17 (1) ◽  
Author(s):  
Anne Meyer ◽  
Céline Faverjon ◽  
Miel Hostens ◽  
Arjan Stegeman ◽  
Angus Cameron

Abstract Background The FAIR (Findable, Accessible, Interoperable, Reusable) principles were proposed in 2016 to set a path towards reusability of research datasets. In this systematic review, we assessed the FAIRness of datasets associated with peer-reviewed articles in veterinary epidemiology research published since 2017, specifically looking at salmonids and dairy cattle. We considered the differences in practices between molecular epidemiology, the branch of epidemiology using genetic sequences of pathogens and hosts to describe disease patterns, and non-molecular epidemiology. Results A total of 152 articles were included in the assessment. Consistent with previous assessments conducted in other disciplines, our results showed that most datasets used in non-molecular epidemiological studies were not available (i.e., neither findable nor accessible). Data availability was much higher for molecular epidemiology papers, in line with a strong repository base available to scientists in this discipline. The available data objects generally scored favourably for Findable, Accessible and Reusable indicators, but Interoperability was more problematic. Conclusions None of the datasets assessed in this study met all the requirements set by the FAIR principles. Interoperability, in particular, requires specific skills in data management which may not yet be broadly available in the epidemiology community. In the discussion, we present recommendations on how veterinary research could move towards greater reusability according to FAIR principles. Overall, although many initiatives to improve data access have been started in the research community, their impact on the availability of datasets underlying published articles remains unclear to date.


2021 ◽  
Vol 13 (22) ◽  
pp. 12654
Author(s):  
Asha-Dee N. Celestine ◽  
Martin Sulic ◽  
Marika Wieliczko ◽  
Ned T. Stetson

Global demand for data and data access has spurred the rapid growth of the data center industry. To meet demands, data centers must provide uninterrupted service even during the loss of primary power. Service providers seeking ways to eliminate their carbon footprint are increasingly looking to clean and sustainable energy solutions, such as hydrogen technologies, as alternatives to traditional backup generators. In this viewpoint, a survey of the current state of data centers and hydrogen-based technologies is provided along with a discussion of the hydrogen storage and infrastructure requirements needed for large-scale backup power applications at data centers.


2020 ◽  
Vol 63 (8) ◽  
pp. 1216-1230 ◽  
Author(s):  
Wei Guo ◽  
Sujuan Qin ◽  
Jun Lu ◽  
Fei Gao ◽  
Zhengping Jin ◽  
...  

Abstract For a high level of data availability and reliability, a common strategy for cloud service providers is to rely on replication, i.e. storing several replicas onto different servers. To provide cloud users with a strong guarantee that all replicas required by them are actually stored, many multi-replica integrity auditing schemes were proposed. However, most existing solutions are not resource economical since users need to create and upload replicas of their files by themselves. A multi-replica solution called Mirror is presented to overcome the problems, but we find that it is vulnerable to storage saving attack, by which a dishonest provider can considerably save storage costs compared to the costs of storing all the replicas honestly—while still can pass any challenge successfully. In addition, we also find that Mirror is easily subject to substitution attack and forgery attack, which pose new security risks for cloud users. To address the problems, we propose some simple yet effective countermeasures and an improved proofs of retrievability and replication scheme, which can resist the aforesaid attacks and maintain the advantages of Mirror, such as economical bandwidth and efficient verification. Experimental results show that our scheme exhibits comparable performance with Mirror while achieving high security.


2021 ◽  
Author(s):  
Ramon Padullés ◽  
Estel Cardellach ◽  
F. Joseph Turk ◽  
Chi O. Ao ◽  
Kuo Nung Wang ◽  
...  

<p><span>The Radio Occultation and Heavy Precipitation (ROHP) experiment aboard the Spanish PAZ satellite was activated in May 2018 with the objective to </span><span>demonstrate</span><span> the Polarimetric Radio Occultation (PRO) concept for rain detection. This technique enhances standard RO by </span><span>measuring</span><span> GNSS signals at two orthogonal linear polarizations (H and V). Owing to hydrometeor asymmetry, electromagnetic signals propagating through regions of heavy precipitation would experience a differential phase delay expected to be measurable by the ROHP experiment. </span></p><p><span>After 2+ years of operations, the initial hypothesis has been </span><span>verified</span><span> and the main scientific goals have been achieved. Soon after the activation of the experiment it became clear that PRO observables were sensitive to heavy precipitation, showing positive signatures correlated with the presence and intensity of precipitation. After a thorough on-orbit calibration, it has been demonstrated that </span><span>the </span><span>PAZ </span><span>polarimetric </span><span>observable can be used as a proxy for heavy precipitation. Furthermore, PRO </span><span>measurements were</span><span> shown </span><span>to be</span><span> sensitive to the horizontally oriented frozen hydrometeors present throughout the vertical cloud extent, providing </span><span>valuable </span><span>information on the vertical structure of precipitating clouds. </span></p><p><span>In addition, PRO can retrieve standard thermodynamic RO products such as temperature, pressure, and water vapor. These products, provided with high vertical resolution, globally distributed and seamlessly over ocean and over land, make PRO observations a unique dataset, with potential applications ranging from the study of deep convection processes to the evaluation and diagnosis of NWP forecast models. </span></p><p><span>In this presentation we will report on the status of the experiment and current data availability. We will also show the results of the sensitivity studies to heavy precipitation and frozen particles, performed using collocated observations between PAZ and GPM-DPR, GPM-GMI, and other radiometers from the GPM constellation, as well as a-priory information from the Cloudsat radar. Finally, we will address potential level-2 products we can expect from PAZ observations.</span></p>


2020 ◽  
Vol 6 (1) ◽  
pp. 103-110
Author(s):  
Sidik Sidik ◽  
Ade Sudaryana ◽  
Rame Santoso

Computer networks have become an important point in companies that have many branch offices to coordinate the transfer of data. PT Indo Matra Lestari's connection uses a VPN system using the PPTP method. Data Center is used as a VPN server, the client is the Head Office and Citereup Branch Offices. Between the Head Office and the Citereup Branch Office there is no direct connection so access to data made between the Head Office and the Citereup Branch Office is slow, because the data must pass through the Data Center before reaching its destination. Moreover, the data accessed is private to the company and only accessed on the local network. The solution used to create a direct and secure network path between the Head Office and Branch Offices is to use the EoIP Tunnel on the proxy router. Tunneling method in EoIP can make network bridging between proxy devices, EoIP Tunnel will change to Virtual Interface on the proxy router so that it is as if the proxy router is connected locally. Tunnel ID on the EoIP Tunnel functions as a tunneling path security. The application of the EoIP Tunnel makes the point to point connection point between Mikrotik devices faster in data access because the data access is directed to the destination. In order for this EoIP Tunnel connection to run optimally and well, a network management is needed in managing internet bandwidth usage


2021 ◽  
Author(s):  
Hoda R.K. Nejad

With the emergence of wireless devices, service delivery for ad-hoc networks has started to attract a lot of attention recently. Ad-hoc networks provide an attractive solution for networking in the situations where network infrastructure or service subscription is not available. We believe that overlay networks, particularly peer-to-peer (P2P) systems, is a good abstraction for application design and deployment over ad-hoc networks. The principal benefit of this approach is that application states are only maintained by the nodes involved in the application execution and all other nodes only perform networking related functions. On the other hand, data access applications in Ad-hoc networks suffer from restricted resources. In this thesis, we explore how to use Cooperative Caching to improve data access efficiency in Ad-hoc network. We propose a Resource-Aware Cooperative Caching P2P system (RACC) for data access applications in Ad-hoc networks. The objective is to improve data availability by considering energy of each node, demand and supply of network. We evaluated and compared the performance of RACC with Simple Cache, CachePath and CacheData schemes. Our simulation results show that RACC improves the lay of query as well as energy usage of the network as compared to Simple Cache, CachePath and CacheData.


Sign in / Sign up

Export Citation Format

Share Document