scholarly journals Software-defined data storage systems

2021 ◽  
Vol 1 ◽  
pp. 17-23
Author(s):  
Yu.M. Lysetskyi ◽  
◽  
S.V. Kozachenko ◽  

Every year the amount of generated data grows exponentially which entails an increase in both the number and capacity of data storage systems. The highest capacity is required for data storage systems that are used to store backups and archives, file storages with shared access, testing and development environments, virtual machine storages, corporate or public web services. To solve such tasks, nowadays manufacturers offer three types of storage systems: block and file storages which have already become a standard used for implementing IT infrastructures, and software-defined storage systems. They allow to create data storages on non-specialized equipment, such as a group of x86-64 server nodes managed by general-purpose operating systems. The main feature of software-defined data storages is the transfer of storage functions from the hardware level to the software level where these storage functions are defined not by physical features of the hardware but by the software selected for specific tasks solving. Today there are three main singled out technologies characterized by scalable architecture that allow to in-crease efficiency and storage volume through adding new nodes to a single pool: Ceph, DELL EMC VxFlex OS, HP StoreVirtual VSA. Software-defined data storages have the following advantages: fault tolerance, efficiency, flexibility and economy. Utilization of software-defined storages allows to increase efficiency of IT infrastructure and reduce its maintenance costs; to build a hybrid infrastructure that would allow to use internal and external cloud resources; to increase efficiency of both services and us-ers by providing reliable connection by using the most convenient devices; to build a portal as a single point of services and resources control.

2020 ◽  
pp. 082-093
Author(s):  
S.Yu. Punda ◽  
◽  

A review of modern data storage architectures was conducted, the advantages and disadvantages of each of them were given. The data storage systems of the IBM FlashSystem family were analyzed, as well as Spectrum Virtualize software, which is responsible for virtualization, compression, distribution and replication of data stored on the storage system. A mathematical model of the data storage system of IBM Storwize v5030E was developed. Well-known metrics are used to evaluate its performance when using spindle and solid-state drives. The effect of hardware and software data compression on system performance has been experimentally revealed. Recommendations are formulated by which it is possible to determine which media and which technology stack should be used by a business user to complete the tasks assigned to him.


Currently, resources in data centers are used extremely inefficiently. Storage systems are loaded on average about 25%, and servers and network resources - up to 30%. After implementing virtualization, the resource load level in a well-managed server environment increases by 30% to 90%. Virtualization undoubtedly provides many advantages in an infrastructure. One of the most important is the ability to easily create and manage backups of virtual machines, as well as quick recovery if necessary after disasters or accidents. Recovery time is many times faster than when applications and the operating system are hosted on a real server, while the loss of information with proper management is from zero to minimal. The available weekly and daily backups in Proxmox VE are not always flexible enough to properly organize backups in an IT infrastructure. In most companies and organizations there are virtual and real servers that play a significant role, but the data in them, as well as operating systems change very rarely. With existing methods, weekly backups need to be set up to ensure the reliability of the data and to recover quickly in the event of a disaster or accident. The paper aims to research and propose approaches which can extend the bult-in backup process by adding monthly backups for Proxmox VE. The research discusses the optimization of the process of creating backups to reduce network traffic between nodes and storage, as well as optimizing stored storage data.


2020 ◽  
Vol 245 ◽  
pp. 04038 ◽  
Author(s):  
Luca Mascetti ◽  
Maria Arsuaga Rios ◽  
Enrico Bocchi ◽  
Joao Calado Vicente ◽  
Belinda Chan Kwok Cheong ◽  
...  

The CERN IT Storage group operates multiple distributed storage systems to support all CERN data storage requirements: the physics data generated by LHC and non-LHC experiments; object and file storage for infrastructure services; block storage for the CERN cloud system; filesystems for general use and specialized HPC clusters; content distribution filesystem for software distribution and condition databases; and sync&share cloud storage for end-user files. The total integrated capacity of these systems exceeds 0.6 Exabyte. Large-scale experiment data taking has been supported by EOS and CASTOR for the last 10+ years. Particular highlights for 2018 include the special HeavyIon run which was the last part of the LHC Run2 Programme: the IT storage systems sustained over 10GB/s to flawlessly collect and archive more than 13 PB of data in a single month. While the tape archival continues to be handled by CASTOR, the effort to migrate the current experiment workflows to the new CERN Tape Archive system (CTA) is underway. Ceph infrastructure has operated for more than 5 years to provide block storage to CERN IT private OpenStack cloud, a shared filesystem (CephFS) to HPC clusters and NFS storage to replace commercial Filers. S3 service was introduced in 2018, following increased user requirements for S3-compatible object storage from physics experiments and IT use-cases. Since its introduction in 2014N, CERNBox has become a ubiquitous cloud storage interface for all CERN user groups: physicists, engineers and administration. CERNBox provides easy access to multi-petabyte data stores from a multitude of mobile and desktop devices and all mainstream, modern operating systems (Linux, Windows, macOS, Android, iOS). CERNBox provides synchronized storage for end-user’s devices as well as easy sharing for individual users and e-groups. CERNBox has also become a storage platform to host online applications to process the data such as SWAN (Service for Web-based Analysis) as well as file editors such as Collabora Online, Only Office, Draw.IO and more. An increasing number of online applications in the Windows infrastructure uses CIFS/SMB access to CERNBox files. CVMFS provides software repositories for all experiments across the WLCG infrastructure and has recently been optimized to efficiently handle nightlybuilds. While AFS continues to provide general-purpose filesystem for internal CERN users, especially as $HOME login area on central computing infrastructure, the migration of project and web spaces has significantly advanced. In this paper, we report on the experiences from the last year of LHC RUN2 data taking and evolution of our services in the past year.. We will highlight upcoming changes and future improvements and challenges.


Author(s):  
T. A. Dodson ◽  
E. Völkl ◽  
L. F. Allard ◽  
T. A. Nolan

The process of moving to a fully digital microscopy laboratory requires changes in instrumentation, computing hardware, computing software, data storage systems, and data networks, as well as in the operating procedures of each facility. Moving from analog to digital systems in the microscopy laboratory is similar to the instrumentation projects being undertaken in many scientific labs. A central problem of any of these projects is to create the best combination of hardware and software to effectively control the parameters of data collection and then to actually acquire data from the instrument. This problem is particularly acute for the microscopist who wishes to "digitize" the operation of a transmission or scanning electron microscope. Although the basic physics of each type of instrument and the type of data (images & spectra) generated by each are very similar, each manufacturer approaches automation differently. The communications interfaces vary as well as the command language used to control the instrument.


Author(s):  
D. V. Gribanov

Introduction. This article is devoted to legal regulation of digital assets turnover, utilization possibilities of distributed computing and distributed data storage systems in activities of public authorities and entities of public control. The author notes that some national and foreign scientists who study a “blockchain” technology (distributed computing and distributed data storage systems) emphasize its usefulness in different activities. Data validation procedure of digital transactions, legal regulation of creation, issuance and turnover of digital assets need further attention.Materials and methods. The research is based on common scientific (analysis, analogy, comparing) and particular methods of cognition of legal phenomena and processes (a method of interpretation of legal rules, a technical legal method, a formal legal method and a formal logical one).Results of the study. The author conducted an analysis which resulted in finding some advantages of the use of the “blockchain” technology in the sphere of public control which are as follows: a particular validation system; data that once were entered in the system of distributed data storage cannot be erased or forged; absolute transparency of succession of actions while exercising governing powers; automatic repeat of recurring actions. The need of fivefold validation of exercising governing powers is substantiated. The author stresses that the fivefold validation shall ensure complex control over exercising of powers by the civil society, the entities of public control and the Russian Federation as a federal state holding sovereignty over its territory. The author has also conducted a brief analysis of judicial decisions concerning digital transactions.Discussion and conclusion. The use of the distributed data storage system makes it easier to exercise control due to the decrease of risks of forge, replacement or termination of data. The author suggests defining digital transaction not only as some actions with digital assets, but also as actions toward modification and addition of information about legal facts with a purpose of its establishment in the systems of distributed data storage. The author suggests using the systems of distributed data storage for independent validation of information about activities of the bodies of state authority. In the author’s opinion, application of the “blockchain” technology may result not only in the increase of efficiency of public control, but also in the creation of a new form of public control – automatic control. It is concluded there is no legislation basis for regulation of legal relations concerning distributed data storage today.


2020 ◽  
pp. 102-109
Author(s):  
D.KH. DOMULLODZHANOV ◽  
◽  
R. RAHMATILLOEV

The article presents the results of the field studies and observations that carried out on the territory of the hilly, low-mountain and foothill agro landscapes of the Kyzylsu-yuzhnaya (Kyzylsu-Southern) River Basin of Tajikistan. Taking into account the high-altitude location of households and the amount of precipitation in the river basin, the annual volumes of water accumulated with the use of low-cost systems of collection and storage of precipitation have been clarified. The amount of water accumulated in the precipitation collection and storage systems has been established, the volume of water used for communal and domestic needs,the watering of livestock and the amount of water that can be used to irrigate crops in the have been determined. Possible areas of irrigation of household plots depending on the different availability of precipitation have been determined. It has been established that in wet years (with precipitation of about 10%) the amount of water collected using drip irrigation will be sufficient for irrigation of 0.13 hectares, and in dry years (with 90% of precipitation) it will be possible to irrigate only 0.03 ha of the household plot. On the basis of the basin, the total area of irrigation in wet years can be 4497 ha, and in dry years only 1087 ha. Taking into account the forecasts of population growth by 2030 and an increase in the number of households, the total area of irrigation of farmlands in wet years may reach 5703 hectares,and in dry years – 1379 hectares. Growing crops on household plots under irrigation contributes to a significant increase in land productivity and increases the efficiency of water use of the Kyzylsu-yuzhnaya basin.


Sign in / Sign up

Export Citation Format

Share Document