large data set
Recently Published Documents


TOTAL DOCUMENTS

187
(FIVE YEARS 55)

H-INDEX

24
(FIVE YEARS 3)

2021 ◽  
Vol 13 (1) ◽  
pp. 90-96
Author(s):  
F.B. Abdullahi ◽  
F. Coenen
Keyword(s):  
Data Set ◽  

No Abstract


2021 ◽  
Vol 13 (12) ◽  
pp. 303
Author(s):  
Xiaoliang Wang ◽  
Peiquan Jin

The traditional page-grained buffer manager in database systems has a low hit ratio when only a few tuples within a page are frequently accessed. To handle this issue, this paper proposes a new buffering scheme called the AMG-Buffer (Adaptive Multi-Grained Buffer). AMG-Buffer proposes to use two page buffers and a tuple buffer to organize the whole buffer. In this way, the AMG-Buffer can hold more hot tuples than a single page-grained buffer. Further, we notice that the tuple buffer may cause additional read I/Os when writing dirty tuples into disks. Thus, we introduce a new metric named clustering rate to quantify the hot-tuple rate in a page. The use of the tuple buffer is determined by the clustering rate, allowing the AMG-Buffer to adapt to different workloads. We conduct experiments on various workloads to compare the AMG-Buffer with several existing schemes, including LRU, LIRS, CFLRU, CFDC, and MG-Buffer. The results show that AMG-Buffer can significantly improve the hit ratio and reduce I/Os compared to its competitors. Moreover, the AMG-Buffer achieves the best performance on a dynamic workload as well as on a large data set, suggesting its adaptivity and scalability to changing workloads.


2021 ◽  
Vol 30 (6) ◽  
pp. 1131-1140
Author(s):  
LIU Chuanlu ◽  
WANG Shuliang ◽  
YUAN Hanning ◽  
GENG Jing

2021 ◽  
pp. 0308518X2110478
Author(s):  
Rachel G McKane ◽  
David J Hess

Ridesourcing advocates and companies promise many benefits to cities, such as increased accessibility, a solution to the last-mile transit problem, and even reduced need for automobiles. However, an important body of research has indicated that ridesourcing is more heavily used by more privileged consumers and in more affluent and whiter neighborhoods. Questions have also emerged about the effects of ridesourcing on public transportation. This study builds on a mobility disparities perspective by analyzing ridesourcing in the context of urban inequality, including gentrification and displacement. Using a large data set from the Chicago area, this study shows that ridesourcing is associated with areas that have seen rising rents and have become whiter and more educated. The results also show that ridesourcing is more prevalent in areas that are accessible by public transportation. Although the causal relationship between ridesourcing and gentrification is complex, the study suggests a new direction in the literature that embeds the analysis of ridesourcing in the broader frameworks of unequal urban development and neoliberalization. The study also suggests policy approaches that could help to reduce some of the connections between ridesourcing and urban inequity.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Vyacheslav I. Zavalin ◽  
Shawne D. Miksa

Purpose This paper aims to discuss the challenges encountered in collecting, cleaning and analyzing the large data set of bibliographic metadata records in machine-readable cataloging [MARC 21] format. Possible solutions are presented. Design/methodology/approach This mixed method study relied on content analysis and social network analysis. The study examined subject representation in MARC 21 metadata records created in 2020 in WorldCat – the largest international database of “big smart data.” The methodological challenges that were encountered and solutions are examined. Findings In this general review paper with a focus on methodological issues, the discussion of challenges is followed by a discussion of solutions developed and tested as part of this study. Data collection, processing, analysis and visualization are addressed separately. Lessons learned and conclusions related to challenges and solutions for the design of a large-scale study evaluating MARC 21 bibliographic metadata from WorldCat are given. Overall recommendations for the design and implementation of future research are suggested. Originality/value There are no previous publications that address the challenges and solutions of data collection and analysis of WorldCat’s “big smart data” in the form of MARC 21 data. This is the first study to use a large data set to systematically examine MARC 21 library metadata records created after the most recent addition of new fields and subfields to MARC 21 Bibliographic Format standard in 2019 based on resource description and access rules. It is also the first to focus its analyzes on the networks formed by subject terms shared by MARC 21 bibliographic records in a data set extracted from a heterogeneous centralized database WorldCat.


2021 ◽  
pp. 102586
Author(s):  
Chuanjun Du ◽  
Ruoying He ◽  
Zhiyu Liu ◽  
Tao Huang ◽  
Lifang Wang ◽  
...  

2021 ◽  
Vol 57 (4) ◽  
Author(s):  
M. Gómez-Ramos ◽  
A. Obertelli ◽  
Y. L. Sun

AbstractWe review the ambiguities in the nuclear information extracted from breakup reactions, focusing on those originating from the description of the reaction mechanism and the overall ambiguity inherent to their interpretation in terms of shell occupancies. We present the current discussion about nucleon knockout reactions and how the understanding of the reaction mechanism would help reducing uncertainties. For the former, we consider the case of $$^{11}$$ 11 Li, due to the existing large data set. For the latter, we recall the paradigmatic example of the electro-dissociation of the deuteron to address the question of the scale and scheme dependence from the theoretical framework used for the interpretation.


Sign in / Sign up

Export Citation Format

Share Document