large data set Latest Research Papers

Finding banded patternsin large data set using segmentation

Bayero Journal of Pure and Applied Sciences ◽

10.4314/bajopas.v13i1.13 ◽

2021 ◽

Vol 13 (1) ◽

pp. 90-96

Author(s):

F.B. Abdullahi ◽

F. Coenen

Keyword(s):

Large Data ◽

Data Set ◽

Large Data Set

No Abstract

Adaptive Multi-Grained Buffer Management for Database Systems

Future Internet ◽

10.3390/fi13120303 ◽

2021 ◽

Vol 13 (12) ◽

pp. 303

Author(s):

Xiaoliang Wang ◽

Peiquan Jin

Keyword(s):

Buffer Management ◽

Large Data ◽

Database Systems ◽

Data Set ◽

Large Data Set

The traditional page-grained buffer manager in database systems has a low hit ratio when only a few tuples within a page are frequently accessed. To handle this issue, this paper proposes a new buffering scheme called the AMG-Buffer (Adaptive Multi-Grained Buffer). AMG-Buffer proposes to use two page buffers and a tuple buffer to organize the whole buffer. In this way, the AMG-Buffer can hold more hot tuples than a single page-grained buffer. Further, we notice that the tuple buffer may cause additional read I/Os when writing dirty tuples into disks. Thus, we introduce a new metric named clustering rate to quantify the hot-tuple rate in a page. The use of the tuple buffer is determined by the clustering rate, allowing the AMG-Buffer to adapt to different workloads. We conduct experiments on various workloads to compare the AMG-Buffer with several existing schemes, including LRU, LIRS, CFLRU, CFDC, and MG-Buffer. The results show that AMG-Buffer can significantly improve the hit ratio and reduce I/Os compared to its competitors. Moreover, the AMG-Buffer achieves the best performance on a dynamic workload as well as on a large data set, suggesting its adaptivity and scalability to changing workloads.

Detecting Three‐Dimensional Associations in Large Data Set

Chinese Journal of Electronics ◽

10.1049/cje.2021.08.008 ◽

2021 ◽

Vol 30 (6) ◽

pp. 1131-1140

Author(s):

LIU Chuanlu ◽

WANG Shuliang ◽

YUAN Hanning ◽

GENG Jing

Keyword(s):

Three Dimensional ◽

Large Data ◽

Data Set ◽

Large Data Set

Ridesourcing and urban inequality in Chicago: Connecting mobility disparities to unequal development, gentrification, and displacement

Environment and Planning A Economy and Space ◽

10.1177/0308518x211047872 ◽

2021 ◽

pp. 0308518X2110478

Author(s):

Rachel G McKane ◽

David J Hess

Keyword(s):

Urban Development ◽

Causal Relationship ◽

Public Transportation ◽

Large Data ◽

Data Set ◽

Urban Inequality ◽

Chicago Area ◽

Large Data Set ◽

Last Mile

Ridesourcing advocates and companies promise many benefits to cities, such as increased accessibility, a solution to the last-mile transit problem, and even reduced need for automobiles. However, an important body of research has indicated that ridesourcing is more heavily used by more privileged consumers and in more affluent and whiter neighborhoods. Questions have also emerged about the effects of ridesourcing on public transportation. This study builds on a mobility disparities perspective by analyzing ridesourcing in the context of urban inequality, including gentrification and displacement. Using a large data set from the Chicago area, this study shows that ridesourcing is associated with areas that have seen rising rents and have become whiter and more educated. The results also show that ridesourcing is more prevalent in areas that are accessible by public transportation. Although the causal relationship between ridesourcing and gentrification is complex, the study suggests a new direction in the literature that embeds the analysis of ridesourcing in the broader frameworks of unequal urban development and neoliberalization. The study also suggests policy approaches that could help to reduce some of the connections between ridesourcing and urban inequity.

Collecting and evaluating large volumes of bibliographic metadata aggregated in the WorldCat database: a proposed methodology to overcome challenges

The Electronic Library ◽

10.1108/el-11-2020-0316 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Vyacheslav I. Zavalin ◽

Shawne D. Miksa

Keyword(s):

Data Collection ◽

Large Scale ◽

Large Data ◽

Study Data ◽

Lessons Learned ◽

Future Research ◽

Data Set ◽

Content Type ◽

Large Data Set ◽

Smart Data

Purpose This paper aims to discuss the challenges encountered in collecting, cleaning and analyzing the large data set of bibliographic metadata records in machine-readable cataloging [MARC 21] format. Possible solutions are presented. Design/methodology/approach This mixed method study relied on content analysis and social network analysis. The study examined subject representation in MARC 21 metadata records created in 2020 in WorldCat – the largest international database of “big smart data.” The methodological challenges that were encountered and solutions are examined. Findings In this general review paper with a focus on methodological issues, the discussion of challenges is followed by a discussion of solutions developed and tested as part of this study. Data collection, processing, analysis and visualization are addressed separately. Lessons learned and conclusions related to challenges and solutions for the design of a large-scale study evaluating MARC 21 bibliographic metadata from WorldCat are given. Overall recommendations for the design and implementation of future research are suggested. Originality/value There are no previous publications that address the challenges and solutions of data collection and analysis of WorldCat’s “big smart data” in the form of MARC 21 data. This is the first study to use a large data set to systematically examine MARC 21 library metadata records created after the most recent addition of new fields and subfields to MARC 21 Bibliographic Format standard in 2019 based on resource description and access rules. It is also the first to focus its analyzes on the networks formed by subject terms shared by MARC 21 bibliographic records in a data set extracted from a heterogeneous centralized database WorldCat.

A Step Closer Toward Precision Medicine by Leveraging a Deep Learning Model to Detect Knee Osteoarthritis: Our Experience with a Large Data Set of 6,571 Patients

10.1055/s-0041-1731546 ◽

2021 ◽

Author(s):

S. Naren ◽

A. Kharat ◽

P. Ajmera

Keyword(s):

Deep Learning ◽

Knee Osteoarthritis ◽

Precision Medicine ◽

Large Data ◽

Learning Model ◽

Data Set ◽

Large Data Set ◽

Deep Learning Model

Climatology of nutrient distributions in the South China Sea based on a large data set derived from a new algorithm

Progress In Oceanography ◽

10.1016/j.pocean.2021.102586 ◽

2021 ◽

pp. 102586

Author(s):

Chuanjun Du ◽

Ruoying He ◽

Zhiyu Liu ◽

Tao Huang ◽

Lifang Wang ◽

...

Keyword(s):

South China Sea ◽

South China ◽

Large Data ◽

The South China Sea ◽

The South ◽

Data Set ◽

China Sea ◽

Large Data Set

Breakup reactions and their ambiguities

The European Physical Journal A ◽

10.1140/epja/s10050-021-00446-3 ◽

2021 ◽

Vol 57 (4) ◽

Author(s):

M. Gómez-Ramos ◽

A. Obertelli ◽

Y. L. Sun

Keyword(s):

Reaction Mechanism ◽

Large Data ◽

Theoretical Framework ◽

Data Set ◽

Current Discussion ◽

Large Data Set ◽

Knockout Reactions

AbstractWe review the ambiguities in the nuclear information extracted from breakup reactions, focusing on those originating from the description of the reaction mechanism and the overall ambiguity inherent to their interpretation in terms of shell occupancies. We present the current discussion about nucleon knockout reactions and how the understanding of the reaction mechanism would help reducing uncertainties. For the former, we consider the case of $$^{11}$$ 11 Li, due to the existing large data set. For the latter, we recall the paradigmatic example of the electro-dissociation of the deuteron to address the question of the scale and scheme dependence from the theoretical framework used for the interpretation.

A Comparison of Residential Apartment Rent Price Predictions Using a Large Data Set: Kriging Versus Deep Neural Network

Geographical Analysis ◽

10.1111/gean.12283 ◽

2021 ◽

Author(s):

Hajime Seya ◽

Daiki Shiroi

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Large Data ◽

Data Set ◽

Large Data Set

Dividend Behavior of Indian Firms: New Evidence from Large Data Set

Journal of Asia-Pacific Business ◽

10.1080/10599231.2021.1866396 ◽

2021 ◽

pp. 1-35

Author(s):

Debasis Pahi ◽

Inder Sekhar Yadav

Keyword(s):

Large Data ◽

Data Set ◽

Large Data Set ◽

New Evidence

large data set
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Finding banded patternsin large data set using segmentation

Adaptive Multi-Grained Buffer Management for Database Systems

Detecting Three‐Dimensional Associations in Large Data Set

Ridesourcing and urban inequality in Chicago: Connecting mobility disparities to unequal development, gentrification, and displacement

Collecting and evaluating large volumes of bibliographic metadata aggregated in the WorldCat database: a proposed methodology to overcome challenges

A Step Closer Toward Precision Medicine by Leveraging a Deep Learning Model to Detect Knee Osteoarthritis: Our Experience with a Large Data Set of 6,571 Patients

Climatology of nutrient distributions in the South China Sea based on a large data set derived from a new algorithm

Breakup reactions and their ambiguities

A Comparison of Residential Apartment Rent Price Predictions Using a Large Data Set: Kriging Versus Deep Neural Network

Dividend Behavior of Indian Firms: New Evidence from Large Data Set

Export Citation Format

large data setRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Finding banded patternsin large data set using segmentation

Adaptive Multi-Grained Buffer Management for Database Systems

Detecting Three‐Dimensional Associations in Large Data Set

Ridesourcing and urban inequality in Chicago: Connecting mobility disparities to unequal development, gentrification, and displacement

Collecting and evaluating large volumes of bibliographic metadata aggregated in the WorldCat database: a proposed methodology to overcome challenges

A Step Closer Toward Precision Medicine by Leveraging a Deep Learning Model to Detect Knee Osteoarthritis: Our Experience with a Large Data Set of 6,571 Patients

Climatology of nutrient distributions in the South China Sea based on a large data set derived from a new algorithm

Breakup reactions and their ambiguities

A Comparison of Residential Apartment Rent Price Predictions Using a Large Data Set: Kriging Versus Deep Neural Network

Dividend Behavior of Indian Firms: New Evidence from Large Data Set

large data set
Recently Published Documents