selectivity estimation Latest Research Papers

Efficient Selectivity Estimation for Relation-Tree Joins in Multi-Model Databases

10.1109/bigdata52589.2021.9672047 ◽

2021 ◽

Author(s):

Linli Qi ◽

Peiquan Jin ◽

Shouhong Wan

Keyword(s):

Selectivity Estimation

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

Proceedings of the 2021 International Conference on Management of Data ◽

10.1145/3448016.3452772 ◽

2021 ◽

Author(s):

Yaoshu Wang ◽

Chuan Xiao ◽

Jianbin Qin ◽

Rui Mao ◽

Makoto Onizuka ◽

...

Keyword(s):

High Dimensional Data ◽

High Dimensional ◽

Selectivity Estimation

LATEST: Learning-Assisted Selectivity Estimation Over Spatio-Textual Streams

2021 IEEE 37th International Conference on Data Engineering (ICDE) ◽

10.1109/icde51399.2021.00142 ◽

2021 ◽

Author(s):

Mayur Patil ◽

Amr Magdy

Keyword(s):

Selectivity Estimation

Selectivity estimation with density-model-based multidimensional histogram

Knowledge and Information Systems ◽

10.1007/s10115-021-01547-7 ◽

2021 ◽

Author(s):

Meifan Zhang ◽

Hongzhi Wang

Keyword(s):

Density Model ◽

Selectivity Estimation ◽

Model Based ◽

Multidimensional Histogram

Astrid

Proceedings of the VLDB Endowment ◽

10.14778/3436905.3436907 ◽

2020 ◽

Vol 14 (4) ◽

pp. 471-484

Author(s):

Suraj Shetiya ◽

Saravanan Thirumuruganathan ◽

Nick Koudas ◽

Gautam Das

Keyword(s):

Deep Learning ◽

Objective Function ◽

Pattern Matching ◽

Language Processing ◽

Language Model ◽

Language Models ◽

Selectivity Estimation ◽

Statistical Correlations ◽

Benchmark Datasets ◽

Traditional Approaches

Accurate selectivity estimation for string predicates is a long-standing research challenge in databases. Supporting pattern matching on strings (such as prefix, substring, and suffix) makes this problem much more challenging, thereby necessitating a dedicated study. Traditional approaches often build pruned summary data structures such as tries followed by selectivity estimation using statistical correlations. However, this produces insufficiently accurate cardinality estimates resulting in the selection of sub-optimal plans by the query optimizer. Recently proposed deep learning based approaches leverage techniques from natural language processing such as embeddings to encode the strings and use it to train a model. While this is an improvement over traditional approaches, there is a large scope for improvement. We propose Astrid, a framework for string selectivity estimation that synthesizes ideas from traditional and deep learning based approaches. We make two complementary contributions. First, we propose an embedding algorithm that is query-type (prefix, substring, and suffix) and selectivity aware. Consider three strings 'ab', 'abc' and 'abd' whose prefix frequencies are 1000, 800 and 100 respectively. Our approach would ensure that the embedding for 'ab' is closer to 'abc' than 'abd'. Second, we describe how neural language models could be used for selectivity estimation. While they work well for prefix queries, their performance for substring queries is sub-optimal. We modify the objective function of the neural language model so that it could be used for estimating selectivities of pattern matching queries. We also propose a novel and efficient algorithm for optimizing the new objective function. We conduct extensive experiments over benchmark datasets and show that our proposed approaches achieve state-of-the-art results.

Selectivity Estimation for Relation-Tree Joins

32nd International Conference on Scientific and Statistical Database Management ◽

10.1145/3400903.3400921 ◽

2020 ◽

Author(s):

Chao Zhang ◽

Jiaheng Lu

Keyword(s):

Selectivity Estimation

Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data ◽

10.1145/3318464.3389741 ◽

2020 ◽

Author(s):

Shohedul Hasan ◽

Saravanan Thirumuruganathan ◽

Jees Augustine ◽

Nick Koudas ◽

Gautam Das

Keyword(s):

Deep Learning ◽

Learning Models ◽

Selectivity Estimation

Selectivity of a fishing gear used in the catch of Anomalocardia flexuosa in the Northeast of Brazil

Ciência Rural ◽

10.1590/0103-8478cr20191022 ◽

2020 ◽

Vol 50 (8) ◽

Author(s):

Severino Adriano de Oliveira Lima ◽

Humber Agrelli Andrade ◽

Alfredo Olivera Gálvez

Keyword(s):

Logistic Regression ◽

Bayesian Approach ◽

Reference Value ◽

The State ◽

Fishing Gear ◽

Selectivity Estimation ◽

The World ◽

The Bayesian Approach

ABSTRACT: A type of dredge was introduced as fishing gear along the extractive bank of Mangue Seco - PE from which the largest annual catch of Anomalocardia flexuosa in the world is extracted. This study was carried out with the objective of estimating the selectivity of the new fishing gear and quantitatively evaluating the length classes most compromised by the catches, especially considering 20 mm as the reference value. Specimens larger than this size are most likely to be mature. For the selectivity estimation, the methodology using codends (16 or 20 mm) and small meshed cover (2 mm) was used. To estimate the selectivity parameters, a logistic regression and the Bayesian approach were used. The transition between the state in which the specimen is invulnerable to the fishing gear and vulnerable occurs between 10 and 18 mm, using a 16 mm mesh, and using a 20 mm mesh, this transition is between 14 and 20 mm. Dredgers with 16 mm and 20 mm mesh compromise a large proportion of specimens smaller than 20 mm. If the intention is to protect this part of the population, measures such as total restriction of the 16 mm mesh and use of the 20 mm mesh should be necessary only in the months of less catching incidences, or increasing the mesh to 25 mm.

Selectivity Estimation with Attribute Value Dependencies Using Linked Bayesian Networks

Lecture Notes in Computer Science - Transactions on Large-Scale Data- and Knowledge-Centered Systems XLVI ◽

10.1007/978-3-662-62386-2_6 ◽

2020 ◽

pp. 154-188

Author(s):

Max Halford ◽

Philippe Saint-Pierre ◽

Franck Morvan

Keyword(s):

Bayesian Networks ◽

Selectivity Estimation ◽

Attribute Value

Euler++: Improved Selectivity Estimation for Rectangular Spatial Records

2019 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata47090.2019.9006498 ◽

2019 ◽

Author(s):

A. B. Siddique ◽

Ahmed Eldawy ◽

Vagelis Hristidis

Keyword(s):

Selectivity Estimation

selectivity estimation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Efficient Selectivity Estimation for Relation-Tree Joins in Multi-Model Databases

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

LATEST: Learning-Assisted Selectivity Estimation Over Spatio-Textual Streams

Selectivity estimation with density-model-based multidimensional histogram

Astrid

Selectivity Estimation for Relation-Tree Joins

Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries

Selectivity of a fishing gear used in the catch of Anomalocardia flexuosa in the Northeast of Brazil

Selectivity Estimation with Attribute Value Dependencies Using Linked Bayesian Networks

Euler++: Improved Selectivity Estimation for Rectangular Spatial Records

Export Citation Format

selectivity estimationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Efficient Selectivity Estimation for Relation-Tree Joins in Multi-Model Databases

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

LATEST: Learning-Assisted Selectivity Estimation Over Spatio-Textual Streams

Selectivity estimation with density-model-based multidimensional histogram

Astrid

Selectivity Estimation for Relation-Tree Joins

Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries

Selectivity of a fishing gear used in the catch of Anomalocardia flexuosa in the Northeast of Brazil

Selectivity Estimation with Attribute Value Dependencies Using Linked Bayesian Networks

Euler++: Improved Selectivity Estimation for Rectangular Spatial Records

selectivity estimation
Recently Published Documents