Bipartite Random Graphs and Cuckoo Hashing

International audience The aim of this paper is to extend the analysis of Cuckoo Hashing of Devroye and Morin in 2003. In particular we make several asymptotic results much more precise. We show, that the probability that the construction of a hash table succeeds, is asymptotically $1-c(\varepsilon)/m+O(1/m^2)$ for some explicit $c(\varepsilon)$, where $m$ denotes the size of each of the two tables, $n=m(1- \varepsilon)$ is the number of keys and $\varepsilon \in (0,1)$. The analysis rests on a generating function approach to the so called Cuckoo Graph, a random bipartite graph. We apply a double saddle point method to obtain asymptotic results covering tree sizes, the number of cycles and the probability that no complex component occurs.

Download Full-text

A further analysis of Cuckoo Hashing with a Stash and Random Graphs of Excess r

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.495 ◽

2010 ◽

Vol Vol. 12 no. 3 (Analysis of Algorithms) ◽

Author(s):

Reinhard Kutzelnigg

Keyword(s):

Analysis Of Algorithms ◽

Hash Table ◽

Saddle Point Method ◽

Access Time ◽

Worst Case ◽

Cuckoo Hashing ◽

Original Algorithm ◽

International Audience ◽

Significant Probability ◽

Table Data

Analysis of Algorithms International audience Cuckoo hashing is a hash table data structure offering constant access time, even in the worst case. As a drawback, the construction fails with small, but practically significant probability. However, Kirsch et al. (2008) showed that a constant-sized additional memory, the so called stash, is sufficient to reduce the failure rate drastically. But so far, using a modified insertion procedure that demands additional running time to look for an admissible key is required. As a major contribution of this paper, we show that the same bounds on the failure probability hold even without this search process and thus, the performance increases. Second, we extend the analysis to simplified cuckoo hashing, a variant of the original algorithm offering increased performance. Further, we derive some explicit asymptotic approximations concerning the number of usual resp. bipartite graphs related to the data structures. Using these results, we obtain much more precise asymptotic expansions of the success rate. These calculations are based on a generating function approach and applying the saddle point method. Finally, we provide numerical results to support the theoretical analysis.

Download Full-text

Asymptotics of the Stirling numbers of the first kind revisited: A saddle point approach

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.501 ◽

2010 ◽

Vol Vol. 12 no. 2 ◽

Author(s):

Guy Louchard

Keyword(s):

Saddle Point ◽

Generating Function ◽

Central Region ◽

Integral Formula ◽

Stirling Numbers ◽

Saddle Point Method ◽

Asymptotic Results ◽

Central Regions ◽

Cauchy’S Integral Formula ◽

International Audience

International audience Using the saddle point method, we obtain from the generating function of the Stirling numbers of the first kind [n j] and Cauchy's integral formula, asymptotic results in central and non-central regions. In the central region, we revisit the celebrated Goncharov theorem with more precision. In the region j = n - n(alpha); alpha > 1/2, we analyze the dependence of [n j] on alpha.

Download Full-text

The Saddle Point Method for the Integral of the Absolute Value of the Brownian Motion

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3332 ◽

2003 ◽

Vol DMTCS Proceedings vol. AC,... (Proceedings) ◽

Author(s):

Leonid Tolmatz

Keyword(s):

Distribution Function ◽

Brownian Motion ◽

Saddle Point ◽

Point Method ◽

Saddle Point Method ◽

Tail Asymptotics ◽

Absolute Value ◽

Exact Tail Asymptotics ◽

International Audience ◽

The Absolute

International audience The distribution function of the integral of the absolute value of the Brownian motion was expressed by L.Takács in the form of various series. In the present paper we determine the exact tail asymptotics of this distribution function. The proposed method is applicable to a variety of other Wiener functionals as well.

Download Full-text

Subgraphs of Dense Random Graphs with Specified Degrees

Combinatorics Probability Computing ◽

10.1017/s0963548311000034 ◽

2011 ◽

Vol 20 (3) ◽

pp. 413-433 ◽

Cited By ~ 11

Author(s):

BRENDAN D. McKAY

Keyword(s):

Saddle Point ◽

Random Graph ◽

Random Graphs ◽

Degree Sequence ◽

Bipartite Graphs ◽

Average Degree ◽

Point Method ◽

Saddle Point Method ◽

Induced Subgraph ◽

Basic Facts

Let d = (d1, d2, . . ., dn) be a vector of non-negative integers with even sum. We prove some basic facts about the structure of a random graph with degree sequence d, including the probability of a given subgraph or induced subgraph.Although there are many results of this kind, they are restricted to the sparse case with only a few exceptions. Our focus is instead on the case where the average degree is approximately a constant fraction of n.Our approach is the multidimensional saddle-point method. This extends the enumerative work of McKay and Wormald (1990) and is analogous to the theory developed for bipartite graphs by Greenhill and McKay (2009).

Download Full-text

Joint String Complexity for Markov Sources

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3001 ◽

2012 ◽

Vol DMTCS Proceedings vol. AQ,... (Proceedings) ◽

Author(s):

Philippe Jacquet ◽

Wojciech Szpankowski

Keyword(s):

Saddle Point ◽

Saddle Points ◽

Singularity Analysis ◽

Point Method ◽

Saddle Point Method ◽

Genome Sequences ◽

Markov Source ◽

International Audience ◽

The Common ◽

Markov Sources

International audience String complexity is defined as the cardinality of a set of all distinct words (factors) of a given string. For two strings, we define $\textit{joint string complexity}$ as the set of words that are common to both strings. We also relax this definition and introduce $\textit{joint semi-complexity}$ restricted to the common words appearing at least twice in both strings. String complexity finds a number of applications from capturing the richness of a language to finding similarities between two genome sequences. In this paper we analyze joint complexity and joint semi-complexity when both strings are generated by a Markov source. The problem turns out to be quite challenging requiring subtle singularity analysis and saddle point method over infinity many saddle points leading to novel oscillatory phenomena with single and double periodicities.

Download Full-text

Asymptotics of the Stirling numbers of the second kind revisited

Applicable Analysis and Discrete Mathematics ◽

10.2298/aadm130612011l ◽

2013 ◽

Vol 7 (2) ◽

pp. 193-210 ◽

Cited By ~ 2

Author(s):

Guy Louchard

Keyword(s):

Saddle Point ◽

Generating Function ◽

Central Region ◽

Integral Formula ◽

Stirling Numbers ◽

Saddle Point Method ◽

Asymptotic Results ◽

Analytic Combinatorics ◽

Central Regions ◽

Cauchy’S Integral Formula

Using the Saddle point method and multiseries expansions, we obtain from the generating function of the Stirling numbers of the second kind {n / m} and Cauchy's integral formula, asymptotic results in central and non-central regions. In the central region, we revisit the celebrated Gaussian theorem with more precision. In the region m = n - na, 1 > a > 1/2, we analyze the dependence of {n / m} on a. An extension of some Moser and Wyman's result to full m range is also provided. This paper fits within the framework of Analytic Combinatorics.

Download Full-text

The saddle-point method, and the method of stationary phase (section 6.3)

Recombination in Semiconductors ◽

10.1017/cbo9780511470769.016 ◽

1992 ◽

pp. 538-539

Keyword(s):

Stationary Phase ◽

Saddle Point ◽

Point Method ◽

Saddle Point Method ◽

Method Of Stationary Phase

Download Full-text

A Note on the Asymptotic Behavior of the Heights in $b$-Tries for $b$ Large

The Electronic Journal of Combinatorics ◽

10.37236/1517 ◽

2000 ◽

Vol 7 (1) ◽

Cited By ~ 7

Author(s):

Charles Knessl ◽

Wojciech Szpankowski

Keyword(s):

Asymptotic Behavior ◽

Saddle Point ◽

Generating Functions ◽

Single Point ◽

Extreme Value ◽

Limiting Distribution ◽

Point Method ◽

Saddle Point Method ◽

Numerical Verification ◽

Height Distribution

We study the limiting distribution of the height in a generalized trie in which external nodes are capable to store up to $b$ items (the so called $b$-tries). We assume that such a tree is built from $n$ random strings (items) generated by an unbiased memoryless source. In this paper, we discuss the case when $b$ and $n$ are both large. We shall identify five regions of the height distribution that should be compared to three regions obtained for fixed $b$. We prove that for most $n$, the limiting distribution is concentrated at the single point $k_1=\lfloor \log_2 (n/b)\rfloor +1$ as $n,b\to \infty$. We observe that this is quite different than the height distribution for fixed $b$, in which case the limiting distribution is of an extreme value type concentrated around $(1+1/b)\log_2 n$. We derive our results by analytic methods, namely generating functions and the saddle point method. We also present some numerical verification of our results.

Download Full-text

Sorting using complete subintervals and the maximum number of runs in a randomly evolving sequence: Extended abstract.

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3548 ◽

2007 ◽

Vol DMTCS Proceedings vol. AH,... (Proceedings) ◽

Author(s):

Svante Janson

Keyword(s):

Order Term ◽

Combinatorial Problem ◽

Random Order ◽

Priority Queues ◽

Sorting Algorithm ◽

Asymptotic Results ◽

First Order ◽

Asymptotically Normal ◽

International Audience ◽

Space Requirements

International audience We study the space requirements of a sorting algorithm where only items that at the end will be adjacent are kept together. This is equivalent to the following combinatorial problem: Consider a string of fixed length n that starts as a string of 0's, and then evolves by changing each 0 to 1, with the n changes done in random order. What is the maximal number of runs of 1's? We give asymptotic results for the distribution and mean. It turns out that, as in many problems involving a maximum, the maximum is asymptotically normal, with fluctuations of order $n^{1/2}$, and to the first order well approximated by the number of runs at the instance when the expectation is maximized, in this case when half the elements have changed to 1; there is also a second order term of order $n^{1/3}$. We also treat some variations, including priority queues and sock-sorting.

Download Full-text

Enumerating alternating tree families

Discrete Mathematics & Theoretical Computer Science ◽

10.46298/dmtcs.3624 ◽

2008 ◽

Vol DMTCS Proceedings vol. AJ,... (Proceedings) ◽

Author(s):

Markus Kuba ◽

Alois Panholzer

Keyword(s):

Nous Obtenons ◽

Asymptotic Results ◽

Root Node ◽

Ordered Trees ◽

Exponential Generating Function ◽

Random Node ◽

Enumeration Problems ◽

International Audience ◽

Nous Examinons ◽

Labelled Trees

International audience We study two enumeration problems for $\textit{up-down alternating trees}$, i.e., rooted labelled trees $T$, where the labels $ v_1, v_2, v_3, \ldots$ on every path starting at the root of $T$ satisfy $v_1 < v_2 > v_3 < v_4 > \cdots$. First we consider various tree families of interest in combinatorics (such as unordered, ordered, $d$-ary and Motzkin trees) and study the number $T_n$ of different up-down alternating labelled trees of size $n$. We obtain for all tree families considered an implicit characterization of the exponential generating function $T(z)$ leading to asymptotic results of the coefficients $T_n$ for various tree families. Second we consider the particular family of up-down alternating labelled ordered trees and study the influence of such an alternating labelling to the average shape of the trees by analyzing the parameters $\textit{label of the root node}$, $\textit{degree of the root node}$ and $\textit{depth of a random node}$ in a random tree of size $n$. This leads to exact enumeration results and limiting distribution results. Nous étudions deux problèmes de dénombrement d'$\textit{arbres alternés haut-bas}$ : par définition, ce sont des arbres munis d'une racine et tels que, pour tout chemin partant de la racine, les valeurs $v_1,v_2,v_3,\ldots$ associées aux nœuds du chemin satisfont la chaîne d'inégalités $v_1 < v_2 > v_3 < v_4 > \cdots$. D'une part, nous considérons diverses familles d'arbres intéressantes du point de vue de l'analyse combinatoire (comme les arbres de Motzkin, les arbres non ordonnés, ordonnés et $d$-aires) et nous étudions pour chaque famille le nombre total $T_n$ d'arbres alternés haut-bas de taille $n$. Nous obtenons pour toutes les familles d'arbres considérées une caractérisation implicite de la fonction génératrice exponentielle $T(z)$. Cette caractérisation nous renseigne sur le comportement asymptotique des coefficients $T_n$ de plusieurs familles d'arbres. D'autre part, nous examinons le cas particulier de la famille des arbres ordonnés : nous étudions l'influence de l'étiquetage alterné haut-bas sur l'allure générale de ces arbres en analysant trois paramètres dans un arbre aléatoire (valeur de la racine, degré de la racine et profondeur d'un nœud aléatoire). Nous obtenons alors des résultats en terme de distribution limite, mais aussi de dénombrement exact.

Download Full-text