Michael Drmota: The Random Bisection Problem and the Distribution of the Height of Binary Search Trees

Consider an interval of length which is cut into two halves of length and , where is uniformly distributed on . Next, each of these two fragments is cut again into two parts (independently of the other and previous cuts). After the -th step, there are fragments whose lengths are correlated random variables. Given the initial length , the Random Bisection Problem is to determine the probability $P_k(x,\ell)$ that each of the fragments is shorter (or equal) than $\ell$ . The result (we want to present) says that there exists a monotonically decreasing function $\Psi(y)$ such that

$\begin{displaymath} P_k(x,l) = \Psi\left(\frac{x}{\ell x_k}\right) + o(1), \end{displaymath}$

where

is a increasing sequence with $x_k= e^{k/c + \frac {3}{2(c-1)}\log k + O(1)}$ and $c=4.31107\ldots$ is the (largest real) solution of the equation $c\log \left( \frac{2e}c\right) = 1.$

Interestingly, there is a close connection to the distribution of the height of binary search trees. There we have with the same function $\Psi(y)$

$\begin{displaymath} {\bf Pr}[H_{n}\le k] = \Psi(n/y_k) + o(1), \end{displaymath}$

where

is another increasing sequence with

Back to the Index

Please send comments and corrections to Thomas Klausner.