What is extendible hashing?

Directories and buckets are two key terms in this algorithm. Buckets are the holders of hashed data, while directories are the holders of pointers pointing towards these buckets. Each directory has a unique ID.

The following points explain how the algorithm work:Storage of hashed data

Initialize the bucket depthsThe number of times a bucket has overflown and the global depthMax of the bucket depths of the directories.
Convert data into a binary representation.
Consider the "global depth" number of the least significant bits (LSBs)Rightmost bits of a binary number of data.
Map the data according to the ID of a directory.
Check for the following conditions if a bucket overflows (if the number of elements in a bucket exceeds the set limit):

Global depth == bucket depth: Split the bucket into two and increment the global depth and the buckets' depth. Re-hash the elements that were present in the split bucket.
Global depth > bucket depth: Split the bucket into two and increment the bucket depth only. Re-hash the elements that were present in the split bucket.

Repeat the steps above for each element.

By implementing the steps above, it will be evident why this method is considered so flexible and dynamic.

Example

Let's take the following example to see how this hashing method works where:

Data = {28,4,19,1,22,16,12,0,5,7}
Bucket limit = 3

Convert the data into binary representation:

28 = 11100
4 = 00100
19 = 10011
1 = 00001
22 = 10110
16 = 10000
12 = 01100
0 = 00000
5 = 00101
7 = 00111

The following slideshow represents the remaining steps:

Initialize the hash table with two initial directories and buckets. Set the global depth and bucket depth to 1

1 of 14

Free AI Mock Interviews

Coding Interview

Coding PatternsFree Interview

Gain insights and practical experience with coding patterns through targeted MCQs and coding problems, designed to match and challenge your expertise level.

System Design

YouTubeFree Interview

Learn to design a video streaming platform like YouTube by tackling functional and non-functional requirements, core components, and high-level to detailed design challenges.

Free Resources

Advantages	Disadvantages
Less costly data retrieval.	Memory wastage due to certain buckets containing more data than the others.
The dynamic approach avoids data loss.	Complicated coding.

What is extendible hashing?

Algorithm

Example

Analysis