SQL join statement or command is often used to fetch data from multiple tables. SQL joins are used to join a row of data from multiple tables on the basis of a common column (field) between them. Multiple types of joins are used in the SQL listed below.
SQL inner join
SQL outer join
SQL left join
SQL right join
SQL semi join
SQL anti join
The name Hash join
comes from the hash function()
. This hash join is useful for middle to large inputs, but it is not efficient for every small set. Hash join requires at least one equi join(=)
, and it supports all joins (left/ right semi/ anti join). Hash join is the only physical operator that needs memory. Hash join consists of 2 phases.
Building or blocking phase
Probe or non-blocking phase
The Hash join
is one of the three available joins for joining two tables. However, it is not only about joining. Hash join is used to find the matching in two tables with a hash table, several joins are available, like nested loop join, but the hash join is more efficient than the nested loop join.
In the first phase, sever creates a hash table in the memory. In this hash table, rows of the input will be stored using join. Hash join attributes are used as hash table keys. This build is called build input. Let’s assume countries
is designated as the build input. The hash join condition is countries.country_id
, which belongs to the build input. It will be used as the key in the hash table. Once all the rows are stored in the hash table, the build phase is completed.
During the probe phase, the server reads rows from the probe input (persons
in our illustration). For individual rows, the server probes the hash table for comparing rows using the value from persons.country_id
as the lookup key. For the particular match, a joined row is sent to the client. In the end, the server scanned each input only once, using constant time lookup to find matching rows between the two inputs.
Let’s see the working of hash join with the help of the following query as an example:
SELECTperson_name, country_nameFROMpersons JOIN countries ON persons.country_id = countries.country_id;
Line 1: The SELECT
clause is used to select the columns from the table we want to display.
Line 2: The person_name
and country_name
are the column names in the tables.
Line 3: The FROM
clause is used to mention the required table name.
Line 4: The JOIN
clause is used to join two related connected tables
on a specific matching condition for both tables.
Unlock your potential: SQL joins series, all in one place!
To continue your exploration of SQL joins, check out our series of Answers below:
Different types of SQL joins
Understand the different types of SQL joins used for combining data from multiple tables.
What is inner join in SQL?
Learn how inner joins return matched rows from both tables in SQL.
What is outer join in SQL?
Explore outer joins and how they return unmatched rows as well.
What is a self join in SQL?
Discover how self joins allow a table to join with itself.
What is a natural join in SQL?
Learn how natural joins match columns with the same name in SQL tables.
What is a hash join in SQL?
Understand hash joins, which are used for large-scale data matching and combining.
What is the SQL CROSS JOIN?
Explore the SQL CROSS JOIN to combine every row from one table with every row from another.
What is an equi join in SQL?
Learn how equi joins match rows based on the equality of values between tables.
How to join 3 or more tables in SQL
Master the technique of joining multiple tables in a SQL query for complex data retrieval.
Free Resources