Tips and Traps¶
Bucketed column is only supported in Hive table at this time.
A Hive table can have both partition and bucket columns.
Suppose
t1
andt2
are 2 bucketed tables and with the number of bucketsb1
andb2
respecitvely. For bucket optimization to kick in when joining them:- The 2 tables must be bucketed on the same keys/columns. - Must joining on the bucket keys/columns. - `b1` is a multiple of `b2` or `b2` is a multiple of `b1`.