hipdf.DataFrame.partition_by_hash

hipdf.DataFrame.partition_by_hash#

20 min read time

Applies to Linux

DataFrame.partition_by_hash(columns, nparts, keep_index=True)#

Partition the dataframe by the hashed value of data in columns.

Parameters#

columnssequence of str

The names of the columns to be hashed. Must have at least one name.

npartsint

Number of output partitions

keep_indexboolean

Whether to keep the index or drop it

Returns#

partitioned: list of DataFrame