hipdf.DataFrame.partition_by_hash

hipdf.DataFrame.partition_by_hash#

21 min read time

Applies to Linux

DataFrame.partition_by_hash(columns, nparts: int, keep_index: bool = True) list[DataFrame]#

Partition the dataframe by the hashed value of data in columns.

Parameters#

columnssequence of str

The names of the columns to be hashed. Must have at least one name.

npartsint

Number of output partitions

keep_indexboolean

Whether to keep the index or drop it

Returns#

partitioned: list of DataFrame