Repartition a Spark DataFrame
R/sdf_interface.R
sdf_repartition
Description
Repartition a Spark DataFrame
Usage
sdf_repartition(x, partitions = NULL, partition_by = NULL)
Arguments
Arguments | Description |
---|---|
x | A spark_connection , ml_pipeline , or a tbl_spark . |
partitions | number of partitions |
partition_by | vector of column names used for partitioning, only supported for Spark 2.0+ |