Generate random samples from some distribution

spark_statistical_routines

Description

Generator methods for creating single-column Spark dataframes comprised of i.i.d. samples from some distribution.

Arguments

Arguments	Description
sc	A Spark connection.
n	Sample Size (default: 1000).
num_partitions	Number of partitions in the resulting Spark dataframe (default: default parallelism of the Spark cluster).
seed	Random seed (default: a random long integer).
output_col	Name of the output column containing sample values (default: “x”).

---
title: "Generate random samples from some distribution"
execute:
  eval: true
  freeze: true
---

## spark_statistical_routines

## Description
Generator methods for creating single-column Spark dataframes comprised of i.i.d. samples from some distribution.



## Arguments
|Arguments|Description|
|---|---|
| sc | A Spark connection. |
| n | Sample Size (default: 1000). |
| num_partitions | Number of partitions in the resulting Spark dataframe (default: default parallelism of the Spark cluster). |
| seed | Random seed (default: a random long integer). |
| output_col | Name of the output column containing sample values (default: "x"). |