Frequent Pattern Mining – FPGrowth
R/ml_fpm_fpgrowth.R
ml_fpgrowth
Description
A parallel FP-growth algorithm to mine frequent itemsets.
Usage
ml_fpgrowth(
x, items_col = "items",
min_confidence = 0.8,
min_support = 0.3,
prediction_col = "prediction",
uid = random_string("fpgrowth_"),
...
)
ml_association_rules(model)
ml_freq_itemsets(model)
Arguments
Arguments | Description |
---|---|
x | A spark_connection , ml_pipeline , or a tbl_spark . |
items_col | Items column name. Default: “items” |
min_confidence | Minimal confidence for generating Association Rule. min_confidence will not affect the mining for frequent itemsets, but will affect the association rules generation. Default: 0.8 |
min_support | Minimal support level of the frequent pattern. [0.0, 1.0]. Any pattern that appears more than (min_support * size-of-the-dataset) times will be output in the frequent itemsets. Default: 0.3 |
prediction_col | Prediction column name. |
uid | A character string used to uniquely identify the ML estimator. |
… | Optional arguments; currently unused. |
model | A fitted FPGrowth model returned by ml_fpgrowth() |