library(graphframes)
# checkpoint directory is required for gf_connected_components()
spark_set_checkpoint_dir(sc, tempdir())
<- gf_friends(sc)
g gf_connected_components(g)
Connected components
R/gf_connected_components.R
gf_connected_components
Description
Computes the connected component membership of each vertex and returns a DataFrame of vertex information with each vertex assigned a component ID.
Usage
gf_connected_components(x, broadcast_threshold = 1000000L,
algorithm = c("graphframes", "graphx"), checkpoint_interval = 2L,
...)
Arguments
Arguments | Description |
---|---|
x | An object coercable to a GraphFrame (typically, a gf_graphframe ). |
broadcast_threshold | Broadcast threshold in propagating component assignments. |
algorithm | One of ‘graphframes’ or ‘graphx’. |
checkpoint_interval | Checkpoint interval in terms of number of iterations. |
… | Optional arguments, currently not used. |