Create a Spark Configuration for Livy
R/livy_connection.R
livy_config
Description
Create a Spark Configuration for Livy
Usage
livy_config(
config = spark_config(),
username = NULL,
password = NULL,
negotiate = FALSE,
custom_headers = list(`X-Requested-By` = "sparklyr"),
proxy = NULL,
curl_opts = NULL,
... )
Arguments
Arguments | Description |
---|---|
config | Optional base configuration |
username | The username to use in the Authorization header |
password | The password to use in the Authorization header |
negotiate | Whether to use gssnegotiate method or not |
custom_headers | List of custom headers to append to http requests. Defaults to list("X-Requested-By" = "sparklyr") . |
proxy | Either NULL or a proxy specified by httr::use_proxy(). Defaults to NULL. |
curl_opts | List of CURL options (e.g., verbose, connecttimeout, dns_cache_timeout, etc, see httr::httr_options() for a list of valid options) – NOTE: these configurations are for libcurl only and separate from HTTP headers or Livy session parameters. |
… | additional Livy session parameters |
Details
Extends a Spark spark_config()
configuration with settings for Livy. For instance, username
and password
define the basic authentication settings for a Livy session.
The default value of "custom_headers"
is set to list("X-Requested-By" = "sparklyr")
in order to facilitate connection to Livy servers with CSRF protection enabled.
Additional parameters for Livy sessions are:
proxy_user
User to impersonate when starting the session
jars
jars to be used in this session
py_files
Python files to be used in this session
files
files to be used in this session
driver_memory
Amount of memory to use for the driver process
driver_cores
Number of cores to use for the driver process
executor_memory
Amount of memory to use per executor process
executor_cores
Number of cores to use for each executor
num_executors
Number of executors to launch for this session
archives
Archives to be used in this session
queue
The name of the YARN queue to which submitted
name
The name of this session
heartbeat_timeout
Timeout in seconds to which session be orphaned
conf
Spark configuration properties (Map of key=value)
Note that queue
is supported only by version 0.4.0 of Livy or newer. If you are using the older one, specify queue via config
(e.g. config = spark_config(spark.yarn.queue = "my_queue")
).
Value
Named list with configuration data