Class: Google::Cloud::Dataproc::V1::PySparkJob

Inherits:

Object

Object
Google::Cloud::Dataproc::V1::PySparkJob

show all

Defined in:: lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb

Overview

A Cloud Dataproc job for running Apache PySpark applications on YARN.

Instance Attribute Summary collapse

#archive_uris ⇒ Array<String>
Optional.
#args ⇒ Array<String>
Optional.
#file_uris ⇒ Array<String>
Optional.
#jar_file_uris ⇒ Array<String>
Optional.
#logging_config ⇒ Google::Cloud::Dataproc::V1::LoggingConfig
Optional.
#main_python_file_uri ⇒ String
Required.
#properties ⇒ Hash{String => String}
Optional.
#python_file_uris ⇒ Array<String>
Optional.

Instance Attribute Details

#archive_uris ⇒ `Array<String>`

Returns Optional. HCFS URIs of archives to be extracted in the working directory of .jar, .tar, .tar.gz, .tgz, and .zip.

Returns:

(Array<String>) —
Optional. HCFS URIs of archives to be extracted in the working directory of .jar, .tar, .tar.gz, .tgz, and .zip.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#args ⇒ `Array<String>`

Returns Optional. The arguments to pass to the driver. Do not include arguments, such as +--conf+, that can be set as job properties, since a collision may occur that causes an incorrect job submission.

Returns:

(Array<String>) —
Optional. The arguments to pass to the driver. Do not include arguments, such as +--conf+, that can be set as job properties, since a collision may occur that causes an incorrect job submission.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#file_uris ⇒ `Array<String>`

Returns Optional. HCFS URIs of files to be copied to the working directory of Python drivers and distributed tasks. Useful for naively parallel tasks.

Returns:

(Array<String>) —
Optional. HCFS URIs of files to be copied to the working directory of Python drivers and distributed tasks. Useful for naively parallel tasks.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#jar_file_uris ⇒ `Array<String>`

Returns Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.

Returns:

(Array<String>) —
Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#logging_config ⇒ `Google::Cloud::Dataproc::V1::LoggingConfig`

Returns Optional. The runtime log config for job execution.

Returns:

(Google::Cloud::Dataproc::V1::LoggingConfig) —
Optional. The runtime log config for job execution.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#main_python_file_uri ⇒ `String`

Returns Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.

Returns:

(String) —
Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#properties ⇒ `Hash{String => String}`

Returns Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Cloud Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.

Returns:

(Hash{String => String}) —
Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Cloud Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

#python_file_uris ⇒ `Array<String>`

Returns Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.

Returns:

(Array<String>) —
Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.

180	# File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end

Class: Google::Cloud::Dataproc::V1::PySparkJob

Overview

Instance Attribute Summary collapse

Instance Attribute Details

#archive_uris ⇒ Array<String>

#args ⇒ Array<String>

#file_uris ⇒ Array<String>

#jar_file_uris ⇒ Array<String>

#logging_config ⇒ Google::Cloud::Dataproc::V1::LoggingConfig

#main_python_file_uri ⇒ String

#properties ⇒ Hash{String => String}

#python_file_uris ⇒ Array<String>

#archive_uris ⇒ `Array<String>`

#args ⇒ `Array<String>`

#file_uris ⇒ `Array<String>`

#jar_file_uris ⇒ `Array<String>`

#logging_config ⇒ `Google::Cloud::Dataproc::V1::LoggingConfig`

#main_python_file_uri ⇒ `String`

#properties ⇒ `Hash{String => String}`

#python_file_uris ⇒ `Array<String>`