Class: Google::Cloud::Dataproc::V1::PySparkJob
- Inherits:
 - 
      Object
      
        
- Object
 - Google::Cloud::Dataproc::V1::PySparkJob
 
 
- Defined in:
 - lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb
 
Overview
A Cloud Dataproc job for running Apache PySpark applications on YARN.
Instance Attribute Summary collapse
- 
  
    
      #archive_uris  ⇒ Array<String> 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 - 
  
    
      #args  ⇒ Array<String> 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 - 
  
    
      #file_uris  ⇒ Array<String> 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 - 
  
    
      #jar_file_uris  ⇒ Array<String> 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 - 
  
    
      #logging_config  ⇒ Google::Cloud::Dataproc::V1::LoggingConfig 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 - 
  
    
      #main_python_file_uri  ⇒ String 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Required.
 - 
  
    
      #properties  ⇒ Hash{String => String} 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 - 
  
    
      #python_file_uris  ⇒ Array<String> 
    
    
  
  
  
  
    
    
  
  
  
  
  
  
    
Optional.
 
Instance Attribute Details
#archive_uris ⇒ Array<String>
Returns Optional. HCFS URIs of archives to be extracted in the working directory of .jar, .tar, .tar.gz, .tgz, and .zip.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#args ⇒ Array<String>
Returns Optional. The arguments to pass to the driver. Do not include arguments, such as +--conf+, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#file_uris ⇒ Array<String>
Returns Optional. HCFS URIs of files to be copied to the working directory of Python drivers and distributed tasks. Useful for naively parallel tasks.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#jar_file_uris ⇒ Array<String>
Returns Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#logging_config ⇒ Google::Cloud::Dataproc::V1::LoggingConfig
Returns Optional. The runtime log config for job execution.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#main_python_file_uri ⇒ String
Returns Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#properties ⇒ Hash{String => String}
Returns Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Cloud Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  | 
  
#python_file_uris ⇒ Array<String>
Returns Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.
      180  | 
    
      # File 'lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb', line 180 class PySparkJob; end  |