class Google::Apis::DataprocV1::PySparkJob

A Dataproc job for running Apache PySpark (spark.apache.org/docs/0.9.0/ python-programming-guide.html) applications on YARN.

Attributes

archive_uris[RW]

Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip. Corresponds to the JSON property `archiveUris` @return [Array<String>]

args[RW]

Optional. The arguments to pass to the driver. Do not include arguments, such as –conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission. Corresponds to the JSON property `args` @return [Array<String>]

file_uris[RW]

Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks. Corresponds to the JSON property `fileUris` @return [Array<String>]

jar_file_uris[RW]

Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks. Corresponds to the JSON property `jarFileUris` @return [Array<String>]

logging_config[RW]

The runtime logging config of the job. Corresponds to the JSON property `loggingConfig` @return [Google::Apis::DataprocV1::LoggingConfig]

main_python_file_uri[RW]

Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file. Corresponds to the JSON property `mainPythonFileUri` @return [String]

properties[RW]

Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code. Corresponds to the JSON property `properties` @return [Hash<String,String>]

python_file_uris[RW]

Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip. Corresponds to the JSON property `pythonFileUris` @return [Array<String>]

Public Class Methods

new(**args) click to toggle source
# File lib/google/apis/dataproc_v1/classes.rb, line 2887
def initialize(**args)
   update!(**args)
end

Public Instance Methods

update!(**args) click to toggle source

Update properties of this object

# File lib/google/apis/dataproc_v1/classes.rb, line 2892
def update!(**args)
  @archive_uris = args[:archive_uris] if args.key?(:archive_uris)
  @args = args[:args] if args.key?(:args)
  @file_uris = args[:file_uris] if args.key?(:file_uris)
  @jar_file_uris = args[:jar_file_uris] if args.key?(:jar_file_uris)
  @logging_config = args[:logging_config] if args.key?(:logging_config)
  @main_python_file_uri = args[:main_python_file_uri] if args.key?(:main_python_file_uri)
  @properties = args[:properties] if args.key?(:properties)
  @python_file_uris = args[:python_file_uris] if args.key?(:python_file_uris)
end