How to know deploy mode of PySpark application?

Solution 1

If you are running an interactive shell, e.g. pyspark (CLI or via an IPython notebook), by default you are running in client mode. You can easily verify that you cannot run pyspark or any other interactive shell in cluster mode:

$ pyspark --master yarn --deploy-mode cluster
Python 2.7.11 (default, Mar 22 2016, 01:42:54)
[GCC Intel(R) C++ gcc 4.8 mode] on linux2
Type "help", "copyright", "credits" or "license" for more information.
Error: Cluster deploy mode is not applicable to Spark shells.
$ spark-shell --master yarn --deploy-mode cluster
Error: Cluster deploy mode is not applicable to Spark shells.

Examining the contents of the bin/pyspark file may be instructive, too - here is the final line (which is the actual executable):

$ pwd
/home/ctsats/spark-1.6.1-bin-hadoop2.6
$ cat bin/pyspark
[...]
exec "${SPARK_HOME}"/bin/spark-submit pyspark-shell-main --name "PySparkShell" "[email protected]"

i.e. pyspark is actually a script run by spark-submit and given the name PySparkShell, by which you can find it in the Spark History Server UI; and since it is run like that, it goes by whatever arguments (or defaults) are included with its spark-submit command.

Solution 2

Since sc.deployMode is not available in PySpark, you could check spark.submit.deployMode

scala> sc.getConf.get("spark.submit.deployMode")
res0: String = client

This is not available in PySpark

Use sc.deployMode

scala> sc.deployMode
res0: String = client
scala> sc.version
res1: String = 2.1.0-SNAPSHOT

Solution 3

As of Spark 2+ the below works.

for item in spark.sparkContext.getConf().getAll():print(item)
(u'spark.submit.deployMode', u'client') # will be one of the items in the list.

Author by

Admin

Updated on June 16, 2022

Comments

Admin about 2 months

Recents

how to re-enable default after doing event.preventDefault()

Using HttpClient in a Xamarin core project

checks all checkboxes in the DataTable including hidden rows

Select box elements and :after

Tortoise Subversion - undo update

Assign double constant to float variable without warning in C?

Is there a way to copy all the data in a mysql database to another? (phpmyadmin)

Filtering ng-repeat list using checkboxes and AngularJS

PHP script stops running arbitrarily with no errors

What is a regular expression for removing spaces between uppercase letters, but keeps spaces between words?

vbs how to get result from a command line command

WWW and non-WWW URL. Two different sites