Cloudera Community

Support Questions

Find answers, ask questions, and share your expertise

Advanced Search

Solved

Contributor

I am using Spark 2.3.2.3.1.0.0-78.

I tried to use:

spark_session.sparkContext._conf.get('spark.executor.memory')

but I only received 'None'.

Can someone help me, please?

4,557 Views

1 ACCEPTED SOLUTION

Master Collaborator

If you do not specify spark.executor.memory when using spark-submit or spark-shell, or pyspark, the default value for spark.executor.memory will be set to 1g.

Launch the Spark Shell by passing the Executor Memory:

[root@local ~]# pyspark --conf spark.executor.memory=1g
>>> spark.conf.get("spark.executor.memory")
u'1g'

Launch the Spark Shell by without passing the Executor Memory:

[root@local ~]# pyspark
>>> spark.conf.get("spark.executor.memory")
py4j.protocol.Py4JJavaError: An error occurred while calling o66.get. :
java.util.NoSuchElementException: spark.executor.memory ...

Note: Spark is not able to detect/load the spark.executor.memory parameter value from launched spark shell.

Reference:

https://spark.apache.org/docs/latest/configuration.html

View solution in original post

4,515 Views

4 REPLIES 4

Master Collaborator

If you do not specify spark.executor.memory when using spark-submit or spark-shell, or pyspark, the default value for spark.executor.memory will be set to 1g.

Launch the Spark Shell by passing the Executor Memory:

[root@local ~]# pyspark --conf spark.executor.memory=1g
>>> spark.conf.get("spark.executor.memory")
u'1g'

Launch the Spark Shell by without passing the Executor Memory:

[root@local ~]# pyspark
>>> spark.conf.get("spark.executor.memory")
py4j.protocol.Py4JJavaError: An error occurred while calling o66.get. :
java.util.NoSuchElementException: spark.executor.memory ...

Note: Spark is not able to detect/load the spark.executor.memory parameter value from launched spark shell.

Reference:

https://spark.apache.org/docs/latest/configuration.html

4,516 Views

Contributor

Tks @RangaReddy My purpose is to collect a series of pagings from an RDBMS and compare it with JVM_HEAP_MEMORY. Do you find this approach acceptable? I believe it could help alleviate the issue of small files on HDFS. I'm facing difficulties in calculating the size of the DataFrame. It seems there's no straightforward way to accomplish it

4,497 Views

Master Collaborator

The following articles will help to identify the small file issue:

1. https://community.cloudera.com/t5/Community-Articles/Identify-where-most-of-the-small-file-are-locat...

2. https://sauravagarwaldigital.medium.com/too-small-data-solving-small-files-issue-using-spark-b7ef668...

4,474 Views

Master Collaborator

To calculate the DataFrame size, you can use SizeEstimator class.

4,474 Views

Announcements

What's New @ Cloudera

Cloudera Data Engineering 1.23: Access Spark from Your Favor...

What's New @ Cloudera

HBase REST server scaling support is Generally Available

What's New @ Cloudera

New CLI option in the update-database command

What's New @ Cloudera

New Action menu item in the Cloudera Operational Database UI

What's New @ Cloudera

Get the list of supported instance types in the Cloudera Ope...