<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Fail to start pyspark session in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367128#M239801</link>
    <description>&lt;P&gt;Hi all, I am exploring the features in my CDP cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I added Spark service to the cluster, when I try to study Spark and run pyspark in terminal, I got the following error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Type "help", "copyright", "credits" or "license" for more information.&lt;BR /&gt;Warning: Ignoring non-Spark config property: hdfs&lt;BR /&gt;Warning: Ignoring non-Spark config property: ExitCodeException&lt;BR /&gt;Warning: Ignoring non-Spark config property: at&lt;BR /&gt;Setting default log level to "WARN".&lt;BR /&gt;To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel).&lt;BR /&gt;23/03/29 02:47:40 WARN conf.HiveConf: HiveConf of name hive.masking.algo does not exist&lt;BR /&gt;23/03/29 02:47:43 WARN conf.HiveConf: HiveConf of name hive.masking.algo does not exist&lt;BR /&gt;23/03/29 02:47:49 ERROR spark.SparkContext: Error initializing SparkContext.&lt;BR /&gt;java.io.FileNotFoundException: File file:/home/asl/2023-03-28 23:17:30,775 WARN [TGT Renewer for asl@MY.CLOUDERA.LAB] security.UserGroupInformation (UserGroupInformation.java:run(1026)) - Exception encountered while running the renewal command for asl@MY.CLOUDERA.LAB. (TGT end time:1680069424000, renewalFailures: 0, renewalFailuresTotal: 1) does not exist&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:755)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1044)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:745)&lt;BR /&gt;at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:456)&lt;BR /&gt;at org.apache.spark.deploy.history.EventLogFileWriter.requireLogBaseDirAsDirectory(EventLogFileWriters.scala:76)&lt;BR /&gt;at org.apache.spark.deploy.history.SingleEventLogFileWriter.start(EventLogFileWriters.scala:220)&lt;BR /&gt;at org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:84)&lt;BR /&gt;at org.apache.spark.SparkContext.&amp;lt;init&amp;gt;(SparkContext.scala:536)&lt;BR /&gt;at org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;at java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;at py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;at py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;at py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;BR /&gt;23/03/29 02:47:49 WARN cluster.YarnSchedulerBackend$YarnSchedulerEndpoint: Attempted to request executors before the AM has registered!&lt;BR /&gt;23/03/29 02:47:49 WARN spark.SparkContext: Another SparkContext is being constructed (or threw an exception in its constructor). This may indicate an error, since only one SparkContext may be running in this JVM (see SPARK-2243). The other SparkContext was created at:&lt;BR /&gt;org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;java.lang.Thread.run(Thread.java:748)&lt;BR /&gt;23/03/29 02:47:49 WARN conf.HiveConf: HiveConf of name hive.masking.algo does not exist&lt;BR /&gt;23/03/29 02:47:54 ERROR spark.SparkContext: Error initializing SparkContext.&lt;BR /&gt;java.io.FileNotFoundException: File file:/home/asl/2023-03-28 23:17:30,775 WARN [TGT Renewer for asl@MY.CLOUDERA.LAB] security.UserGroupInformation (UserGroupInformation.java:run(1026)) - Exception encountered while running the renewal command for asl@MY.CLOUDERA.LAB. (TGT end time:1680069424000, renewalFailures: 0, renewalFailuresTotal: 1) does not exist&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:755)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1044)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:745)&lt;BR /&gt;at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:456)&lt;BR /&gt;at org.apache.spark.deploy.history.EventLogFileWriter.requireLogBaseDirAsDirectory(EventLogFileWriters.scala:76)&lt;BR /&gt;at org.apache.spark.deploy.history.SingleEventLogFileWriter.start(EventLogFileWriters.scala:220)&lt;BR /&gt;at org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:84)&lt;BR /&gt;at org.apache.spark.SparkContext.&amp;lt;init&amp;gt;(SparkContext.scala:536)&lt;BR /&gt;at org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;at java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;at py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;at py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;at py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;BR /&gt;23/03/29 02:47:54 WARN cluster.YarnSchedulerBackend$YarnSchedulerEndpoint: Attempted to request executors before the AM has registered!&lt;BR /&gt;/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/shell.py:45: UserWarning: Failed to initialize Spark session.&lt;BR /&gt;warnings.warn("Failed to initialize Spark session.")&lt;BR /&gt;Traceback (most recent call last):&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/shell.py", line 41, in &amp;lt;module&amp;gt;&lt;BR /&gt;spark = SparkSession._create_shell_session()&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/sql/session.py", line 583, in _create_shell_session&lt;BR /&gt;return SparkSession.builder.getOrCreate()&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/sql/session.py", line 173, in getOrCreate&lt;BR /&gt;sc = SparkContext.getOrCreate(sparkConf)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 369, in getOrCreate&lt;BR /&gt;SparkContext(conf=conf or SparkConf())&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 136, in __init__&lt;BR /&gt;conf, jsc, profiler_cls)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 198, in _do_init&lt;BR /&gt;self._jsc = jsc or self._initialize_context(self._conf._jconf)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 308, in _initialize_context&lt;BR /&gt;return self._jvm.JavaSparkContext(jconf)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/lib/py4j-0.10.7-src.zip/py4j/java_gateway.py", line 1525, in __call__&lt;BR /&gt;answer, self._gateway_client, None, self._fqn)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/lib/py4j-0.10.7-src.zip/py4j/protocol.py", line 328, in get_return_value&lt;BR /&gt;format(target_id, ".", name), value)&lt;BR /&gt;Py4JJavaError: An error occurred while calling None.org.apache.spark.api.java.JavaSparkContext.&lt;BR /&gt;: java.io.FileNotFoundException: File file:/home/asl/2023-03-28 23:17:30,775 WARN [TGT Renewer for asl@MY.CLOUDERA.LAB] security.UserGroupInformation (UserGroupInformation.java:run(1026)) - Exception encountered while running the renewal command for asl@MY.CLOUDERA.LAB. (TGT end time:1680069424000, renewalFailures: 0, renewalFailuresTotal: 1) does not exist&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:755)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1044)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:745)&lt;BR /&gt;at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:456)&lt;BR /&gt;at org.apache.spark.deploy.history.EventLogFileWriter.requireLogBaseDirAsDirectory(EventLogFileWriters.scala:76)&lt;BR /&gt;at org.apache.spark.deploy.history.SingleEventLogFileWriter.start(EventLogFileWriters.scala:220)&lt;BR /&gt;at org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:84)&lt;BR /&gt;at org.apache.spark.SparkContext.&amp;lt;init&amp;gt;(SparkContext.scala:536)&lt;BR /&gt;at org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;at java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;at py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;at py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;at py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can't figure out the cause of this issue. Please kindly help me out of this. Thank you.&lt;/P&gt;</description>
    <pubDate>Wed, 29 Mar 2023 06:59:42 GMT</pubDate>
    <dc:creator>BrianChan</dc:creator>
    <dc:date>2023-03-29T06:59:42Z</dc:date>
    <item>
      <title>Fail to start pyspark session</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367128#M239801</link>
      <description>&lt;P&gt;Hi all, I am exploring the features in my CDP cluster.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I added Spark service to the cluster, when I try to study Spark and run pyspark in terminal, I got the following error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Type "help", "copyright", "credits" or "license" for more information.&lt;BR /&gt;Warning: Ignoring non-Spark config property: hdfs&lt;BR /&gt;Warning: Ignoring non-Spark config property: ExitCodeException&lt;BR /&gt;Warning: Ignoring non-Spark config property: at&lt;BR /&gt;Setting default log level to "WARN".&lt;BR /&gt;To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel).&lt;BR /&gt;23/03/29 02:47:40 WARN conf.HiveConf: HiveConf of name hive.masking.algo does not exist&lt;BR /&gt;23/03/29 02:47:43 WARN conf.HiveConf: HiveConf of name hive.masking.algo does not exist&lt;BR /&gt;23/03/29 02:47:49 ERROR spark.SparkContext: Error initializing SparkContext.&lt;BR /&gt;java.io.FileNotFoundException: File file:/home/asl/2023-03-28 23:17:30,775 WARN [TGT Renewer for asl@MY.CLOUDERA.LAB] security.UserGroupInformation (UserGroupInformation.java:run(1026)) - Exception encountered while running the renewal command for asl@MY.CLOUDERA.LAB. (TGT end time:1680069424000, renewalFailures: 0, renewalFailuresTotal: 1) does not exist&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:755)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1044)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:745)&lt;BR /&gt;at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:456)&lt;BR /&gt;at org.apache.spark.deploy.history.EventLogFileWriter.requireLogBaseDirAsDirectory(EventLogFileWriters.scala:76)&lt;BR /&gt;at org.apache.spark.deploy.history.SingleEventLogFileWriter.start(EventLogFileWriters.scala:220)&lt;BR /&gt;at org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:84)&lt;BR /&gt;at org.apache.spark.SparkContext.&amp;lt;init&amp;gt;(SparkContext.scala:536)&lt;BR /&gt;at org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;at java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;at py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;at py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;at py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;BR /&gt;23/03/29 02:47:49 WARN cluster.YarnSchedulerBackend$YarnSchedulerEndpoint: Attempted to request executors before the AM has registered!&lt;BR /&gt;23/03/29 02:47:49 WARN spark.SparkContext: Another SparkContext is being constructed (or threw an exception in its constructor). This may indicate an error, since only one SparkContext may be running in this JVM (see SPARK-2243). The other SparkContext was created at:&lt;BR /&gt;org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;java.lang.Thread.run(Thread.java:748)&lt;BR /&gt;23/03/29 02:47:49 WARN conf.HiveConf: HiveConf of name hive.masking.algo does not exist&lt;BR /&gt;23/03/29 02:47:54 ERROR spark.SparkContext: Error initializing SparkContext.&lt;BR /&gt;java.io.FileNotFoundException: File file:/home/asl/2023-03-28 23:17:30,775 WARN [TGT Renewer for asl@MY.CLOUDERA.LAB] security.UserGroupInformation (UserGroupInformation.java:run(1026)) - Exception encountered while running the renewal command for asl@MY.CLOUDERA.LAB. (TGT end time:1680069424000, renewalFailures: 0, renewalFailuresTotal: 1) does not exist&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:755)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1044)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:745)&lt;BR /&gt;at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:456)&lt;BR /&gt;at org.apache.spark.deploy.history.EventLogFileWriter.requireLogBaseDirAsDirectory(EventLogFileWriters.scala:76)&lt;BR /&gt;at org.apache.spark.deploy.history.SingleEventLogFileWriter.start(EventLogFileWriters.scala:220)&lt;BR /&gt;at org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:84)&lt;BR /&gt;at org.apache.spark.SparkContext.&amp;lt;init&amp;gt;(SparkContext.scala:536)&lt;BR /&gt;at org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;at java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;at py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;at py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;at py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;BR /&gt;23/03/29 02:47:54 WARN cluster.YarnSchedulerBackend$YarnSchedulerEndpoint: Attempted to request executors before the AM has registered!&lt;BR /&gt;/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/shell.py:45: UserWarning: Failed to initialize Spark session.&lt;BR /&gt;warnings.warn("Failed to initialize Spark session.")&lt;BR /&gt;Traceback (most recent call last):&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/shell.py", line 41, in &amp;lt;module&amp;gt;&lt;BR /&gt;spark = SparkSession._create_shell_session()&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/sql/session.py", line 583, in _create_shell_session&lt;BR /&gt;return SparkSession.builder.getOrCreate()&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/sql/session.py", line 173, in getOrCreate&lt;BR /&gt;sc = SparkContext.getOrCreate(sparkConf)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 369, in getOrCreate&lt;BR /&gt;SparkContext(conf=conf or SparkConf())&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 136, in __init__&lt;BR /&gt;conf, jsc, profiler_cls)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 198, in _do_init&lt;BR /&gt;self._jsc = jsc or self._initialize_context(self._conf._jconf)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/pyspark/context.py", line 308, in _initialize_context&lt;BR /&gt;return self._jvm.JavaSparkContext(jconf)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/lib/py4j-0.10.7-src.zip/py4j/java_gateway.py", line 1525, in __call__&lt;BR /&gt;answer, self._gateway_client, None, self._fqn)&lt;BR /&gt;File "/opt/cloudera/parcels/CDH-7.1.8-1.cdh7.1.8.p0.30990532/lib/spark/python/lib/py4j-0.10.7-src.zip/py4j/protocol.py", line 328, in get_return_value&lt;BR /&gt;format(target_id, ".", name), value)&lt;BR /&gt;Py4JJavaError: An error occurred while calling None.org.apache.spark.api.java.JavaSparkContext.&lt;BR /&gt;: java.io.FileNotFoundException: File file:/home/asl/2023-03-28 23:17:30,775 WARN [TGT Renewer for asl@MY.CLOUDERA.LAB] security.UserGroupInformation (UserGroupInformation.java:run(1026)) - Exception encountered while running the renewal command for asl@MY.CLOUDERA.LAB. (TGT end time:1680069424000, renewalFailures: 0, renewalFailuresTotal: 1) does not exist&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:755)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:1044)&lt;BR /&gt;at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:745)&lt;BR /&gt;at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:456)&lt;BR /&gt;at org.apache.spark.deploy.history.EventLogFileWriter.requireLogBaseDirAsDirectory(EventLogFileWriters.scala:76)&lt;BR /&gt;at org.apache.spark.deploy.history.SingleEventLogFileWriter.start(EventLogFileWriters.scala:220)&lt;BR /&gt;at org.apache.spark.scheduler.EventLoggingListener.start(EventLoggingListener.scala:84)&lt;BR /&gt;at org.apache.spark.SparkContext.&amp;lt;init&amp;gt;(SparkContext.scala:536)&lt;BR /&gt;at org.apache.spark.api.java.JavaSparkContext.&amp;lt;init&amp;gt;(JavaSparkContext.scala:58)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;BR /&gt;at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)&lt;BR /&gt;at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)&lt;BR /&gt;at java.lang.reflect.Constructor.newInstance(Constructor.java:423)&lt;BR /&gt;at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:247)&lt;BR /&gt;at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)&lt;BR /&gt;at py4j.Gateway.invoke(Gateway.java:238)&lt;BR /&gt;at py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:80)&lt;BR /&gt;at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:69)&lt;BR /&gt;at py4j.GatewayConnection.run(GatewayConnection.java:238)&lt;BR /&gt;at java.lang.Thread.run(Thread.java:748)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can't figure out the cause of this issue. Please kindly help me out of this. Thank you.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Mar 2023 06:59:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367128#M239801</guid>
      <dc:creator>BrianChan</dc:creator>
      <dc:date>2023-03-29T06:59:42Z</dc:date>
    </item>
    <item>
      <title>Re: Fail to start pyspark session</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367149#M239811</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/103713"&gt;@BrianChan&lt;/a&gt;,&amp;nbsp;&lt;SPAN&gt;This is a known CM issue, which&amp;nbsp;incurs a bad&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;spark-defaults.conf&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;generated by the CM Agent after some users (or applications) logged the hosts as root and kinited as some user (for example,&amp;nbsp;hdfs), and left that ticket cache around.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To avoid the issue, do the following:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Navigate to&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Cloudera Manager&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&amp;gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Spark&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&amp;gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Configuration&lt;/STRONG&gt;.&lt;/LI&gt;&lt;LI&gt;Add&amp;nbsp;the following in the Spark Client Advanced Configuration Snippet (Safety Valve) for spark-conf/spark-defaults.conf.&lt;BR /&gt;Ensure a correct value is set:&lt;PRE&gt;spark.eventLog.dir=hdfs://nameserviceXYZ/user/spark/applicationHistory&lt;/PRE&gt;&lt;/LI&gt;&lt;LI&gt;Deploy the client configuration&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If this helps to resolve the issue, please accept this as a solution. Thanks.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Mar 2023 11:12:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367149#M239811</guid>
      <dc:creator>nikhilm</dc:creator>
      <dc:date>2023-03-29T11:12:57Z</dc:date>
    </item>
    <item>
      <title>Re: Fail to start pyspark session</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367236#M239836</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/78238"&gt;@nikhilm&lt;/a&gt;&amp;nbsp;Thank you for your reply.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;May I know what should I input for&amp;nbsp;nameserviceXYZ? Please give some example for me if possible.&lt;/P&gt;</description>
      <pubDate>Thu, 30 Mar 2023 03:58:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367236#M239836</guid>
      <dc:creator>BrianChan</dc:creator>
      <dc:date>2023-03-30T03:58:04Z</dc:date>
    </item>
    <item>
      <title>Re: Fail to start pyspark session</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367271#M239844</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/103713"&gt;@BrianChan&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If your cluster is enabled HDFS HA cluster then you will get the namespace from hdfs-site.xml file.&lt;/P&gt;&lt;P&gt;If your cluster is not enabled HDFS HA then simply you can specify like below&lt;/P&gt;&lt;PRE&gt;spark.eventLog.dir=/user/spark/applicationHistory&amp;nbsp;&lt;/PRE&gt;</description>
      <pubDate>Thu, 30 Mar 2023 09:22:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367271#M239844</guid>
      <dc:creator>RangaReddy</dc:creator>
      <dc:date>2023-03-30T09:22:17Z</dc:date>
    </item>
    <item>
      <title>Re: Fail to start pyspark session</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367388#M239887</link>
      <description>&lt;P&gt;Thank you&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/78612"&gt;@RangaReddy&lt;/a&gt;, I managed to solve the problem using your advice. Thank you very much.&lt;/P&gt;</description>
      <pubDate>Fri, 31 Mar 2023 02:25:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367388#M239887</guid>
      <dc:creator>BrianChan</dc:creator>
      <dc:date>2023-03-31T02:25:28Z</dc:date>
    </item>
    <item>
      <title>Re: Fail to start pyspark session</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367389#M239888</link>
      <description>&lt;P&gt;Thank you&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/78238"&gt;@nikhilm&lt;/a&gt;, your advice works.&lt;/P&gt;</description>
      <pubDate>Fri, 31 Mar 2023 02:26:29 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Fail-to-start-pyspark-session/m-p/367389#M239888</guid>
      <dc:creator>BrianChan</dc:creator>
      <dc:date>2023-03-31T02:26:29Z</dc:date>
    </item>
  </channel>
</rss>

