<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Spark job in cdp 7.2.18 RangerRaz not generating logs in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406181#M252522</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/102515"&gt;@AyazHussain&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;I have updated/added the suggested changes in Ranger policies still the issue is occuring.&lt;BR /&gt;Please find the attched screenshots for reference.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/44993iDE19BF91A27BCF75/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/44994iBE47B700346388B4/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Sushant&lt;/P&gt;</description>
    <pubDate>Mon, 14 Apr 2025 13:44:41 GMT</pubDate>
    <dc:creator>Jaguar</dc:creator>
    <dc:date>2025-04-14T13:44:41Z</dc:date>
    <item>
      <title>Spark job in cdp 7.2.18 RangerRaz not generating logs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/405162#M252422</link>
      <description>&lt;P&gt;in CDP Public Cloud 7218 RangerRaz Cluster.Below spark job is not generating logs though it Run Successfully.&lt;/P&gt;&lt;P&gt;[cloudbreak@ajay7218razdh-master0 ~]$ spark3-submit \&lt;BR /&gt;&amp;gt; --master yarn \&lt;BR /&gt;&amp;gt; --deploy-mode cluster \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.hadoop.security.authentication=KERBEROS" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.hadoop.security.authorization=true" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.delegation.token.binding=org.apache.knox.gateway.cloud.idbroker.s3a.IDBDelegationTokenBinding" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.idb.auth.token.enabled=true" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.aws.credentials.provider=org.apache.hadoop.fs.s3a.auth.IAMInstanceCredentialsProvider" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.security.credential.provider.path=jceks://hdfs/user/infa/knox_credentials.jceks" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.endpoint=s3.amazonaws.com" \&lt;BR /&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.impl=org.apache.hadoop.fs.s3a.S3AFileSystem" \&lt;BR /&gt;&amp;gt; --conf "spark.driver.extraJavaOptions=-Djavax.net.debug=ssl:handshake" \&lt;BR /&gt;&amp;gt; --conf "spark.executor.extraJavaOptions=-Djavax.net.debug=ssl:handshake" \&lt;BR /&gt;&amp;gt; --class org.apache.spark.examples.SparkPi \&lt;BR /&gt;&amp;gt; /opt/cloudera/parcels/CDH-7.2.18-1.cdh7.2.18.p0.51297892/jars/spark-examples_2.12-3.4.1.7.2.18.0-641.jar 100&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Apr 2025 05:08:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/405162#M252422</guid>
      <dc:creator>satvaddi</dc:creator>
      <dc:date>2025-04-01T05:08:15Z</dc:date>
    </item>
    <item>
      <title>Re: Spark job in cdp 7.2.18 RangerRaz not generating logs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/405356#M252463</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/99032"&gt;@satvaddi&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Please follow the below actions to setup the policies in RAZ for Spark. Spark doesnt have any plugin of its own so the data accessed on S3 will be logged. Other than that the table metadata will be logged from HMS.&lt;/P&gt;&lt;P class="p"&gt;Running the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;create external table&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;[***table definition***]&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;location ‘s3a://bucket/data/logs/tabledata’&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;command in Hive requires the following Ranger policies:&lt;/P&gt;&lt;UL class="ul"&gt;&lt;LI&gt;An S3 policy in the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;cm_s3&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;repo on&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;s3a://bucket/data/logs/tabledata&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;for&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;hive&lt;/EM&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;user to perform recursive read/write.&lt;/LI&gt;&lt;LI&gt;An S3 policy in the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;cm_s3&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;repo on&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;s3a://bucket/data/logs/tabledata&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;for the end user.&lt;/LI&gt;&lt;LI&gt;A Hive URL authorization policy in the Hadoop SQL repo on&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;s3a://bucket/data/logs/tabledata&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;for the end user.&lt;/LI&gt;&lt;/UL&gt;&lt;P class="p"&gt;Access to the same external table location using Spark shell requires an S3 policy (Ranger policy) in the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;cm_s3&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;repo on&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class="ph filepath"&gt;s3a://bucket/data/logs/tabledata&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;for the end user.&lt;/P&gt;</description>
      <pubDate>Thu, 03 Apr 2025 05:07:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/405356#M252463</guid>
      <dc:creator>AyazHussain</dc:creator>
      <dc:date>2025-04-03T05:07:25Z</dc:date>
    </item>
    <item>
      <title>Re: Spark job in cdp 7.2.18 RangerRaz not generating logs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406181#M252522</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/102515"&gt;@AyazHussain&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;I have updated/added the suggested changes in Ranger policies still the issue is occuring.&lt;BR /&gt;Please find the attched screenshots for reference.&lt;BR /&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/44993iDE19BF91A27BCF75/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 999px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/44994iBE47B700346388B4/image-size/large?v=v2&amp;amp;px=999" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Sushant&lt;/P&gt;</description>
      <pubDate>Mon, 14 Apr 2025 13:44:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406181#M252522</guid>
      <dc:creator>Jaguar</dc:creator>
      <dc:date>2025-04-14T13:44:41Z</dc:date>
    </item>
    <item>
      <title>Re: Spark job in cdp 7.2.18 RangerRaz not generating logs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406296#M252527</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/112085"&gt;@Jaguar&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Can you please get the RM logs and grep with Ranger in RM and check that.&lt;BR /&gt;Do you have the cm_yarn service plugin setup in Ranger?&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 15 Apr 2025 08:25:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406296#M252527</guid>
      <dc:creator>AyazHussain</dc:creator>
      <dc:date>2025-04-15T08:25:07Z</dc:date>
    </item>
    <item>
      <title>Re: Spark job in cdp 7.2.18 RangerRaz not generating logs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406361#M252531</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/99032"&gt;@satvaddi&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;If you are running in a Ranger RAZ enabled environment you don't need all these settings:&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.hadoop.security.authentication=KERBEROS" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.hadoop.security.authorization=true" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.delegation.token.binding=org.apache.knox.gateway.cloud.idbroker.s3a.IDBDelegationTokenBinding" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.idb.auth.token.enabled=true" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.aws.credentials.provider=org.apache.hadoop.fs.s3a.auth.IAMInstanceCredentialsProvider" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.security.credential.provider.path=jceks://hdfs/user/infa/knox_credentials.jceks" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.endpoint=s3.amazonaws.com" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.impl=org.apache.hadoop.fs.s3a.S3AFileSystem" \&lt;/SPAN&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;To me it looks like you are bypassing Raz by setting this parameter:&lt;BR /&gt;&lt;SPAN&gt;&amp;gt; --conf "spark.hadoop.fs.s3a.aws.credentials.provider=org.apache.hadoop.fs.s3a.auth.IAMInstanceCredentialsProvider" \&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This, I would check whether the instance profile (IAM Role attached to the cluster) does not have too much privileges. Like access to data. This should be controlled in Ranger instead.&lt;/P&gt;</description>
      <pubDate>Tue, 15 Apr 2025 18:15:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Spark-job-in-cdp-7-2-18-RangerRaz-not-generating-logs/m-p/406361#M252531</guid>
      <dc:creator>DanielR</dc:creator>
      <dc:date>2025-04-15T18:15:08Z</dc:date>
    </item>
  </channel>
</rss>

