<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Is there any chance to use Spark 3 on CDH 6.x cluster? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Is-there-any-chance-to-use-Spark-3-on-CDH-6-x-cluster/m-p/372358#M241225</link>
    <description>&lt;P&gt;I've successfully setup Spark 3.3.0 on CDH 6.2 (we used YARN). Here are some important step&lt;/P&gt;&lt;P&gt;1. Back up the current spark come from Cloudera package (v2.4.0 I think) at&amp;nbsp;/opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/lib/spark&lt;/P&gt;&lt;P&gt;2. Download the spark version from Spark homepage, for ex "spark-3.3.0-bin-hadoop3.tgz". Extract, delete old spark folder and replace with new spark folder (rename it to "spark") at&amp;nbsp;/opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/lib/spark&lt;/P&gt;&lt;P&gt;3. Copy all the config files from old spark conf folder to the new spark conf folder&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_0-1686292386444.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37742i82CA48F8D83B6098/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_0-1686292386444.png" alt="quangbilly79_0-1686292386444.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;4. Copy the Yarn-related config file into spark conf folder too&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_1-1686292423641.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37743i9B1217AAEF91D5C8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_1-1686292423641.png" alt="quangbilly79_1-1686292423641.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;4.1. Copy file &lt;STRONG&gt;spark-3.3.0-yarn-shuffle.jar&amp;nbsp;&lt;/STRONG&gt;from spark/yarn to spark/jars folder&lt;/P&gt;&lt;P&gt;5. Make some modifications to spark-default.conf file, mostly disable log and point to the right jar folder&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_2-1686292516879.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37744iDA2718BC08D2FCEF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_2-1686292516879.png" alt="quangbilly79_2-1686292516879.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;6. Modify some yarn config like below (yarn-site.xml)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_3-1686292576948.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37745i580DF66A6556377C/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_3-1686292576948.png" alt="quangbilly79_3-1686292576948.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_4-1686292594363.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37746i630FF60CB47C75E5/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_4-1686292594363.png" alt="quangbilly79_4-1686292594363.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;7. Restart the cluster and run spark-shell command. Run some queries for testing. You could modify the yarn-site.xml file in the spark conf folder directly to make sure.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 09 Jun 2023 06:41:14 GMT</pubDate>
    <dc:creator>quangbilly79</dc:creator>
    <dc:date>2023-06-09T06:41:14Z</dc:date>
    <item>
      <title>Is there any chance to use Spark 3 on CDH 6.x cluster?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Is-there-any-chance-to-use-Spark-3-on-CDH-6-x-cluster/m-p/372112#M241156</link>
      <description>&lt;P&gt;My manager forces me to find a way to install and use Spark 3 on CDH 6.x cluster. Is there any change?&lt;/P&gt;&lt;P&gt;When I did some research, I found out that only CDP 7. supports Spark 3, and CDH 6.x only support Spark 2. But my manager said that you don't need to install Spark through Cloudera Manager, you can install Spark 3 separately (by downloading a tar from the internet or sth like that) and then find a way to make that Spark service connect with Cloudera service like Hive, HDFS,... (by copying the hive-site, hdfs-site,... to spark conf folder maybe?)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So does anyone have any experience with this? My manager is insane!!!!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Jun 2023 09:05:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Is-there-any-chance-to-use-Spark-3-on-CDH-6-x-cluster/m-p/372112#M241156</guid>
      <dc:creator>quangbilly79</dc:creator>
      <dc:date>2023-06-05T09:05:35Z</dc:date>
    </item>
    <item>
      <title>Re: Is there any chance to use Spark 3 on CDH 6.x cluster?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Is-there-any-chance-to-use-Spark-3-on-CDH-6-x-cluster/m-p/372358#M241225</link>
      <description>&lt;P&gt;I've successfully setup Spark 3.3.0 on CDH 6.2 (we used YARN). Here are some important step&lt;/P&gt;&lt;P&gt;1. Back up the current spark come from Cloudera package (v2.4.0 I think) at&amp;nbsp;/opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/lib/spark&lt;/P&gt;&lt;P&gt;2. Download the spark version from Spark homepage, for ex "spark-3.3.0-bin-hadoop3.tgz". Extract, delete old spark folder and replace with new spark folder (rename it to "spark") at&amp;nbsp;/opt/cloudera/parcels/CDH-6.2.0-1.cdh6.2.0.p0.967373/lib/spark&lt;/P&gt;&lt;P&gt;3. Copy all the config files from old spark conf folder to the new spark conf folder&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_0-1686292386444.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37742i82CA48F8D83B6098/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_0-1686292386444.png" alt="quangbilly79_0-1686292386444.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;4. Copy the Yarn-related config file into spark conf folder too&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_1-1686292423641.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37743i9B1217AAEF91D5C8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_1-1686292423641.png" alt="quangbilly79_1-1686292423641.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;4.1. Copy file &lt;STRONG&gt;spark-3.3.0-yarn-shuffle.jar&amp;nbsp;&lt;/STRONG&gt;from spark/yarn to spark/jars folder&lt;/P&gt;&lt;P&gt;5. Make some modifications to spark-default.conf file, mostly disable log and point to the right jar folder&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_2-1686292516879.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37744iDA2718BC08D2FCEF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_2-1686292516879.png" alt="quangbilly79_2-1686292516879.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;6. Modify some yarn config like below (yarn-site.xml)&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_3-1686292576948.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37745i580DF66A6556377C/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_3-1686292576948.png" alt="quangbilly79_3-1686292576948.png" /&gt;&lt;/span&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="quangbilly79_4-1686292594363.png" style="width: 400px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/37746i630FF60CB47C75E5/image-size/medium?v=v2&amp;amp;px=400" role="button" title="quangbilly79_4-1686292594363.png" alt="quangbilly79_4-1686292594363.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;7. Restart the cluster and run spark-shell command. Run some queries for testing. You could modify the yarn-site.xml file in the spark conf folder directly to make sure.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 09 Jun 2023 06:41:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Is-there-any-chance-to-use-Spark-3-on-CDH-6-x-cluster/m-p/372358#M241225</guid>
      <dc:creator>quangbilly79</dc:creator>
      <dc:date>2023-06-09T06:41:14Z</dc:date>
    </item>
  </channel>
</rss>

