<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: HDPCD Spark Certification in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158702#M121093</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt;  Thanks a lot..that really helps.&lt;/P&gt;</description>
    <pubDate>Tue, 01 Nov 2016 21:28:12 GMT</pubDate>
    <dc:creator>madhusudhanbabu</dc:creator>
    <dc:date>2016-11-01T21:28:12Z</dc:date>
    <item>
      <title>HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158696#M121087</link>
      <description>&lt;P&gt;1. Which version of HDP sandbox is being used ? &lt;/P&gt;&lt;P&gt;2. Which version of Spark being used ? &lt;/P&gt;&lt;P&gt;3. What kind of IDE options are available during the exam for Python? Apart from pyspark-shell, is there any IDE available like IPython or Zeppelin ? Is there any IDE option available which have auto suggestion option and where we can submit jobs to cluster. Please advise.&lt;/P&gt;&lt;P&gt;4. I have read few posts in hortonworks community, that we may use Spark RDDs or Spark Dataframes for accomplishing the tasks? Please confirm.&lt;/P&gt;&lt;P&gt;5. What is the pass percentage on average ?&lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 01:12:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158696#M121087</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-01T01:12:01Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158697#M121088</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt;  : Could you please help with the above queries. &lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 01:23:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158697#M121088</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-01T01:23:45Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158698#M121089</link>
      <description>&lt;P&gt;the test environment is on AMS virtual. When I took the test, it was HDP2.3 and I am not sure what version is used now. you could use the current sandbox for your exercise. Spark is something later than 1.4, probably 1.5. But the knowledge covered are all basic RDD and dataframe that are not very much linked to newer versions. test environment has no IDE. You use either gedit or vi base on you preference. debug with spark-shell or pyspark &lt;/P&gt;&lt;P&gt;couple notes on the exam&lt;/P&gt;&lt;P&gt;1. know RDD and dataframe api well. Go through all the docs in the test web page.&lt;/P&gt;&lt;P&gt;2. know how to import and export RDD/dataframe from/to csv files.&lt;/P&gt;&lt;P&gt;3. there is no limit on how you finish the task, so choose the technical you are most familiar with either the API or Spark SQL &lt;/P&gt;&lt;P&gt;4. test environment is quite slow in response, so be patient with it and leave enough time for tasks.&lt;/P&gt;&lt;P&gt;Good luck taking the exam.&lt;/P&gt;&lt;P&gt;   &lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 01:33:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158698#M121089</guid>
      <dc:creator>qiwang</dc:creator>
      <dc:date>2016-11-01T01:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158699#M121090</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Thank you for your prompt response. Could you please help with below queries.&lt;/P&gt;&lt;P&gt;Current sandbox version is HDP 2.5 and supported Spark version is 1.6.2. &lt;/P&gt;&lt;P&gt;1. In the sandbox which I have downloaded, only vi is available, there is no gedit. Do we need to install gedit ?&lt;/P&gt;&lt;P&gt;2. I have learn that Apache Spark documentation and Hortonworks Spark documentation is available during exam.&lt;/P&gt;&lt;P&gt;Apache Spark Documentation:   &lt;A href="https://spark.apache.org/docs"&gt;https://spark.apache.org/docs&lt;/A&gt;  .  Is this the right link ?&lt;/P&gt;&lt;P&gt;Hortonworks Spark documentation: What is hortonworks spark documentation link ?&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 02:32:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158699#M121090</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-01T02:32:48Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158700#M121091</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt; &lt;/P&gt;&lt;P&gt;3. Is there anyway, to activate intellisense/auto completion work in HDP environment for spark in python. Either using vi/gedit or by using pyspark shell.&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 02:46:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158700#M121091</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-01T02:46:23Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158701#M121092</link>
      <description>&lt;P&gt;the exam is setup on ubuntu with centOS VM as HDP. gedit is on ubuntu. I guess you could install gedit on your own environment but it is very easy to use, so no worry. If you really want to try the test environment, try use HDPCD practice exam, very similar.&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/wp-content/uploads/2015/02/HDPCD-PracticeExamGuide1.pdf"&gt;http://hortonworks.com/wp-content/uploads/2015/02/HDPCD-PracticeExamGuide1.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The document will be accessible during exam. it is the link you used for apache site &lt;A href="http://spark.apache.org/docs"&gt;http://spark.apache.org/docs&lt;/A&gt; For Hortonworks document, it is under &lt;A href="http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/index.html"&gt;http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.5.0/index.html&lt;/A&gt; Stick with Apache document as the exam is not really anything Hortonworks specific.&lt;/P&gt;&lt;P&gt;There is no way to change the exam environment. You have very limited permissions. &lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 08:42:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158701#M121092</guid>
      <dc:creator>qiwang</dc:creator>
      <dc:date>2016-11-01T08:42:49Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158702#M121093</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt;  Thanks a lot..that really helps.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Nov 2016 21:28:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158702#M121093</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-01T21:28:12Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158703#M121094</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt; &lt;/P&gt;&lt;P&gt;In Spark 1.6.* version, RDD/dataframe have functions to write only to below formats &lt;/P&gt;&lt;P&gt;rdd.saveAsTextFile / saveAsSequenceFile&lt;/P&gt;&lt;P&gt;df.write.orc / json / parquet / text / saveAsTable   &lt;/P&gt;&lt;P&gt;Query: I am sure we can not download other csv packages (i.e. databricks..etc) during the test. Is there any way to write the output file in csv format. Please advise.&lt;/P&gt;&lt;P&gt;Thanks in advance. &lt;/P&gt;</description>
      <pubDate>Thu, 03 Nov 2016 23:52:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158703#M121094</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-03T23:52:25Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158704#M121095</link>
      <description>&lt;P&gt;Yes and check my answer on another thread&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/46772/how-to-save-dataframe-as-text-file.html#answer-46773" target="_blank"&gt;https://community.hortonworks.com/questions/46772/how-to-save-dataframe-as-text-file.html#answer-46773&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 03 Nov 2016 23:58:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158704#M121095</guid>
      <dc:creator>qiwang</dc:creator>
      <dc:date>2016-11-03T23:58:59Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158705#M121096</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Is it safe to assume that the Databricks package will be available during the test to read and write to csv files ?&lt;/P&gt;&lt;OL&gt;
&lt;LI&gt;pyspark --packages com.databricks:spark-csv_2.10:1.4.0&lt;/LI&gt;&lt;LI&gt;df.write.format("com.databricks.spark.csv").option("header","true").save("file.csv")&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Sun, 06 Nov 2016 11:49:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158705#M121096</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-06T11:49:58Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158706#M121097</link>
      <description>&lt;P&gt;That I am not sure. I did not use that library.&lt;/P&gt;</description>
      <pubDate>Sun, 06 Nov 2016 23:03:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158706#M121097</guid>
      <dc:creator>qiwang</dc:creator>
      <dc:date>2016-11-06T23:03:09Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158707#M121098</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Thank you. Apologies for repeating the same question again. &lt;/P&gt;&lt;P&gt;Which library you have used to write to csv file. I am planning to take the exam in python. Please advise.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Nov 2016 06:08:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158707#M121098</guid>
      <dc:creator>madhusudhanbabu</dc:creator>
      <dc:date>2016-11-07T06:08:43Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158708#M121099</link>
      <description>&lt;P&gt;I won't assume the package is available. Better find a way to do that in python.&lt;/P&gt;</description>
      <pubDate>Thu, 10 Nov 2016 08:42:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158708#M121099</guid>
      <dc:creator>qiwang</dc:creator>
      <dc:date>2016-11-10T08:42:03Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158709#M121100</link>
      <description>&lt;P&gt;Hi Wang,&lt;/P&gt;&lt;P&gt;thanks for all the tips shared. Its really helpful. &lt;/P&gt;&lt;P&gt;Could you please tell me, how many tasks provided and how many we have to complete to clear the test.&lt;/P&gt;&lt;P&gt;Also it would be great help, if you could share the questions asked, if you remember. &lt;/P&gt;&lt;P&gt;As there is no practice test available for HDPCD-Spark, I dont have any clue about the pattern of tasks.&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;&lt;P&gt;Himansu&lt;/P&gt;</description>
      <pubDate>Thu, 30 Mar 2017 18:42:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158709#M121100</guid>
      <dc:creator>hmsvigle</dc:creator>
      <dc:date>2017-03-30T18:42:57Z</dc:date>
    </item>
    <item>
      <title>Re: HDPCD Spark Certification</title>
      <link>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158710#M121101</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/3090/qiwang.html" nodeid="3090"&gt;@Qi Wang&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Hello Sir,&lt;/P&gt;&lt;P&gt;Thanks for your input.&lt;/P&gt;&lt;P&gt;Just one doubt about the pattern. Please let us know how many questions would be asked in the exam?&lt;BR /&gt;and what would be the passing criteria?&lt;/P&gt;&lt;P&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Fri, 22 Dec 2017 23:17:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/HDPCD-Spark-Certification/m-p/158710#M121101</guid>
      <dc:creator>himanishkopalle</dc:creator>
      <dc:date>2017-12-22T23:17:17Z</dc:date>
    </item>
  </channel>
</rss>

