Member since
04-09-2017
12
Posts
0
Kudos Received
0
Solutions
01-05-2020
06:17 AM
Hi, Hope this below links helps in deciding the Configurations apart from the previous comments https://blog.cloudera.com/how-to-tune-your-apache-spark-jobs-part-2/ https://blog.cloudera.com/how-to-tune-your-apache-spark-jobs-part-1/ Thanks AKR
... View more
04-12-2017
06:39 AM
Thank you @Romil Choksi..
As i could't see any mention about RAM in FAQ and while installing sanbox we came across RAM constraint so i was just confused is minimum ram required for EXAM also?
... View more
04-12-2017
05:15 PM
No problem Anand, good luck with your exam!
... View more
04-10-2017
07:17 AM
@Anand Pawar Its kind of tricky here. You can have the header when storing in HDFS. While processing the data for analysis you should remember that file contains header and it should be skipped orelse it will cause errors. As mentioned above if you use skip header properties it will be skipped by default in hive. However the base data lying underneath the hive table will contain header which can be used for any further processing. In simple when storing it you can have header but when processing the data you should not have header. If you feel it satisfies your question then accept the answer.
... View more