Member since
09-24-2015
105
Posts
82
Kudos Received
9
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2119 | 04-11-2016 08:30 PM | |
1745 | 03-11-2016 04:08 PM | |
1744 | 12-21-2015 09:51 PM | |
1021 | 12-18-2015 10:43 PM | |
8627 | 12-08-2015 03:01 PM |
11-19-2015
03:20 AM
Are there any concerns with turning on HDFS snapshots on the Apps/Hive/Warehouse directory? I want to make sure the directory is recoverable if someone were to accidentally rmr it (or if anything else happened). https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HdfsSnapshots.html
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Hive
11-18-2015
02:37 AM
Not yet, we are upgrading to HDP 2.3.2 next week and then will retest it. I'll keep you all posted.
... View more
11-17-2015
07:02 PM
1 Kudo
@Saptak Sen Why try Spark 1.5.1 when you can try 1.5.2? Spark is sort of like a client, so you can easily run multiple versions of Spark on one cluster. Simply download Spark from their the apache website - https://spark.apache.org/downloads.html - and invoke it to run it (can configure Zeppelin to use it or just run via the CLI).
... View more
11-17-2015
05:24 PM
1 Kudo
@snukavarapu It depends on how big the files are you are loading into HDFS. If the files are very big, a bigger block size would provide higher throughput. Bigger blocks would also equate to less blocks in HDFS which would reduce the load on the namenode. You can specify blocksize for particular files too: hadoop fs -D fs.local.block.size=134217728 -put local_name remote_location
... View more
11-09-2015
02:11 AM
Hi, Are there any concerns about running all services (hdfs, hive, ambari, etc) as the same user? In this case, 'root'? Thanks,
... View more
Labels:
- Labels:
-
Apache Ambari
10-29-2015
04:16 PM
2 Kudos
Hi All, Does anyone have any steps for installing Flink on HDP? And any tutorials/examples of using Flink? Maybe with Hive ORC tables? Thanks,
... View more
Labels:
- Labels:
-
Apache Spark
10-27-2015
03:24 AM
Hi, How do you reduce the level of logging for YARN?
... View more
Labels:
- Labels:
-
Apache YARN
10-25-2015
03:15 PM
1 Kudo
Hi, Are there any Atlas tutorials or examples? I don't see any on the Hortonworks website. Thanks,
... View more
Labels:
- Labels:
-
Apache Atlas
10-24-2015
02:06 PM
1 Kudo
Hi All, While ORC and Parquet are both columnar data stores that are supported in HDP, I was wondering if there was additional guidance on when to use one over the other? Or things to consider before choosing which format to use? Thanks, Andrew
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Hive