- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Can we get better performance for hive queries by using SSD?
- Labels:
-
Apache Hive
Created ‎10-20-2015 11:49 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
One of my client is using Azure based IaaS for their HDP cluster. They are open to using more expensive storage to get better performance.
Is it recommended to use SSD for some of the data in hive tables, to get that boost in performance? Also what are the steps to make your temporary storage to point to SSD, that is used by Tez/MR jobs?
Created ‎10-21-2015 10:29 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
This thread has all the details needed:
Created ‎10-20-2015 03:13 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I understood that yarn.nodemanager.local-dirs would be the setting to point to SSD to get better performance on shuffle and other temporary usage. I also would like to confirm it.
Created ‎10-21-2015 10:29 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
This thread has all the details needed:
