Member since
01-25-2016
345
Posts
86
Kudos Received
25
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
5007 | 10-20-2017 06:39 PM | |
3536 | 03-30-2017 06:03 AM | |
2591 | 02-16-2017 04:55 PM | |
16101 | 02-01-2017 04:38 PM | |
1146 | 01-24-2017 08:36 PM |
08-10-2016
03:29 PM
@thejas I tested with insert script, basically writes.
... View more
08-10-2016
02:19 PM
Sample hive script: CREATE EXTERNAL TABLE test.temp3 ( cat_0 bigint, cat_1 bigint, cat_2 bigint, cat_3 bigint, cat_4 bigint, cat_5 bigint, cat_6 bigint, cat_7 bigint, cat_8 bigint, cat_9 bigint ) row format delimited fields terminated by ',' stored as ORC
location '/test/' tblproperties ("orc.compress"="ZLIB");
... View more
08-10-2016
02:13 PM
1 Kudo
@subhash parise Here is the link for more information: https://community.hortonworks.com/content/kbentry/49252/performance-comparison-bw-orc-snappy-and-zlib-in-h.html
... View more
08-09-2016
04:21 AM
@saswati sahu just FYI. ORC
has a built-in index and not required to apply index if you are using ORC tables.
... View more
08-05-2016
11:40 PM
1 Kudo
Test Conducted on: 1) HDP2.3.4 2) Data Size : 1.4 GB 2) Cluster is ideal and not running any other jobs. Conclusion: Observed that Zlib is doing more compression than SNAPPY but
SNAPPY jobs are completing quicker than ZLib. Results are here: Execution Time in Sec: Disk Usage Comparison:
... View more
Labels:
08-03-2016
06:37 AM
Basically you increased your YARN memory from 32Gb to 64GB, it means you increased all containers memory. Container is a unit for YARN submitting the jobs in-terms of CPU and RAM. you increased YARN container size then what about Tez container size? --> ideally tez container size should be multiple of YARN Memory. --> ideally we can allocate two containers per disk and per CPU.
... View more
08-03-2016
06:22 AM
@Arunkumar Dhanakumar Here is the link for calculating YARN memory: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.0/bk_installing_manually_book/content/determine-hdp-memory-config.html 1) How many data nodes do you have? 2) How many disks do you have in each data node? 3) Did you install HBase? 4) How many Cores do you have on each data node? 5) RAM size on each data node?
... View more
08-02-2016
10:47 PM
2 Kudos
@kavitha velaga hive.server2.enable.do= True --> Run hive scripts as end user instead of Hive user. = False --> All the jobs will run as hive user. hive.server2.enable.impersnation --It's old property and no longer in use. for more info on Hive impersnation: http://hortonworks.com/blog/best-practices-for-hive-authorization-using-apache-ranger-in-hdp-2-2/
... View more
07-28-2016
04:12 AM
@ccasano
As per HDP doc Hue requires RHEL6. Here is the link https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.4.0/bk_installing_manually_book/content/before_you_begin.html
... View more
07-21-2016
09:46 PM
Cab you run below command and paste the output. show partitions <table name>;
... View more