Member since
12-10-2015
58
Posts
24
Kudos Received
6
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1759 | 02-17-2016 04:12 AM | |
2946 | 02-03-2016 05:15 AM | |
1633 | 01-27-2016 09:13 AM | |
4133 | 01-27-2016 07:00 AM | |
2099 | 01-02-2016 03:29 PM |
12-31-2015
05:59 AM
@Artem Ervits Now i changed sqoop command like this sqoop-export --connect jdbc:oracle:thin:@ipaddress:orcl --username username -P --table EMP --columns EMPNO,ENAME,JOB,MGR --export-dir /sqooptest/export -m 1 --direct Even , No result.It behaves same as i mentioned. Current sqoop version: 15/12/31 11:18:33 INFO sqoop.Sqoop: Running Sqoop version: 1.4.4.2.1.7.0-784 see the following error once. sqoop database hanging error
... View more
12-30-2015
06:28 AM
1 Kudo
Sqoop version: 1.4.4.2.1.7.0-784 The following is the sqoop command that i used to export simple(comma separated) records . sqoop-export --connect jdbc:oracle:thin:@ipaddress:1521:orcl --username user --password password --table EMP --columns EMPNO,ENAME,JOB,MGR --export-dir /sqooptest/export -m 1 --batch The above command stucks at 95% and doesn't get completed.
... View more
Labels:
- Labels:
-
Apache Sqoop
12-28-2015
08:44 AM
Yupp...i got the solution . Map join hint will no longer be valid for some queries. Drop the hint in those cases. Hive will automatically try to convert join to map-join with config hive.auto.convert.join set to true.The only reason for the user to specify a mapjoin currently is if they want it to be converted to a bucketed-mapjoin or a sort-merge bucketed mapjoin.
Eventually, that should also go away, but that may take some time to stabilize.Me myself find out the solution. https://issues.apache.org/jira/browse/HIVE-3784
... View more
12-28-2015
06:42 AM
select t1.student_id, t2.course_name from table_one t1 join table_two t2
on (t1.course_id=t2.course_id); The above one is the basic query i am dealing with.Where t1 and t2 are less in size 160 bytes and 50 bytes. When i run the above query ,Map only join is running as usual as "hive.auto.convert.join=true" by default in 0.14. Now, I made "set hive.auto.convert.join=false;".I run the query .Then query chooses "Default or map reduce join " as i taught. Now i changed query like select /*+ MAPJOIN(t2) */ t1.student_id,t2.course_name from table_one t1 join table_two t2
on (t1.course_id=t2.course_id); i.e I am giving hint through query.But In this case also,query chooses default join only.So i have 2 queries(still auto conversion set false) 1) How to give hint through query,so that query should select map only join.(when auto conversion set false). 2)What is the configuration parameter to know size where tables can fit it.So that map only join run. or How to know the size where tables can fit with in to run map only join.
... View more
Labels:
- Labels:
-
Apache Hive
12-27-2015
04:23 PM
@Artem Ervits Thank you.Really we don't about this fix.We encountered above mentioned problem in Ambari 1.7 with HDP 2.2. I guess,HDP certification exam is provided with HDP 2.2. So i think ,we can get above mentioned problem.
... View more
12-27-2015
02:01 PM
I know some of the admin commands to check cluster detail.I am planning to write HDP practical as well as real exam.Other than admin commands ,Is there any way to check AWS ???
... View more
Labels:
12-27-2015
01:54 PM
1 Kudo
As @rich said, We need to use org.apache.hive.hcatalog.pig.HCatLoader We should not use org.apache.hcatalog.pig.HCatLoader (We have this package in so many materials like Pig Programming )
... View more
12-27-2015
06:25 AM
@Aron, Initially getsplits method splits the data based on the blocks in HDFS.But it was changed so that splitting is based on stripes of orc file. https://issues.apache.org/jira/browse/HIVE-5102 The above link provides the complete link for details and source code for OrcInputformat and getSplit Method
... View more
12-25-2015
09:19 AM
2 Kudos
Upto now, I am thinking like Normal join is simple map/reduce job where two tables are two inputs .Based on columns given in equality expression ,Key and values will be decided in map side and they are grouped when comes to reducer side. But when i read following lines, i confused a little bit about "Difference between Map side join and Normal join". In every map/reduce stage of the join, the last table in the sequence is streamed through the reducers where as the others are buffered. Therefore, it helps to reduce the memory needed in the reducer for buffering the rows for a particular value of the join key by organizing the tables such that the largest tables appear last in the sequence. e.g. in SELECT a.val, b.val, c.val FROM a JOIN b ON (a.key = b.key1) JOIN c ON (c.key = b.key1) Can any one explain above lines in normal join in details??? Once explain stream table and buffering in normal join??
... View more
Labels:
- Labels:
-
Apache Hive
12-24-2015
05:11 AM
@Artem Ervits Is StringConcat function that you defined working for you??? I am encountering following error. Caused by: java.lang.NoSuchMethodException: java.lang.String.concat()
at java.lang.Class.getMethod(Class.java:1665)
at org.apache.pig.builtin.Invoker.<init>(Invoker.java:88)
at org.apache.pig.builtin.GenericInvoker.<init>(GenericInvoker.java:90)
at org.apache.pig.builtin.InvokeForString.<init>(InvokeForString.java:40)
... 39 more I am not sure whether Pig Dynamic Invokes work for functions other than "Static".
... View more