About bsuresh

bsuresh · ‎12-31-2015

@Artem Ervits Now i changed sqoop command like this sqoop-export --connect jdbc:oracle:thin:@ipaddress:orcl --username username -P --table EMP --columns EMPNO,ENAME,JOB,MGR --export-dir /sqooptest/export -m 1 --direct Even , No result.It behaves same as i mentioned. Current sqoop version: 15/12/31 11:18:33 INFO sqoop.Sqoop: Running Sqoop version: 1.4.4.2.1.7.0-784 see the following error once. sqoop database hanging error

bsuresh · ‎12-30-2015

Sqoop version: 1.4.4.2.1.7.0-784 The following is the sqoop command that i used to export simple(comma separated) records . sqoop-export --connect jdbc:oracle:thin:@ipaddress:1521:orcl --username user --password password --table EMP --columns EMPNO,ENAME,JOB,MGR --export-dir /sqooptest/export -m 1 --batch The above command stucks at 95% and doesn't get completed.

bsuresh · ‎12-28-2015

Yupp...i got the solution . Map join hint will no longer be valid for some queries. Drop the hint in those cases. Hive will automatically try to convert join to map-join with config hive.auto.convert.join set to true.The only reason for the user to specify a mapjoin currently is if they want it to be converted to a bucketed-mapjoin or a sort-merge bucketed mapjoin. Eventually, that should also go away, but that may take some time to stabilize.Me myself find out the solution. https://issues.apache.org/jira/browse/HIVE-3784

bsuresh · ‎12-28-2015

select t1.student_id, t2.course_name from table_one t1 join table_two t2 on (t1.course_id=t2.course_id); The above one is the basic query i am dealing with.Where t1 and t2 are less in size 160 bytes and 50 bytes. When i run the above query ,Map only join is running as usual as "hive.auto.convert.join=true" by default in 0.14. Now, I made "set hive.auto.convert.join=false;".I run the query .Then query chooses "Default or map reduce join " as i taught. Now i changed query like select /*+ MAPJOIN(t2) */ t1.student_id,t2.course_name from table_one t1 join table_two t2 on (t1.course_id=t2.course_id); i.e I am giving hint through query.But In this case also,query chooses default join only.So i have 2 queries(still auto conversion set false) 1) How to give hint through query,so that query should select map only join.(when auto conversion set false). 2)What is the configuration parameter to know size where tables can fit it.So that map only join run. or How to know the size where tables can fit with in to run map only join.

bsuresh · ‎12-27-2015

@Artem Ervits Thank you.Really we don't about this fix.We encountered above mentioned problem in Ambari 1.7 with HDP 2.2. I guess,HDP certification exam is provided with HDP 2.2. So i think ,we can get above mentioned problem.

bsuresh · ‎12-27-2015

I know some of the admin commands to check cluster detail.I am planning to write HDP practical as well as real exam.Other than admin commands ,Is there any way to check AWS ???

bsuresh · ‎12-27-2015

As @rich said, We need to use org.apache.hive.hcatalog.pig.HCatLoader We should not use org.apache.hcatalog.pig.HCatLoader (We have this package in so many materials like Pig Programming )

bsuresh · ‎12-27-2015

@Aron, Initially getsplits method splits the data based on the blocks in HDFS.But it was changed so that splitting is based on stripes of orc file. https://issues.apache.org/jira/browse/HIVE-5102 The above link provides the complete link for details and source code for OrcInputformat and getSplit Method

bsuresh · ‎12-25-2015

Upto now, I am thinking like Normal join is simple map/reduce job where two tables are two inputs .Based on columns given in equality expression ,Key and values will be decided in map side and they are grouped when comes to reducer side. But when i read following lines, i confused a little bit about "Difference between Map side join and Normal join". In every map/reduce stage of the join, the last table in the sequence is streamed through the reducers where as the others are buffered. Therefore, it helps to reduce the memory needed in the reducer for buffering the rows for a particular value of the join key by organizing the tables such that the largest tables appear last in the sequence. e.g. in SELECT a.val, b.val, c.val FROM a JOIN b ON (a.key = b.key1) JOIN c ON (c.key = b.key1) Can any one explain above lines in normal join in details??? Once explain stream table and buffering in normal join??

bsuresh · ‎12-24-2015

@Artem Ervits Is StringConcat function that you defined working for you??? I am encountering following error. Caused by: java.lang.NoSuchMethodException: java.lang.String.concat() at java.lang.Class.getMethod(Class.java:1665) at org.apache.pig.builtin.Invoker.<init>(Invoker.java:88) at org.apache.pig.builtin.GenericInvoker.<init>(GenericInvoker.java:90) at org.apache.pig.builtin.InvokeForString.<init>(InvokeForString.java:40) ... 39 more I am not sure whether Pig Dynamic Invokes work for functions other than "Static".

Online	Offline
Last Visited	‎05-03-2017 01:58 PM

Member Since	‎12-10-2015 05:13 AM
Last Visited	‎05-03-2017 01:58 PM
Posts	58
Kudos received	24

Cloudera Community

Re: How to start HWI (hive web interface) on HDP 2...

Re: Sqoop export hangs at 95% for oracle Database

Re: How to substractduration of 1min from a ISO t...

Re: Passing parameters to pig script from file iss...

Re: Error executing DISTINCT Function in Pig

Re: Sqoop export hangs at 95% for oracle Database

Sqoop export hangs at 95% for oracle Database

Re: Why hive query with hint cannot be converted i...

Why hive query with hint cannot be converted into ...

Re: pig -useHCatalog not loading all the jars requ...

What are the simple ways to know Hadoop Environmen...

Re: pig -useHCatalog not loading all the jars requ...

Re: For ORC File what determines the number of map...

How Default join will be performed???

Re: Example for Dynamic Invokers in Pig??