About bsuresh

bsuresh · ‎11-08-2016

I am just wondering has anybody come across the scenario where you need to import or read the data from excel to Hadoop? Is there such thing like Flume Excel source around? btw, I know I can convert the excel file to csv then deal with it. Really just trying to explore flume source a bit further here.

bsuresh · ‎02-17-2016

I completed this task by dowloading hwi.*.war file from 0.12 version of hive as i didn't find it in 0.13 and 0.14

bsuresh · ‎02-15-2016

What are the prerequisites dor starting HWI on HDP 2.2

bsuresh · ‎02-09-2016

I installed Hive ODBC Driver for HDP 2.2 on my windows 7 machine and trying to connect to hive through ODBC(hadoop istalled on CENTOS).I encoutered with following error. configs are all default. For example authentication for hiveserver2 is "none"(default).Is anything i missed out.I followed the document of hortoworks.I gave the server ip and port is 10000.I assumed hiveserver2 is running because beeline command line is working for following command beeline -u jdbc:hive2://ip:10000

bsuresh · ‎02-03-2016

As @Gangadhar Kadam said it has problem in 0.13 but works fine in 0.14

bsuresh · ‎02-03-2016

The configuration variable "sqoop.export.records.per.statement" can be set to 1 as a workaround for this problem. https://issues.apache.org/jira/browse/SQOOP-314

bsuresh · ‎01-29-2016

Yeah ...@Artem Ervits i got your point.simple but logical.

bsuresh · ‎01-29-2016

Hi All, According to my requirement i need script like following A = load '/bsuresh/sample' USING PigStorage(',') as (id,name,sal,deptid); B = GROUP A by deptid; C = foreach B { D = A.name,A.sal;--two fields E = DISTINCT D; generate group,COUNT(E); }; In relation 'D', i am extracting two fields.Where exactly i am facing error. If i chnaged the script like,which is working fine. C = foreach B { D = A.name; --one filed E = DISTINCT D; generate group,COUNT(E); }; But i need count based on distinct of two columns .Can any one help me??

bsuresh · ‎01-27-2016

use ISO time pattern instead of dd-MMM-yyyy from my code

bsuresh · ‎01-27-2016

all the comments mentioned here are correct,this is small example emp = load 'data' using PigStorage(',') as (empno,ename ,job,mgr,hiredate ,sal,comm,deptno); each_date = foreach emp generate ToDate(hiredate,'dd-MMM-yyyy') as mydate; subt = foreach each_date generate mydate,SubtractDuration(mydate,'PT1M'); dump subt;

Online	Offline
Last Visited	‎05-03-2017 01:58 PM

Member Since	‎12-10-2015 05:13 AM
Last Visited	‎05-03-2017 01:58 PM
Posts	58
Kudos received	24

Cloudera Community

Re: How to start HWI (hive web interface) on HDP 2...

Re: Sqoop export hangs at 95% for oracle Database

Re: How to substractduration of 1min from a ISO t...

Re: Passing parameters to pig script from file iss...

Re: Error executing DISTINCT Function in Pig

How to inject Excel files from local file system t...

Re: How to start HWI (hive web interface) on HDP 2...

How to start HWI (hive web interface) on HDP 2.2

Windows - Hive connection issue through ODBC usin...

Re: How to rename partition value in Hive?

Re: Sqoop export hangs at 95% for oracle Database

Re: How to extact two fields( more than one) in pi...

How to extact two fields( more than one) in pig ne...

Re: How to substractduration of 1min from a ISO t...

Re: How to substractduration of 1min from a ISO t...