About balavignesh_nag

radhouane_touj · ‎07-29-2017

hello guys , please where can I modify the tweet , I got the same error with the same tweet (the sentiment analysis tutorial) ?? can't I find it by his id quickly and where?

MattWho · ‎03-27-2017

@Bala Vignesh N V The latest documentation for Apache NiFi can be found here: https://nifi.apache.org/docs.html You will want look in the "Getting Started" section for installing on a Linux based platform. Thank you, Matt

fmm_pires · ‎03-27-2017

thanks, this will help.

balavignesh_nag · ‎03-23-2017

Thanks @Deepesh. You are right default compression is ZLIB and that causes the difference in compression.

tli · ‎03-27-2017

Hi @Bala Vignesh N V Specifying the hive.execution.engine to spark will result in kicking off Spark jobs for the SQL query. But that's not supported by Hortonworks. The better way is to use Spark thrift server plus beeline to run queries: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.4.0/bk_installing_manually_book/content/starting_sts.html You can create hive tables, execute a query (by submitting a spark job under the hood) and the query result set is generated based on SparkContext. Is that what you need?

rdevprasad1 · ‎03-21-2017

Amazing answer!

bpreachuk · ‎09-09-2017

Hi @Bala Vignesh N V. I know this is an old question, but I have encountered this recently. This answer may help someone else as well... The issue you had is most likely caused by specifying " COLLECTION ITEMS TERMINATED BY ',' ". When the table is defined like this (with COLLECTION ITEMS TERMINATED BY comma): -- Create a Dummy table to use in the insert query - like an Oracle DUAL table create table dummy_TBL (col1 int) ; insert into dummy_TBL (col1) values(1) ; create table data_TBL (id int, name string, address struct<city:string,State:string>) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' COLLECTION ITEMS TERMINATED BY ',' STORED AS TEXTFILE; insert into table data_TBL select 1, 'Bala', named_struct('city','Tampa','State','FL') from dummy_TBL limit 1; The address.state value is NULL: +--------------+----------------+--------------------------------+--+ | data_tbl.id | data_tbl.name | data_tbl.address | +--------------+----------------+--------------------------------+--+ | 1 | Bala | {"city":"Tampa","state":null} | +--------------+----------------+--------------------------------+--+ But when you define the table like this (without COLLECTION ITEMS TERMINATED BY comma): create table data_TBL (id int, name string, address struct<city:string,State:string>) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS TEXTFILE; insert into table data_TBL select 1, 'Bala', named_struct('city','Tampa','State','FL') from dummy_TBL limit 1; The address.state value is correct: +--------------+----------------+--------------------------------+--+ | data_tbl.id | data_tbl.name | data_tbl.address | +--------------+----------------+--------------------------------+--+ | 1 | Bala | {"city":"Tampa","state":"FL"} | +--------------+----------------+--------------------------------+--+ I hope this helps.

ssanthosh · ‎03-14-2017

@Bala Vignesh N V Also When the reducers performs the sum operation will each reducers works on its key value pair which was feed from its mapper job? -- Each reducer will operate on all of the map outputs

aervits · ‎03-04-2017

col1 is already int based on your schema in load statement. You can check with describe data; If you want to change type with generate, you can do so like this X = FOREACH A GENERATE c1 AS x1:int;

balaram38489 · ‎03-03-2017

you can use combiners in this situation. increasing number of reducers is another solution.

Online	Offline
Last Visited	‎10-03-2019 09:01 AM

Member Since	‎05-02-2017 01:47 PM
Last Visited	‎10-03-2019 09:01 AM
Posts	360
Kudos received	64

Cloudera Community

Re: what is the best way to get ftp file to hdfs c...

Re: when yarn communicates with the namenodes when...

Re: [TEZ] are partition, sort and shuffle built-in...

Re: CASE statement Error in Beeline HIVE

Re: hive query to display Week of the timestamp an...

Re: hive Query error

Re: Install NiFi In ubuntu

Re: Data Lake Architecture

Re: Data Compression Doesn't work in ORC with SNAP...

Re: Using RDD in hive

Re: How do you generate dates using HiveQL? Is the...

Re: Struct datatype insert in hive

Re: Distribution of key,value in mappers and Reduc...

Re: defining datatype in Pig

Re: Generation of Key value pair