About csguna

csguna · ‎03-13-2017

csguna · ‎03-13-2017

there is typo in the configuration . agent.sinks.agent-sink.channels = agent-chan to agent.sinks.agent-sink.channel = agent-chan

csguna · ‎03-13-2017

There are few things that needs to be take care when dealing with flume configuration. when u define source . agent.sources = sr1 when u define sink agent.sinks = sink1 sink2 ... when u define channels agent.channels = ch1 ch1 in your configuration there is a typo . agent.sinks.agent-sink.channels = agent-chan change it to agent.sinks.agent-sink.channel = agent-chan You can configure an agent with zero or more sinks , but each sink can read events exactly from one channel . also you have to configure one channel for sink , if not it will be removed.

csguna · ‎03-08-2017

Indeed . To sum up , the below stated are the default compression codec - Hive - default Compression is DeflateCodec Impala - default Compression is Snappy Thanks mate

csguna · ‎03-08-2017

I think snappy by default . refer this link - https://www.cloudera.com/documentation/enterprise/5-6-x/topics/impala_parquet.html Could you please correct me if I am wrong . Thanks

csguna · ‎03-08-2017

1) If we create a table (both hive and impala)and just specify stored as parquet . Will that be snappy compressed by default in CDH? Currently the default compression is - Snappy with Impala tables. 2) If not how do i identify a parquet table with snappy compression and parquet table without snappy compression?. describe formated tableName Note - but you will always see the compression as NO because the compression data format is not stored in metadata of the table , the best way is to do dfs -ls -r to the table location and see the file format for compression. 3) Also how to specify snappy compression for table level whiel creating and also at global level, even if nobody specified at table level (all table stored as parquet should be snappy compressed). CREATE TABLE external_parquet (c1 INT, c2 STRING) STORED AS PARQUET LOCATION ' ' or Session basis SET hive.exec.compress.output=true; SET mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec; SET mapred.output.compression.type=BLOCK; Globally - i,e file is executed when you launch the hive shell Put the above in location in CDH /etc/hive/conf.cloudera.hive1 if dont find one you can always create .hiverc file Please refer this link for more Create Table properties https://www.cloudera.com/documentation/enterprise/5-6-x/topics/impala_create_table.html

csguna · ‎03-07-2017

You may not have appropriate Jar in your class path thats the reaon it is throwing java.lang.NoClassDefFoundError i belive you are missing the in the httpclient-4.2.jar in your Java application classpath. When you extra the jar you could see the below class. org.apache.http.client.utils.URIUtils.class

csguna · ‎03-02-2017

I belive the problem might be in this configuration file . did you change the localhost into your "hostname" - in Server_host in the below configuration. /etc/cloudera-scm-agent/config.ini server_host=localhost change it to server_host= - to the host were you installed CM then sudo service cloudera-scm-server-db start $ sudo service cloudera-scm-server start this should help you to connect to CM via browser

csguna · ‎03-01-2017

in mapred-site.xml mapreduce.map.memory.mb = mapreduce.task.io.sort.mb =

csguna · ‎02-24-2017

Use the event desearlizer You can use BlobDeserializer - if you want to parse the whole file inside one event. or You can use Line - one event per line of text input. Refer the link https://flume.apache.org/FlumeUserGuide.html#event-deserializers

Online	Offline
Last Visited	‎10-28-2024 06:24 AM

Member Since	‎05-16-2016 09:33 PM
Last Visited	‎10-28-2024 06:24 AM
Posts	785
Kudos received	112

Cloudera Community

Re: Kerberos / Sentry Integration

Re: How to upgrade Hive from 2.1 to 3.0 via CDH 6....

Re: How does nameservice id works for HA, how does...

Re: What license does the express edition fall und...

Re: Sqoop2 over Sqoop1 in CDH6

Re: Parquet table snappy compressed by default

Re: i am also having this error please let me know...

Re: Flume ingestion error ( need solution)

Re: Parquet table snappy compressed by default

Re: Parquet table snappy compressed by default

Re: Parquet table snappy compressed by default

Re: java.lang.NoClassDefFoundError: org/apache/htt...

Re: Cloudera Manager Server Panel doesn't work pro...

Re: How to see Mapreduce Spill Disk Activity

Re: CSV files stored in partition to HDFS