Member since
04-08-2019
115
Posts
97
Kudos Received
9
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
4190 | 04-16-2016 03:39 AM | |
2264 | 04-14-2016 11:13 AM | |
3917 | 04-13-2016 12:31 PM | |
4917 | 04-08-2016 03:47 AM | |
3922 | 04-07-2016 05:05 PM |
11-20-2015
12:06 PM
@gopal, Yes. The input JSON is all in single line. Enabled debugging, to see if the select * on hive console is using a different Serde than the one defined in the Table. however it looks thats not the case, Enabled the debug option in hive cli
serialization.lib=org.openx.data.jsonserde.JsonSerDe,
... View more
11-20-2015
11:59 AM
@Ancil McBarnett, i am using the 2nd one here. Which is quite popular and well tested as it appers
... View more
11-20-2015
11:58 AM
@Artem Ervits. Have used the last create external statement . If i do select * from json_serde_table ; i get a JSON output.but not in a valid format.
... View more
11-19-2015
08:19 PM
1 Kudo
I am using https://github.com/rcongiu/Hive-JSON-Serde to query the JSON data via hive. As part of testing, I am using an external table to query the JSON plain text file in HDFS. i am able to query the data from hive using select, However when i do select * from JSON_EXTERNAL_TABLE limit 1, the output is an Invalid JSON though the message in HDFS is a valid JSON. Is this an expected one ?
... View more
Labels:
- Labels:
-
Apache Hive
11-11-2015
04:08 PM
1 Kudo
@Sourygna Luangsay. We used syslogtcp for our project. Which is struggling for between 500-1000 events / seconds. Looks like multiport_syslogtcp uses Apache Mina (https://mina.apache.org/) having High-performance asynchronous TCP library, which provides better throughput on multicore machines even when using single TCP port.
... View more
10-29-2015
07:07 PM
Thanks @David Schorow. I am looking for some guidance on the current release. As we are doing it right now.
... View more
10-29-2015
04:53 PM
2 Kudos
I am working Chef blueprint deployment where in blueprint json memory values gets adjusted based on the system memory available. the target system system memory can be from 8 GB to 145GB. HDP will be used for running OpenSOC.(Storm,Kafka,HBase,HDFS) For Example Kafka,KAFKA Heap > 5GB would not give any benefit whatever the source memory is. So looking for automic sizing of the componetns(Hbase,Storm,AMS,HDFS) where more memory would help the system to perform better . This is for HDP 2.2.8 and Ambari 2.1.2.
... View more
Labels:
- Labels:
-
Apache Ambari
10-27-2015
05:47 PM
Hi Neeraj.I am mainly looking at log4j stuff.
... View more
10-27-2015
05:46 PM
Thanks a lot for all the pointers.
... View more
10-27-2015
09:55 AM
Kafka error logs are getting filled bringing the kafka down.Looking for options to purge the old kafka errors logs. Logs that are getting fillled are server.log.**. kafka.out.
... View more
Labels:
- Labels:
-
Apache Kafka