Member since
11-18-2014
196
Posts
18
Kudos Received
8
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
8668 | 03-16-2016 05:54 AM | |
4005 | 02-05-2016 04:49 AM | |
2851 | 01-08-2016 06:55 AM | |
16316 | 09-29-2015 01:31 AM | |
1732 | 05-06-2015 01:50 AM |
01-17-2017
05:42 PM
Yes. Use a spark-hbase-connector.
... View more
09-08-2016
03:03 AM
After some more testing I found that the following command is working: split '<namespace>:<table_name>', 'NEW_SPLIT_VALUE' I just need to call it once per "pre-split" value I need.
... View more
09-02-2016
11:59 AM
Hey Alina, I have tried this with CDH5.3.10 (a bit newer than your version) and with the new CDH releases, and round seems to be functioning as expected. Connected to: Apache Hive (version 0.13.1-cdh5.3.10) Driver: Hive JDBC (version 0.13.1-cdh5.3.10) Transaction isolation: TRANSACTION_REPEATABLE_READ Beeline version 0.13.1-cdh5.3.10 by Apache Hive 0: jdbc:hive2://localhost:10000/default> show databases; +--------------------------------------------------------------------------------------------------+--+ | database_name | +--------------------------------------------------------------------------------------------------+--+ | cloudera_manager_metastore_canary_test_db_hive_1_hivemetastore_03567ed21d1b892110ff1cd925ae25bd | | default | +--------------------------------------------------------------------------------------------------+--+ 2 rows selected (1.916 seconds) 0: jdbc:hive2://localhost:10000/default> use default; No rows affected (0.127 seconds) 0: jdbc:hive2://localhost:10000/default> show tables; +------------+--+ | tab_name | +------------+--+ | sample_07 | | sample_08 | +------------+--+ 2 rows selected (0.153 seconds) 0: jdbc:hive2://localhost:10000/default> select round((20456079/100000),5); +------------+--+ | _c0 | +------------+--+ | 204.56079 | +------------+--+ 1 row selected (30.771 seconds) 0: jdbc:hive2://localhost:10000/default> select round((20456079/100000),1); +--------+--+ | _c0 | +--------+--+ | 204.6 | +--------+--+ 1 row selected (23.981 seconds) 0: jdbc:hive2://localhost:10000/default> select round((20456079/100000),3); +----------+--+ | _c0 | +----------+--+ | 204.561 | +----------+--+ 1 row selected (23.257 seconds) 0: jdbc:hive2://localhost:10000/default> select round(150,3); +------+--+ | _c0 | +------+--+ | 150 | +------+--+ 1 row selected (22.973 seconds) 0: jdbc:hive2://localhost:10000/default> Against CDH5.8, same results. The big difference between the 2 releases is the time of execution. It consistently take 23-30 seconds on CDH5.3.10 where as it takes about 0.1 seconds on CDH5.8.0. It is possible that my enviroment for CDH5.3.10 is not so kosher. I do not have a CDH5.3.4 environment anymore. But let me know if you would like me to try it with CDH5.3.4. Hope this helps.
... View more
05-17-2016
10:06 AM
Hi Romainr, I am using cdh5.5.2 with Hue 3.9.0. As part of Hive Config, I have set Hive Auxiliary JARs Directory as '/usr/lib/hive/lib' which contains the Json Serde Jar. Further I have - 1) Using Hive Command Line - Created a table in Hive, with [ ROW FORMAT SERDE 'com.cloudera.hive.serde.JSONSerDe' ] 2) Loaded sample data to Hive table 3) Ran 'select * tableName;' successfully. 4) When I ran the same in < Hue - Hive Editor >, I am facing the following error- FAILED: RuntimeException MetaException(message:java.lang.ClassNotFoundException Class com.cloudera.hive.serde.JSONSerDe not found) Can you please advise ?
... View more
05-16-2016
12:09 AM
Can you please help me in this? http://community.cloudera.com/t5/Advanced-Analytics-Apache-Spark/Pros-and-Cons-of-fetching-data-using-SparkSQL-vs-Hive-in-Spark/m-p/40855#U40855
... View more
04-12-2016
08:17 AM
Thank you! Indeed, I recreated all the tables... since I have the trash disabled, I had nothing in trash... However, this is a very complete reply. Thank you!
... View more
04-12-2016
08:12 AM
I'm note sure that I can change all the sources in order to post to all my Flume agents, but this is an interesting solution. Thank you!
... View more
03-14-2016
09:52 AM
1 Kudo
Hi Alina, you can specify query options by appending them to the connection string for the JDBC driver, separated by semicolon. For example jdbc:impala://me.cloudera.com:21050/default;appx_count_distinct=1 Settings you add there will be active for all queries during that JDBC connection. If you want to change defaults for the impalad daemon lifetime you can do so by adjusting IMPALA_SERVER_ARGS as described here: http://www.cloudera.com/documentation/enterprise/5-4-x/topics/impala_config_options.html Cheers, Lars
... View more
03-02-2016
11:24 PM
1 Kudo
What action triggered the stacktrace? The stacktrace is from deep within Spring and suggests system level issue, e.g. out of memory. A few things to check: - server log (/var/log/cloudera-scm-server/cloudera-scm-server.log) - management daemon logs (/var/log/cloudera-scm-firehose/*.log - check "Hosts"->"All Hosts" for memory pressure. The "Resources" tab of individual Host page may help as well
... View more
02-22-2016
02:29 AM
There was a difference in the amount of average load. Since it's computed when we do the command, it may vary.. I forgot to mention one question , what is the difference in the information that I get in the commands status and status 'replication'?
... View more