Member since
07-01-2015
460
Posts
78
Kudos Received
43
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1365 | 11-26-2019 11:47 PM | |
1312 | 11-25-2019 11:44 AM | |
9538 | 08-07-2019 12:48 AM | |
2198 | 04-17-2019 03:09 AM | |
3528 | 02-18-2019 12:23 AM |
09-20-2018
12:57 PM
Thank you 🙂 that resolved it for me.
... View more
09-20-2018
12:07 PM
I think that this log line highlights the error. PluginExceptionCondition{type=ERROR, exceptionInfo={message=Encountered AWS exception, awsErrorCode=InvalidParameterValue, awsErrorMessage=Duplicate tag key 'Name' specified.}} Are you trying to add a "Name" tag to your instances? Cloudera Altus Director uses the "Name" tag, but can be re-configured to use a different tag instead. See https://www.cloudera.com/documentation/director/latest/topics/director_aws_custom_tag_names.html for instructions on how to do so.
... View more
09-20-2018
12:05 PM
Try to use https://github.com/rcongiu/Hive-JSON-Serde#jsonserde---a-readwrite-serde-for-json-data
... View more
09-20-2018
12:02 PM
Hi, I have tested the correlated and uncorrelated subqueries in Hive CDH 5.13 and you are right. CORRELATED: First the data and test in Impala: [10.197.0.0:21000] > create table work.tbla ( id int, s string );
[10.197.0.0:21000] > create table work.tblb ( id int, s string );
[10.197.0.0:21000] > insert into work.tbla select 100, '2008-01-01' union select 200, '2009-12-01';
Modified 2 row(s) in 3.69s
[10.197.0.0:21000] > insert into work.tblb select 100, '2008-01-01' union select 200, '2009-12-01';
Modified 2 row(s) in 3.63s
[10.197.0.0:21000] > select * from work.tbla a where exists ( select null from work.tblb b where a.id = b.id and b.s > '2009-10-01' );
+-----+------------+
| id | s |
+-----+------------+
| 200 | 2009-12-01 |
+-----+------------+ Now in Hive: 0: jdbc:hive2://xxx.com> explain select * from work.tbla a where exists ( select null from work.tblb b where a.id = b.id and b.s > '2009-10-01' );
..
INFO : OK
+----------------------------------------------------+--+
| Explain |
+----------------------------------------------------+--+
| STAGE DEPENDENCIES: |
| Stage-4 is a root stage |
| Stage-3 depends on stages: Stage-4 |
| Stage-0 depends on stages: Stage-3 |
| |
| STAGE PLANS: |
| Stage: Stage-4 |
| Map Reduce Local Work |
| Alias -> Map Local Tables: |
| sq_1:b |
| Fetch Operator |
| limit: -1 |
| Alias -> Map Local Operator Tree: |
| sq_1:b |
| TableScan |
| alias: b |
| filterExpr: ((s > '2009-10-01') and id is not null) (type: boolean) |
| Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE |
| Filter Operator |
| predicate: ((s > '2009-10-01') and id is not null) (type: boolean) |
| Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE |
| Select Operator |
| expressions: id (type: int) |
| outputColumnNames: _col1 |
| Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE |
| Group By Operator |
| keys: _col1 (type: int) |
| mode: hash |
| outputColumnNames: _col0 |
| Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE |
| HashTable Sink Operator |
| keys: |
| 0 id (type: int) |
| 1 _col0 (type: int) |
| |
| Stage: Stage-3 |
| Map Reduce |
| Map Operator Tree: |
| TableScan |
| alias: a |
| filterExpr: id is not null (type: boolean) |
| Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE |
| Filter Operator |
| predicate: id is not null (type: boolean) |
| Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE |
| Map Join Operator |
| condition map: |
| Left Semi Join 0 to 1 |
| keys: |
| 0 id (type: int) |
| 1 _col0 (type: int) |
| outputColumnNames: _col0, _col1 |
| Statistics: Num rows: 1 Data size: 33 Basic stats: COMPLETE Column stats: NONE |
| File Output Operator |
| compressed: false |
| Statistics: Num rows: 1 Data size: 33 Basic stats: COMPLETE Column stats: NONE |
| table: |
| input format: org.apache.hadoop.mapred.TextInputFormat |
| output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat |
| serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe |
| Local Work: |
| Map Reduce Local Work |
| |
| Stage: Stage-0 |
| Fetch Operator |
| limit: -1 |
| Processor Tree: |
| ListSink |
| |
+----------------------------------------------------+--+
69 rows selected (0.311 seconds) UNCORRELATED - Hive fails: 0: jdbc:hive2://xxxx.com> explain select * from work.tbla a where exists ( select null from work.tblb b where b.s > '2009-10-01' );
Error: Error while compiling statement: FAILED: SemanticException Line 1:65 Invalid SubQuery expression ''2009-10-01'' in definition of SubQuery sq_1 [
exists ( select null from work.tblb b where b.s > '2009-10-01' )
] used as sq_1 at Line 1:39: For Exists/Not Exists operator SubQuery must be Correlated. (state=42000,code=40000) Impala ok [10.197.0.0:21000] > select * from work.tbla a where exists ( select null from work.tblb b where b.s > '2009-10-01' );
+-----+------------+
| id | s |
+-----+------------+
| 100 | 2008-01-01 |
| 200 | 2009-12-01 |
+-----+------------+
Fetched 2 row(s) in 0.19s
... View more
09-18-2018
12:37 PM
@Tomas79, If Hue is shut down from Cloudera Manager, those health checks should cease automatically. If Hue exited unexpectedly you may get these alerts The agent log will tell us much more about the story since it will show what it knew of the Hue process from a start/stop perspective and also it will show any issues getting results from the Hue UI.
... View more
09-18-2018
11:08 AM
@sid2707, Very cool and thanks for sharing your solution!
... View more
09-18-2018
08:49 AM
@gerasimos, What are your goals for your failover or backup strategy? BDR schedules only replicate on a schedule in one direction. "Active-Active" concepts may not truly apply to CDH depending on what you mean by that.
... View more
09-13-2018
09:40 AM
Here is an example of that log entry. There are tons of them in my logs. I'm on Streamsets 3.4.2 and CDH 5.14. I would love to understand what the root cause of this is. Removing server 053a1bbcc6b243b0a9c90f37b336fac1 from this tablet's cache 747423b5bf834fbb9a6508aae8eb1f63 AsyncKuduClient *admin 0 New I/O worker #965
... View more
09-13-2018
07:06 AM
agree to @Tomas79 , in restarting the services for new certs to come into effect
... View more
09-12-2018
06:17 AM
1 Kudo
For example for 4GB map and reduce memory set this via Hive set mapreduce.map.memory.mb = 4096 set mapreduce.reduce.memory.mb = 4096
... View more