About Tomas79

PD · ‎09-20-2018

Thank you 🙂 that resolved it for me.

dhan · ‎09-20-2018

I think that this log line highlights the error. PluginExceptionCondition{type=ERROR, exceptionInfo={message=Encountered AWS exception, awsErrorCode=InvalidParameterValue, awsErrorMessage=Duplicate tag key 'Name' specified.}} Are you trying to add a "Name" tag to your instances? Cloudera Altus Director uses the "Name" tag, but can be re-configured to use a different tag instead. See https://www.cloudera.com/documentation/director/latest/topics/director_aws_custom_tag_names.html for instructions on how to do so.

Tomas79 · ‎09-20-2018

Try to use https://github.com/rcongiu/Hive-JSON-Serde#jsonserde---a-readwrite-serde-for-json-data

Tomas79 · ‎09-20-2018

Hi, I have tested the correlated and uncorrelated subqueries in Hive CDH 5.13 and you are right. CORRELATED: First the data and test in Impala: [10.197.0.0:21000] > create table work.tbla ( id int, s string ); [10.197.0.0:21000] > create table work.tblb ( id int, s string ); [10.197.0.0:21000] > insert into work.tbla select 100, '2008-01-01' union select 200, '2009-12-01'; Modified 2 row(s) in 3.69s [10.197.0.0:21000] > insert into work.tblb select 100, '2008-01-01' union select 200, '2009-12-01'; Modified 2 row(s) in 3.63s [10.197.0.0:21000] > select * from work.tbla a where exists ( select null from work.tblb b where a.id = b.id and b.s > '2009-10-01' ); +-----+------------+ | id | s | +-----+------------+ | 200 | 2009-12-01 | +-----+------------+ Now in Hive: 0: jdbc:hive2://xxx.com> explain select * from work.tbla a where exists ( select null from work.tblb b where a.id = b.id and b.s > '2009-10-01' ); .. INFO : OK +----------------------------------------------------+--+ | Explain | +----------------------------------------------------+--+ | STAGE DEPENDENCIES: | | Stage-4 is a root stage | | Stage-3 depends on stages: Stage-4 | | Stage-0 depends on stages: Stage-3 | | | | STAGE PLANS: | | Stage: Stage-4 | | Map Reduce Local Work | | Alias -> Map Local Tables: | | sq_1:b | | Fetch Operator | | limit: -1 | | Alias -> Map Local Operator Tree: | | sq_1:b | | TableScan | | alias: b | | filterExpr: ((s > '2009-10-01') and id is not null) (type: boolean) | | Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE | | Filter Operator | | predicate: ((s > '2009-10-01') and id is not null) (type: boolean) | | Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE | | Select Operator | | expressions: id (type: int) | | outputColumnNames: _col1 | | Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE | | Group By Operator | | keys: _col1 (type: int) | | mode: hash | | outputColumnNames: _col0 | | Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE | | HashTable Sink Operator | | keys: | | 0 id (type: int) | | 1 _col0 (type: int) | | | | Stage: Stage-3 | | Map Reduce | | Map Operator Tree: | | TableScan | | alias: a | | filterExpr: id is not null (type: boolean) | | Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE | | Filter Operator | | predicate: id is not null (type: boolean) | | Statistics: Num rows: 1 Data size: 30 Basic stats: COMPLETE Column stats: NONE | | Map Join Operator | | condition map: | | Left Semi Join 0 to 1 | | keys: | | 0 id (type: int) | | 1 _col0 (type: int) | | outputColumnNames: _col0, _col1 | | Statistics: Num rows: 1 Data size: 33 Basic stats: COMPLETE Column stats: NONE | | File Output Operator | | compressed: false | | Statistics: Num rows: 1 Data size: 33 Basic stats: COMPLETE Column stats: NONE | | table: | | input format: org.apache.hadoop.mapred.TextInputFormat | | output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat | | serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe | | Local Work: | | Map Reduce Local Work | | | | Stage: Stage-0 | | Fetch Operator | | limit: -1 | | Processor Tree: | | ListSink | | | +----------------------------------------------------+--+ 69 rows selected (0.311 seconds) UNCORRELATED - Hive fails: 0: jdbc:hive2://xxxx.com> explain select * from work.tbla a where exists ( select null from work.tblb b where b.s > '2009-10-01' ); Error: Error while compiling statement: FAILED: SemanticException Line 1:65 Invalid SubQuery expression ''2009-10-01'' in definition of SubQuery sq_1 [ exists ( select null from work.tblb b where b.s > '2009-10-01' ) ] used as sq_1 at Line 1:39: For Exists/Not Exists operator SubQuery must be Correlated. (state=42000,code=40000) Impala ok [10.197.0.0:21000] > select * from work.tbla a where exists ( select null from work.tblb b where b.s > '2009-10-01' ); +-----+------------+ | id | s | +-----+------------+ | 100 | 2008-01-01 | | 200 | 2009-12-01 | +-----+------------+ Fetched 2 row(s) in 0.19s

bgooley · ‎09-18-2018

@Tomas79, If Hue is shut down from Cloudera Manager, those health checks should cease automatically. If Hue exited unexpectedly you may get these alerts The agent log will tell us much more about the story since it will show what it knew of the Hue process from a start/stop perspective and also it will show any issues getting results from the Hue UI.

bgooley · ‎09-18-2018

@sid2707, Very cool and thanks for sharing your solution!

bgooley · ‎09-18-2018

@gerasimos, What are your goals for your failover or backup strategy? BDR schedules only replicate on a schedule in one direction. "Active-Active" concepts may not truly apply to CDH depending on what you mean by that.

supahcraig · ‎09-13-2018

Here is an example of that log entry. There are tons of them in my logs. I'm on Streamsets 3.4.2 and CDH 5.14. I would love to understand what the root cause of this is. Removing server 053a1bbcc6b243b0a9c90f37b336fac1 from this tablet's cache 747423b5bf834fbb9a6508aae8eb1f63 AsyncKuduClient *admin 0 New I/O worker #965

Johnny_Bach · ‎09-13-2018

agree to @Tomas79 , in restarting the services for new certs to come into effect

Tomas79 · ‎09-12-2018

For example for 4GB map and reduce memory set this via Hive set mapreduce.map.memory.mb = 4096 set mapreduce.reduce.memory.mb = 4096

Online	Offline
Last Visited	‎01-14-2021 05:46 AM

Member Since	‎07-01-2015 06:03 AM
Last Visited	‎01-14-2021 05:46 AM
Posts	460
Kudos received	79

Cloudera Community

Re: Read service-wide configuration values via API

Re: Cloudera Altus - create CM with existing postg...

Re: Spark job getting failed with Jupyter notebook

Re: Create Parameterized view Impala

Re: Unable to access NameNode in cross realm trust...

Re: Impala String Date - greater than or less than...

Re: Cloudera Director failing in aws

Re: InvalidStorageDescriptorException: Impala

Re: Support uncorrelated subqueries in the WHERE c...

Re: HUE - ERROR in handle_mutual_auth

Re: Rack topology script not working ERROR: script...

Re: Setup a CDH cluster for BDR purposes

Re: Getting Row Error for Primary key error on kud...

Re: Renewed TLS Certificates - but browser still ...

Re: select count(*) inside hive giving error