Member since
09-25-2016
34
Posts
1
Kudos Received
2
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
8449 | 08-24-2017 09:36 AM | |
4578 | 08-17-2017 08:57 AM |
08-02-2018
11:50 PM
Is there a way we can refresh multiple partitions in a single statement?. What I am trying to execute is something like "refresh <table_name> partition ( col in ('val1','val2',val3'......))". So that I do not have to fire multiple refresh queries. If it is not present, are there plans to bring this feature anytime in the near future?.
... View more
10-26-2017
03:55 AM
You might want to try map parquet tables by index, rather than column name: SET parquet.column.index.access=true;
... View more
09-18-2017
06:07 AM
2 Kudos
You can pip install impala-shell from github: $ pip install git+git://github.com/dknupp/impala-shell.git afterwards you can run impala-shell to connect to your cluster: $ impala-shell -i my_impalad.foo.cloudera.com For more information go to: https://github.com/dknupp/impala-shell
... View more
09-15-2017
09:43 AM
Finding logs manually in machine sound very brute force; I was thinking more of an API or CLI option to find logs Anyway the main issue we're trying to solve is access to logs to all developers in prod environment. Our node managers are behind the bars and not accessible ( any port or web ) to develoeprs and it's unlikely to happen. So we're trying to find a way to proxy the logs. I discovered that there is a jobhistory proxy to look at completed jobs / yarn apps but I coudln't get it working for running app. Is there any trick / way to access running app's logs like above ? http://resourcemanager.xyz.com:19888/jobhistory/logs//dataNode.com:8041/container_id_000001/container_id_000001/root
... View more
09-07-2017
12:03 PM
1 Kudo
@Telematics, In Cloudera Manager, edit Proxy User Configuration What did you enter in the field? It should look like this, for example: joe=alice,bob;hue=*;admin=* See the Description of Proxy User Configuration in Cloudera Manager (click the question mark next to the property) -Ben
... View more
08-24-2017
04:01 PM
I'm afraid Impala is not yet able to recognize that only two partitions need to be scanned. We're aware of the gap and that specific optimization is tracked by: https://issues.apache.org/jira/browse/IMPALA-2108 For now, you can manually rewrite your query as suggested in the JIRA as follows: select id, yyyymmdd, group_id, test from dwh.table where ((id='1a' and yyyymmdd=20170815 and group_id=1) OR (id='2b' and yyyymmdd=20170811 and group_id=2)) AND ((yyyymmdd=20170811 and group_id=2) OR (yyyymmdd=20170815 and group_id=1)) or alternatively, use a union: select id, yyyymmdd, group_id, test from dwh.table where id='1a' and yyyymmdd=20170815 and group_id=1 union all select id, yyyymmdd, group_id, test from dwh.table where id='2b' and yyyymmdd=20170811 and group_id=2
... View more
08-24-2017
09:36 AM
Using cloudera manager goto Sentry->Configurations Add users/groups to following property to allow them create/show roles. Smaller fonts are property name in the configuration file while regular fonts are display name of the property in the CM. Admin Groups sentry.service.admin.group Allowed Connecting Users sentry.service.allow.connect
... View more
08-16-2017
05:34 PM
This is a known bug [1] fixed in the upcoming 2.10.0 release. [1] https://issues.apache.org/jira/browse/IMPALA-5657
... View more