Member since
09-25-2015
356
Posts
382
Kudos Received
62
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1259 | 11-03-2017 09:16 PM | |
1059 | 10-17-2017 09:48 PM | |
1693 | 09-18-2017 08:33 PM | |
1920 | 08-04-2017 04:14 PM | |
2034 | 05-19-2017 06:53 AM |
10-13-2015
06:36 PM
1 Kudo
http://hortonworks.com/products/releases/hdp-2-2/#add_ons
... View more
10-13-2015
03:56 PM
4 Kudos
Can you try the following connection url (observe the / after the <ZOOKEEPER QUORUM>)? jdbc:hive2://<ZOOKEEPER QUORUM>/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver Above is for binary mode, for http mode jdbc:hive2://<ZOOKEEPER QUORUM>/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver;transportMode=http;httpPath=cliservice For secure environments you will additionally have to add the hive principal, eg. jdbc:hive2://<ZOOKEEPER QUORUM>/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver;principal=hive/_HOST@EXAMPLE.COM;transportMode=http;httpPath=cliservice
... View more
10-13-2015
03:33 PM
2 Kudos
These permissions should be enough for the oozie user on Oracle: GRANT SELECT_CATALOG_ROLE TO <OOZIE USER>;
GRANT CONNECT, RESOURCE TO <OOZIE USER>; For the other databases MySQL and PostgresQL I agree with @David Streever that the permission needs to be set to just Oozie database. Additionally we want to limit the access from just the oozie host. On MySQL this can be: GRANT ALL PRIVILEGES ON <OOZIE DB>.* TO '<OOZIE USER>'@'<OOZIE HOST>'
... View more
10-12-2015
05:42 PM
1 Kudo
Look at the Oozie coordinator documentation, the timezone value will be "America/New_York" and start would be something like "2015-10-12T08:00Z". Alternatively you can do a cron like thing, see the example in the following blog. Based on that it can be something like: <coordinator-app name="weekdays-at-eight-pm"
frequency="0 20 * * 2-6"
start="${start}" end="${end}" timezone="America/New_York"
xmlns="uri:oozie:coordinator:0.2">
...
</coordinator-app>
... View more
10-12-2015
04:36 PM
4 Kudos
There is an open intermittent issue where Hive View fails handling submission of multiple statements in a single query/submit. Is that the case here?
... View more
10-09-2015
09:09 PM
Can you shed more details on what type of configuration change was applied using Ambari that triggered the hive cli hang?
... View more
10-09-2015
07:18 PM
1 Kudo
Check out http://sqoop.apache.org/docs/1.4.2/SqoopUserGuide.html#_large_objects
... View more
10-09-2015
06:44 PM
2 Kudos
We support both Oracle 11g r2 and Oracle 12c.
... View more
10-09-2015
04:46 PM
1 Kudo
Its not that MR gets resources, Tez just pre-allocates resources, it starts a YARN application as soon as you invoke the Hive CLI. In case of MR a YARN application is launched only when its required by the executing query. This is a performance optimization in Hive on Tez saving the user the overhead of launching a AM everytime a new query is run in the same Hive CLI session.
... View more
10-09-2015
03:43 PM
5 Kudos
One thing to check is if they are using tez. If they are, it could be possible that they do not have any capacity on their YARN cluster to launch AM. So the Hive CLI waits indefinitely to launch an AM. One thing they can do to alleviate this is to launch hive cli as "hive -hiveconf hive.execution.engine=mr".
... View more
10-09-2015
03:27 PM
Yes, that is not documented as its not a common configuration/practice.
... View more
10-08-2015
07:49 PM
2 Kudos
Row based security can be achieved through SQL Standard Based Hive Authorization.You can create a view with the filter from the original table and then GRANT permissions to role or individual user.
... View more
10-08-2015
02:22 PM
1 Kudo
Yes, check out http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.0/bk_yarn_resource_mgt/content/configuring_node_labels.html
... View more
10-08-2015
02:18 PM
1 Kudo
Yes, those are the right ones, we are in the process of pushing to the web, should be available in a day or two.
... View more
10-07-2015
11:30 PM
1 Kudo
From the code it looks like ColumnPruner is always applied by optimizer and there is no way to exclude it after HIVE-4113. It would be good to get more details on the scenario where you ran into this. Have you tried this query with Hive v0.14 or later?
... View more
10-07-2015
05:09 PM
1 Kudo
Coming very soon.
... View more
10-07-2015
05:08 PM
1 Kudo
For that you will need to get access to javax.security.auth.kerberos.KerberosTicket, the class in Hadoop UesrGroupInformation provides API to manage that, however the closest method to deal with expirations is checkTGTAndReloginFromKeytab which is likely not what you want.
... View more
10-07-2015
04:51 PM
1 Kudo
The trick will be to manage separate hive configs for each HiveServer2 instance. You will need to use different values for authentication properties and the port on which to start.
... View more
10-07-2015
03:24 PM
1 Kudo
One way you can achieve the transformation of your CSV data to ORC would be to do the following: 1. Register your CSV GZ data as a text table, something like: create table <tablename>_txt (...) location '...'; 2. Create equivalent ORC table create table <tablename>_orc (...) stored as orc; 3. Populate the data into equivalent ORC table insert overwrite table <tablename>_orc select * from <tablename>_txt; I have used this in the past and worked for me.
... View more
10-07-2015
02:37 AM
Could be a real bug. What is HDP/hive version you are using?
... View more
10-07-2015
12:41 AM
1 Kudo
You can delete Hive tables by calling "drop table <tablename> purge;", this will skip the trash. If this is for testing purposes you can temporarily set fs.trash.interval to 0 and restart namenode. This will globally disable trash collection on HDFS so should only be employed during testing. On your last question about the support of TDE feature, it was available starting HDP 2.3.
... View more
10-06-2015
06:37 PM
1 Kudo
Note that when running more that one Hiveserver2 instances registered on zookeeper you also get load balancing when client uses the ZK info in JDBC url. ZooKeeper responds to client requests by randomly passing a link to one of the active HS2 instances.
... View more
10-06-2015
05:30 PM
One more thing you can try is to give the absolute path to the hive-site.xml in job-xml tag.
... View more
10-06-2015
12:56 AM
Any name other than hive-site.xml should have potentially worked. Will update here if I think of anything else. One question (maybe a stupid one) but have you uploaded this on HDFS after changes?
... View more
10-05-2015
11:38 PM
1 Kudo
I think it should have worked, anyways a workaround you could try is to rename hive-site.xml to oozie-hive-site.xml under script directory and use that path in the workflow xml.
... View more
10-05-2015
10:50 PM
1 Kudo
I doubt if you can build rpm on your own easily as HDP uses custom packaging scheme /usr/hdp/<version>/<component>. I don't think Maven is used to build these. @ashish@hortonworks.com, @gkesavan@hortonworks.com can you confirm?
... View more
10-05-2015
04:49 PM
2 Kudos
Apache Drill supports JSON as self describing data format, you can find the usage here. In Hive, HCatalog supports JSON as serde format for reading and writing data into tables.
... View more
10-04-2015
11:39 PM
2 Kudos
You start the datanodes as root user in secure setup. Did you do that?
... View more
10-04-2015
11:20 PM
1 Kudo
One possibility this can happen if you previously were on Hive 0.12 and you upgraded the binaries to Hive 0.13 and started metastore, likely datanucleus.autoCreateSchema was set to true. This may have already upgraded some of the tables including the table DBS. Now when you run the schemaTool it fails.
... View more
10-04-2015
10:22 PM
3 Kudos
Flume provides a Host interceptor that will inserts a header with key host or a configured key whose value is the hostname or IP address of the host, based on configuration. If you want to set a specific key value in the header you can use the Static Interceptor. You should be able to use %{host} in a latter sink. If the host name that you want to add is the hostname where the agent is running and its an HDFS sink you can use %{host} directly without the need of interceptor, see here.
... View more
- « Previous
- Next »