Member since
12-27-2016
73
Posts
34
Kudos Received
5
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
24306 | 03-23-2018 09:21 PM | |
2063 | 02-05-2018 07:08 PM | |
8329 | 01-15-2018 07:21 PM | |
1882 | 12-01-2017 06:35 PM | |
5106 | 03-09-2017 06:21 PM |
01-31-2018
05:08 PM
1 Kudo
In SPARK-20901 `Feature Parity for ORC with Parquet`, you can see the
issue links marked as `is blocked by`. Among them, the following issues
are what you want to see for ORC library,
- SPARK-21422 Depend on Apache ORC 1.4.0 - SPARK-22300 Update ORC to 1.4.1
In addition to that, the following will convert Hive ORC table into Spark data sources tables to use Apache ORC 1.4.1. - SPARK-22279 Turn on spark.sql.hive.convertMetastoreOrc by default
... View more
01-16-2018
04:55 PM
As of now, Apache JIRA is `Maintenance in progress`. So, I cannot give you the link. The umbrella ORC JIRA is https://issues.apache.org/jira/browse/SPARK-20901.
... View more
01-16-2018
04:54 PM
If you can wait for it, Apache Spark 2.3 will be released with Apache ORC 1.4.1. There are many ORC patch in Hive. Apache Spark cannot sync it promptly. So, in Apache Spark, we decide to use the latest ORC 1.4.1 library instead of upgrading Hive 1.2.1 library. From Apache Spark 2.3, Hive ORC table is converted into ORC data sources tables by default and uses ORC 1.4.1 library to read it. Not only your issue but also vectorization on ORC are supported. Anyway, again, HDP 2.6.3+ is already shipped with ORC 1.4.1 with vectorization, too.
... View more
01-15-2018
07:21 PM
2 Kudos
Hi, @Rajiv Chodisetti . It's related to HIVE-13232 (fixed in Hive 1.3.0, 2.0.1, 2.1.0), but all Apache Spark still uses Hive 1.2.1 library. Could you try HDP 2.6.3+ (2.6.4 is the latest one). HDP Spark 2.2 has that fixed hive library.
... View more
01-02-2018
08:56 PM
Let's ping the maintainer of SHC. Ping, @wyang Could you help @Eric Hanson?
... View more
01-02-2018
08:37 PM
Hi, @Eric Hanson . SHC seems to work for both Spark 1.6.3 and Spark 2.2. Could you share your specific problem with SHC here?
... View more
12-05-2017
12:05 AM
I see. Yes, Ranger and Parquet does. I believe you can find a way for your requirements!
... View more
12-04-2017
05:12 PM
1 Kudo
In addition to that, STS supports Spark SQL syntax since v2.0.0. If you want to use Spark SQL Syntax with SQL 2003 support, it's a good choice. Also, you can use Spark-specific syntax like `CACHE TABLE`, too.
... View more