Member since
04-28-2022
1
Post
0
Kudos Received
0
Solutions
04-28-2022
04:56 AM
Hi! We have migrated from Spark 1.6 to 2.4 getting the follow error when saving a dataframe into a parquet file: org.apache.spark.sql.sources.DataSourceRegister: Provider org.apache.spark.sql.execution.datasources.parquet.ParquetFileFormat could not be instantiated When we executed this code: StructType schema = QueryResultsHelper . createStructType ( fields ) ; Dataset < Row > df = sqlContext .createDataFrame( rows , schema ) ; df .coalesce( 1 ).write().format( "parquet" ).mode( SaveMode.Append ).save( stringPath ) ; It fails when saving. I understand is due to the migration. Could you give me some advice to solve this? Thank you very much.
... View more
Labels:
- Labels:
-
Apache Impala
-
Apache Spark