About hades_63146

ggangadharan · ‎07-14-2023

If my understanding is correct, the schema is altered for different input files, which implies that the data itself lacks a structured schema. Given the frequent changes in the schema, it is advisable to store the data in a column-oriented system such as HBASE. The Same HBASE data can be accessed through spark using HBase-Spark Connector. Ref - https://docs.cloudera.com/cdp-private-cloud-base/7.1.8/accessing-hbase/topics/hbase-example-using-hbase-spark-connector.html

Online	Offline
Last Visited	‎02-06-2023 02:06 AM

Member Since	‎02-01-2023 08:30 AM
Last Visited	‎02-06-2023 02:06 AM
Posts	2

Cloudera Community

Re: Hive with spark table schema changes sensitivi...