About Yeseswini

Yeseswini · ‎06-13-2020

Hi Royles, I am using hiveql for creating the table, altering the table for adding new columns. Doing all the operations like msck repair table,add partition to table everything I am doing from hiveql only.Only we are reading table from sparksql. After reading your reply,I tried to create external table,do msck repair,alter table to add new columns everything from sparksql. I got the below results 1.No results from spark when reading data from table 2.No results from hive shell when reading table 3.If I see the tblproperties,parquet schema is not matching .So there are no results from hiveql and from spark The only solution which I am following till now is(for adding new columns to external tbls) 1.Drop and create table using hiveql from hiveshell with all columns(old + new) 2.add latest partition manually which has data for all new columns added so far apart from beginning creation of table from hiveshell 3.query table from spark.Then check for tblproperties and parquet schema should be reflecting and mapped with hive columns 4.If the schema is not matching like testData in parquet is reflecting as testdata in hive tblproperties then we will get null values form spark 5.If both the schemas are matching,then we can see results from spark 4.then do msck repair which is giving me results in both spark 2.2 and 2.3 But I feel there must be some other way of adding new columns instead of dropping table and recreating it.

EricL · ‎06-12-2019

@Yeseswini Hive's VIEW is ready only, please see below doc: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Create/Drop/AlterView >> Views are read-only and may not be used as the target of LOAD/INSERT/ALTER. Insert or update is not supported. Thanks Eric

Yeseswini · ‎06-03-2019

from spark or pyspark shell use the below commands to access hive database objects. spark.sql("show databases;") spark.sql("select * from databasename.tablename;") or spark.read.table("databasename.tablename") You can give any query inside spark.sql which will give you results.

Yeseswini · ‎02-22-2018

Thank You.It worked.

Online	Offline
Last Visited	‎07-02-2020 07:09 AM

Member Since	‎02-13-2018 03:07 AM
Last Visited	‎07-02-2020 07:09 AM
Posts	10

Cloudera Community

Re: Issue with adding new columns to HIVE external...

Re: Error while adding partitions to views in hive

Re: how to access hive database/tables through spa...

Re: class not found exception while running UDF in...