About jack0188

Vipuly2j · ‎03-13-2023

You are life saver, I have been struggling with this for 7-8 hours and my deadline to submit a case study was close. Thanks alot!!!

VidyaSargur · ‎06-13-2022

@PriyalPotnis as this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.

Shahrukh · ‎11-29-2021

I found other simple solution for this issue, Simply find faulty partition from partition list by using command. show partitions table table_name; then rename the faulty partition to some other name in correct format of your partition. In my case, I used ALTER table table_name partition (date_flag='2021-11-25_bak') rename to partition (date_flag='2021-01-01');

Manoj690 · ‎12-14-2020

echo "scan 'emp'" | $HBASE_HOME/bin/hbase shell | awk -F'=' '{print $2}' | awk -F ':' '{print $2}'|awk -F ',' '{print $1}'

jack0188 · ‎10-05-2017

Yes, metadata does store the columns name in its database. hive > show columns in table_name: hive> set hive.cli.print.header=true; To view the column names of your table.

allod · ‎05-18-2017

Thanks for sharing the code of your solution. I've also found that just making HiveContext variable lazy works: val sparkConf = new SparkConf().setAppName("StreamHDFSdata") sparkConf.set("spark.dynamicAllocation.enabled","false") val ssc = new StreamingContext(sparkConf, Seconds(5)) ssc.checkpoint("/user/hdpuser/checkpoint") val sc = ssc.sparkContext val smDStream = ssc.textFileStream("/user/hdpuser/data") val smSplitted = smDStream.map( x => x.split(";") ).map( x => Row.fromSeq( x ) ) ... lazy val sqlContext = new org.apache.spark.sql.hive.HiveContext(sc) smSplitted.foreachRDD( rdd => { // use sqlContext here } )

jack0188 · ‎04-04-2017

How did u solved it ??? Which things one has to check ?

sanjeev20 · ‎03-22-2017

@DWinters How were you able to overcome this issue? Could you please post a sample solution

jack0188 · ‎01-17-2017

fixed it, like below df.withColumn("Timestamp_val",lit(current_timestamp)) As the second argument in the .withColumn() will expect a named column and val newDF=dataframe.withColumn("Timestamp_val",current_timestamp()) will not generate a named column.Hence the exception

adityatelidevar · ‎02-08-2017

Want to point out couple of things.. date is a hive key word. To have column called date you need to say `date` string,. If you specify that a table is partitioned ( in your cased partitioned by (`date` string) ), that partition is implicitly accepted as the last column of the table. No need to specify it in the schema again. CREATE EXTERNAL TABLE user ( userId BIGINT, type INT, level TINYINT )PARTITIONED BY (`date` String)LOCATION '/external_table_path'; As @Steven O'Neill mentioned, when ingesting data into HDFS, you should create sub-directories like date=2016-01-01 under your table location. Then your alter table statement will work.

Online	Offline
Last Visited	‎04-05-2017 08:32 AM

Member Since	‎11-22-2016 02:28 AM
Last Visited	‎04-05-2017 08:32 AM
Posts	50
Kudos received	3

Cloudera Community

Re: Exception in thread "main" java.lang.IllegalAr...

Re: how to access the hive tables from spark-shell

Re: java.lang.NoSuchMethodError: twitter4j.FilterQ...

Re: Hive : Drop Partitions : How to drop Date part...

Re: Hbase complete list of columns in column famil...

Re: Does Hive metadata store column names of a tab...

Re: How to write data from dStream into permanent ...

Re: Permission denied: user=mapred, access=WRITE, ...

Re: add columns to hive/parquet table

Re: Exception in thread "main" java.lang.IllegalAr...

Re: External Hive Partitioned table, is empty!!