Reply
Highlighted
Contributor
Posts: 56
Registered: ‎07-15-2014
Accepted Solution

Deleting existing DataSets

I started learning kite sdk today and am finding some strange behaviour.

 

so I had created this "users" dataset before which I want to delete and re-create. to do this I do the following steps

 

1. kite-sdk delete users

 

In order to confirm if the delete is successful I do a 

 

kite-sdk show users

Hive table no found: default.users

 

OK so great users is deleted. Now I do a 

 

kite-dataset create "users" --schema users.avsc

kite-dataset show users

 

and I get to see a bunch of records.... but wait... I never imported any data yet. so how come kite is showing me data in the users schema?

 

as of now I have just created the schema (the step of kite-dataset csv-import) has not been done yet.

 

so where did the data come from?? is it the data from the previous users table??

 

capture.png

 

Where is this data coming from? I have not imported anything yet....

Cloudera Employee
Posts: 26
Registered: ‎07-08-2013

Re: Deleting existing DataSets

Kite uses the Hive API to drop the table when you tell it to delete the dataset and Hive should take care of dropping the table. Can you check the log of your HiveMetastoreServer to see if there was an error on that side?

 

To get past the error, you can remove the directory by hand.

Contributor
Posts: 56
Registered: ‎07-15-2014

Re: Deleting existing DataSets

I had to delete the directories in HDFS manually It could be that kite-dataset delete command only does a logical delete. this means it only removes the metadata.

 

anyways. doing kite-dataset delete and then a manual delete in HDFS works for me.

 

Announcements
The Kite SDK is a collection of docs, sample code, APIs, and tools to make Hadoop application development faster. Learn more at http://kitesdk.org.