Support Questions

Find answers, ask questions, and share your expertise

Error on Tutorial "How to Process Data with Apache Hive" - Cannot load data into Hive tables

Explorer

Problem loading data to table in this tutorial

http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/

This command

<code>LOAD DATA INPATH '/user/admin/Batting.csv' OVERWRITE INTO TABLE temp_batting;

produces error

H110 Unable to submit statement. Error while compiling statement: FAILED: HiveAccessControlException Permission denied: user [admin] does not have [READ] privilege on [hdfs://sandbox.hortonworks.com:8020/user/admin/elecMonthly_Orc] [ERROR_STATUS]

I created both user/admin and temp/admin folders. I used hdfs superuser to make admin owner of file, folder, and even parent folder. I gave full permissions in HDFS. and this is clearly shown in Ambari. Error persists.

Can anyone help? Thanks

1 ACCEPTED SOLUTION

Explorer

It was because when I thought I was creating an elecMonthly_Orc file, actually I created an elecMonthly_Orc folder with several files three files: _SUCCESS, part-r-r-00000-1a0c14e3-0dd0-42db-abc7-7f655a02f634.orc ... and another similar orc files. The files within the elecMonthly_Orc directory were owned by Hive, and that's why the permissions error.

Resolved by using command line as superuser hdfs:

hadoop fs -chown admin:admin /user/admin/elecMonthly_Orc/*.*

Now I just have to figure out how to recombine Orc files in Hive!

View solution in original post

14 REPLIES 14

@Aidan Condron

Can you please take a screenshot of the output of

hdfs dfs -ls /user/admin/elecMonthly_Orc

@Aidan Condron what user are you running the Hive Statement as?

Explorer

Thanks Andrew. Think it was discongruity between Hive account running statement and Admin owing file. Thanks for this and sorry for delay in reply.

@Aidan Condron Is it resolved?

Mentor

@Aidan Condron accept best answer below

Are you using the latest sandbox? Try the Ambari Hive view, it has the Upload File action specifically for csv files.

Explorer

Thanks for answering so quickly!

1196-miqgf.png

@Aidan Condron what about the directory about it. What's the output of:

hdfs dfs -ls /user/admin

It looks like the files are owned by the user 'Spark'. Which user is the running the Hive Statement?

Explorer

admin is running statement as per tutorial

I thought I did hdfs chown on files. Shown in Ambari as owned by admin

1197-5noch.png

@Aidan Condron

Is temp_batting an external or internal table? My assumption is it's an internal table and thus when you load data in path, it's trying to move the data in /apps/warehouse/database(default)/temp_batting and the admin user doesn't have permissions to move the file.

Can you please try running:

hdfs dfs -chmod -R 777 /user/admin/elecMonthly_Orc

and then trying to run your load data inpath command?

Mentor

@Aidan Condron are you still having issues with this? Can you accept best answer or provide your own solution?

Explorer

It was because when I thought I was creating an elecMonthly_Orc file, actually I created an elecMonthly_Orc folder with several files three files: _SUCCESS, part-r-r-00000-1a0c14e3-0dd0-42db-abc7-7f655a02f634.orc ... and another similar orc files. The files within the elecMonthly_Orc directory were owned by Hive, and that's why the permissions error.

Resolved by using command line as superuser hdfs:

hadoop fs -chown admin:admin /user/admin/elecMonthly_Orc/*.*

Now I just have to figure out how to recombine Orc files in Hive!

Mentor

@Aidan Condron can you convert your response to answer and we can mark it as best answer?

Contributor

Hey guys. The tutorial mentioned above has been updated and is also compatible with the latest Sandbox HDP 2.4. It addresses the issue of permissions. Here is the link: http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/

When you a chance, can you go through the tutorial on our new Sandbox?