Support Questions
Find answers, ask questions, and share your expertise

KiteSDK FileNotFoundException hdfs://sandbox.hortonworks.com:8020/tmp/crunch-...

KiteSDK FileNotFoundException hdfs://sandbox.hortonworks.com:8020/tmp/crunch-...

New Contributor

I try to insert data from csv file with the kitesdk in the hortonworks sandbox 2.3.2.

After fixing the missing ojdbc6.jar issue and missing mapreduce.tar.gz, I'm run in following error:

1 job failure(s) occurred:
org.kitesdk.tools.CopyTask: Kite(dataset:file:/tmp/f00edc6d-4862-40d0-ad48-d6973b3261... ID=1 (1/1)(1): java.io.FileNotFoundException: File does not exist: hdfs://sandbox.hortonworks.com:8020/tmp/crunch-615301065/p1/REDUCE
at org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:1309)
at 
org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:1301)
at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1301)
at org.apache.hadoop.fs.FileSystem.resolvePath(FileSystem.java:751)
at org.apache.hadoop.mapreduce.v2.util.MRApps.parseDistributedCacheArtifacts(MRApps.java:571)
at org.apache.hadoop.mapreduce.v2.util.MRApps.setupDistributedCache(MRApps.java:463)
at org.apache.hadoop.mapred.LocalDistributedCacheManager.setup(LocalDistributedCacheManager.java:93)
at org.apache.hadoop.mapred.LocalJobRunner$Job.<init>(LocalJobRunner.java:163)
at org.apache.hadoop.mapred.LocalJobRunner.submitJob(LocalJobRunner.java:731)
at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:240)
at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1290)
at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1287)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:1287)
at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchControlledJob.submit(CrunchControlledJob.java:329)
at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchJobControl.startReadyJobs(CrunchJobControl.java:204)
at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchJobControl.pollJobStatusAndStartNewOnes(CrunchJobControl.java:238)
at org.apache.crunch.impl.mr.exec.MRExecutor.monitorLoop(MRExecutor.java:112)
at org.apache.crunch.impl.mr.exec.MRExecutor.access$000(MRExecutor.java:55)
at org.apache.crunch.impl.mr.exec.MRExecutor$1.run(MRExecutor.java:83)
at java.lang.Thread.run(Thread.java:745)

Any idea whats going wrong here or what I'm missed?

Thanks in advanced.

3 REPLIES 3
Highlighted

Re: KiteSDK FileNotFoundException hdfs://sandbox.hortonworks.com:8020/tmp/crunch-...

Mentor

@cst you may need to follow up on the Crunch mailing list and/or the KiteSDK mailing list if there is such a thing? My guess would be it's security or permissions related.

Highlighted

Re: KiteSDK FileNotFoundException hdfs://sandbox.hortonworks.com:8020/tmp/crunch-...

Mentor

@cst are you still having issues with this? Can you accept best answer or provide your own solution?

Highlighted

Re: KiteSDK FileNotFoundException hdfs://sandbox.hortonworks.com:8020/tmp/crunch-...

Super Guru

http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/

try pig, hive, nifi or other tools. It may be a permissions issue for KiteSDK. Try a newer Kite

Don't have an account?