<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: KiteSDK HDP 2.5 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136260#M43768</link>
    <description>&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/8658-logs.zip"&gt;logs.zip&lt;/A&gt;&lt;P&gt;I think I zipped up the requested logs. Thanks for your help, I'm somewhat new to hortonworks and  trying to flush out a POC. Here's the full text of my error message:&lt;/P&gt;&lt;P&gt;[hdfs@sandbox bin]$ ./kite-dataset csv-import /home/hdfs/bin/ingest/Payor_1_Claims.txt  Payor_1_Claims --delimiter '|'
1 job failure(s) occurred:
org.kitesdk.tools.CopyTask: Kite(dataset:file:/tmp/b138551a-23e0-49ee-a51e-d9dd0773f1... ID=1 (1/1)(1): java.io.FileNotFoundException: File does not exist: hdfs://sandbox.hortonworks.com:8020/tmp/crunch-380116631/p1/REDUCE
  at org.apache.hadoop.hdfs.DistributedFileSystem$25.doCall(DistributedFileSystem.java:1427)
  at org.apache.hadoop.hdfs.DistributedFileSystem$25.doCall(DistributedFileSystem.java:1419)
  at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
  at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1419)
  at org.apache.hadoop.fs.FileSystem.resolvePath(FileSystem.java:766)
  at org.apache.hadoop.mapreduce.v2.util.MRApps.parseDistributedCacheArtifacts(MRApps.java:600)
  at org.apache.hadoop.mapreduce.v2.util.MRApps.setupDistributedCache(MRApps.java:490)
  at org.apache.hadoop.mapred.LocalDistributedCacheManager.setup(LocalDistributedCacheManager.java:93)
  at org.apache.hadoop.mapred.LocalJobRunner$Job.&amp;lt;init&amp;gt;(LocalJobRunner.java:163)
  at org.apache.hadoop.mapred.LocalJobRunner.submitJob(LocalJobRunner.java:731)
  at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:240)
  at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1290)
  at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1287)
  at java.security.AccessController.doPrivileged(Native Method)
  at javax.security.auth.Subject.doAs(Subject.java:415)
  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724)
  at org.apache.hadoop.mapreduce.Job.submit(Job.java:1287)
  at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchControlledJob.submit(CrunchControlledJob.java:329)
  at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchJobControl.startReadyJobs(CrunchJobControl.java:204)
  at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchJobControl.pollJobStatusAndStartNewOnes(CrunchJobControl.java:238)
  at org.apache.crunch.impl.mr.exec.MRExecutor.monitorLoop(MRExecutor.java:112)
  at org.apache.crunch.impl.mr.exec.MRExecutor.access$000(MRExecutor.java:55)
  at org.apache.crunch.impl.mr.exec.MRExecutor$1.run(MRExecutor.java:83)
  at java.lang.Thread.run(Thread.java:745)&lt;/P&gt;</description>
    <pubDate>Tue, 18 Oct 2016 23:40:43 GMT</pubDate>
    <dc:creator>daniel_rolls</dc:creator>
    <dc:date>2016-10-18T23:40:43Z</dc:date>
    <item>
      <title>KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136258#M43766</link>
      <description>&lt;P&gt;I recently swapped sandboxes from HDP 2.4 to HDP 2.5 and I'm running into all sorts of issues with the KiteSDK. I created the directory /hdp/apps/2.5.0.0-1245/mapreduce/ and copied in mapreduce.tar.gz which got me a little further, but now I'm running into a "org.kitesdk.tools.CopyTask: Kite(dataset:file:/tmp/413a41a2-8813-4056-9433-3c5e073d80... ID=1 (1/1)(1): java.io.FileNotFoundException: File does not exist: hdfs://sandbox.hortonworks.com:8020/tmp/crunch-283520469/p1/REDUCE" that I can't seem to overcome. 
Has anyone successfully gotten KiteAPI to work on  HDP 2.5? I can't figure out what I'm  doing wrong here.

I'd be happy to go back to 2.4 but I can't seem to find a download for it.&lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 04:10:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136258#M43766</guid>
      <dc:creator>daniel_rolls</dc:creator>
      <dc:date>2016-10-18T04:10:54Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136259#M43767</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/13694/danielrolls.html" nodeid="13694" target="_blank"&gt;@Daniel Rolls&lt;/A&gt; I'd be happy to debug your issues if you could provide RM and nodemanager logs for the tasks that fail. As a side note, Sandbox archives are available from the same page as the one you use to download 2.5 release. You have to click on EXPAND button next to Hortonworks Sandbox Archive where you'll be able to find any previous release up to Sandbox 1.3&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="8656-sandbox.png" style="width: 1342px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/22030i12F93CEA1814E968/image-size/medium?v=v2&amp;amp;px=400" role="button" title="8656-sandbox.png" alt="8656-sandbox.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Aug 2019 08:51:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136259#M43767</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2019-08-19T08:51:37Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136260#M43768</link>
      <description>&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/8658-logs.zip"&gt;logs.zip&lt;/A&gt;&lt;P&gt;I think I zipped up the requested logs. Thanks for your help, I'm somewhat new to hortonworks and  trying to flush out a POC. Here's the full text of my error message:&lt;/P&gt;&lt;P&gt;[hdfs@sandbox bin]$ ./kite-dataset csv-import /home/hdfs/bin/ingest/Payor_1_Claims.txt  Payor_1_Claims --delimiter '|'
1 job failure(s) occurred:
org.kitesdk.tools.CopyTask: Kite(dataset:file:/tmp/b138551a-23e0-49ee-a51e-d9dd0773f1... ID=1 (1/1)(1): java.io.FileNotFoundException: File does not exist: hdfs://sandbox.hortonworks.com:8020/tmp/crunch-380116631/p1/REDUCE
  at org.apache.hadoop.hdfs.DistributedFileSystem$25.doCall(DistributedFileSystem.java:1427)
  at org.apache.hadoop.hdfs.DistributedFileSystem$25.doCall(DistributedFileSystem.java:1419)
  at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
  at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1419)
  at org.apache.hadoop.fs.FileSystem.resolvePath(FileSystem.java:766)
  at org.apache.hadoop.mapreduce.v2.util.MRApps.parseDistributedCacheArtifacts(MRApps.java:600)
  at org.apache.hadoop.mapreduce.v2.util.MRApps.setupDistributedCache(MRApps.java:490)
  at org.apache.hadoop.mapred.LocalDistributedCacheManager.setup(LocalDistributedCacheManager.java:93)
  at org.apache.hadoop.mapred.LocalJobRunner$Job.&amp;lt;init&amp;gt;(LocalJobRunner.java:163)
  at org.apache.hadoop.mapred.LocalJobRunner.submitJob(LocalJobRunner.java:731)
  at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:240)
  at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1290)
  at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1287)
  at java.security.AccessController.doPrivileged(Native Method)
  at javax.security.auth.Subject.doAs(Subject.java:415)
  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724)
  at org.apache.hadoop.mapreduce.Job.submit(Job.java:1287)
  at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchControlledJob.submit(CrunchControlledJob.java:329)
  at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchJobControl.startReadyJobs(CrunchJobControl.java:204)
  at org.apache.crunch.hadoop.mapreduce.lib.jobcontrol.CrunchJobControl.pollJobStatusAndStartNewOnes(CrunchJobControl.java:238)
  at org.apache.crunch.impl.mr.exec.MRExecutor.monitorLoop(MRExecutor.java:112)
  at org.apache.crunch.impl.mr.exec.MRExecutor.access$000(MRExecutor.java:55)
  at org.apache.crunch.impl.mr.exec.MRExecutor$1.run(MRExecutor.java:83)
  at java.lang.Thread.run(Thread.java:745)&lt;/P&gt;</description>
      <pubDate>Tue, 18 Oct 2016 23:40:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136260#M43768</guid>
      <dc:creator>daniel_rolls</dc:creator>
      <dc:date>2016-10-18T23:40:43Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136261#M43769</link>
      <description>&lt;P&gt;looks like a permissions issue&lt;/P&gt;&lt;P&gt;make sure that this exists and the current user has access&lt;/P&gt;&lt;P&gt;hdfs dfs -mkdir /tmp&lt;/P&gt;&lt;P&gt;hdfs dfs -chmod -R 777 /tmp&lt;/P&gt;&lt;P&gt;are you in as root?&lt;/P&gt;&lt;P&gt;&lt;A href="http://kitesdk.org/docs/1.0.0/Using-the-Kite-CLI-to-Create-a-Dataset.html"&gt;http://kitesdk.org/docs/1.0.0/Using-the-Kite-CLI-to-Create-a-Dataset.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Could be a kite error.&lt;/P&gt;&lt;P&gt;&lt;A href="http://mail-archives.apache.org/mod_mbox/crunch-dev/201303.mbox/%3CBLU0-SMTP1468E6756741B738043FB98ADF90@phx.gbl%3E"&gt;http://mail-archives.apache.org/mod_mbox/crunch-dev/201303.mbox/%3CBLU0-SMTP1468E6756741B738043FB98ADF90@phx.gbl%3E&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Nifi, Hive or Pig may be a better option.&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/"&gt;http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Looks like it may be Kite, may need to upgrade that&lt;/P&gt;&lt;P&gt;&lt;A href="https://issues.cloudera.org/browse/KITE-874" target="_blank"&gt;https://issues.cloudera.org/browse/KITE-874&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 19 Oct 2016 02:51:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136261#M43769</guid>
      <dc:creator>TimothySpann</dc:creator>
      <dc:date>2016-10-19T02:51:44Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136262#M43770</link>
      <description>&lt;P&gt;I'm not sure if it's the docker implementation of HDP 2.5 on sandbox or what the story is. I've got the most recent version of the Kite API installed.
Perms look good:&lt;/P&gt;&lt;P&gt;[root@sandbox ~]# hdfs dfs -ls / &lt;/P&gt;&lt;P&gt;Found 12 items &lt;/P&gt;&lt;P&gt;drwxrwxrwx  - yarn  hadoop  0 2016-10-17 19:51 /app-logs 
drwxr-xr-x  - hdfs  hdfs  0 2016-09-13 11:01 /apps

drwxr-xr-x  - yarn  hadoop  0 2016-09-13 10:56 /ats

drwxr-xr-x  - hdfs  hdfs  0 2016-09-13 11:08 /demo

drwxr-xr-x  - hdfs  hdfs  0 2016-09-13 10:56 /hdp

drwxr-xr-x  - mapred hdfs  0 2016-09-13 10:56 /mapred

drwxrwxrwx  - mapred hadoop  0 2016-09-13 10:56 /mr-history

drwxr-xr-x  - hdfs  hdfs  0 2016-10-12 15:05 /ranger

drwxrwxrwx  - spark  hadoop  0 2016-10-19 19:46 /spark-history

drwxrwxrwx  - spark  hadoop  0 2016-09-13 11:20 /spark2-history

drwxrwxrwx  - hdfs  hdfs  0 2016-10-17 17:23 /tmp

drwxr-xr-x  - hdfs  hdfs  0 2016-10-12 15:18 /user&lt;/P&gt;&lt;P&gt;[root@sandbox ~]# hdfs dfs -ls /tmp

Found 13 items &lt;/P&gt;&lt;P&gt;-rwxrwxrwx  3 raj_ops  hdfs  6676440 2016-10-17 17:23 /tmp/Payor_1_Claims.txt

-rwxrwxrwx  3 raj_ops  hdfs  2803 2016-10-17 17:23 /tmp/Payor_1_Eligibility.txt 
-rwxrwxrwx  3 raj_ops  hdfs  21015 2016-10-17 17:23 /tmp/Payor_1_Glucose_Results.txt

-rwxrwxrwx  3 raj_ops  hdfs  2317192 2016-10-17 17:22 /tmp/Payor_2_Additional_Dx_Codes.txt

-rwxrwxrwx  3 raj_ops  hdfs  7866129 2016-10-17 17:23 /tmp/Payor_2_Claims.txt

-rwxrwxrwx  3 raj_ops  hdfs  8626 2016-10-17 17:23 /tmp/Payor_2_Eligibility.txt

-rwxrwxrwx  3 raj_ops  hdfs  22969 2016-10-17 17:23 /tmp/Payor_2_Glucose_Results.txt

-rwxrwxrwx  3 raj_ops  hdfs  8474653 2016-10-17 17:23 /tmp/Payor_3_Claims.txt

-rwxrwxrwx  3 raj_ops  hdfs  995712 2016-10-17 17:23 /tmp/Payor_3_Dx_Codes.txt

-rwxrwxrwx  3 raj_ops  hdfs  88106 2016-10-17 17:23 /tmp/Payor_3_Eligibility.txt

-rwxrwxrwx  3 raj_ops  hdfs  23125 2016-10-17 17:23 /tmp/Payor_3_Glucose_Results.txt

drwxrwxrwx  - hdfs  hdfs  0 2016-09-13 10:56 /tmp/entity-file-history

drwxrwxrwx  - ambari-qa hdfs  0 2016-10-17 19:52 /tmp/hive&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 02:49:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136262#M43770</guid>
      <dc:creator>daniel_rolls</dc:creator>
      <dc:date>2016-10-20T02:49:39Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136263#M43771</link>
      <description>&lt;P&gt;Fixed the error by using an earlier version of KiteAPI:&lt;/P&gt;&lt;P&gt;curl &lt;A href="http://central.maven.org/maven2/org/kitesdk/kite-tools/0.17.0/kite-tools-0.17.0-binary.jar" target="_blank"&gt;http://central.maven.org/maven2/org/kitesdk/kite-tools/0.17.0/kite-tools-0.17.0-binary.jar&lt;/A&gt; -o kite-dataset&lt;/P&gt;</description>
      <pubDate>Fri, 21 Oct 2016 23:29:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136263#M43771</guid>
      <dc:creator>daniel_rolls</dc:creator>
      <dc:date>2016-10-21T23:29:06Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136264#M43772</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/13694/danielrolls.html" nodeid="13694"&gt;@Daniel Rolls&lt;/A&gt; thank you for the solution, I urge you to post an article on HCC describing your use case. I'm going to escalate this issue to Sandbox team. That way inconsistencies with versions of KiteSDK will be addressed going forward.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Oct 2016 23:36:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136264#M43772</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-10-21T23:36:20Z</dc:date>
    </item>
    <item>
      <title>Re: KiteSDK HDP 2.5</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136265#M43773</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/527/rmolina.html" nodeid="527"&gt;@rmolina&lt;/A&gt; &lt;A rel="user" href="https://community.cloudera.com/users/69/rcoss.html" nodeid="69"&gt;@Rafael Coss&lt;/A&gt; FYI&lt;/P&gt;</description>
      <pubDate>Fri, 21 Oct 2016 23:37:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/KiteSDK-HDP-2-5/m-p/136265#M43773</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-10-21T23:37:20Z</dc:date>
    </item>
  </channel>
</rss>

