<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Importing sqoop data - cloudera live exercise 1 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31690#M7256</link>
    <description>Your cluster has both Sqoop 1 and Sqoop 2, and they are both managed by&lt;BR /&gt;Cloudera Manager (although Sqoop 2 is a service, rather than a command-line&lt;BR /&gt;tool, and that service is not started by default - but the tool and&lt;BR /&gt;configuration for the command-line tool is deployed on all the machines in&lt;BR /&gt;the cluster). I think when you say you see only one, you're referring to&lt;BR /&gt;the Sqoop 2 app in Hue? If so, then yes, the Sqoop app in Hue only supports&lt;BR /&gt;Sqoop 2.&lt;BR /&gt;&lt;BR /&gt;To run the Sqoop command in the tutorial, you must be logged in to the&lt;BR /&gt;manager node via SSH. You can use PuTTY (on Windows) or the ssh command&lt;BR /&gt;(OpenSSH, on everything else). A page or two before the Sqoop step in the&lt;BR /&gt;tutorial, there should be directions for getting the SSH credentials to&lt;BR /&gt;your GoGrid cluster.&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Thu, 10 Sep 2015 13:40:20 GMT</pubDate>
    <dc:creator>Sean</dc:creator>
    <dc:date>2015-09-10T13:40:20Z</dc:date>
    <item>
      <title>Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31666#M7255</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I m very new to hadoop as well as to distributed computing. I was trying to do the first exercise in cloudera live demo and got stuck in the part where we have to import the data using sqoop. Firstly, i see that the cloud cluster CDH 5.2.0 has the option of only sqoop2 and secondly, I don't see any terminal to run the given command. Sorry if my question is very lame. Can someone please help me with where exactly to input this sqoop query. Thanks !&lt;BR /&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:40:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31666#M7255</guid>
      <dc:creator>sank</dc:creator>
      <dc:date>2022-09-16T09:40:22Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31690#M7256</link>
      <description>Your cluster has both Sqoop 1 and Sqoop 2, and they are both managed by&lt;BR /&gt;Cloudera Manager (although Sqoop 2 is a service, rather than a command-line&lt;BR /&gt;tool, and that service is not started by default - but the tool and&lt;BR /&gt;configuration for the command-line tool is deployed on all the machines in&lt;BR /&gt;the cluster). I think when you say you see only one, you're referring to&lt;BR /&gt;the Sqoop 2 app in Hue? If so, then yes, the Sqoop app in Hue only supports&lt;BR /&gt;Sqoop 2.&lt;BR /&gt;&lt;BR /&gt;To run the Sqoop command in the tutorial, you must be logged in to the&lt;BR /&gt;manager node via SSH. You can use PuTTY (on Windows) or the ssh command&lt;BR /&gt;(OpenSSH, on everything else). A page or two before the Sqoop step in the&lt;BR /&gt;tutorial, there should be directions for getting the SSH credentials to&lt;BR /&gt;your GoGrid cluster.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 10 Sep 2015 13:40:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31690#M7256</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2015-09-10T13:40:20Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31781#M7257</link>
      <description>&lt;P&gt;Got it! Thanks.&amp;nbsp;&lt;/P&gt;&lt;P&gt;But now I m getting the below error while running the following sqoop import command.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[root@g5157-cldramaster-01 ~]# sqoop import-all-tables \&lt;BR /&gt;&amp;gt; -m 3 \&lt;BR /&gt;&amp;gt; --connect jdbc:mysql://208.113.123.213:3306/retail_db \&lt;BR /&gt;&amp;gt; --username=retail_dba \&lt;BR /&gt;&amp;gt; --password=cloudera \&lt;BR /&gt;&amp;gt; --compression-codec=snappy \&lt;BR /&gt;&amp;gt; --as-parquetfile \&lt;BR /&gt;&amp;gt; --warehouse-dir=/user/hive/warehouse \&lt;BR /&gt;&amp;gt; --hive-import&lt;BR /&gt;Warning: /opt/cloudera/parcels/CDH-5.2.0-1.cdh5.2.0.p0.36/bin/../lib/sqoop/../accumulo does not exist! Accumulo imports will fail.&lt;BR /&gt;Please set $ACCUMULO_HOME to the root of your Accumulo installation.&lt;BR /&gt;15/09/13 20:47:04 INFO sqoop.Sqoop: Running Sqoop version: 1.4.5-cdh5.2.0&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: Setting your password on the command-line is insecure. Consider using -P instead.&lt;BR /&gt;15/09/13 20:47:04 INFO tool.BaseSqoopTool: Using Hive-specific delimiters for output. You can override&lt;BR /&gt;15/09/13 20:47:04 INFO tool.BaseSqoopTool: delimiters with --fields-terminated-by, etc.&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: It seems that you're doing hive import directly into default&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: hive warehouse directory which is not supported. Sqoop is&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: firstly importing data into separate directory and then&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: inserting data into hive. Please consider removing&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: --target-dir or --warehouse-dir into /user/hive/warehouse in&lt;BR /&gt;15/09/13 20:47:04 WARN tool.BaseSqoopTool: case that you will detect any issues.&lt;BR /&gt;15/09/13 20:47:04 INFO manager.SqlManager: Using default fetchSize of 1000&lt;BR /&gt;15/09/13 20:47:05 INFO tool.CodeGenTool: Beginning code generation&lt;BR /&gt;15/09/13 20:47:05 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `categories` AS t LIMIT 1&lt;BR /&gt;15/09/13 20:47:05 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `categories` AS t LIMIT 1&lt;BR /&gt;15/09/13 20:47:05 INFO orm.CompilationManager: HADOOP_MAPRED_HOME is /opt/cloudera/parcels/CDH/lib/hadoop-mapreduce&lt;BR /&gt;Note: /tmp/sqoop-root/compile/c621727433ffc0137ec3c8b84f7bd461/categories.java uses or overrides a deprecated API.&lt;BR /&gt;Note: Recompile with -Xlint:deprecation for details.&lt;BR /&gt;15/09/13 20:47:06 INFO orm.CompilationManager: Writing jar file: /tmp/sqoop-root/compile/c621727433ffc0137ec3c8b84f7bd461/categories.jar&lt;BR /&gt;15/09/13 20:47:06 WARN manager.MySQLManager: It looks like you are importing from mysql.&lt;BR /&gt;15/09/13 20:47:06 WARN manager.MySQLManager: This transfer can be faster! Use the --direct&lt;BR /&gt;15/09/13 20:47:06 WARN manager.MySQLManager: option to exercise a MySQL-specific fast path.&lt;BR /&gt;15/09/13 20:47:06 INFO manager.MySQLManager: Setting zero DATETIME behavior to convertToNull (mysql)&lt;BR /&gt;15/09/13 20:47:06 INFO mapreduce.ImportJobBase: Beginning import of categories&lt;BR /&gt;15/09/13 20:47:06 INFO Configuration.deprecation: mapred.jar is deprecated. Instead, use mapreduce.job.jar&lt;BR /&gt;15/09/13 20:47:07 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `categories` AS t LIMIT 1&lt;BR /&gt;15/09/13 20:47:07 WARN spi.Registration: Not loading URI patterns in org.kitesdk.data.spi.hive.Loader&lt;BR /&gt;15/09/13 20:47:07 ERROR sqoop.Sqoop: Got exception running Sqoop: org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI: hive?dataset=null&lt;BR /&gt;org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI: hive?dataset=null&lt;BR /&gt;at org.kitesdk.data.spi.Registration.lookupDatasetUri(Registration.java:109)&lt;BR /&gt;at org.kitesdk.data.Datasets.create(Datasets.java:189)&lt;BR /&gt;at org.kitesdk.data.Datasets.create(Datasets.java:240)&lt;BR /&gt;at org.apache.sqoop.mapreduce.ParquetJob.createDataset(ParquetJob.java:81)&lt;BR /&gt;at org.apache.sqoop.mapreduce.ParquetJob.configureImportJob(ParquetJob.java:70)&lt;BR /&gt;at org.apache.sqoop.mapreduce.DataDrivenImportJob.configureMapper(DataDrivenImportJob.java:112)&lt;BR /&gt;at org.apache.sqoop.mapreduce.ImportJobBase.runImport(ImportJobBase.java:262)&lt;BR /&gt;at org.apache.sqoop.manager.SqlManager.importTable(SqlManager.java:665)&lt;BR /&gt;at org.apache.sqoop.manager.MySQLManager.importTable(MySQLManager.java:102)&lt;BR /&gt;at org.apache.sqoop.tool.ImportTool.importTable(ImportTool.java:497)&lt;BR /&gt;at org.apache.sqoop.tool.ImportAllTablesTool.run(ImportAllTablesTool.java:105)&lt;BR /&gt;at org.apache.sqoop.Sqoop.run(Sqoop.java:143)&lt;BR /&gt;at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)&lt;BR /&gt;at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:179)&lt;BR /&gt;at org.apache.sqoop.Sqoop.runTool(Sqoop.java:218)&lt;BR /&gt;at org.apache.sqoop.Sqoop.runTool(Sqoop.java:227)&lt;BR /&gt;at org.apache.sqoop.Sqoop.main(Sqoop.java:236)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can you please help me with this? Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 14 Sep 2015 04:01:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31781#M7257</guid>
      <dc:creator>kisan</dc:creator>
      <dc:date>2015-09-14T04:01:04Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31783#M7258</link>
      <description>There's a known issue in CDH 5.2.0 (that's since been fixed) that prevents&lt;BR /&gt;Sqoop from importing Parquet datasets. I think you might be using a version&lt;BR /&gt;of the tutorial that's not intended for your specific cluster (which would&lt;BR /&gt;also explain why you missed the instructions for logging in on GoGrid,&lt;BR /&gt;specifically). Are you using the tutorial hosted on Cloudera's website,&lt;BR /&gt;perhaps? It's intended for a generic environment using a more recent&lt;BR /&gt;version - I'd recommend using the one that's customized for the version of&lt;BR /&gt;CDH on your cluster and running in GoGrid's environment.&lt;BR /&gt;&lt;BR /&gt;You should be able to find a link to the "Guidance Page" in your welcome&lt;BR /&gt;email, and the tutorial is one of the resources linked to from that page.&lt;BR /&gt;If you pick up where you are now in that version of the tutorial, you&lt;BR /&gt;shouldn't have issues like this.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 14 Sep 2015 04:52:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31783#M7258</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2015-09-14T04:52:20Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31826#M7259</link>
      <description>&lt;P&gt;I did exactly what you had mentioned, used the cloudera website tutorial instead of goGrid one. The sqoop import worked fine with avro dataformat. Thanks a lot!&lt;/P&gt;</description>
      <pubDate>Tue, 15 Sep 2015 02:40:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/31826#M7259</guid>
      <dc:creator>kisan</dc:creator>
      <dc:date>2015-09-15T02:40:01Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/41131#M7260</link>
      <description>&lt;P&gt;Hi, I am having the same problem in quickstart VM 5.7, but quick start VM 5.5 is just working fine with the same command.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I suspect this is a bug linked with quick start VM 5.7.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Please provide a workaround for&amp;nbsp;&lt;SPAN&gt;quickstart VM 5.7.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 21 May 2016 08:39:33 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/41131#M7260</guid>
      <dc:creator>LeeHadoop</dc:creator>
      <dc:date>2016-05-21T08:39:33Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/41147#M7261</link>
      <description>&lt;P&gt;Can you post the output from when you first ran the Sqoop job? It's very unlikely you're hitting the same root problem. If you try it again, be sure you add the --hive-overwrite option since some of the tables &amp;amp; metadata will have already been created.&lt;/P&gt;</description>
      <pubDate>Sat, 21 May 2016 17:27:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/41147#M7261</guid>
      <dc:creator>Sean</dc:creator>
      <dc:date>2016-05-21T17:27:55Z</dc:date>
    </item>
    <item>
      <title>Re: Importing sqoop data - cloudera live exercise 1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/41150#M7262</link>
      <description>&lt;P&gt;sorry, mistake on my side.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The error I got is different, was caused by having loaded examples, then run the exercise 1, triggered by existing directory in HDFS.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 21 May 2016 19:02:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Importing-sqoop-data-cloudera-live-exercise-1/m-p/41150#M7262</guid>
      <dc:creator>LeeHadoop</dc:creator>
      <dc:date>2016-05-21T19:02:42Z</dc:date>
    </item>
  </channel>
</rss>

