Member since
Kudos Received
07:06 AM
I am getting error when trying to import the table from Oracle database to Hadoop using Sqoop with --direct utility. The error is, "ERROR manager.SqlManager: Error executing statement: java.sql.SQLSyntaxErrorException: ORA-00942: table or view does not exist
java.sql.SQLSyntaxErrorException: ORA-00942: table or view does not exist" When I take off --direct from sqoop statement, then started importing data. Is there any other property to be added to the Sqoop statement when using --direct utility? Thanks!
... View more
- Labels:
Apache Sqoop
04:35 PM
1 Kudo
Try giving it just after sqoop job. Eg: Sqoop job "-Dorg.apache.sqoop.splitter.allow_text_splitter=true" -- import ...
... View more
08:24 AM
We are not using Sqoop2. Does the security guide applies to Sqoop too?
... View more
11:42 AM
I'm using HDP2.5 , sqoop 1.4.6. full log: $ sqoop import "-Dorg.apache.sqoop.splitter.allow_text_splitter=true" --connect --table tablename --username <username> -password <password> --hive-import --hive-table <hivetable> --split-by <col> -m 8
Warning: /usr/hdp/ does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME -- --
16/10/21 07:25:01 INFO oracle.OraOopManagerFactory: Data Connector for Oracle and Hadoop is disabled.
16/10/21 07:25:01 INFO manager.SqlManager: Using default fetchSize of 1000
16/10/21 07:25:01 INFO tool.CodeGenTool: Beginning code generation
16/10/21 07:25:03 INFO manager.OracleManager: Time zone has been set to GMT
16/10/21 07:25:03 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM "db"."tablename" t WHERE 1=0
16/10/21 07:25:05 INFO orm.CompilationManager: HADOOP_MAPRED_HOME is /usr/hdp/
Note: /tmp/sqoop-<username>/compile/163383944ed0d448144da421e24c5571/ uses or overrides a deprecated API.
Note: Recompile with -Xlint:deprecation for details.
16/10/21 07:25:06 INFO orm.CompilationManager: Writing jar file: /tmp/sqoop-<username>/compile/163383944ed0d448144da421e24c5571/db.tablename.jar
16/10/21 07:25:06 INFO mapreduce.ImportJobBase: Beginning import of db.tablename
16/10/21 07:25:06 INFO manager.OracleManager: Time zone has been set to GMT
16/10/21 07:25:08 INFO impl.TimelineClientImpl: Timeline service address:
16/10/21 07:25:08 INFO client.AHSProxy: Connecting to Application History server at
16/10/21 07:25:08 WARN ipc.Client: Failed to connect to server: retries get failed --
at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(
at com.sun.proxy.$Proxy23.getNewApplication(Unknown Source)
at org.apache.hadoop.yarn.api.impl.pb.client.ApplicationClientProtocolPBClientImpl.getNewApplication(
-- -- --
16/10/21 07:25:12 INFO mapreduce.Job: Running job: job_1476174512012_0126
16/10/21 07:25:18 INFO mapreduce.Job: Job job_1476174512012_0126 running in uber mode : false
16/10/21 07:25:18 INFO mapreduce.Job: map 0% reduce 0%
16/10/21 07:25:25 INFO mapreduce.Job: map 10% reduce 0%
16/10/21 07:25:26 INFO mapreduce.Job: map 70% reduce 0%
16/10/21 07:25:27 INFO mapreduce.Job: map 90% reduce 0%
16/10/21 07:25:51 INFO mapreduce.Job: map 100% reduce 0%
16/10/21 07:25:51 INFO mapreduce.Job: Job job_1476174512012_0126 completed successfully
16/10/21 07:25:51 INFO mapreduce.Job: Counters: 30
File System Counters
FILE: Number of bytes read=0
FILE: Number of bytes written=1676345
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=1483
HDFS: Number of bytes written=32451988
HDFS: Number of read operations=40
HDFS: Number of large read operations=0
HDFS: Number of write operations=20
Job Counters
Launched map tasks=10
Other local map tasks=10
Total time spent by all maps in occupied slots (ms)=81510
Total time spent by all reduces in occupied slots (ms)=0
Total time spent by all map tasks (ms)=81510
Total vcore-milliseconds taken by all map tasks=81510
Total megabyte-milliseconds taken by all map tasks=333864960
Map-Reduce Framework
Map input records=116058
Map output records=116058
Input split bytes=1483
Spilled Records=0
GC time elapsed (ms)=769
CPU time spent (ms)=27350
Physical memory (bytes) snapshot=4567121920
Virtual memory (bytes) snapshot=56302190592
Total committed heap usage (bytes)=5829558272
File Input Format Counters
Bytes Read=0
File Output Format Counters
Bytes Written=32451988
16/10/21 07:25:51 INFO mapreduce.ImportJobBase: Transferred 30.9486 MB in 42.8346 seconds (739.8552 KB/sec)
16/10/21 07:25:51 INFO mapreduce.ImportJobBase: Retrieved 116058 records.
16/10/21 07:25:51 INFO mapreduce.ImportJobBase: Publishing Hive/Hcat import job data to Listeners
16/10/21 07:25:51 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM "db"."tablename" t WHERE 1=0
16/10/21 07:25:52 WARN hive.TableDefWriter: Column col1 had to be cast to a less precise type in Hive
16/10/21 07:25:52 INFO hive.HiveImport: Loading uploaded data into Hive Logging initialized using configuration in jar:file:/usr/hdp/!/
Time taken: 1.168 seconds
Loading data to table hivedb.hivetable
Failed with exception java.util.ConcurrentModificationException
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask
... View more
11:17 AM
Yes, I can access oracle and using sqoop I can import to HDFS directory by specifying --target-directory in sqoop import. I can access hive too, I created a db, table. in our cluster hive warehouse dir is: /apps/hive/warehouse. why will username comes into warehouse directory. I can't see any userid's under warehouse directory.
... View more
11:03 AM
Yes, I do have access to that table. I tried "insert overwrite table <managed_table> select * from ext_table;". This has worked. But I also tried, loading data from HDFS path(same path pointed to ext_table in prev query) to managed_table, but failed with the same error.
... View more
08:53 AM
I want to kerberize the sqoop job. What is the process? What are the things to be taken care to run the sqoop job in Kerberos environement? I didn't find any documentation on this. Your help is most important.
... View more
- Labels:
Apache Sqoop
10:20 AM
I am trying to import oracle table to HDFS directory, but getting the error "Generating splits for a textual index column allowed only in case of "-Dorg.apache.sqoop.splitter.allow_text_splitter=true" property passed as a parameter" I fixed the import issue by giving "-Dorg.apache.sqoop.splitter.allow_text_splitter=true" in sqoop import. But why do we need to set this property? I imported other tables without setting this property. When should we set this property?
... View more
- Labels:
Apache Sqoop
10:14 AM
1 Kudo
I am trying to import RDBMS Oracle table to Hive using Sqoop --hive-import option.The Sqoop importing process went fine but at the end error'd out saying "Failed with exception java.util.ConcurrentModificationException
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask". When I opened Hive terminal, I could see table created in Hive database, but no records were inserted. Below is the code: sqoop import "-Dorg.apache.sqoop.splitter.allow_text_splitter=true" \
--connect <jdbc:oracle:thin:@connectionstring:portno> \
--table tablename --username <username> -password <Password> \
--hive-import \
--hive-table <hivedb.hivetable> \
--split-by <column> \
-m 8 Do I need to set any parameters? Or Hive Internal tables will have such issues.
... View more
- Labels:
Apache Hive
Apache Sqoop
- « Previous
- 1
- 2
- Next »