- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Sqoop import - Special characters
- Labels:
-
Apache Sqoop
Created 04-28-2017 03:48 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
All,
Working on importing data from DB2 using sqoop import, it worked fine for the most part except one table, which seemed to have some special characters ( control-M = ^M ) in contents, hence while sqooping, these characters are treated as newline and hence everything after it will be on the next line in the imported files, which will affect all the records after one bad record.
I am unable to guess how to fix the imports? is there any eazy way?
Created 04-28-2017 05:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Sqoop should load data in UTF-8 by default. run the following
get db cfg for db_name
and see the value for Database_code_set. In your mapred-site.xml, can you please try adding the following for mapreduce.map.java.opts:
-Ddb2.jcc.charsetDecoderEncoder=3
Created 04-28-2017 05:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Sqoop should load data in UTF-8 by default. run the following
get db cfg for db_name
and see the value for Database_code_set. In your mapred-site.xml, can you please try adding the following for mapreduce.map.java.opts:
-Ddb2.jcc.charsetDecoderEncoder=3
