Member since
07-21-2014
141
Posts
8
Kudos Received
3
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2769 | 02-01-2017 04:49 PM | |
2078 | 01-15-2015 01:57 PM | |
2725 | 01-05-2015 12:59 PM |
09-16-2015
10:21 PM
I got the oozie job to run after adding all the args under Parameters. thanks!
... View more
09-16-2015
09:56 PM
Thanks Harsha! I'm using Oozie editor in Hue 3.6 and do not see any option to add 'args'. I attempted to use 'parameters' but upon submitting got an exception regarding invalid xml. Please let me know how to add the args.
... View more
09-16-2015
06:43 PM
I've configured an Oozie workflow to run this Sqoop job: <sqoop xmlns="uri:oozie:sqoop-action:0.2"> <job-tracker>my-node:8032</job-tracker> <name-node>hdfs://my-node:8020</name-node> <command>import --connect "jdbc:mysql://<url>" --username "dummy" --password "pwd" --query "SELECT col1, col2, CAST(REPLACE(SUBSTR(datecol,1,7), '-', '') AS UNSIGNED) as dateyyymm FROM src_table WHERE \$CONDITIONS" --target-dir /user/hive/warehouse/hive_table --fetch-size 0 --hive-import --hive-drop-import-delims --hive-table hive_table -m1</command> <file>/tmp/hive-site.xml#hive-site.xml</file> </sqoop> The Sqoop command works fine if ran from command prompt, but I keep running into these errors using Oo: [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Error parsing arguments for import: [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: col1 [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: col2 [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: CAST(REPLACE(SUBSTR(datecol,1,7), [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: '-', [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: '') [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: AS [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: UNSIGNED) [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: as [uber-SubtaskRunner] ERROR org.apache.sqoop.tool.BaseSqoopTool - Unrecognized argument: dateyyymm .. ...
... View more
Labels:
- Labels:
-
Apache Oozie
06-19-2015
07:56 AM
Thanks for the info. I did the 'Deploy client configuration' but still do not see the hive-site.xml updated with the configuration I provided in the safety valve field via CM. Please let me know if there is any other way to verify the deployment. Thansk!
... View more
06-19-2015
02:03 AM
I'm facing same error as well when I add this configuration to Hue's hue_safety_valve.ini via CM: ~~~ [hbase] hbase_clusters=(Cluster|some-hbase-thrift-server:9090) ~~~~ * I've verified that the 'Enable HBase Thrift Server Framed Transport' is unchecked * thrift_transport is set to 'buffered' in Hue Is there a way to download the hbase-site.xml via CM? Please let me know if there is any other config to update. Thanks!
... View more
06-19-2015
12:21 AM
I've updated the configuration for Hive using CM under "Hive Service Advanced Configuration Snippet (Safety Valve) for hive-site.xml" providing this configuration: <property> <name>hbase.zookeeper.quorum</name> <value>some.zookeeper.node</ value> </property> After restarting the Hive service, I don't see this config in the /etc/hive/conf.cloudera.hive/hive-site.xml config file. It seems to already have this property set to some other value. Does adding a safety value with an existing property overwrite or append the value to the hive-site.xml? Thanks!
... View more
Labels:
04-20-2015
03:35 PM
I need to process nested JSON. How did you go about mapping the fields to the nested AVRO schema?
... View more
03-12-2015
04:11 PM
Sorry, was a copy/paste error. I did have a comma between pagename and year but got the error I pasted. Thanks!
... View more
03-12-2015
04:05 PM
I got a 'log' table which is currently partitioned by year, month and day. I'm looking to create a partitioned view on top of 'log' table but running into this error: ~~~~ hive> CREATE VIEW log_view PARTITIONED ON (pagename,year,month,day) AS SELECT pagename year,month,day,uid,properties FROM log; FAILED: SemanticException [Error 10093]: Rightmost columns in view output do not match PARTITIONED ON clause ~~ Whats the right way to create a partitioned views? I'm using Hive 0.13 in CDH 5.3.2. Thanks!
... View more
Labels:
01-15-2015
01:57 PM
Thanks Joey, looks like "/usr/bin/flume-ng agent ... -Duser.home=/user/xyz -c /path/to/jar1:/path/to/jar2" seem to have done the trick by prepending to the java.class.path.
... View more