<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Why Falcon pipeline is failing? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111570#M30287</link>
    <description>&lt;P&gt;Thanks cnormile,&lt;/P&gt;&lt;P&gt;Could you please tell me,generally at which location we should put that serDe jar so that falcon/oozie will also pick that jar while running the data pipeline.&lt;/P&gt;&lt;P&gt;As I already mentioned that jar is present at both the locations(i.e, In user/oozie/share/lib/hive and usr/hdp/hdp-&amp;lt;version&amp;gt;/hive/lib) and restarted already hive and oozie both but stiill prompting same error.&lt;/P&gt;</description>
    <pubDate>Wed, 01 Jun 2016 09:30:23 GMT</pubDate>
    <dc:creator>Manus</dc:creator>
    <dc:date>2016-06-01T09:30:23Z</dc:date>
    <item>
      <title>Why Falcon pipeline is failing?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111568#M30285</link>
      <description>&lt;P&gt;Hello guys,&lt;/P&gt;&lt;P&gt;Currently,my requirement is to process data of 3 hive tables and store it's result into another hive table.Therefore,I have created 3 hive uri input feed ,1 hive uri output feed and 1 process which is accepting this 3 feed entity as input and genereting output in output feed.Process entity giving error as:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;FAILED: RuntimeException MetaException(message:java.lang.ClassNotFoundException Classorg.openx.data.jsonserde.JsonSerDe not found) &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Actually i am getting this error in oozie when scheduled process entity gets failed&lt;/P&gt;&lt;P&gt;I understood that error is so simple and it's caused  because of serDe jar is missing somewhere within hive/lib or oozir/share/lib.&lt;/P&gt;&lt;P&gt;I tried the following solutions:&lt;/P&gt;&lt;P&gt;1) add a jar to hive lib folder or add it to oozie lib.&lt;/P&gt;&lt;P&gt;Jar:&lt;/P&gt;&lt;P&gt;json-serde-1.3.8-SNAPSHOT-jar- with-dependencies.jar;.&lt;/P&gt;&lt;P&gt;But still getting same error.if i add the jar through hive cli using ADD JAR command then query run smoothly there.&lt;/P&gt;&lt;P&gt;My xml are:&lt;/P&gt;&lt;P&gt;1) observationInputFeed.xml&lt;/P&gt;&lt;P&gt; &amp;lt;feed xmlns='uri:falcon:feed:0.1' name='observationInputFeed' description='This is observation table'&amp;gt;
  &amp;lt;tags&amp;gt;table=observation&amp;lt;/tags&amp;gt;
  &amp;lt;frequency&amp;gt;hours(1)&amp;lt;/frequency&amp;gt;
  &amp;lt;timezone&amp;gt;UTC&amp;lt;/timezone&amp;gt;
  &amp;lt;clusters&amp;gt;
    &amp;lt;cluster name='hiveCluster' type='source'&amp;gt;
      &amp;lt;validity start='2016-05-31T09:00Z' end='2016-06-06T09:00Z'/&amp;gt;
      &amp;lt;retention limit='days(1)' action='delete'/&amp;gt;
      &amp;lt;table uri='catalog:falconexample:observation1#ds=${YEAR}-${MONTH}-${DAY}-${HOUR}'/&amp;gt;
    &amp;lt;/cluster&amp;gt;
  &amp;lt;/clusters&amp;gt;
  &amp;lt;table uri='catalog:falconexample:observation1#ds=${YEAR}-${MONTH}-${DAY}-${HOUR}'/&amp;gt;
  &amp;lt;ACL owner='ambari-qa' group='users' permission='0755'/&amp;gt;
  &amp;lt;schema location='hcat' provider='hcat'/&amp;gt; &lt;/P&gt;&lt;P&gt;______________________________________________&lt;/P&gt;&lt;P&gt;2) patientInputFeed.xml&lt;/P&gt;&lt;P&gt;&amp;lt;feed xmlns='uri:falcon:feed:0.1' name='patientInputFeed' description='This is Patient table'&amp;gt;
  &amp;lt;tags&amp;gt;table=patient&amp;lt;/tags&amp;gt;
  &amp;lt;frequency&amp;gt;hours(1)&amp;lt;/frequency&amp;gt;
  &amp;lt;timezone&amp;gt;UTC&amp;lt;/timezone&amp;gt;
  &amp;lt;clusters&amp;gt;
    &amp;lt;cluster name='hiveCluster' type='source'&amp;gt;
      &amp;lt;validity start='2016-05-31T09:00Z' end='2016-06-06T09:00Z'/&amp;gt;
      &amp;lt;retention limit='days(1)' action='delete'/&amp;gt;
      &amp;lt;table uri='catalog:falconexample:patient1#ds=${YEAR}-${MONTH}-${DAY}-${HOUR}'/&amp;gt;
    &amp;lt;/cluster&amp;gt;
  &amp;lt;/clusters&amp;gt;
  &amp;lt;table uri='catalog:falconexample:patient1#ds=${YEAR}-${MONTH}-${DAY}-${HOUR}'/&amp;gt;
  &amp;lt;ACL owner='ambari-qa' group='users' permission='0755'/&amp;gt;
  &amp;lt;schema location='hcat' provider='hcat'/&amp;gt;
&amp;lt;/feed&amp;gt;&lt;/P&gt;&lt;P&gt;____________________________________________&lt;/P&gt;&lt;P&gt;3) patientprocessedOutputFeed.xml&lt;/P&gt;&lt;P&gt;&amp;lt;feed xmlns='uri:falcon:feed:0.1' name='patientprocessedOutputFeed' description='This is patientprocessed table'&amp;gt;
  &amp;lt;tags&amp;gt;table=observationInputFeed&amp;lt;/tags&amp;gt;
  &amp;lt;frequency&amp;gt;hours(1)&amp;lt;/frequency&amp;gt;
  &amp;lt;timezone&amp;gt;UTC&amp;lt;/timezone&amp;gt;
  &amp;lt;clusters&amp;gt;
    &amp;lt;cluster name='hiveCluster' type='source'&amp;gt;
      &amp;lt;validity start='2016-05-31T09:00Z' end='2016-06-06T09:00Z'/&amp;gt;
      &amp;lt;retention limit='days(1)' action='delete'/&amp;gt;
      &amp;lt;table uri='catalog:falconexample:Patient_proce#ds=${YEAR}-${MONTH}-${DAY}-${HOUR}'/&amp;gt;
    &amp;lt;/cluster&amp;gt;
  &amp;lt;/clusters&amp;gt;
  &amp;lt;table uri='catalog:falconexample:Patient_proce#ds=${YEAR}-${MONTH}-${DAY}-${HOUR}'/&amp;gt;
  &amp;lt;ACL owner='ambari-qa' group='users' permission='0755'/&amp;gt;
  &amp;lt;schema location='hcat' provider='hcat'/&amp;gt;
&amp;lt;/feed&amp;gt;&lt;/P&gt;&lt;P&gt;_____________________________________________&lt;/P&gt;&lt;P&gt;4) And 4th one is also very similar to above feed xml except table name.&lt;/P&gt;&lt;P&gt;______________________________________________&lt;/P&gt;&lt;P&gt;Please help my guys,&lt;/P&gt;&lt;P&gt; i dont understand why oozie throwing ClassNotFoundException &lt;STRONG&gt;message:java.lang.ClassNotFoundException &lt;/STRONG&gt;even though jar is present at proper location.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 00:24:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111568#M30285</guid>
      <dc:creator>Manus</dc:creator>
      <dc:date>2016-06-01T00:24:14Z</dc:date>
    </item>
    <item>
      <title>Re: Why Falcon pipeline is failing?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111569#M30286</link>
      <description>&lt;P&gt;Try restarting Hive to pick up the jar.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 00:52:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111569#M30286</guid>
      <dc:creator>cnormile</dc:creator>
      <dc:date>2016-06-01T00:52:28Z</dc:date>
    </item>
    <item>
      <title>Re: Why Falcon pipeline is failing?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111570#M30287</link>
      <description>&lt;P&gt;Thanks cnormile,&lt;/P&gt;&lt;P&gt;Could you please tell me,generally at which location we should put that serDe jar so that falcon/oozie will also pick that jar while running the data pipeline.&lt;/P&gt;&lt;P&gt;As I already mentioned that jar is present at both the locations(i.e, In user/oozie/share/lib/hive and usr/hdp/hdp-&amp;lt;version&amp;gt;/hive/lib) and restarted already hive and oozie both but stiill prompting same error.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 09:30:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111570#M30287</guid>
      <dc:creator>Manus</dc:creator>
      <dc:date>2016-06-01T09:30:23Z</dc:date>
    </item>
    <item>
      <title>Re: Why Falcon pipeline is failing?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111571#M30288</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/10447/manoj-dhake.html" nodeid="10447"&gt;@Manoj Dhake&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Jar file is missing:&lt;/P&gt;&lt;P&gt; 0: jdbc:hive2://192.168.56.101:10000&amp;gt; ADD JAR /tmp/hive-json-serde-0.2.jar; 
No rows affected (0.231 seconds) &lt;/P&gt;&lt;P&gt; 0: jdbc:hive2://192.168.56.101:10000&amp;gt; select * from my_table;&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/questions/35278/javalangclasscastexception-orgopenxdatajsonserdejs-1.html"&gt;Link&lt;/A&gt; might give you more example.&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 09:38:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111571#M30288</guid>
      <dc:creator>bandarusridhar1</dc:creator>
      <dc:date>2016-06-01T09:38:10Z</dc:date>
    </item>
    <item>
      <title>Re: Why Falcon pipeline is failing?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111572#M30289</link>
      <description>&lt;P&gt;Thank you Sri,&lt;/P&gt;&lt;P&gt;I followed the link which you have sent to me but still getting same error &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1) Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: java.lang.ClassNotFoundException: Class org.openx.data.jsonserde.JsonSerDe not found.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Actually query  which am running using falcon is,&lt;/P&gt;&lt;P&gt;INSERT OVERWRITE TABLE  falconexample.Patient_proce PARTITION (${falcon_output_partitions_hive})
select p.id,p.gender, p.Age, p.birthdate, o.component[1].valuequantity.value, o.component[1].valuequantity.unit
from (select *, floor(datediff(to_date(from_unixtime(unix_timestamp())), to_date(birthdate)) / 365.25) as Age  FROM falconexample. patient1) p inner join falconexample.DiagnosticReport1 d on p.id = substr(d.subject.reference,9) inner join falconexample.Observation1 o on p.id = substr(o.subject.reference,9)
where p.Age&amp;gt;17 and p.Age&amp;lt;86 and o.component[1].valuequantity.value &amp;lt;140;&lt;/P&gt;&lt;P&gt;If I write statement in hive script  as "ADD JAR /user/oozie/share/lib/lib_20160503082834/hive/json-serde-1.3.8-SNAPSHOT-jar-with-dependencies.jar;" then I get an error like &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;2) java.lang.IllegalArgumentException: /user/oozie/share/lib/lib_20160503082834/hive/json-serde-1.3.8-SNAPSHOT-jar-with-dependencies.jar does not exist.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;and if I remove that statement then I get 1st error.&lt;/P&gt;&lt;P&gt;Following solutions I have tried to resolve this issue.&lt;/P&gt;&lt;P&gt;1)added serDe jar at oozie share lib folder.(HDFS location)&lt;/P&gt;&lt;P&gt;2)added serDe jar at hive lib(Local FS).&lt;/P&gt;&lt;P&gt;3)added serDe jat at "falcon.libpath" location(i.e,/apps/falcon/pigCluster/working/lib)(HDFS location)&lt;/P&gt;&lt;P&gt;4)added jar at /apps/falcon/pigCluster/staging/falcon/workflows/process/patientDataProcess/6b7edfbfe5bcfc50e0fc845f71cd9122_1464767974691/DEFAULT/lib(HDFS location)&lt;/P&gt;&lt;P&gt;every where I have put the jar but again getting 1st error.I dont know what is happening with this stuff?&lt;/P&gt;&lt;P&gt;why falcon does not getting that jar?&lt;/P&gt;&lt;P&gt;I am posting  log file which I found under falcon working direcoty &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/4675-user-action-hive-failed.txt"&gt;user-action-hive-failed.txt&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 17:51:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111572#M30289</guid>
      <dc:creator>Manus</dc:creator>
      <dc:date>2016-06-01T17:51:15Z</dc:date>
    </item>
    <item>
      <title>Re: Why Falcon pipeline is failing?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111573#M30290</link>
      <description>&lt;P&gt;Hello Guys,&lt;/P&gt;&lt;P&gt;The error is has been solved ,I have solved it by adding additional statement in hive script along with above query as&lt;/P&gt;&lt;P&gt;statement:-&lt;/P&gt;&lt;P&gt;add jar
hdfs://&amp;lt;hostname&amp;gt;:8020//user/oozie/share/lib/lib_20160503082834/hive/json-serde-1.3.8-SNAPSHOT-jar-with-dependencies.jar;&lt;/P&gt;</description>
      <pubDate>Wed, 01 Jun 2016 19:56:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Why-Falcon-pipeline-is-failing/m-p/111573#M30290</guid>
      <dc:creator>Manus</dc:creator>
      <dc:date>2016-06-01T19:56:39Z</dc:date>
    </item>
  </channel>
</rss>

