Member since
09-29-2015
57
Posts
49
Kudos Received
19
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1477 | 05-25-2017 06:03 PM | |
1332 | 10-19-2016 10:17 PM | |
1644 | 09-28-2016 08:41 PM | |
985 | 09-21-2016 05:46 PM | |
4597 | 09-06-2016 11:49 PM |
09-02-2016
05:17 AM
What limitations are we talking about here? Sorry, I don't understand your question. If you are asking about DIstCP options supported in HDFS Mirroirng, currently below options are supported maxMaps mapBandwidth Below additional options can be supported by using workaround given below: overwrite ignoreErrors skipChecksum removeDeletedFiles preserveBlockSize preserveReplicationNumber preservePermission Please modify the WF hdfs-replication-workflow.xml as below. After distcpMapBandwidth add below content <arg>-overwrite </arg>
<arg>${overwrite}</arg>
<arg>-ignoreErrors </arg>
<arg>${ignoreErrors}</arg>
<arg>-skipChecksum </arg>
<arg>${skipChecksum}</arg>
<arg>-removeDeletedFiles </arg>
<arg>${removeDeletedFiles}</arg>
<arg>-preserveBlockSize </arg>
<arg>${preserveBlockSize}</arg>
<arg>-preserveReplicationNumber </arg>
<arg>${preserveReplicationNumber}</arg>
<arg>-preservePermission </arg>
<arg>${preservePermission}</arg>
Pass below options in hdfs-replication.properties overwrite=false
ignoreErrors=false
skipChecksum=false
removeDeletedFiles=true
preserveBlockSize=true
preserveReplicationNumber=true
preservePermission=true
These will work OOTB as FeedReplicator already has support for this and hence no code change is required. Thanks!
... View more
09-01-2016
06:55 PM
2 Kudos
@Kyle Dunn: Falcon doesn't support those DistCP options and yes that would require a code change.
... View more
07-07-2016
07:04 PM
1 Kudo
@Dhaval Modi: Following is the prerequisites to use Hive Mirrroring Hive 1.2.0+ Oozie 4.2.0+* Falcon Hive Mirroring is not supported without those prerequisites. Thanks!
... View more
05-12-2016
07:02 PM
You cannot use ${DAY-1} in feed. IF you want to process previous day's data then you can achieve it using process by using yesterday EL expression. <inputs>
<input name="input" feed="SampleInput" start="yesterday(0,0)" end="today(-1,0)" />
</inputs>
... View more
05-12-2016
06:48 PM
As @Benjamin Leonhardi specified Falcon should honor the retry attempts in Retry policy. If its not working as expected please create a support issue. Thanks!
... View more
05-09-2016
06:42 PM
3 Kudos
@Piotr Pruski: As you mentioned Falcon piggy backs on DistCP under the hood to achieve replication. If another client is still writing to a source file, the copy will likely fail If the DistCP job fails then Falcon replication job fails too and status API/command can be used to get the finished status of the replication job. Same in case of success too. Also with FALCON-1313 support was added for email based notification for job status for Feeds and mirror recipes. Replication using snapshots is not yet supported in Falcon. This feature is added with FALCON-1861. Additional benefit is performance. It leverages HDFS snapshots which are very cost effective to create ( cost is O(1) excluding inode lookup time).
Once created, it is very efficient to find modifications relative to a snapshot and copy over these
modifications for disaster recovery (DR). This makes it's cost effective.
... View more
03-05-2016
01:21 AM
2 Kudos
@Pavel Benes Exception "javax.xml.bind.UnmarshalException: [org.xml.sax.SAXParseException; Premature end of file." can occur for various reasons. Its xml parser exception. Please ensure the entity xml generated correctly. Quick google search will point you to various reasons why "Premature end of file" exception can occur. Can you attach the entity xml generated when this exception occurs?! Thanks!
... View more
02-05-2016
05:54 PM
3 Kudos
You can refer this doc Atlas Falcon bridge.
... View more
02-02-2016
09:46 PM
2 Kudos
@Balu: I already replied with same analysis. I asked him to change the process start time to 2016-01 instead https://community.hortonworks.com/answers/12696/view.html
... View more
02-02-2016
01:05 AM
1 Kudo
@Nayan Paul: If you look at the oozie job launched when the falcon process entity is scheduled do you see any errors? Does running pig script outside Falcon works as intended? Can you please attach the Falcon, Oozie and MR logs to debug this issue?
... View more