Member since
02-09-2016
40
Posts
14
Kudos Received
0
Solutions
09-28-2016
09:16 PM
@Sowmya Ramesh thanks for your response. Not sure I understood it correctly. For example, in the case of feed replication, if the first replication job is submitted at time T and is still in progress and another replication job submitted at T+1 hour, do you intend to say that both of them complete one after the other without any overlap, in a FIFO fashion? All I am trying to understand is would my feed replication / mirroring job have any adverse impacts if their scheduling is not handled properly i.e. scheduled too frequently which would cause overlap while execution
... View more
09-28-2016
03:34 PM
Trying to understand what happens if there is a scheduled Falcon replication that is running while another one starts? For example, if we have a hourly replication schedule and the one at T hour is still running, what happens if another one starts at T+1 hour?
... View more
Labels:
- Labels:
-
Apache Falcon
09-23-2016
10:52 AM
1 Kudo
Hi, Just wondering how the cluster topology should look like for Kafka alongside Hadoop? I presume Kafka brokers shouldn't be co-located alongside data nodes. Instead should probably be installed on nodes outside Hadoop cluster (probably gateway / edge nodes) as Kafka serves as the landing area and the data be eventually pushed to one of the Hadoop storage engines. Am I correct thinking this way? Please validate my understanding.
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Kafka
08-22-2016
02:02 PM
Hi, I am trying to run a very simple command hdfs dfs -ls -t / However, it prompts me saying that -t is an illegal option. However, when I look for documentation it says -t is supported. FYI, I am using Hadoop 2.7.1 version. Any idea how to list the files / directories in HDFS sorted by time?
... View more
Labels:
- Labels:
-
Apache Hadoop
08-03-2016
01:23 PM
Hi,
I am performing a basic check to see if a file exists in HDFS or not. I am using hdfs dos -test command for the same. But it doesn't seem to work correctly. In the documentation its mentioned as it would return 0 if the file exists. But I am not getting any output when the command is run.
Let me know what need to be done to get this working.
Please see the screenshot attached
Thanks
... View more
Labels:
- Labels:
-
Apache Hadoop
07-27-2016
02:59 PM
Is there a kerberised version of HDP Sandbox image available which can be used for proof of concept purposes on AWS? I am planning to have two secured sandboxes on AWS and then play around with some functionality and hence trying to understand whats the best way to get around this. Thanks
... View more
Labels:
07-19-2016
11:42 PM
Thanks @Arpit Agarwal for your response. Any specific reason still there are two branches maintained? Are they significantly different from one another and hence need to be tracked and maintained separately? I presume HDP and many commercial distributions follow 2.7.x lineage. So wondering who is using 2.6.x series? Thanks in Advance.
... View more
07-19-2016
10:59 AM
Thanks @rbiswas Any idea how does it work for other services like hive, yarn, hbase etc?
... View more
07-19-2016
10:38 AM
2 Kudos
Hi, Any idea how Apache Hadoop versioning works? When I go to Hadoop homepage on Apache page, it lists 2.7.2 as the latest stable release (I believe 2.7.1 is part of HDP2.4.2) But thats released in Jan 2016. But there is 2.6.4 released in Feb 2016. Which is the current branch to follow and when to use 2.6 release? Any idea when 2.8.0 release date is? Thanks
... View more
Labels:
- Labels:
-
Apache Hadoop