Member since
05-11-2016
29
Posts
1
Kudos Received
3
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
7287 | 09-25-2017 02:06 PM | |
12389 | 08-16-2016 01:41 PM | |
3357 | 05-18-2016 06:30 AM |
06-16-2016
08:59 AM
I am getting below error while running Flume agent to spool directory, 2016-06-16 10:07:35,927 INFO org.apache.flume.source.SpoolDirectorySource: SpoolDirectorySource source starting with directory: /export/home/user1/flume/input 2016-06-16 10:07:35,927 ERROR org.apache.flume.lifecycle.LifecycleSupervisor: Unable to start EventDrivenSourceRunner: { source:Spool Directory source sdir: { spoolDir: /export/home/user1/flume/input } } - Exception follows. java.lang.IllegalStateException: Directory does not exist: /export/home/user1/flume/input at com.google.common.base.Preconditions.checkState(Preconditions.java:145) at org.apache.flume.client.avro.ReliableSpoolingFileEventReader.<init>(ReliableSpoolingFileEventReader.java:145) at org.apache.flume.client.avro.ReliableSpoolingFileEventReader.<init>(ReliableSpoolingFileEventReader.java:77) at org.apache.flume.client.avro.ReliableSpoolingFileEventReader$Builder.build(ReliableSpoolingFileEventReader.java:669) at org.apache.flume.source.SpoolDirectorySource.start(SpoolDirectorySource.java:85) at org.apache.flume.source.EventDrivenSourceRunner.start(EventDrivenSourceRunner.java:44) at org.apache.flume.lifecycle.LifecycleSupervisor$MonitorRunnable.run(LifecycleSupervisor.java:251) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178) at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) - I am running Flume as service on Edge node. - Directory '/export/home/user1/flume/input' is on same Edge node, but this direcory is owned by user1 - I assume Flume agent on Edge node run by 'Flume' as user - May be error is because 'Flume' user not having r / w access on folder '/export/home/user1/flume/input' If my understanding is correct then I must create a spool directory where Flume as user have r/w access and other users posting files in that folder have at-least write permission. Please correct me if my understanding is not correct.
... View more
Labels:
- Labels:
-
Apache Flume
06-07-2016
01:08 PM
I am trying to read the data from SQL Server using REST API as a Flume source with JSON format. Not sure which Flume source I can use here (may be HTTP) ? If it is HTTP then where I should mention the API URL(http://api.xxxx.com.xxxx) in Flume source property ? Please advice.
... View more
Labels:
- Labels:
-
Apache Flume
05-26-2016
05:15 AM
Thank you for detail reply. I have initiated Flume as service on Edge node and its as expected.
... View more
05-25-2016
10:09 AM
I am executing Flumge-ng agnet command on Edge node. As you already explained in another post, I need to run Flume as service on Edge node to start / stop flume agent.
... View more
05-19-2016
11:27 AM
I can see Flume running on CM portal, it means we already have Flume as service on Cloudera Manager.
... View more
05-19-2016
11:19 AM
I guess Flume installed using parcels. I am running Flume-ng commands on edge node. Below are details, [@ ~]$ which flume-ng /usr/bin/flume-ng [@ ~]$ alternatives --display flume-ng flume-ng - status is auto. link currently points to /opt/cloudera/parcels/CDH-5.3.3-1.cdh5.3.3.p0.5/bin/fl ume-ng /opt/cloudera/parcels/CDH-5.3.3-1.cdh5.3.3.p0.5/bin/flume-ng - priority 10 Current `best' version is /opt/cloudera/parcels/CDH-5.3.3-1.cdh5.3.3.p0.5/bin/fl ume-ng. Also your will be very helpfull if provide details about setting up a flume service in CM. Thank you
... View more
05-19-2016
07:53 AM
Thank you for solution. In my case, I am reading logs from webserver and dumping in HDFS. Currently I am running agent on web server and edge node (this node is not part of cluster but all clients installed on it, so I can run flume agent here by manual flume-ng command) to push data to HDFS. What is difference in running Flume on edge node (like I am currently running) and running Flume on one of cluster node (as you suggested) ? Also I don’t know where to find the start and stop script, do I need to write my own ? We are using CDH - 5.3.3 and Flume 1.5.0 Any help appreciated
... View more
05-18-2016
12:27 PM
Hi, I have below questions related to Flume, On which node should Flume agent run ? On Edge node or one of Hadoop cluster node ? Do I need to run Flume agent using nohup in production as it may keep running until interrupted
... View more
Labels:
- Labels:
-
Apache Flume
-
Apache Hadoop
05-18-2016
06:30 AM
There were permission issue with Centrify where users permissions were masked with ACL like method.
... View more
05-17-2016
07:23 AM
Hi, Encountered below error while connecting to hive2 using SAS, ERROR: Unable to connect to the Hive server. ERROR: Error trying to establish connection. ERROR: Error in the LIBNAME statement. Same user can connect to hive but to different schema.
... View more
Labels:
- Labels:
-
Apache Hive
- « Previous
-
- 1
- 2
- Next »