About mhegedus

mhegedus · ‎07-13-2017

It is possible if the website publishes their streaming data via a public API and if you implement a custom Flume source to ingest that. In case of Twitter there is an API for that but you have to pay to use it. In case of quora or blogger I am not sure if it exists. An option could be to write code that reads RSS feeds and writes that to disk or hdfs but to do this you do not need Flume.

mhegedus · ‎07-09-2017

Flume does not have website scraping capabilities. One might guess that HttpSource can be used for tasks like this but HttpSource is just a http server running in Flume. You can push data to it and not the other way around. Check IMDB site, you can download data from Amazon S3 but you have to pay the data transfer fee: http://www.imdb.com/interfaces

mhegedus · ‎04-19-2017

You can edit flume.conf directly and the running agent will reconfigure itself without restart. The default location of the configuration file is: /etc/flume/conf/{agent_name}/flume.conf. However these changes will not be visible in Ambari and next time you restart Flume from Ambari then it will overwrite your manual changes with the stale config.

mhegedus · ‎02-08-2017

Execute the import command from bash. Seems like you were in hbase shell.

mhegedus · ‎01-04-2017

You have to send an array of JSONEvents otherwise the handler will fail to deserialize the events. An event must have at least a body and the body must be a string. You can also add optional headers. See the event specification in the user guide. import requests import json a = [{'body': 'my 1st event data'}, {'body': 'my 2nd event data'}] requests.post('http://localhost:44444', data=json.dumps(a)) You can also use GET method but still have to specify data to send.

Online	Offline
Last Visited	‎08-22-2017 02:09 PM

Member Since	‎10-30-2016 10:12 PM
Last Visited	‎08-22-2017 02:09 PM
Posts	20
Kudos received	15

Cloudera Community

Re: I want to fetch IMDB data using flume . I am s...

Re: Error while loading data into Hbase table

Re: is my url of http-source of flume agent correc...

Re: I want to fetch IMDB data using flume . I am s...

Re: I want to fetch IMDB data using flume . I am s...

Re: How to refresh flume agent configure without r...

Re: Error while loading data into Hbase table

Re: is my url of http-source of flume agent correc...