About isoardi

isoardi · ‎04-21-2016

Hi, from shell ambari-server status to see if ambari (the web UI) is running; if it's not running ambari-server start

isoardi · ‎04-21-2016

https://community.hortonworks.com/questions/28395/how-to-load-the-data-from-sql-server-to-hdfs-using.html#comment-28450

isoardi · ‎04-21-2016

Thank you Davide

isoardi · ‎04-21-2016

It is not totally true: with NiFi you can get "aloha.csv" from different source (web, local file, HDFS, DB, twitter, ...), enrich the data (for example you can merge with "byebye.csv") or to modify the data, and save data (to hive in different ways, to hdfs, to local file, ...). If you eant to see the personalityof twitter user, you can not use NiFi to calculate it. For this you can use spark

isoardi · ‎04-21-2016

Hi, I saw that there is HDF certification in cooming soon, when will it be available?

isoardi · ‎04-21-2016

Hi, NiFi can comunicate whit Solr via PutSolrContentStream. (documentation) If you want NiFi have also a GetSolr process. Spark get/put information from/to Solr via API (documentation) I have never tried (I promised myself to do it) but if you want to create a strreaming process from NIFI to Spark you can try , perhaps starting with this one.

isoardi · ‎04-20-2016

Hi @Rendiyono Wahyu Saputro, I answer point to point: 1. You can use whatever you prefer. I used NIFI in a similar project for get and model the data from twitter 2. If you use NiFi you can process the data (the twitter API return a JSON file) to set the attributes with the values of JSON node. I suggest you to index data to Solr and use Spark for querying to Solr. After you should process the data in Spark for find the user with more retwitter tweets, for example. 3. In this page we used Storm to querying and generate JSON, for minimize the traffic to the SolrCloud (We don't have more CPU), but if you want you can queryng directly solr and return a JSON file for your app. My idea is: a- I register to your apps b- I insert my twitter account c - the app send to NiFi my personal information to follow me (add my userName in the getTwitter-processor filter) d - NiFi send to Solr my tweets e - Spark querying Solr for my username and process my data f - Spark send the result to another Solr collection (coll_B) g - the application requires the data processed to coll_B of solr in JSON format All components that I mentioned run on HDP in cluster mode. The size of the cluster depend on different factors: - how many people will use your app (many requests to solr require a lot of CPU and RAM) for example for ingest (avg) 800 tweets we have 3 NiFi worker with 3GB of Xmx - how heavy data processing? - It must be in real time? These and more are all things to be assessed for deploy a cluster.

isoardi · ‎04-20-2016

Hi @karthik sai, what do you see in ambari-mertrics-log (Default path /var/log/ambari-metrics-collector/)? Because it seems that you have stopped from user, there must be another reason.

isoardi · ‎04-15-2016

I have run this command and stop and star the problematic nameNode; Now it's all ok. Thank you

isoardi · ‎04-14-2016

This state still persist after restart of all HDFS component. I didn't try to stop and start the datanode, but it's strange why I have 3 datanode and not 4. Also shows me the following state:

Online	Offline
Last Visited	‎09-25-2024 11:44 AM

Member Since	‎12-10-2015 08:22 AM
Last Visited	‎09-25-2024 11:44 AM
Posts	76
Kudos received	29

Cloudera Community

Re: Cloudera management cloud console

Re: is it neccesery to backup the folder - /var/...

Re: Can i get twitter data to HDP, process it and ...

Re: Upgrade Ambari 2.2.0 to 2.2.1.0

Re: Tutorial: Tag based policies with Apache Range...

Re: hive import from sql server?

Re: HDF Certified Stream Developer

Re: Can i get twitter data to HDP, process it and ...

HDF Certified Stream Developer

Re: Can i get twitter data to HDP, process it and ...

Re: Can i get twitter data to HDP, process it and ...

Re: i'm installing ambari server with HDP 2.2.8.0 ...

Re: Version mismatch of datanode after HDP upgrade...

Re: Version mismatch of datanode after HDP upgrade...