Member since
04-23-2016
5
Posts
2
Kudos Received
0
Solutions
11-02-2016
04:57 AM
Hi, Yes, I made it work. Please use this as server name: http://(ip address of your cloudera VM):50070/webhdfs/v1/user/cloudera/(the name of the directory which you want to import) example: http://127.0.0.1:50070/webhdfs/v1/user/cloudera/sensordata
... View more
11-01-2016
08:59 PM
Hi, I am using Flume to ingest real time data into HDFS. The stored data can be analyzed in the terminal using Spark. But what I actually want is some sort of real time visualization/graphs of this data. Since Flume is continuously (or let's say every 5 seconds) ingesting new data into HDFS, I would like my visualization to update automatically in real time or near real time by extracting the data stored in HDFS. I have visualized the data by making Hive tables but that actually generates the table based on what is stored in a particular file at that moment and does not updated automatically if new data arrives. In other words, it is not real time visualization. Is there a way I can achieve the real time visualizations i.e. pie charts, bar graphs etc for my data that is being ingested into HDFS? As a second part of this question, Flume has a rool over time of 30 seconds which means it creates a new file every 30 seconds, I would want the visualizations to read the data stored in all the files in a particular directory rather than reading it from a particular file to generate the visualization. Thanks, -Riz
... View more
07-20-2016
04:30 AM
Hi Rusty M, Please read this post after my last post as a continuation of the last one. I saved a csv file in HDFS and tried to import it into Power BI. When I do that, Power BI considers it as a Binary file and when I try to open it, Power BI pops up an error message as shown below. Please guide.
... View more
07-20-2016
03:54 AM
Hi Rusty M, I am trying to do the same thing as you did. I am able to see all the files and directories in my HDFS when I connect Power BI to HDFS. But I cannot actually pull the data from those files. Power BI sees these files as binary files and for the queries only imports parameters like data executed, folder path etc and DOES NOT seem to import the data in the files. I have tried to import an xlsx file and a text file into Power BI which were stored in HDFS. Can you please guide me on how to import a excel xlsx file stored into the HDFS and import it into Power BI? Please do see the attached image below. To connect to the HDFS from Power BI, I am using the IP address of the Cloudera VM (I assume this is the correct way to connect to HDFS from Power BI). Here is the complete URL for connecting Power BI ti HDFS. http://<ip address of the VM>:50070/webhdfs/v1/user/cloudera/output_join11 http://<ip address of the VM>:50070/webhdfs/v1/user/cloudera/words.txt Is it some kind of a driver issue or is it something that I am missing. Please answer considering the fact that I am just a beginner trying to learn cloudera and Visualization tools. Thanks in advance.
... View more
07-14-2016
06:41 AM
2 Kudos
Hi, I am fairly new to the Data Sciences field. I am trying to connect Microsoft Power BI to HDFS, I have installed cloudera VM on my system and it works fine. When I connect Power BI to HDFS it asks for the server name to connect to. I read on various places that I need to put the namenode for HDFS in place of the server name in Power BI, but I am not sure how I can find my namenode? I tried to go to do the following as shown in the image below. But not sure if it is my namenode or is there something I am missing. Any help in how to find the namenode and putting the correct server name in Power BI will be appreciated.
... View more