Member since
09-24-2015
27
Posts
69
Kudos Received
3
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
5219 | 12-04-2015 03:40 PM | |
27478 | 10-19-2015 01:56 PM | |
4098 | 09-29-2015 11:38 AM |
12-17-2019
07:48 AM
Hi All . here is more Details about above :- https://community.cloudera.com/t5/Support-Questions/HDInsight-Vs-HDP-Service-on-Azure-Vs-HDP-on-Azure-IaaS/m-p/166424 Thanks HadoopHelp
... View more
09-13-2019
08:06 AM
Great article, wonder if you could update it. SAP Data Services now had a new way to connect to Hadoop that does NOT require installing an SAPDS job server on the Hadoop edge node. Also can SLT read BW on Hana objects (ADSO, HANA view, CDS views, etc.) and connect direct to Hadoop? Thanks JDR
... View more
12-29-2016
07:30 AM
1 Kudo
For the beginners, like myself: If you added a new processor, or change the processor name, you will need to add or change the name in the .Processor file in <Home Dir>/Documents/nifi/ChakraProcessor/HWX/nifi-demo-processors/src/main/resources/META-INF/services. If you don't do this, the processor will not be loaded.
... View more
12-07-2016
03:19 AM
Hello Chakra!! What jar files need to be added for windows machine or Linux? I am on HDP 2.5, Hive, Beeline 1.2 Please see my question here on SOF, Please let me know. Thanks in advance!!
... View more
02-08-2016
04:29 PM
turn off the other services when you run HBase like metrics, hive, oozie, atlas, etc. I've run HBase without issues on Sandbox, perhaps you have a bad image? Try re-importing again. @Chakra
... View more
01-14-2016
04:20 PM
5 Kudos
1Overview Traditionally enterprises have been dealing
with data flows or data movement within their data centers. But as the world
has become more flattened and global presence of companies has become a norm,
enterprises are faced with the challenge of collecting and connecting data from
their global footprint. This problem was
daunting NSA a decade ago and they came up with a solution for this using a
product which was later named as Apache Nifi.
Apache nifi is a easy to use, powerful, and reliable system to process
and distribute data. Within Nifi, as you will see, I will be able to build a
global data flow with minimal to no
Coding. You can learn the details
about Nifi from Apache Nifi website. This is one of most well
documented Apache projects. The
focus of this article to just look at one specific feature within Nifi that I
believe no other software product does it as well as Nifi. And this feature is
“site to site” protocol data transfer. 2Business use case One of the classic business problem is to
push data from a location that has a small IT footprint, to the main data
center, where all the data is collected and connected. This small IT footprint
could be a oil rig at the middle of the ocean, a small bank location at a
remote mountain in a town, a sensor on a vehicle so on and so forth. So, your
business wants a mechanism to push the data generated at various location to
say Headquarters in a reliable fashion, with all the bells and whistles of an
enterprise data flow which means maintain lineage, secure, provenance, audit,
ease of operations etc. The
data that’s generated at my sources are of various formats such as txt, csv,
json, xml, audio, image etc.. and they could of various size ranges from few
MBs to GBs. I wanted to break these files into smaller chunks as I have a low
bandwidth at my source data centers and want to stich them together at the
destination and load that into my centralized Hadoop data lake. 3Solution Architecture Apache Nifi (aka Hortonworks Data Flow) is a
perfect tool to solve this problem. The overall architecture looks something
like Fig 1. We
have a Australian & Russian data center from where we want to move the data
to US Headquarters. We will have what we call edge instance of nifi that will
be sitting in Australian & Russian data center, that will act as a data
acquisition points. We will then have a Nifi processing cluster in US where we
will receive and process all these data coming from global location. We will
build this end to end flow without any coding but rather by just a drag and
drop GUI interface. 4Build the data flow Here are the high level steps to build the
overall data flow. Step1) Setup a Nifi instance at Australian
data center that will act as data acquisition instance. I will create a local
instance of Nifi that will act as my Australian data center. Step2) Setup Nifi instance on a CentOS based
virtual machine that will act as my Nifi data processing instance. This could
be cluster of Nifi as well but, in my case it will be just a single instance. Step3) Build Nifi data flow for the processing
instance. This will have an input port that will indicate that this instance
can accept data from other Nifi instances. Step4) Build Nifi data for the data
acquisition instance. This will have a “remote process group” that will talk to
the Nifi data processing instance via site-to-site protocol. Step5) Test out the overall flow. Attached is the document that provides detailed step by step instruction on how to set this up. data-flow-across-data-centers-v5.zip
... View more
Labels:
12-23-2015
06:50 PM
2 Kudos
The best you can do is export from a single component (i.e. table), take a screenshot of the dashboard or export the dashboard to load into another banana instance. The reason why you can't do an offline dashboard is because you would need the entire index. Dashboards typically contain summarized data and/or a subset of detailed records. In order for the dashboard to remain interactive (search, filter, faceting, etc) you would need the entire data set offline because it does all of the counts/aggregations on the fly.
... View more
07-28-2016
07:20 AM
So i found appropriate components but it doesnt convert the file properly, any idea? input file is a binary
... View more
11-23-2015
02:26 PM
Also of note, some community members have been able to write processors in Scala as well: https://github.com/jahhulbert-ccri/geomesa-nifi/blob/master/nifi-geomesa-nifi-processors/src/main/scala/org/locationtech/geomesa/nifi/GeoMesaIngestProcessor.scala
... View more