Support Questions

Find answers, ask questions, and share your expertise

Salesforce integration with Hortonworks Data Flow (Apache Nifi)

avatar

The requirement is to extract data from Salesforce and to ingest into Hive. Is this a good use case for HDF?

The main requirement is to pull data from Salesforce. What processors are appropriate here?

The invokeHttp processor and/or ExtractText processor?

1 ACCEPTED SOLUTION

avatar
Master Mentor

You could use the invokeHTTP processor to connect to a restAPI to pull data. Once NiFi has the data you could do things like extract parts of the data in to NiFi FlowFIle attributes using the extractText processor.

View solution in original post

8 REPLIES 8

avatar
Master Mentor

@Ancil McBarnett their API guide shows every which possible way. I'd use the rest processors in Nifi for that.

avatar
Master Mentor

You could use the invokeHTTP processor to connect to a restAPI to pull data. Once NiFi has the data you could do things like extract parts of the data in to NiFi FlowFIle attributes using the extractText processor.

avatar
Rising Star

Matt, if you have the detailed document on importing data from Salesforce to Hadoop using nifi, please share.

avatar

avatar
New Member

The invokeHTTP processor would work best, you would need to follow SalesForce authentication guidelines for the rest api: here.

avatar
New Member

Salesforce requires SOAP login to access API, linking in another discussion about WSDL soap.

avatar
Explorer

I came across this thread and my colleague published a more recent option on the DataDirect blog with Salesforce JDBC and Apache Nifi

avatar

Connecting Salesforce with Hortonworks Data Flow (powered by Apache NiFi) unlocks powerful data integration. Here's how:

  • NiFi Processors: Utilize processors like "InvokeHTTP" to call Salesforce APIs and retrieve data.
  • Real-Time or Batch: Move data bi-directionally (Salesforce to NiFi or vice versa) in real-time or batches.
  • Data Transformation: Cleanse, transform, and enrich data using NiFi's processors before storing it in your data lake.

This salesforce integration helps you leverage valuable Salesforce data for analytics, reporting, and deeper customer insights.