New Contributor
Posts: 1
Registered: ‎10-10-2018

Near/real-time Outlook email ingestion

[ Edited ]

Hello everyone,


I have a task that requires email ingestion as soon as is received in outlook, then extract some information by doing a search based on keywords and store the extracted information in hive : 


Near/real-time email ingestion ---> extract value --> Store into hive


I read that NIFI can do the job but isn't included in Cloudera.

My question is there any Cloudera service (Flume/Kafka/Spark ....) that can connect to outlook capture emails that satisfy certain criteria, or do I have to make a python code using imaplib and run it using Cron on each time interval.


any given hint is appreciated.

Posts: 34
Registered: ‎03-07-2017

Re: Near/real-time Outlook email ingestion