We have couple of use cases to implement related to source being a Mainframe system where data are kept as large files. These data needs to be offloaded to Hadoop both in Batch and Real Time mechanism for further downstream analytics.
Wanted to know on implementation of below processes :
1) Batch load of Mainframe data (both as files and from DB2) to Hadoop
2) Real Time ingestion mechanism (incremental) of Mainframe data (both as file and from DB2) to Hadoop.
Looking for your help to come with the architecture and ideas for implementation.
Thanks and Regards,