We have some of our data stored in MYSQL database and we are looking to build/buy an ETL solution that will allow us to move the data into Hive. Can someone recommend some of the best practices to achieve the same? If possible, we would want to move the data in real time.
NiFi is one option to accomplish what you need. You can find an example here for a generic SQL database moving data "in real-time" to Hive. If you could describe your use case in more details, the community could assist you better.