I started a new job and I will soon be working with Hive and Pyspark to pull from the company's big data lake. I have lots of experience with Python and SQL but not much with big data systems. Can anyone recommend any good books to help a data scientist understand how to work with Hadoop systems? Extra helpful if they go into detail on Hive and Spark