Get date from Filename
There are some ways to get at the filename in mapreduce but its difficult. MapReduce by definition abstracts filenames away. You have two options there:
1) Use a little python/java/shell whatever preprocessing script OUTSIDE hadoop that adds a field with the date to each row of each file taken from the filename. Easy but not that scalable
2) Write your own recordreader
3) Pig seems to provide some value called tagsource that can do the same
http://stackoverflow.com/questions/9751480/how-can-i-incorporate-the-current-input-filename-into-my-...
4) Hive has a hidden column for the filename so you could use that to compute a date column
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+VirtualColumns