- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Writing a Map reduce code with larger and smaller file
- Labels:
-
Apache Hadoop
Created ‎04-05-2018 10:04 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have a large file of 5 GB which has detailed information about an Employee and also, i have 1 small file with 2 MB which has only employee names. I want to extract the employee names from the smaller file and do analysis on larger file using employee name. How can I do this in Map reduce ?
Created ‎04-12-2018 08:17 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
---> Large file can be input to your Map Reduce program
----> Small file can be passed in distributed cache and can be loaded in List
----> Inside your mapper function, you can do comparisons(Input file vs List) or any other operation which you want.
Let me know if you need help in MR job.
Please post sample data for files and operation you want to perform.
Created ‎04-12-2018 08:17 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
---> Large file can be input to your Map Reduce program
----> Small file can be passed in distributed cache and can be loaded in List
----> Inside your mapper function, you can do comparisons(Input file vs List) or any other operation which you want.
Let me know if you need help in MR job.
Please post sample data for files and operation you want to perform.
Created ‎04-19-2018 12:40 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Rakesh AN If above information helped you, Could you please accept answer?
