- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Please i need assist with this project ; Please i need a guide to go about this. Thanks
- Labels:
-
MapReduce
Created 07-22-2018 07:14 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Problem Statement
A. Find out the top 5 categories with maximum number of videos uploaded.
B. Find out the top 10 rated videos.
C. Find out the most viewed videos.
Dataset
http://www.edureka.co/medias/6cchxi6to4
Dataset Description
Column1: Video id of 11 characters.
Column2: uploader of the video of string data type.
Column3: Interval between day of establishment of Youtube and the date of uploading of the video of integer data type.
Column4: Category of the video of String data type.
Column5: Length of the video of integer data type.
Column6: Number of views for the video of integer data type.
Column7: Rating on the video of float data type.
Column8: Number of ratings given on the video.
Column9: Number of comments on the videos in integer data type.
Column10: Related video ids with the uploaded video
PLease i need a quide to go about this. Any help will be appreciated.
Created 07-25-2018 12:11 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi ,
For example:
* load this data to hive,
* run queries such as: select category, count(*) number_of_videos from youtube_data order by number_of_videos desc limit 5;
Regards
Andrzej
Created 07-25-2018 12:11 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi ,
For example:
* load this data to hive,
* run queries such as: select category, count(*) number_of_videos from youtube_data order by number_of_videos desc limit 5;
Regards
Andrzej
