07-22-2018 07:14 PM
A. Find out the top 5 categories with maximum number of videos uploaded.
B. Find out the top 10 rated videos.
C. Find out the most viewed videos.
Column1: Video id of 11 characters.
Column2: uploader of the video of string data type.
Column3: Interval between day of establishment of Youtube and the date of uploading of the video of integer data type.
Column4: Category of the video of String data type.
Column5: Length of the video of integer data type.
Column6: Number of views for the video of integer data type.
Column7: Rating on the video of float data type.
Column8: Number of ratings given on the video.
Column9: Number of comments on the videos in integer data type.
Column10: Related video ids with the uploaded video
PLease i need a quide to go about this. Any help will be appreciated.
07-25-2018 12:11 PM
* load this data to hive,
* run queries such as: select category, count(*) number_of_videos from youtube_data order by number_of_videos desc limit 5;