Youtube Analysis in Python

Youtube Analysis in Python

Abstract:

There is a tremendous growth and popularity of YouTube. It has the potential to touch billions of lives globally as the no. of YouTube users is growing day by day. YouTube, owned by Google, a video streaming site which has billions of users and 400 hours of videos are being uploaded every minute. Almost billions of videos are watched on YouTube every single day, generating mammoth amount of data daily. Since YouTube data is generally in unstructured form, there is an increased demand to store, process and analyze such real time Big Data. YouTubers can analyze their own channel performance with YouTube Analytics. But one can not analyze other channels. The proposed system uses MapReduce framework of Hadoop for processing and analyzing real time YouTube datasets. It will help in discovering how competitors are performing on YouTube. One can easily identify what content works best on YouTube. These analytical data can be represented in demographic form which can be used by individuals and organizations for making immediate actionable decisions so as to gain competitive advantages.