Build a Movie Recommendation API using Scikit-Learn, Flask and Heroku

what is a recommendation system ?

Content-Based recommendation works on the principle that if a user likes a certain item then we recommend the user a similar item based on the item’s features or attributes. So in our case, if a user likes a movie of a particular genre or an actor then we recommend a movie on similar lines to our user. So, if a user has watched the movie Joker then our recommendation system would predict movies similar to Joker or with the same cast as that of Joker if we consider the movie’s cast.

Source Datasets:-

TMDB 5000 Movies Dataset: https://www.kaggle.com/tmdb/tmdb-movie-metadata
The Indian Movie Database: https://www.kaggle.com/tmdb/tmdb-movie-metadata

Dataset Overview

we have 6477 movies in our dataset with the following attributes:-

cast: Top 3 actors/actresses of the movies genres: Top 3 genres of the movies movie_id: TMDb and IMDb IDs for Hollywood and Bollywood movies original_title: Title of the movies plot: Basic overview of the movie

cleaning folder Description:-

cleaningData.ipynb

cleaning/preprocessing the data set before getting into a recommendation

recommendation folder Description:-

recommendation.py

Here, we have our logic written for the recommendation algorithm. Consists a total of 5 functions

get_data():-

get_data() is used to fetch the data about the movies and return the dataset with it's attributes as the result for further preprocessing.

combine_data():-

combine_data() drops the columns not required for feature extraction and then combines the cast and genres column,finally returning the combine column as the result of this function.

It will be helpful for you to have a basic understanding of these topics before we move on to the next part :-

transform_data():-

transform_data() takes the value returned by combine_data() and the plot column from get_data() and applies CountVectorizer and TfidfVectorizer respectively and calculates the Cosine values.

recommend_movies():-

recommend_movies() takes four parameters i.e movie_name and return values of get_data(), combine_data(), transform_data as input and returns the top 20 movies similar to our input movie.

results():-

result() takes a movie's title as input and returns the top 20 recommendations using the recommend_movies() function.

Deployment folder Description:-

main.py

The Flask app where we use the recommendation.py as a module and return the results in JSON format.

requirements.txt and .gitignore and setup.sh

This has all the dependencies required to deploy our application on Heroku

Procfile

A Procfile is a file which describes how to run your application.

movie_dic.pkl and movie_ist.pkl

encoded data

Data set Folder :-

Contains the dataset we have used to build our recommendation system.

Finally the performance of my project :-

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Deployment		Deployment
cleaning		cleaning
data sets		data sets
images		images
presentation		presentation
recommendation		recommendation
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build a Movie Recommendation API using Scikit-Learn, Flask and Heroku

what is a recommendation system ?

Source Datasets:-

Dataset Overview

we have 6477 movies in our dataset with the following attributes:-

cleaning folder Description:-

cleaningData.ipynb

recommendation folder Description:-

recommendation.py

get_data():-

combine_data():-

It will be helpful for you to have a basic understanding of these topics before we move on to the next part :-

transform_data():-

recommend_movies():-

results():-

Deployment folder Description:-

main.py

requirements.txt and .gitignore and setup.sh

Procfile

movie_dic.pkl and movie_ist.pkl

Data set Folder :-

Finally the performance of my project :-

About

Releases

Packages

Languages

liliansteven/Movie-Recommendation-System-using-flask-and-Heroku

Folders and files

Latest commit

History

Repository files navigation

Build a Movie Recommendation API using Scikit-Learn, Flask and Heroku

what is a recommendation system ?

Source Datasets:-

Dataset Overview

we have 6477 movies in our dataset with the following attributes:-

cleaning folder Description:-

cleaningData.ipynb

recommendation folder Description:-

recommendation.py

get_data():-

combine_data():-

It will be helpful for you to have a basic understanding of these topics before we move on to the next part :-

transform_data():-

recommend_movies():-

results():-

Deployment folder Description:-

main.py

requirements.txt and .gitignore and setup.sh

Procfile

movie_dic.pkl and movie_ist.pkl

Data set Folder :-

Finally the performance of my project :-

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages