Amazon Product Recommender (MIT IDSS – Capstone Project)

As part of a machine learning capstone from MIT-IDSS, this project developed a product recommendation system using a real-world dataset of Amazon product reviews. The goal was to design a recommender that improves customer engagement by suggesting relevant items based on user behaviour.

The work began with cleaning and preprocessing the dataset — removing duplicates, handling missing values, and addressing sparse user–item interactions. Through exploratory analysis, cold-start issues were investigated and user and product activity distributions were examined, establishing minimum rating thresholds to reduce noise from infrequent interactions.

Multiple models were implemented and evaluated, including:

All models were evaluated using precision@k and recall@k. The final solution was an on-demand, deployable recommender using precomputed predictions for fast Streamlit inference.

Tech used: Python, Google-Colab, Streamlit | Surprise and Pickle (main libraries).

Project Links

GitHub Repository Live Streamlit App

Notebook Preview

Amazon recommender notebook screenshot 1 Amazon recommender notebook screenshot 2 Amazon recommender notebook screenshot 3 Amazon recommender notebook screenshot 4

Due to MIT-IDSS copyright restrictions, the notebook cannot be shared publicly. I am happy to discuss further details upon request.

Notebook screenshot enlarged