Lead Conversion Prediction (MIT IDSS – Capstone Project)

As part of an MIT-IDSS capstone project, a predictive analytics system was built to identify leads most likely to convert into paying customers for the fictional EdTech company ExtraaLearn.

The workflow began with data cleaning — removing duplicates, reconciling inconsistencies, validating ranges, and addressing anomalies — followed by univariate and bivariate exploratory analysis.

Feature engineering was performed to create engagement scores, interaction ratios, and normalised lead scores that better represent user behaviour, then multiple models were evaluated:

This was followed by hyperparameter tuning, threshold optimisation, and precision–recall tradeoff analysis. From the best-performing model, feature-importance insights were extracted and lead profiles were developed to support targeted marketing.

Tech Used: Python, Google Colab | Scikit-learn, Pandas and Seaborn (main libraries)

Notebook Preview

Lead conversion notebook screenshot 1 Lead conversion notebook screenshot 2 Lead conversion notebook screenshot 3 Lead conversion notebook screenshot 4

Due to MIT-IDSS copyright restrictions, the notebook cannot be shared publicly. I am happy to discuss further details upon request.

Notebook screenshot enlarged