IMDb data scraping + EDA + content-based recommender using Scikit-learn.
Pipeline: cleaning → EDA → feature engineering → ML model (SVM, RF).