--- tags: - nlp - regression - tfidf - ridge - summaries - kaggle --- # 🧠 CommonLit Summary Scoring Model This model was trained using the **CommonLit Evaluate Student Summaries** dataset on Kaggle. It predicts two scores for student-written summaries: - `content` → Idea coverage quality - `wording` → Clarity and phrasing quality Built with: - TF-IDF vectorizer - Ridge Regression (scikit-learn) - MultiOutputRegressor wrapper Example usage: ```python from joblib import load model = load("ridge_model.pkl") tfidf = load("tfidf_vectorizer.pkl") summary = "This text discusses..." X = tfidf.transform([summary]) pred = model.predict(X)