This project focuses on analyzing historical student data from Jamboree Education, a leading institute for test prep and admissions consulting.
The goal is to build a Linear Regression model to understand how different student attributes influence admission chances and to guide data-driven decisions in marketing, outreach, and course design.
Using regression modeling, we quantify how key factors (such as test scores, CGPA, and research experience) affect admission probability and derive strategic insights that can help improve student conversion rates.
| Feature | Description |
|---|---|
| GRE_Score | GRE score of the student |
| TOEFL_Score | TOEFL score of the student |
| University_Rating | Rating of the university (1 to 5) |
| SOP | Strength of Statement of Purpose (1 to 5) |
| LOR | Strength of Letter of Recommendation (1 to 5) |
| CGPA | CGPA of the student |
| Research | Binary indicator if the student has research experience (0/1) |
| Chance_of_Admit | π― Target variable β Probability of admission (0 to 1) |
- Perform Exploratory Data Analysis (EDA) to understand student characteristics
- Build and interpret a Multiple Linear Regression model
- Identify which features most significantly impact the chance of admission
- Evaluate model performance using metrics like RΒ², RMSE, and residual plots
- Translate analytical results into actionable insights for admissions strategy
- CGPA has the highest positive correlation with admission probability.
- Students with CGPA above 8.5 have a significantly higher chance of admission.
- Candidates with research experience perform better even with moderate GRE/TOEFL scores.
- Indicates that universities value research exposure during evaluation.
- Both scores contribute positively but plateau after a certain threshold.
- Focused improvement helps, but returns diminish beyond a score range.
- SOP and LOR scores are key differentiators when academic metrics are average.
- Subjective elements can influence decisions in competitive cases.
- The Linear Regression model explains about 80% of the variance (RΒ² β 0.8).
- Residual analysis shows minimal heteroscedasticity β model assumptions hold true.
- π― Prioritize high-CGPA students for programs with strict academic cutoffs.
- π¬ Encourage research projects to enhance student profiles and admission chances.
- π§ Offer GRE/TOEFL booster programs for students with potential but low test scores.
- βοΈ Conduct SOP/LOR workshops to help borderline applicants strengthen their profiles.
- π Integrate model insights into a dashboard to assist counselors in personalized guidance.
By applying Linear Regression, Jamboree Education can predict admission probabilities and optimize strategies for recruitment, counseling, and student success.
This project empowers data-backed decision-making, ensuring higher conversion rates, better guidance, and improved outcomes for both the institute and its students.
Python, Pandas, NumPy, Matplotlib, Seaborn, scikit-learn, Jupyter Notebook
Ankit Verma
π Data Analyst | Machine Learning Enthusiast
π« Gmail.com | π LinkedIn
β If you found this project insightful, consider giving it a star on GitHub!