Deep Learning โข Education Analytics โข Data Visualization
Predicting student performance with TensorFlow neural networks and data-driven insights.
Visual summary of the project: predicting student performance using TensorFlow neural networks and educational analytics.
- ๐ Overview
- ๐ Objectives
- โก Key Features
- ๐ Repository Structure
- ๐งช Technologies Used
- ๐ง Model Architectures
- ๐ Results & Evaluation
- ๐ Installation
- ๐ฅ๏ธ Usage
- ๐ Visualizations
- ๐ฎ Future Improvements
- ๐ License
- ๐ Acknowledgments
- ๐ค Author
This project applies Deep Learning to predict student performance from real-world data.
It covers the full machine learning lifecycle โ data exploration, exploratory data analysis, preprocessing, modeling, and evaluation โ to extract actionable insights that enhance learning outcomes.
- Analyze and visualize student performance patterns
- Build predictive models using Artificial Neural Networks (ANNs)
- Evaluate results using key metrics for regression and classification
โ
Complete data science pipeline (EDA โ Modeling โ Evaluation)
โ
Built with TensorFlow/Keras for scalable deep learning
โ
Interactive Jupyter Notebook walkthrough
โ
Beautiful data visualizations for insights and interpretability
โ
Reproducible and modular โ ideal for research or education projects
| Path | Description |
|---|---|
.gitignore |
Git configuration to exclude unnecessary files |
LICENSE |
MIT license for open-source distribution |
README.md |
Project overview, methodology, and visualizations |
requirements.txt |
Python dependencies for running the project |
StudentPerformanceFactors.csv |
Dataset containing student performance features |
student_performance_dl_analysis.ipynb |
Main notebook with EDA, preprocessing, modeling, and evaluation |
assets/ |
Folder contains visual assets used in the README (plots, thumbnails, etc.) |
| Category | Tools / Libraries |
|---|---|
| Language | Python 3.10+ |
| Data Processing | NumPy, Pandas, Scikit-learn |
| Visualization | Matplotlib, Seaborn |
| Modeling | TensorFlow, Keras |
| Environment | Jupyter Notebook |
This project features two deep learning models built with TensorFlow/Keras: one for regression and one for classification. Both models share a clean, interpretable architecture and are optimized for educational data.
| Layer | Configuration |
|---|---|
| Input Layer | Receives preprocessed feature vector |
| Hidden Layer 1 | Dense(256), ReLU activation |
| Dropout Layer | Dropout(0.3) |
| Hidden Layer 2 | Dense(128), ReLU activation |
| Hidden Layer 3 | Dense(64), ReLU activation |
| Output Layer | Dense(1) |
- Loss Function: Mean Squared Error (MSE)
- Optimizer: Adam
- Evaluation Metric: Mean Absolute Error (MAE)
This model predicts continuous exam scores based on behavioral and academic features.
| Layer | Configuration |
|---|---|
| Input Layer | Receives preprocessed feature vector |
| Hidden Layer 1 | Dense(128), ReLU activation |
| Hidden Layer 2 | Dense(64), ReLU activation |
| Output Layer | Dense(3), Softmax activation |
- Loss Function: Categorical Crossentropy
- Optimizer: Adam
- Evaluation Metric: Accuracy
This model classifies students into three performance tiers: Low, Medium, and High
Model performance was evaluated using key metrics and visual diagnostics from the notebook.
- MSE: ~4.48
- MAE: ~0.89
- Rยฒ Score: ~0.69
Evaluation Visuals:
- Predicted vs. Actual grade scatter plot
- Training vs Validation Loss Curve
- Residual Distribution Plot
- Training Accuracy: 100%
- Validation Accuracy: ~98%
Evaluation Visuals:
- Training vs Validation Accuracy
The results indicate that both models generalize well, with stable learning curves and limited overfitting due to dropout and early stopping.
- Clone the repository
git clone https://github.com/ArianJr/student-performance-deep-learning.git
cd student-performance-deep-learning- Install dependencies
pip install -r requirements.txtOpen the Jupyter notebook to explore the analysis and models:
jupyter notebook student_performance_dl_analysis.ipynbFollow the notebook to:
- ๐ Explore data distributions
- ๐ง Build and train models
- ๐ Evaluate predictive performance
Visual insights play a key role in understanding student performance:
- Correlation Heatmap: Reveals relationships between features
- Class Distribution: Shows balance of target labels
- Model Metrics: Visualizes loss and accuracy trends
- ๐งฉ Feature Engineering: Explore polynomial or interaction features
- ๐ง Hyperparameter Tuning: Use
KerasTunerorOptunafor optimal architectures - ๐งพ Cross-Validation: Add k-fold validation to reduce variance
- ๐ Explainability: Incorporate SHAP or LIME for feature importance visualization
- โ๏ธ Deployment: Wrap models in a Flask API or Streamlit dashboard for real-time prediction
- ๐ Data Expansion: Include demographic and attendance trends for improved accuracy
This project is licensed under the MIT License. See the LICENSE file for details.
- Dataset Source: Kaggle - Student Performance
- Libraries: TensorFlow, Keras, Scikit-learn, Pandas, NumPy, Matplotlib, Seaborn
- Inspired by educational data mining techniques and performance analytics research
Arian Jr
๐ง Contact Me โข ๐ GitHub Profile
Made with โค๏ธ by ArianJr
โญ If you found this project useful, please consider giving it a star! It helps others discover it and supports my work.



