Skip to content

Interactive Streamlit-powered platform for fast, no-code exploratory data analysis with automated insights and rich visualizations.

Notifications You must be signed in to change notification settings

Tejas3545/Exploratory-Data-Analysis-EDA-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

26 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŒŒ Exploratory Data Analysis (EDA) Web Platform

A seamless, intelligent, and interactive data-diagnostics experience โ€” built for analysts, students, researchers, and problem-solving minds.


โšก Overview

The Exploratory Data Analysis (EDA) Platform is a fully-interactive, browser-based analytics engine built with Streamlit, designed to simplify the most critical phase of any data project โ€” understanding your dataset.

It transforms raw data into actionable insights through:

  • Automated statistical summaries
  • Beautiful visualizations
  • Intelligent column analysis
  • Outlier detection
  • Correlation diagnostics
  • Categorical + numerical comparisons
  • Data quality reporting
  • Smooth UI with a frictionless workflow

This platform brings the discipline of classical statistics and the elegance of modern UI into one unified workspace.


๐ŸŽฏ Why This Project Exists

Traditional EDA is repetitive: You write the same code again and again โ€” describe(), info(), head(), heatmaps, histograms, null countsโ€ฆ

This app eliminates that cycle.

It empowers:

๐Ÿง‘โ€๐ŸŽ“ Students

To explore datasets visually and learn data patterns intuitively.

๐Ÿง‘โ€๐Ÿ’ผ Analysts

To rapidly inspect large CSV/Excel datasets without spinning up notebooks.

โš™๏ธ Engineers

To run sanity checks before pushing data pipelines to production.

๐Ÿงช Researchers

To validate hypotheses and visualize distributions in one place.

In a world obsessed with automation, this tool gives you a fast lane for intelligent discovery.


๐Ÿ› ๏ธ Key Features

๐Ÿ” 1. Automated Dataset Profiling

  • Column type inference
  • Missing-value mapping
  • Summary statistics (central tendency, spread, distribution shape)
  • Feature-type breakdown (numeric, categorical, datetime)

๐Ÿ“Š 2. Interactive Visualizations

Render complex charts instantly:

  • Histograms
  • Bar charts
  • Line graphs
  • Box plots
  • Scatter plots
  • KDE distributions
  • Correlation heatmaps
  • Pairwise feature relations

Animations for hover + transitions are powered by Streamlitโ€™s smooth rendering engine.


๐Ÿ“ˆ 3. Correlation Engine

Deep-dive into relationship patterns:

  • Pearson correlations
  • Spearman rank
  • Heatmaps
  • Highlight strongest signals
  • Feature influence margins

Perfect for ML model pre-analysis.


๐Ÿงน 4. Data Cleaning Assistance

(Full cleaning with but essential diagnostics)

  • Identify duplicate rows
  • Missing value density mapping
  • Unique value breakdown
  • Detection of constant columns
  • Detection of skewed / sparse features

๐Ÿงช 5. Smart Sampling & Filtering

Instantly:

  • Filter rows
  • Sort columns
  • Drop missing data
  • Sample N% of dataset
  • Preview dataset slices

๐Ÿงฐ 6. No-Code EDA Workflow

Everything is point-and-click. No installation headaches. 100% cloud-ready.


๐Ÿงฌ Architecture Overview

๐Ÿ“ฆ Exploratory-Data-Analysis-EDA-
โ”‚
โ”œโ”€โ”€ app.py                      # Main Streamlit app
โ”œโ”€โ”€ components/                 # Reusable UI blocks
โ”œโ”€โ”€ utils/                      # Helper functions for EDA
โ”œโ”€โ”€ requirements.txt / pyproject.toml
โ”œโ”€โ”€ runtime.txt (Python version)
โ””โ”€โ”€ assets/                     # Images, gifs, UI previews

๐ŸŒ Frontend

  • Streamlit's declarative component engine
  • Responsive UI containers & animations
  • Dynamic chart rendering

๐Ÿงฎ Backend

  • pandas
  • numpy
  • matplotlib / seaborn
  • pyarrow (if needed, Streamlit manages versions)

โšก Performance Mindset

This app is optimized for:

  • Fast file loading
  • Efficient memory handling
  • Real-time chart rendering
  • Smooth state transitions
  • Minimal user delay

Even large CSVs load gracefully thanks to careful batching.


๐Ÿ“ฆ Installation (Local Development)

# Clone the repository  
git clone https://github.com/Tejas3545/Exploratory-Data-Analysis-EDA-.git

cd Exploratory-Data-Analysis-EDA-

# Create virtual environment  
python -m venv venv
venv\Scripts\activate  # Windows
# or source venv/bin/activate  # Mac/Linux

# Install dependencies  
pip install -r requirements.txt

# Start the app  
streamlit run app.py

๐Ÿ” Deployment Notes (Streamlit Cloud)

To avoid deployment failures:

โœ”๏ธ Set Python version

Add runtime.txt:

python-3.10

โœ”๏ธ Do NOT manually pin pyarrow

Streamlit bundles correct versions; building from source breaks CI.

โœ”๏ธ Keep your pyproject.toml simple

Only include necessary dependencies.


๐Ÿค Contributing

Contributions are welcomed with gratitude. Before submitting a PR:

  1. Follow consistent formatting
  2. Add comments for complex logic
  3. Test your components
  4. Update README if you add/remove features

Workflow:

git checkout -b feature/your-feature-name
git commit -m "add: new chart component"
git push origin feature/your-feature-name

๐Ÿ›ฃ๏ธ Roadmap

  • Add automated PDF report generation
  • Add drag-and-drop interface enhancements
  • AI-powered anomaly detection
  • ML feature importance module
  • Add theme switcher (dark/light)
  • Add column transformation tools
  • Data cleaning automation

๐Ÿง‘โ€๐Ÿ’ผ Maintainer

Tejas J. Solanki ๐Ÿ“ India ๐Ÿ’ผ Engineering (B.Tech IT) ๐ŸŒ GitHub: https://github.com/Tejas3545

Dixita Balapuriya ๐Ÿ“ India ๐Ÿ’ผ Engineering (B.Tech CSE) ๐ŸŒ GitHub: https://github.com/dixitaBalapuriya19


๐ŸŒŸ Final Words

This platform is not just an EDA tool โ€” itโ€™s a gateway into structured thinking, a companion for aspiring analysts, a rapid-fire engine for insights, and a testament to disciplined engineering.

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •