This project aims to perform semantic analysis on Turkish Grand National Assembly (TBMM) transcripts.
-
Install Dependencies
pip install -r requirements.txt
-
Setup Elasticsearch
- Follow the instructions in
docs/build_elasticsearch.md - Run the Elasticsearch Docker container
- Create the index using
scripts/create_elastic.py
- Follow the instructions in
-
Run Analysis
Step 1: Load Data into Elasticsearch
# Make sure Elasticsearch is running (Docker container) python src/aciklamalar_d25-d28.pyStep 2: Perform Topic Analysis
python src/analyze_speech_topics.py
Step 3: Generate Parliament Galaxy Visualization
python src/parliament_galaxy.py
- Data Scraping: Extract transcripts from TBMM website
- Elasticsearch Integration: Store and search parliamentary speeches
- Semantic Analysis: Analyze speech topics and patterns
- Data Visualization: Generate insights from parliamentary data
- Python
- Elasticsearch 8.6.1
- Docker (for Elasticsearch)
See individual script files in src/ directory for specific functionality.