I'm an MA Linguistics student specializing in computational linguistics, NLP pipelines, and quantitative text analysis. My work blends linguistic theory with data-driven modeling, focusing on corpus building, feature engineering, and reproducible R/Python workflows.
Tools: R, spaCy, PCA, ggplot2 A computational stylometry pipeline comparing minimalist vs maximalist prose using POS-based features and dimensionality reduction.
Tools: R Markdown, visualization, statistical modeling A complete workflow for cleaning, structuring, and analyzing sociolinguistic survey data. Focus on accent perception & solidarity ratings.
Tools: Excel → R preprocessing, ggplot2 1500+ manually annotated discourse-marker tokens across age groups. Examines variation, distribution, and functional categories.
- Quantitative corpus linguistics
- NLP pipelines for linguistic research
- Stylometry & author profiling
- Feature engineering (POS, dependency, lexical)
- Visualization-heavy analysis (ggplot2)
- R Markdown & reproducible workflows
📍 Freiburg, Germany