Skip to content
View napalm-git's full-sized avatar
♥️
Coding :)
♥️
Coding :)
  • Germany

Block or report napalm-git

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
napalm-git/README.md

🌌 Hi, I'm Ersin 👋

Computational Linguist • NLP • Corpus Analysis • R • spaCy • PCA

I'm an MA Linguistics student specializing in computational linguistics, NLP pipelines, and quantitative text analysis. My work blends linguistic theory with data-driven modeling, focusing on corpus building, feature engineering, and reproducible R/Python workflows.


🛠️ Tech Stack

Languages & Frameworks

Methods


📌 Featured Projects

🔹 Hemingway vs Lovecraft Stylometry

Tools: R, spaCy, PCA, ggplot2 A computational stylometry pipeline comparing minimalist vs maximalist prose using POS-based features and dimensionality reduction.


🔹 Stella Survey Accent Perception Analysis

Tools: R Markdown, visualization, statistical modeling A complete workflow for cleaning, structuring, and analyzing sociolinguistic survey data. Focus on accent perception & solidarity ratings.


🔹 Discourse Marker Annotation

Tools: Excel → R preprocessing, ggplot2 1500+ manually annotated discourse-marker tokens across age groups. Examines variation, distribution, and functional categories.


🔍 Interests

  • Quantitative corpus linguistics
  • NLP pipelines for linguistic research
  • Stylometry & author profiling
  • Feature engineering (POS, dependency, lexical)
  • Visualization-heavy analysis (ggplot2)
  • R Markdown & reproducible workflows

📫 Contact

📍 Freiburg, Germany

Pinned Loading

  1. discourse-markers discourse-markers Public

    Visualization project analyzing how discourse markers differ across age groups using R (tidyverse/ggplot2). Created as part of my research on language and aging.

    R

  2. hemingway-vs-lovecraft-stylometry hemingway-vs-lovecraft-stylometry Public

    Computational stylometry using R, spaCy, POS-based MDA, and PCA

    R

  3. stella-survey-analysis stella-survey-analysis Public

    Stella Survey Analysis is a data-cleaning and statistical-analysis project examining how English accents are perceived by non-native speakers.