Michelle Gelman

Home

❯

CRAFT NLP Model Evaluating Utterance Exposure and Imbalance Strategies on fine tuning performance for derailment prediction

CRAFT NLP Model - Evaluating Utterance Exposure and Imbalance Strategies on fine-tuning performance for derailment prediction

Nov 07, 20251 min read

  • PyTorch
  • NLP
  • explainability

CRAFT NLP Model: Evaluating Utterance Exposure and Imbalance Strategies on fine-tuning performance for derailment prediction

  • This project analyzes the model sensitivity to utterance variations and imbalance handling on 9 model variants. We determine whether CRAFT is a good predictor of conversation-level derailment on KODIS using Conversations Gone Awry Dataset (CGA-WIKI)as a baseline comparison. We compares variants using standard classification metrics (F1, AUC, Calibration Curves) at each model’s Youden-optimized threshold on the same Ground test set (CGA-WIKI), with additional diagnostics (horizons, distributions, frustration correlation, token analysis) to interpret differences.
  • Including the submit agreement utterance makes KODIS fine-tuned models more prone to forecasting similar derailment scores for all conversations leading to missed conversation dynamics

Presentation | Github

PyTorch NLP explainability


Graph View

Backlinks

  • About Me

Created with Quartz v4.5.2 © 2025

  • GitHub
  • Discord Community