CRAFT NLP Model - Evaluating Utterance Exposure and Imbalance Strategies on fine-tuning performance for derailment prediction

CRAFT NLP Model: Evaluating Utterance Exposure and Imbalance Strategies on fine-tuning performance for derailment prediction

This project analyzes the model sensitivity to utterance variations and imbalance handling on 9 model variants. We determine whether CRAFT is a good predictor of conversation-level derailment on KODIS using Conversations Gone Awry Dataset (CGA-WIKI)as a baseline comparison. We compares variants using standard classification metrics (F1, AUC, Calibration Curves) at each model’s Youden-optimized threshold on the same Ground test set (CGA-WIKI), with additional diagnostics (horizons, distributions, frustration correlation, token analysis) to interpret differences.

Including the submit agreement utterance makes KODIS fine-tuned models more prone to forecasting similar derailment scores for all conversations leading to missed conversation dynamics

Presentation | Github

PyTorch NLP explainability

Michelle Gelman

Explorer

CRAFT NLP Model - Evaluating Utterance Exposure and Imbalance Strategies on fine-tuning performance for derailment prediction

Presentation | Github

Graph View

Backlinks