Gained a 12% accuracy increase for my problem child category.
Data Cleaning, 800 lines of new training data and 3 additional training runs.
Next I’m going to try introducing custom stop words in an attempt to further the cleaning process.
Update:
Adding the cleaning process decreased the cumulative score of both models by 2 points. I’ll turn that off for now, but I think training a model from scratch, with the cleaner turned on, may actually be the better idea.