NLP Model: 12%

Gained a 12% accuracy increase for my problem child category.

Data Cleaning, 800 lines of new training data and 3 additional training runs.

Next I’m going to try introducing custom stop words in an attempt to further the cleaning process.

Update:

Adding the cleaning process decreased the cumulative score of both models by 2 points. I’ll turn that off for now, but I think training a model from scratch, with the cleaner turned on, may actually be the better idea.

David J Boronow EMAIL