I am trying to use scikit-learn to train a LogisticRegression classifier:
My training data has 2 columns: “text”, “category”.
I have a second dataset with 2 columns also: “text”, “category”.
This is what I want to accomplish:
While training the classifier, each time a “text” record from my training data is identical to a record in the second dataset, I want the classifier to assign the second dataset’s “category” to it.