sklearn bag of words classifier