Skip to content

Conversation

@makinada
Copy link

I used this code for multi-class classification of short Japanese text (which worked quite well, btw. thanks!) I made the following enhancements.

  • Support of multi-class classification in terms of data loading (MultiClassDataLoader)
  • Support of predefined training/cross validation (dev) data sets (MultiClassDataLoader)
  • Support of character-based embedding (CharDataProcessor)

While making these changes, I kept the original implementation as BinaryClassDataLoader and WordDataProcessor. The code is backward compatible by using the combo.

You can mix and swap the classes ((Binary|Multi)ClassDataLoader and (Word|Char)DataProcessor) in train.py and eval.py Any combo works.

I also added a new set of data files for multi-class classification as an example. They're just straight format conversion from the original RT data.

Mak Inada added 2 commits November 28, 2016 15:17
  - multi-classification
  - predefined training/cv data set
  - character-based embedding
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant