-
Notifications
You must be signed in to change notification settings - Fork 3
Labels
enhancementNew feature or requestNew feature or request
Description
-
Create a set of data cleaning methods
- Set to lowercase
- Change
á é í ó ú->aeiouandñ->gn - Remove Emojis
- Remove mentions
- Remove hashtags
- Remove links
- Remove punctuation:
.-:,? - Remove extra spaces
- Remove spaces before and after string content.
- Stemming ?
-
Create the
Cleaning class. The idea is that each method above belongs to the cleaning class. This can be part of thec4vnlp cleaning library.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request