pytextprep.remove_punct
Module Contents
Functions
|
Remove all punctuation and special characters from the |
- pytextprep.remove_punct.remove_punct(tweets, skip=None)[source]
Remove all punctuation and special characters from the input tweets data
- Parameters
tweets (array_like) – List of tweets
skip (array_like or None, optional) – The set of characters that do not have to be removed. Default is None. If None, all characters except alphabets, numbers and space would be removed.
- Returns
list of tweets without special characters
- Return type
list
Examples
>>> tweets_list = [ "Make America Great Again! @DonaldTrump", "It's rocket-science tier investment~~ #LoveElonMusk" ]
>>> remove_punct(tweets_list) [ "Make America Great Again DonaldTrump", "Its rocketscience tier investment LoveElonMusk" ]
>>> remove_punct(tweets_list, skip=["'", "@", "#", '-']) [ "Make America Great Again @DonaldTrump", "It's rocket-science tier investment #LoveElonMusk" ]