Skip to content

Instantly share code, notes, and snippets.

@roshan-adusumilli
Created May 23, 2020 02:11
Show Gist options
  • Select an option

  • Save roshan-adusumilli/d216e8c1c7f0d177681585e7b55ac1fc to your computer and use it in GitHub Desktop.

Select an option

Save roshan-adusumilli/d216e8c1c7f0d177681585e7b55ac1fc to your computer and use it in GitHub Desktop.
import nltk
from nltk import TweetTokenizer
tweet_tokenizer = TweetTokenizer()
def tokenize_tweets(tweet):
tweet = tweet_tokenizer.tokenize(tweet)
return tweet
ca_df['tweet_text'] = ca_df['tweet_text'].apply(tokenize_tweets)
ny_df['tweet_text'] = ny_df['tweet_text'].apply(tokenize_tweets)
tx_df['tweet_text'] = tx_df['tweet_text'].apply(tokenize_tweets)
ca_df.head()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment