Skip to content

Twitter sentiment analysis - bottleneck #260

Answered by DidierRLopes
Felixkruemel asked this question in Q&A
Discussion options

You must be logged in to vote

Fetching tweets from Twitter seems to go pretty quickly, but that is 10 tweets per 1 hour and 240 per day. Which isn't much when you think about it. When I tested like 60 tweets per hour it took ages! We'll look into https://github.com/twintproject/twint in the future as there's no API key required and is much faster retrieving data.

I think the bottleneck may be on parsing each tweet, where we do:

  • Cleaning tweet data to remove links, emojis, weird characters
  • Predict the sentiment using this distilbert model
  • Appending the sentiment score to a list

There should be easier ways to do this processing, but my goal at the beginning was to have something working and then we could focus on opti…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by aia
Comment options

You must be logged in to vote
1 reply
@DidierRLopes
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants