The full code is available on Github
Below are the functions in GetOldTweets3
This is the code for webscraping the coronavirus tweets
>The same process was done for us_east and then we combined the two csv files
Final CSVThis is what our final csv looks like. It has two columns, the Date and the Tweets
In conducting our sentiment analysis we used the sentiment analysis API provided byTextBlob The polarity score is a float within the range [-1.0, 1.0]. The subjectivity is a float within the range [0.0, 1.0] where 0.0 is very objective and 1.0 is very subjective.
In our data engineering process we calculated the subjectivity and polarity of our data. A high subjectivity rate means that the tweet is highly opinionated. Negative polarities point to tweets showing negative emotion while positive tweets refer to tweets showing positive emotion. After calculating these values, we added two new columns, subjectivity and polarity to our dataset.
This is how our final csv looks like. You can find this csv file HERE
Overall we were able to see increases of tweets when significant events occured