Master Thesis "An Empirical Study of Online Sentiment Analysis on Twitter Streams " offered by DIMA, TU Berlin
- Author: Huilin Wu
- Advisor: Dr. Shuhao Zhang
- Most existing studies regarding Sentiment Analysis are based on offline batch-based learning mechanisms. Meanwhile, many stream processing systems have been proposed, but they are not specifically designed for online learning tasks, such as online Sentiment Analysis. As a result, it still remains an open and challenging question of how to efficiently perform Sentiment Analysis for real-time streaming data, e.g., ongoing Twitter Streams.
- The goal of this thesis is to empirically evaluate various online algorithms for Sentiment Analysis on Twitter Streams by implementing them on DSPS (Data Stream Processing System) for practical application.
- Flink v1.12
- Scala v2.11
- Python 3.7
- Java 8
- Kafka 2.13
- Redis server v4.0.9
1.6 million labeled Tweets: Sentiment140
- Tweets_clean_demo: Demo for batch-based Tweets preprocessing for Sentiment Analysis on FLink
- Streaming_demo_Sentiment_Analysis: Demo for stream-based Tweets preprocessing & online Sentiment Analysis model on Flink
- Algorithms of incremental Sentiment Analysis
- Python file of various developing note