Skip to content

Latest commit

 

History

History
36 lines (30 loc) · 1.56 KB

File metadata and controls

36 lines (30 loc) · 1.56 KB
title date categories tags coverImage
US presidential election via Twitter using Apache NiFi, Spark, Hive and Zeppelin
2016-04-29
apache-nifi
apache-hive
apache-spark
apache-zeppelin
data
etl
flow
hdp
hortonworks
processing
streaming
twitter
sentimentpresfeature.png

This article describes a frequency and sentiment analysis based on real-time tweets streams in relation to the four main candidates in the US Presidential Election.

The main objective was to deploy and to test the available connector between Apache NiFi and Apache Spark, so I decided to implement the following use case:

At the end, I get real time analytics such as:

  • frequency of tweets along the time per candidate
  • percentage of negative, positive and neutral tweets per candidate
  • opinion trends along the time for each candidate

The article is available on Hortonworks Community Connection website. And as always, please feel free to comment and/or ask questions.