Three and a half Roses

August 23, 2012

Predicting elections with twitter: setting the stage

Introduction

I’ll now set the stage so that I can start working on the project. As said some times before already, I’ll keep things simple.

So, I apologize up-front if I cut some corners and don’t do everything the scientific way.

A vote

A positive tweet is considered a ‘vote’. A negative is considered a lost vote.

As I said before, this is not perfect, since we don’t tackle the skewed demographics issue (yet). But it is a start to do some BigData. And that, after all, is what we’re trying to do here.

Filtering

For this experiment I won’t be filtering bots/spammers manually, since this is not an easy task. See http://www.cs.ucsb.edu/~gianluca/spamdetector.html (now defunct it seems) and http://networkechoes.blogspot.be/2012/07/fake-followers-on-twitter-my-two-cents.html for more insight on the subject.

Test population

We’ll randomly process tweets that originate from belgian twitter users. This handles the part of the demographics, you know they’re skewed 😉

Content Analysis

We’ll do some basic -probably naive- content analysis on the tweets and then we’ll pass them through an content analysis engine and

check if they’re ‘political’ or not.
analyze what @party is mentioned
if any @party is recognized we’ll keep the tweet
We then also need to do sentiment analysis on the tweet

Calculations

Then we’ll make some calculations on the tweets and the annotations we’ve got. And see if we can get anything useful from that.

Next steps

OK, next step: let’s set up the services to do the analysis.

Big Data, Series

Posted by:

Patrice

As an experienced enterprise architect with a deep-rooted passion for cloud, AI, and architectural design, I’ve guided numerous companies through the management of their existing application landscapes and facilitated their transition to a future state. If you wish to contact me drop me a note at patrice at threeandahalfroses dot com. Or, via skype (patrizz) or twitter (patrizz).

Predicting elections with twitter: setting the stage

Introduction

A vote

Filtering

Test population

Content Analysis

Calculations

Next steps

Leave a ReplyCancel reply

About Me

Recent Posts

Predicting elections with twitter: setting the stage

Introduction

A vote

Filtering

Test population

Content Analysis

Calculations

Next steps

Share this:

Leave a ReplyCancel reply

About Me

Recent Posts

Discover more from Three and a half Roses