Three and a half Roses

August 22, 2012

Predict elections with Twitter: preliminary thoughts

Introduction

In this post I’ll be setting the stage for the next steps of the project.

Data

OK, let’s have a look at the main things we can get from Twitter:

Tweets (content)
user information
1. her “opinion” (vote?)
2. who gets re-tweeted a lot (influence?)
Conversations

To harvest this, we’ll -at least- need to:

“Read” the tweets
Process them linguistically to determine their sentiment and mentions of @party or @partymember
Annotate and store the tweets for later processing

Main challenge

A big question is as [Daniel Gayo-Avello] states correctly (See Flaw number 3) is: what constitutes a vote?

For instance, does the fact that user @voter speaks positively of @party mean that she will vote for @party?

Or, another example: do, say, 10 positive tweets about @party equal one vote for @party?

Or, the other way around: does one negative tweet about @party equal one less vote for @party?

Other Challenges

What constitutes the opinion of a user? Can we even determine it properly?
How to filter the babble, the bots, the spinners? Is this even possible?
How do we tackle the skewed demographics issue?

I probable missed a huge number of other challenges we’ll encounter along the way. But let’s not worry too much about that now. The idea is to discover how to handle BigData.

Next steps

In the next blog post we’ll start by defining what constitutes a vote in our case. Don’t expect a perfect, not even a correct definition.

Again, I mainly want to look at what we need to collect and query big data.

Big Data, Series

Posted by:

Patrice

As an experienced enterprise architect with a deep-rooted passion for cloud, AI, and architectural design, I’ve guided numerous companies through the management of their existing application landscapes and facilitated their transition to a future state. If you wish to contact me drop me a note at patrice at threeandahalfroses dot com. Or, via skype (patrizz) or twitter (patrizz).

Predict elections with Twitter: preliminary thoughts

Introduction

Data

Main challenge

Other Challenges

Next steps

Leave a ReplyCancel reply

About Me

Recent Posts

Predict elections with Twitter: preliminary thoughts

Introduction

Data

Main challenge

Other Challenges

Next steps

Share this:

Leave a ReplyCancel reply

About Me

Recent Posts

Discover more from Three and a half Roses