Finding the Needle in the Twitter Haystack

With millions of Twitter messages floating through the Twittersphere each day, you can use the search tool at  http://search.twitter.com to find references to ANY word that gets uttered there.

So a search for the word “dog” will find every tweet that contains the word dog, and so on.  You can even search for your own twittername and see any time your name is referenced online.  Many companies now use this search feature to find out whenever anyone mentions their products or services on Twitter.

The search tool for Twitter is really quite powerful, and can also be used to generate RSS feeds that can then be embedded into other pages and services.  There is some awesome potential there.

However, Twitter’s ability to search for words being mentioned out there becomes less useful when you search for a really common word, since the search results will invariably turn up lots of stuff you probably don’t want.

When you’re attending a conference for example, you could find every mention that people make about the event by searching for the conference name.  However, it wouldn’t be all that helpful just to do a search on the term “conference” since it would catch all the other possible mentions of the word “conference” from a bunch of other conferences you don’t want. Using the full name of the conference would probably work, but because Twitter limits you to only 140 characters, it would be silly to devote so many of them to including the conference name… there would be little room left for the actual message!

To get around this problem, Twitter users came up with the idea of using a hashtag.. by adding a # in front of a search term. it’s a way to trick Twitter Search into avoiding any results that might contain the keyword but don’t have the hash in front of them.

For conferences, there will generally be a designated hashtag containing a # symbol and an abbreviation for the event. People attending and Twittering from the event can include this short code at the end of each tweet, and then a search (and also an RSS feed) can be created to grab a feed of all the tweets that contain the hashtag, regardless of who they come from. This let’s people follow the conference Tweets in a single stream.

What if the conference has an unusual name already?  A search for a conference abbreviated to “educonf” would probably find most of the references to it fairly easily, since educonf is a kind of “made up” word already.  In this case, a search for the generic term “educonf” or the properly hashtagged “#educonf” would probably turn up pretty much the exact same results.

The real need for the hashtag arises when you have search terms based on regular English words that are ambiguous to the search.  The added # to the front of them makes them unique and helps them stand out from the generic non-hashed word and stops the generic words from getting caught up in the hashtagged feed.  It also carries the added bonus that many 3rd party Twitter clients such as Tweetdeck, Tweetie or Nambu can identify the hashtags and use them to create saved searches, making it much easier to follow the stream based on that tag.

Interestingly, the search feature was never a part of Twitter’s original functionality.  Twitter search was done with a third-party tool created by a company called Summize, but the huge potential (and possibilities for future monetization of Twitter) became immediately obvious and Summize was acquired by Twitter for about $15M almost a year ago.  Now the built-in search functionality is a key part of the Twitter experience, and hashtags play an important role in making that experience even more powerful.

CC Image: ‘Haystack Owl
www.flickr.com/photos/14829735@N00/360683898

The Twouble with Twitter

Sorry Twitter… I really like you and all, but this little video has quite a bit of truth to it. Funny too!

Did I mention that someone I know sends out tweets, on average, including sleep time, every 8 – 10 minutes? Needless to say, I don’t actually follow them.

Gone Phishing

There is a Twitter phishing scam going around at the moment that I’m unhappy to say I fell right into.  This quick post is just a warning to anyone who reads it to hopefully help them not do the same thing.

I’d been off the grid for a few days so I hadn’t heard the news about this scam, but it’s been floating through the Interwebs for 2 days apparently.  It did strike me as odd when i clicked on it that I had to relog in to Twitter, but I’d been playing with different browsers so the fact that it was asking me for a password didn’t seem all that unusual.  Of course, I should have taken more notice of the URL that was in the address bar, but I was too late.

I got a direct tweet from John Pearce that said…
“fixed it.. hehe here is that blog i wanted to show you http://twitterblogs.access-logins.com/login” 

I know John. I trust John. So I clicked it.  It asked for a password, which as I mentioned, was not unusual considering I was trying a new browser (so the password wouldn’t have been already saved in it)  I realised what I’d done almost immediately but by that stage it was already too late.  Bugger!

Since then I’ve had a steady stream of people informing me that my Twitter has been compromised and I’ve now updated the password.  Thanks for the heads up folks.  It’s all fixed now.  And judging by the talk on twitter, I’m not the only one to fall for this scam.

What I found interesting is how easily we can be tricked when there is an element of trust involved.  I’m normally pretty vigilant about suspicious files and links, but I didn’t really question the offending tweet, since I trust John Pearce.  It goes to show the sort of damage that can be inflicted when the troublemakers are able to bring phishing down to a really personal level.