Jan
11
2011

How Do You Visualize 100 GB of Google Text Data?

An anonymous reader writes “There is an amazing series of charts that visualizes trigrams and bigrams, portions of sentences that have been extracted from Google’s web data set. The graphs highlight word associations and the frequency with which we use them on web pages. Chris Harrison from Carnegie Mellon University found, for example, that the word ‘he’ is often tied to ‘argues,’ while ‘she’ is found often with ‘loves.’ There are also word-relation charts that highlight words used in combination with their opposites, such as good and bad, peace and war, and PC and Mac.” There are a lot of these things, and they’re really interesting to browse through.

Read more of this story at Slashdot.


See the original post here:
How Do You Visualize 100 GB of Google Text Data?

Written by Staff in: Slashdot | Tags: ,

No Comments »

RSS feed for comments on this post. TrackBack URL


Leave a Reply

You must be logged in to post a comment.

adsense

Cool-O-Rama: News for Geeks