Linux Today: Linux News On Internet Time.
Search Linux Today
Linux News Sections:  Blog -  Developer -  High Performance -  Infrastructure -  IT Management -  Security -  Storage -
Linux Today Navigation
LT Home
Preferences
Contribute
Link to Us
Search
Linux Jobs

Linux Today
Enterprise Linux Today
Apache Today
JustLinux.com
Linux Planet
PHPBuilder
All Linux Devices
Technology Jobs

JustTechJobs.com

LinuxToday Newsletters
Server Daily
IT Management Daily
Subscribe News
Subscribe PR
Subscribe Security

internet.com
Internet News
Small Business

Advertise
Newsletters
Tech Jobs
E-mail Offers

 






Current Newswire:

Tech Comics: "Groundhog Day"

Want a Job? Learn Linux

PC-BSD 9 review – to FreeBSD what Ubuntu is to Debian

Time to dispel open source myths, says Liam Maxwell

SECURITY: Nmap Inside and Out

Eight features Windows 8 'borrowed' from Linux

Malware devs embrace open-source

A tale of two distros: Ubuntu and Linux Mint

Raspberry Pi benchmarked against Beagleboard, low price is long term

20 popular Ubuntu Linux apps you may want to try



Applications Management Engineer Sr (NYC)
Next Step Systems
US-NY-New York

Justtechjobs.com Post A Job | Post A Resume
:Harvard's Berkman Center Seeds the MediaCloud
Harvard's Berkman Center Seeds the MediaCloud
Mar 13, 2009, 10 :33 UTC (0 Talkback[s]) (3634 reads)

(Other stories by Jennifer Zaino)

[ Thanks to Tom Dunlap for this link. ]

"From there, the story text goes into a full text search engines to retrieve specific terms or phrases, gets dumped into a database, and becomes source material for the three simple tools currently on the site to let people start playing with the service. Being able to throw text against Calais and get pretty high quantity entities and terms out of it, Zuckerman says, was a "big step forward."

"The open source and open data project runs off the Amazon cloud. The Berkman Center tried it on its own server first, but with terabyte file systems and hundreds of gigabytes of relational databases, it couldn't keep up. "It's pretty exciting that by signing up with Amazon we were able to scale massively and very quickly," Zuckerman says. The service hopes ultimately to scale to 15,000 RSS sources.

"What's currently live -- showing the top ten most mentioned terms for up to three media sources at a time, or the top ten most mentioned term for each media source that occurs in stories along with a term you specify, or a world map of each media source that indicates which countries get more coverage--is meant as just of a taste of what you can do with the data."

Complete Story

Related Stories:
Cloud computing versus Grid computing(Mar 04, 2009)
Ubuntu Makes Cloud Strategy a Big Joke(Feb 26, 2009)
Cory Doctorow--Linux Guru?(Feb 22, 2009)
Howto Create a custom Debian Amazon Machine Image ( AMI )(Dec 22, 2008)
Amazon to Sell OLPC's XO Laptop Starting Nov. 17(Nov 12, 2008)
DIY YouTube Uses Open Source Project Panda and Amazon EC2(Sep 23, 2008)
OLPC's Amazon Notebook Linux Only(Sep 08, 2008)
Cloud Computing With Amazon Web Services, Part 1: Introduction(Jul 31, 2008)



No talkbacks posted.
  Home | Search Talkbacks | Customize View    Top of Page  



Enter your comments below:

* Your Name:

* Your Email Address:

* Subject:

CC: [will also send this talkback to an E-Mail address]

* Comments:

Tags allowed:<I>,<B> and <U>. See our talkback-policy for more about talkback content.

Fields marked with * are required!

..............................




All times are recorded in UTC.
Linux is a trademark of Linus Torvalds.
Powered by Linux, Apache and PHP