O'Reilly Network: How the Wayback Machine Works | Linux Today

O’Reilly Network: How the Wayback Machine Works

Written By
Web Webster
Web Webster
Jan 24, 2002

[ Thanks to Jason
Greenwood
for this link. ]

“The Internet Archive made headlines back in November
with the release of the Wayback Machine, a Web interface to the
Archive’s five-year, 100-terabyte collection of Web pages. The
archive is the result of the efforts of its director, Brewster
Kahle, to capture the ephemeral pages of the Web and store them in
a publicly accessible library. In addition to the other millions of
web pages you can find in the Wayback Machine, it has direct
pointers to some of the pioneer sites from the early days of the
Web, including the NCSA What’s New page, The Trojan Room Coffee
Pot, and Feed magazine.

How big is 100 terabytes? Kahle, who serves as archive director
and president of Alexa Internet, a wholly-owned subsidiary of
Amazon.com, says it’s about five times as large as the Library of
Congress, with its 20 million books.

“What we have on the Web is phenomenal,” Kahle says. “There are
more than 10 million people’s voices evidenced on the Web. It’s the
people’s medium, the opportunity for people to publish about
anything — the great, the noble, the absolute picayune, and the
profane.””

Complete
Story

Web Webster

Web Webster

Web Webster has more than 20 years of writing and editorial experience in the tech sector. He’s written and edited news, demand generation, user-focused, and thought leadership content for business software solutions, consumer tech, and Linux Today, he edits and writes for a portfolio of tech industry news and analysis websites including webopedia.com, and DatabaseJournal.com.

Linux Today Logo

LinuxToday is a trusted, contributor-driven news resource supporting all types of Linux users. Our thriving international community engages with us through social media and frequent content contributions aimed at solving problems ranging from personal computing to enterprise-level IT operations. LinuxToday serves as a home for a community that struggles to find comparable information elsewhere on the web.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.