Internet Archive | Logo with old style building with tall column supporting a wide roof

Internet Archive

Part of a series on The Internet. [View Related Entries]
[View Related Sub-entries]

Updated Jul 30, 2020 at 05:33AM EDT by Y F.

Added Oct 29, 2012 at 07:36PM EDT by Brad.

PROTIP: Press 'i' to view the image gallery, 'v' to view the video gallery, or 'r' to view a random entry.


The Internet Archive is a digital library and archival site dedicated to permanent documentation of and free public access to a wide variety of digital artifacts, ranging rom websites and music to videos and nearly three million books registered under public domain.


The Internet Archive[1] was founded in 1996 by American computer engineer and Internet activist Brewster Kahle[2], who also co-founded the web crawling service Alexa Internet around the same time. It initially began as Kahle's personal project to archive the World Wide Web, Netnews Bulletin Board System and other publicly available software and webpages, but by late 1999, its scope had been expanded to include other worthy archive collections like the Prelinger Archives and the NASA Images Archive. Kahle's collection was largely kept private on digital tape throughout the 1990s, only allowing researchers and scientists to access the database on special occasions. Despite its lack of public access, the Internet Archives was met by press coverage from several U.S. news publications upon its launch, including the New York Times, Washington Post, Wired Magazine and National Public Radio (NPR).

ARCHI VE Building a digital library for the future IAINTERNET A R CHIVE a Our Mission Internet Archive is collecting and storingabout our philosophy an<d public materials from the objectives, please read Internet such as the World Wide Web, Netnews, and downloadable software For more information by the tVI Archive's founder, cknowled ster Kahle Board Members Finding Us which have been In the NewS donated by Alexa Webmaster Internet. Visit: The '96 US sidential Election Web Archive. This archive was created in affiliation with the Smithsonian Institution The Archive will provide historians, researchers, scholars and others access to this vast collection of data (reaching ten terabytes), and ensure the longevity of this information ใ‚ณ

The database eventually became available for public access on the fifth anniversary of the project in 2001 with the launch of The Wayback Machine[3], a digital time capsule that allows its users to browse multiple versions of web pages archived over time. According to Kahle[4], he was inspired to create the Machine after visting the offices of now-defunct search engine service Alta Vista and witnessing the company's ambitious plan to store and index everything that was on the web. Throughout the 2000s, the Internet Archive continued to expand its collection by merging pre-existing databases as well as building new ones.


SOPA / PIPA Protest

On January 18th, 2012, the Internet Archives blacked out its website for twelve hours in protest of the controversial U.S. Internet bills Stop Online Piracy Act and the PROTECT IP Act, similar to many other resource websites and databases like Wikipedia.

10 Petabytes Milestone

In mid-October 2012, the project reached an important milestone[5] of 10 petabytes (10,000,000,000,000,000 bytes) worth of digital materials in preservation, including millions of digital books, music, movies and webpages.

A R CHI VE 12 Hours Dark: the Internet Archive vs. Censorship The Internet Archive believes that it is critical to protest and raise awareness of pending legislation in the United States: House Bill 3261, The Stop Online Piracy Act (SOPA) and 3.968, the PROTECT IP Act (PIPA) We are going dark from 8am to 6pm PST on Wedneeday, January 18 (14:00 02:00 UTC) to drive a messago to Washington. We need your help to do this Legislation such as this directy affects libraries (pa such as the Intemet Archive, which collects, preserves, and of ers access to cultural materisis Furthermore, these laws can negatively affect the ecosystem of web publishing that led to the emergence of the Internet Archive For non-US Residents: Sorry for dragging you into his, and if you are wiling please sign a petition to the State Department to express your oonoem ใƒˆARCHIVE The Lucky Byte ่€Œ10,000,008000000,000็ˆช Bytes Archived S INTERNET ARCHIVE 25 OcTORER 2012

San Francisco Office Fire

On November 5th, 2013, Internet Archive's office in San Francisco, California, caught fire, destroying approximately $600,000 worth of digital scanning equipment and damaging an adjacent apartment complex (shown below). According to the official blog post[19], no injuries were reported from the scene and the scope of damage was mostly limited to equipments, albeit costly, with some losses of "physical materials" that were being digitized in the scanning room.

In the blog post, the Internet Archive also announced an emergency fund drive[20] to rebuild its scanning capabilities and called on digitization services to help the group continue its archiving process throughout the recovery.


The Internet Archives is mainly comprised of its free online services Wayback Machine and Archive-It, in addition to a number of specialized media collections that have been acquired over time, most notably the Prelinger Archives, NASA Image Archives, Open Library and Live Music Archive. [this section is currently being researched]

The Wayback Machine

The Wayback Machine is the Internet Archive's "three-dimensional index" service that allows its user to search, browse and access snapshots of the World Wide Web archived through its database over time. Since its launch in 2001, millions of websites and their associated data and media have been archived by the service, which can be used to learn more about what previous versions of certain websites used to look like, to grab source code that have disappeared from websites or to visit websites that no longer exist on the web. Often considered a crucial academic research tool in studying the history of the Internet, its popularity has also led to the synonymous usage of the terms "Wayback Machine" and "Internet Archive" in some online communities.

The Wayback Machine http: /1 Take Me Back Please email any issues to


Archive-It is a web archiving service that enables individuals and organizations to harvest, catalog and preserve specialized collectons of digital media content in the archive format. All of Archive-It partners' collectons are also made publicly available with full-text search and some of them may be periodically indexed into the Internet Archive's general archive. As of mid-2011, the service had reached more than 180 partner institutions in 44 U.S. states and 14 countries with over 2.7 billion URLs and 1,534 public collections.

Explore All Archives Items in the archive are listed below. Narrow your results at left, or enter a search query below to find a collecting organization, collection, site, specific URL or to search the text of archived webpages. Enter search terms here SearchClear

Open Library

The Open Library is a free, open-source software project which can be used to create a web-based database for every book ever published and archived. It holds at least 23 million catalog records of books and approximately 1.6 million fully-readable and downloadable books in the public domain.

Open Library is yours to borrow, read connect. Books to Read The World's classic literature at your fingertips. Over 1,000,000 free ebook titles available. Editorial Andres Bello Greek CODE CIVIL CONINGSBY: Carlo Colod LAS AVENTURAS DE PINOCHO OIL THE BIRDS E INFECTIOUS ARORTION y Doicy Fit ่‡ช็›ฎ ๆœ— ๆœ— ๆœ— ๆœ— ๆœ— Books to Borrow Here's a sample of recently returned books from the eBook lending library. he Fire Next Time lames No ROB THE YEARS OF GRACE SCIENCE OF LIFE CTON BRIDES

Software Museum

On April 13th, 2013, TextFiles founder and archivist Jason Scott announced the launch of the Internet Archive's Software Museum[9] on his blog.[8] According to Scott, the world's largest software repository will serve as the host of smaller collections including shareware CDs[10], emulators for a number of old-school gaming consoles and computers[11], classic PC games[12] a mirror of the now-defunct gaming site FilePlanet[13], and the Tucows Software Library[14], which boasts more than 33,000 files in its collection. Scott also noted that the Museum's work is not completed, as it still lacks the necessary metadata needed to easily browse through the archive. The launch of the Software Museum was subsequently picked up by a number of tech news blogs such as Engadget[15], Tuaw[16] and VentureBeat.[17]

Welcome to Software Archive 45,369 Items The Software Archive is designed to preserve and provide access to all kinds of rare or difficult to find, legally downloadable software titles and background information on those titles. The collection includes a broad range of software related materials including shareware, freeware, video news releases about software titles, speed runs of actual software game play, previews and promos for software games, high-score and skill replays of various game genres, and the art of filmmaking with real-time computer game engines. Software has been collected from over 1500 Shareware CD-ROMs, providing millions of shareware programs for download. The DEMU collection has gathered over 4000 classic PC-based games from a quarter century. on eee or 10 0 64 C rovn Check out the Internet Archive blog post about the USGS collection. If you would like to contribute to this Archive, please contact us at RSS

The Console Living Room

On December 26th, the Internet Archive launched The Console Living Room[21] archive for console games from several different consoles from the 1970s and 1980s, including the Atari 2600, ColecoVision, Magnavox Odyssey and Astrocade systems. In a blog post announcement,[22] archivist Jason Scott revealed that additional classic games would be added to the library in coming months.

Welcome to The Console Living Room 983 items The Internet Archive Console Living Room harkens back to the revolution of the change in the hearth of the home, when the fireplace and later television were transformed by gaming consoles into a center of videogame entertainment. Connected via strange adapters and relying on the television's speaker to put out beeps and boops, these games were resplendent with simple graphics and simpler rules The home console market is credited with slowly shifting attention from the arcade craze of the early 1980s and causing arcades to shrink in popularity, leaving a small percentage of what once were many Through use of the JSMESS emulator system, which allows direct access to these programs in your browser with no additional plugins or settings, these games can be enjoyed again. Simply click on the screenshot or "Emulate This" button for each individual cartridge, and on modern browsers the games will just start to run. As nostalgia, a teaching tool, or just plain fun, you'll find hundreds of the games that started a billion-dollar industry. These games are best enjoyed in an up to date version of a modern browser. Currently, there is no sound in the games, although that feature will be added soon. Please read carefully regarding key mappings of the games and programs, to use them in your browser Simply click on a system below to browse through available games and cartridges and try them out. Where possible, links to manuals and additional information are available for reference. The Atari 2600 is a video game console released in September 1977 by Atari, Inc. It is credited with popularizing the use of microprocessor-based hardware and ROM cartridges containing game code, a format first used with the Fairchild Channel F, instead of having non- microprocessor dedicated hardware with all games built in

On the following day, the BBC[23] published an article about the new console game library, which was subsequently posted to the /r/technology[24] subreddit. In the first six hours, the post gained over 6,000 up votes and 350 comments. In the coming days, several other news sites reported on the collection, including Mashable,[25] Engadget,[26] BoingBoing[27] and ArsTechnica.[28]


According to its FAQ page and Alexa, the Internet Archives receives approximately 2.5 million daily unique visits and currently stands at the 278th place in U.S. rank and 222nd place in Global Rank.

Unique IPs per day - all 3.0 M 2.0 M 1.0 M 0.0 Oct No DecJan Feb Mar Apr May Jun Jul Aug Sep Oct 2.89 M 2.34 M 3.34 M

External References

[1] Internet Archive โ€“ Digital Library of Free Books, Music, Movies and Wayback Machine

[2] Wikipedia โ€“ Internet Archive

[3] Internet Archive โ€“ Wayback Machine

[4] Wikipedia โ€“ Wayback Machine

[5] Internet Archive Blogs โ€“ 10,000,000,000,000,000 bytes archived!

[6] Internet Archive โ€“ Open Library

[7] Internet Archive โ€“ Archive It

[8] ASCII by Jason Scott โ€“ Change Computer History Forever: Well, Here We Are

[9] Internet Archive โ€“ Software Archive

[10] Internet Archive โ€“ The Shareware CD Archive

[11] Internet Archive โ€“ TOSEC: The Old School Emulation Center

[12] Internet Archive โ€“ Classic PC games

[13] Internet Archive via Wayback Machine โ€“ The Fireplanet: A Fileplanet Mirror

[14] Internet Archive โ€“ Tucows Software Library

[15] Engadget โ€“ Internet Archive expands software museum, invites you to dig in

[16] Tuaw via Wayback Machine โ€“ Internet Archive expands software collection, still needs more metadata

[17] VentureBeat โ€“ Internet Archive beefs up its free software museum

[18] Slate โ€“ Phew: Internet Not Lost in Fire at Internet Archive

[19] Internet Archive โ€“ Scanning Center Fire -- Please Help Rebuild

[20] Internet Archive โ€“ Donate to The Internet Archive

[21] Internet Archive โ€“ The Console Living Room

[22] Internet Archive โ€“ A Second Christmas Morning

[23] BBC โ€“ Internet Archive puts classic 70s and 80s games online

[24] Reddit โ€“ Internet Archive releases 70s and 80s games

[25] Mashable โ€“ Internet Archive Now Lets You Play Vintage

[26] Engadget โ€“ Internet Archive starts preserving classic game

[27] BoingBoing โ€“ Console Living Room

[28] Ars Technica โ€“ Internet Archive releases hundreds of classic game console ROMS

Recent Videos 1 total

Recent Images 19 total

Top Comments

+ Add a Comment

Comments (29)

Display Comments

Add a Comment

Hauu! You must login or signup first!