Wayback Machine
success
fail
Jun OCT Nov
Previous capture 16 Next capture
2016 2017 2018
5 captures
04 Apr 2016 - 16 Oct 2017
About this capture
COLLECTED BY
Organization: Internet Archive
The Internet Archive discovers and captures web pages through many different web crawls. At any given time several distinct crawls are running, some for months, and some every day or longer. View the web archive through the Wayback Machine.
Collection: Survey Crawl Number 6: Sep 11th, 2017 - running now

The seeds for this crawl came from:

251 million Domains that had at least one link from a different domain in the Wayback Machine, across all time

~ 300 million Domains that we had in the Wayback, across all time

55,945,067 Domains from https://archive.org/details/wide00016

This crawl was run with a Heritrix setting of "maxHops=0" (URLs including their embeds)

The WARC files associated with this crawl are not currently available to the general public.

TIMESTAMPS
loading