Get a list of
5,671,924 websites using Common Crawl Bot Disallow
which includes location information, hosting data and contact details. The list includes 5,190,420 live websites and 481,504 websites redirecting to those sites. 3,371,356 of these sites are in the United States.
We also know of 1,430,502 sites that have used Common Crawl Bot Disallow previously.