Code Snippet
Check if Website is Available
Performs a cURL-Request to check, if a website exists / is online
Technique #1
<?php
if (isDomainAvailible('http://www.css-tricks.com'))
{
echo "Up and running!";
}
else
{
echo "Woops, nothing found there.";
}
//returns true, if domain is availible, false if not
function isDomainAvailible($domain)
{
//check, if a valid url is provided
if(!filter_var($domain, FILTER_VALIDATE_URL))
{
return false;
}
//initialize curl
$curlInit = curl_init($domain);
curl_setopt($curlInit,CURLOPT_CONNECTTIMEOUT,10);
curl_setopt($curlInit,CURLOPT_HEADER,true);
curl_setopt($curlInit,CURLOPT_NOBODY,true);
curl_setopt($curlInit,CURLOPT_RETURNTRANSFER,true);
//get answer
$response = curl_exec($curlInit);
curl_close($curlInit);
if ($response) return true;
return false;
}
?>Technique #2
<?php
function Visit($url){
$agent = "Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)";$ch=curl_init();
curl_setopt ($ch, CURLOPT_URL,$url );
curl_setopt($ch, CURLOPT_USERAGENT, $agent);
curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt ($ch,CURLOPT_VERBOSE,false);
curl_setopt($ch, CURLOPT_TIMEOUT, 5);
curl_setopt($ch,CURLOPT_SSL_VERIFYPEER, FALSE);
curl_setopt($ch,CURLOPT_SSLVERSION,3);
curl_setopt($ch,CURLOPT_SSL_VERIFYHOST, FALSE);
$page=curl_exec($ch);
//echo curl_error($ch);
$httpcode = curl_getinfo($ch, CURLINFO_HTTP_CODE);
curl_close($ch);
if($httpcode>=200 && $httpcode<300) return true;
else return false;
}
if (Visit("http://www.google.com"))
echo "Website OK"."n";
else
echo "Website DOWN";
?>Technique #3
<?php
ini_set("default_socket_timeout","05");
set_time_limit(5);
$f=fopen("http://www.css-tricks.com","r");
$r=fread($f,1000);
fclose($f);
if(strlen($r)>1) {
echo("<span class='online'>Online</span>");
}
else {
echo("<span class='offline'>Offline</span>");
}
?>
In technique #2 it should probably read
... && $httpcode<400) return true;instead of
... && $httpcode<300) return true;Otherwise a redirect would be regarded as “server down”.
how to use it ?????????
what is the method
Copy code from example #2 into a text file. Give it the extension .php. Move the file to your webserver and call the url that points to the file. If you have a PHP enabled server (most are) the code will check if Google is up (probably always).
You can take it from there, but you will need some basic PHP knowledge.
Which technique is better to use? advantages? disadvantages?
#1 checks if the url is valid, that’s a great plus, if the urls are entered by a user and don’t come out of a database or from a list with urls known to be valid.
#2 on the other hand submits a user agent string. Without it, some server will answer the request with a error 403 (forbidden) or 405 (method not allowed).
Conclusion: I suggest #2 but with the filter test of #1.
Actually, Chris should say something about this, because he is running aremysitesup.com, a service that probably uses this technique (see button in the footer of this page – great service, highly recommended).
We actually use Ruby and some methods that are far more complex =). But the principal is the same.
Thanks! I’m wondering if http://www.checksite.us uses the same method?
Also, what is the purpose of the agent?
Checksite.us is probably based on a similar mechanism, although it’s not clear what language is used.
The UserAgent is a string sent by any browser to identify itself. At least, that was the intention. Nowadays many browser lie about the details. However, as long as a UserAgent is sent, the web server usually sees a request as legitimate. Also robots, e.g. Google have UserAgent strings to identify them.
If no UserAgent string is sent, some web server don’t let you access the site for whatever reason (they probably don’t like the idea that some tool or other server accesses their pages directly).
You can see the UserAgent string of your browser (among other information) on this page:
http://browser.delucamarketing.ch/
none of these techniques really seem to work…everytime i check something like http://www.adfsfdasffsfasfsf.com it always comes back as 200…does anyone know of a way to find out if a website is unavailable like *cough* *cough* godaddy *cough* *cough*
It is a simple whois look up. You need to create a socket connection on port 43. It is just getting PHP to connect to the exact same whois server terminal uses when you do a whois lookup.
It is a simple whois look up. You need to create a socket connection on port 43. It is just getting PHP to connect to the exact same whois server terminal uses when you do a whois lookup.
Sorry for some reason it is not showing the code, you can find a good tut at http://www.phptoys.com/e107_plugins/content/content.php?content.38 which details how to build one.
There’s another solution that uses sockets:
http://neo22s.com/check-if-url-exists-and-is-online-php/
Hi Chris,
Any idea, how can I check if a bunch of URLS are valid. For example. I need to run a check on 1000 twitter profiles and I need to know if they are active or invalid URLS. I want to be able to paste the entire list of twitter urls and check if they are valid pages.
Nice,
But is there any API from who.is to check it? I think using web services will help the cause more effectively. Say as I did in getting the alexa ranking,
http://tutology.net/category/how-php/get-alexa-rank-php-and-alexa-api
DigWP
A book and blog co-authored by Jeff Starr and myself about the World's most popular publishing platform.
Quotes on Design
Design, like Art, can be an elusive word to define and an awfully fun thing to have opinions about.
HTML-Ipsum
One-click copy to clipboard access to Lorem Ipsum text that comes wrapped in a variety of HTML.
Bookshelf
Hey Chris, what books do you recommend? These, young fertile mind, these.