google 收录查询代码
本代码是一款站长常用工具了,用来查询google 收录代码,可以查询天
<option value="1">自定义天数</option>
<option value="d">查询昨日收录情况</option>
<option value="w">查询最近1星期收录情况</option>
<option value="m">查询最近1月收录情况</option>
<option value="m2">查询最近2月收录情况</option>
<option value="m3">查询最近3月收录情况</option>
<option value="m6">查询最近6月收录情况</option>
<option value="y">查询最近1年收录情况</option>
<option value="all">查询总的(所有日期)收录情况</option>
等信息。
<?php
function GetPage($url) {
$buf = parse_url($url);
if ($buf['scheme'] == "http") //如果是URL
{
$host = $buf['host'];
$page = $buf['path'];
if (trim($buf['query']) !== "") $page.= "?" . trim($buf['query']);
$myHeader = "GET $url HTTP/1.1rn";
$myHeader.= "Host: $hostrn";
$myHeader.= "Connection: closern";
$myHeader.= "Accept: text/xml,application/xml,application/xhtml+xml,text/html;q=0.9,text/plain;q=0.8,image/png,*/*;q=0.5rn";
$myHeader.= "Accept-Language: zh-cn,zh;q=0.5rn";
$myHeader.= "Accept-Charset: gb2312,utf-8;q=0.7,*;q=0.7rn";
$myHeader.= "User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.7.6) Gecko/20050226 Firefox/1.0.1 Web-Sniffer/1.0.20rn";
$myHeader.= "Referer: http://$host/rnrn";
$server = $host;
$port = 80;
$res = "";
if (false !== ($fp = @fsockopen($server, $port, $errno, $errstr, 30))) {
@fputs($fp, $myHeader);
while (!@feof($fp)) $res.= @fgets($fp, 1024);
@fclose($fp);
} else return false;
if (strlen($res) == 0) return false;
return $res;
} else
//如果是本地文件
{
$fileName = $url;
if (false !== @file_exists($fileName)) {
if (false !== ($buf = @implode("", file($fileName))) && @strlen($buf) > 0) {
return $buf;
} else return false;
} else return false;
}
}
function GetContent($str, $x, $y) {
$tem = strstr($str, $x);
return substr($tem, 0, strpos($tem, $y));
}
对取得的google信息进行分析并且在本地。if (isset($q)) {
$TheUrl = "http://www.google.cn/search?q=+site:" . $q . "&num=20&complete=1&hl=zh-CN&lr=&newwindow=1&as_qdr=" . $t . "&start=" . $p . "&sa=N";
$TheThie = GetPage($TheUrl);
}
$TheBody = GetContent($TheThie, "<div id=res class=med>", "<div id=bsf style");
$TheBody = iconv("UTF-8", "gb2312//IGNORE", $TheBody);
$TheBody = str_replace("<!--z--><p><i>", "<!--z--><!--<p><i>", $TheBody);
$TheBody = str_replace('<br clear="all"/>', '<br clear="all"/>', $TheBody);
$TheBody = preg_replace('<a href="/search?num=20(.+?)amp;as_qdr=(.+?)&q=site:(.+?)&start=(.+?)&sa=N">', 'a href=?q=$3&t=$2&p=$4> <!--z--', $TheBody);
$TheBody = str_replace("该网站可能含有恶意软件,有可能会危害您的电脑。", "", $TheBody);
$TheBody = str_replace("/interstitial?url=", "", $TheBody);
$TheBody = str_replace("- <nobr>", "", $TheBody);
$TheBody = str_replace("类似网页", "", $TheBody);
$total = GetContent($TheThie, "</b></div><p>", "</b> - <b>");
$total = iconv("UTF-8", "gb2312//IGNORE", $total);
$total = GetContent($total, "有 <b>", "</b> 项");
$total = str_replace("有 <b>", "", $total);
$total = str_replace(",", "", $total);
$total = intval($total);教程链接:http://www.phprm.com/code/9639a5fd21b9f0b21e9c69bed29a4fdc.html
随意转载~但请保留教程地址★