[Grub-dev] notes on grub arc vs heritrix arc

Yousef Ourabi yourabi at zero-analog.com
Mon Jan 14 19:08:01 UTC 2008


notes inline see # comments (only two)

# OFFICIAL, HERITRIX, SLASHDOT INDEX.HTML, WIRED URL RECORD, CNN URL RECORD,
CNN DOCUMENT.
</div>
</body>
</html>
dns:blog.wired.com 68.87.76.178 20080114022153 text/dns 97
20080114022153
a1523.b.akamai.net.    20    IN    A    12.190.48.58
a1523.b.akamai.net.    20    IN    A    12.190.48.82

http://www.cnn.com/ 64.236.29.120 20080114022153 text/html 89349
HTTP/1.1 200 OK
Date: Mon, 14 Jan 2008 02:21:53 GMT
Server: Apache
Accept-Ranges: bytes
Cache-Control: max-age=60, private
Expires: Mon, 14 Jan 2008 02:22:53 GMT
Vary: Accept-Encoding,User-Agent
Content-Type: text/html
X-Pad: avoid browser bug
Connection: close

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN""
http://www.w3.org/TR/html4/loose.dtd"><html lang="en"><head><title>CNN.com -
Breaking News, U.S., World, Weather, Entertainment & Video News</title>
<meta http-equiv="refresh" content="1800;url=?refresh=1">
<meta name="Description" content="CNN.com delivers the latest breaking news
and information on the latest top stories, weather, business, entertainment,
politics, and more. For in-depth coverage, CNN.com provides special reports,
video, audio, photo galleries, and interactive guides.">
<meta name="Keywords" content="CNN, CNN news, CNN.com, CNN TV, news, news
online, breaking news, U.S. news, world news, weather, business, CNN Money,
sports, politics, law, technology, entertainment, education, travel, health,
special reports, autos, developing story, news video, CNN Intl">

<link rel="alternate" type="application/rss+xml" title="CNN - Top Stories
[RSS]" href="http://rss.cnn.com/rss/cnn_topstories.rss">
<link rel="alternate" type="application/rss+xml" title="CNN - Recent Stories
[RSS]" href="http://rss.cnn.com/rss/cnn_latest.rss">


# BABY GRUB, DOCUMENT, URL RECORD, DOCUMENT
</table>


</body>
</html>
http://www.mofa.go.jp/mofaj/area/moldova/ 210.163.22.165:80 19691231175959
text/html 6505
HTTP/1.1 200 OK
Connection: close
Date: Mon, 14 Jan 2008 17:34:50 GMT
Accept-Ranges: bytes
ETag: "181c-475369aa"
Server: Sun-ONE-Web-Server/6.1
Content-Length: 6172
Content-Type: text/html
Last-Modified: Mon, 03 Dec 2007 02:27:54 GMT
Client-Date: Mon, 14 Jan 2008 17:35:01 GMT
Client-Peer: 210.163.22.165:80
Client-Response-Num: 1

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "
http://www.w3.org/TR/html4/loose.dtd">
<html lang="ja">
<head><script src="/__utm.js" type="text/javascript"></script>
    <meta http-equiv="Content-Type" content="text/html; charset=Shift_JIS">
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikia.com/pipermail/grub-dev/attachments/20080114/a386617c/attachment.html 


More information about the Grub-dev mailing list