<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.4" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: XML data file of online, valid phishes from PhishTank</title>
	<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/</link>
	<description>A blog about and from PhishTank, a collaborative clearinghouse for data about phishing.</description>
	<pubDate>Thu, 21 Aug 2008 06:37:08 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.4</generator>

	<item>
		<title>by: John Nagle</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-47819</link>
		<pubDate>Sun, 14 Oct 2007 04:54:53 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-47819</guid>
					<description>The XML file started containing useful data again on Friday, October 12th.  Thanks.

Incidentally, it would help if the file was updated as an atomic operation.  Occasionally, we see a partially written file, if we happen to read it while it's being rewritten.  We have to read the file twice at 30 second intervals and compare, rereading until we get the same contents twice in a row.  It would be better to write a new file on each update, then move or link it to the name of the distributed file.</description>
		<content:encoded><![CDATA[<p>The XML file started containing useful data again on Friday, October 12th.  Thanks.</p>
<p>Incidentally, it would help if the file was updated as an atomic operation.  Occasionally, we see a partially written file, if we happen to read it while it&#8217;s being rewritten.  We have to read the file twice at 30 second intervals and compare, rereading until we get the same contents twice in a row.  It would be better to write a new file on each update, then move or link it to the name of the distributed file.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: John Nagle</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-47487</link>
		<pubDate>Fri, 12 Oct 2007 04:56:35 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-47487</guid>
					<description>Something has gone very wrong with the XML file of PhishTank data at "http://data.phishtank.com/data/online-valid". Today, it reads:


− 
2007-10-12T04:30:01+00:00
0





That's the entire file.  Valid XML, no entries. Something is very broken.</description>
		<content:encoded><![CDATA[<p>Something has gone very wrong with the XML file of PhishTank data at &#8220;http://data.phishtank.com/data/online-valid&#8221;. Today, it reads:</p>
<p>−<br />
2007-10-12T04:30:01+00:00<br />
0</p>
<p>That&#8217;s the entire file.  Valid XML, no entries. Something is very broken.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Tom</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-47408</link>
		<pubDate>Thu, 11 Oct 2007 10:58:15 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-47408</guid>
					<description>http://data.phishtank.com/data/online-valid/index.xml has been returning 0 entries since yesterday some time.

Please advise</description>
		<content:encoded><![CDATA[<p><a href='http://data.phishtank.com/data/online-valid/index.xml' rel='nofollow'>http://data.phishtank.com/data/online-valid/index.xml</a> has been returning 0 entries since yesterday some time.</p>
<p>Please advise
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Steve Garvey</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-42530</link>
		<pubDate>Fri, 14 Sep 2007 21:13:25 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-42530</guid>
					<description>The file is NOT sent compressed to my gzip capable browser (Firefox)

Me: Accept-Encoding: gzip, deflate

Your server: Content-Length: 6584660

What if someone writes a script to retrieve it without an "Accept-Encoding" header? As stated previously, you should be serving a gzip'd file.</description>
		<content:encoded><![CDATA[<p>The file is NOT sent compressed to my gzip capable browser (Firefox)</p>
<p>Me: Accept-Encoding: gzip, deflate</p>
<p>Your server: Content-Length: 6584660</p>
<p>What if someone writes a script to retrieve it without an &#8220;Accept-Encoding&#8221; header? As stated previously, you should be serving a gzip&#8217;d file.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: John Roberts</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-9216</link>
		<pubDate>Mon, 05 Mar 2007 18:09:26 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-9216</guid>
					<description>Andres, the file is transparently gzipped across the wire as long as the client requesting the file support gzip.</description>
		<content:encoded><![CDATA[<p>Andres, the file is transparently gzipped across the wire as long as the client requesting the file support gzip.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Andres Riancho</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-9069</link>
		<pubDate>Sun, 04 Mar 2007 15:39:24 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-9069</guid>
					<description>Great work, the database is really amazing! Only one simple question, why isnt the xml file compressed ?

dz0@sock3t:/tmp$ du -sh index.xml 
6.4M    index.xml
dz0@sock3t:/tmp$ bzip2 index.xml 
dz0@sock3t:/tmp$ du -sh index.xml.bz2 
292K    index.xml.bz2
dz0@sock3t:/tmp$ 

The size reduction is amazing, you would save a lot of bandwidth ! 
Maybe the compression algorithm should be "zip" instead of bzip2 to make it easy to decompress in all programming languages and operating systems.

--
Andres Riancho</description>
		<content:encoded><![CDATA[<p>Great work, the database is really amazing! Only one simple question, why isnt the xml file compressed ?</p>
<p><a href="mailto:dz0@sock3t:/tmp$">dz0@sock3t:/tmp$</a> du -sh index.xml<br />
6.4M    index.xml<br />
<a href="mailto:dz0@sock3t:/tmp$">dz0@sock3t:/tmp$</a> bzip2 index.xml<br />
<a href="mailto:dz0@sock3t:/tmp$">dz0@sock3t:/tmp$</a> du -sh index.xml.bz2<br />
292K    index.xml.bz2<br />
<a href="mailto:dz0@sock3t:/tmp$">dz0@sock3t:/tmp$</a> </p>
<p>The size reduction is amazing, you would save a lot of bandwidth !<br />
Maybe the compression algorithm should be &#8220;zip&#8221; instead of bzip2 to make it easy to decompress in all programming languages and operating systems.</p>
<p>&#8211;<br />
Andres Riancho
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: phishthis</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-399</link>
		<pubDate>Tue, 05 Dec 2006 19:52:35 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-399</guid>
					<description>Thank You OpenDNS for having this data. Hopefully, as one who finds phishing and pharming as totally repulsive, this will help the yet-unwashed of the internet to learn what phishing is, and perhaps our efforts will keep them safer and spare them the time in repairing their good name(s).

Mark
Founder, President, and "Bottle Warsher" of the
London Antiphishing Society, 
near Arkansas Nuclear One.</description>
		<content:encoded><![CDATA[<p>Thank You OpenDNS for having this data. Hopefully, as one who finds phishing and pharming as totally repulsive, this will help the yet-unwashed of the internet to learn what phishing is, and perhaps our efforts will keep them safer and spare them the time in repairing their good name(s).</p>
<p>Mark<br />
Founder, President, and &#8220;Bottle Warsher&#8221; of the<br />
London Antiphishing Society,<br />
near Arkansas Nuclear One.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Ian</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-225</link>
		<pubDate>Wed, 15 Nov 2006 09:42:52 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-225</guid>
					<description>Perhaps you ought to make the server serve the RSS feed as static data, and kick out a 304 header if it is not modified? That should help bandwidth-wise.</description>
		<content:encoded><![CDATA[<p>Perhaps you ought to make the server serve the RSS feed as static data, and kick out a 304 header if it is not modified? That should help bandwidth-wise.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: MASA</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-223</link>
		<pubDate>Wed, 15 Nov 2006 06:04:33 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-223</guid>
					<description>It's out,

http://phishtank.com/sitechecker (redirects to the extension's homepage)</description>
		<content:encoded><![CDATA[<p>It&#8217;s out,</p>
<p><a href='http://phishtank.com/sitechecker' rel='nofollow'>http://phishtank.com/sitechecker</a> (redirects to the extension&#8217;s homepage)
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: MASA</title>
		<link>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-109</link>
		<pubDate>Fri, 20 Oct 2006 02:52:47 +0000</pubDate>
		<guid>http://www.phishtank.com/blog/2006/10/17/xml-data-file-of-online-valid-phishes-from-phishtank/#comment-109</guid>
					<description>I am using this data for a firefox extension that protects the user from phishpages based on data from phishtank.

The extension is in the translation stage right now.</description>
		<content:encoded><![CDATA[<p>I am using this data for a firefox extension that protects the user from phishpages based on data from phishtank.</p>
<p>The extension is in the translation stage right now.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
