<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Want to help improve LibriVox?</title>
	<atom:link href="http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/</link>
	<description>Strategies for Internet citizens</description>
	<lastBuildDate>Sun, 12 Feb 2012 18:22:41 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Jon Udell</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70643</link>
		<dc:creator><![CDATA[Jon Udell]]></dc:creator>
		<pubDate>Fri, 19 Oct 2007 12:34:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70643</guid>
		<description><![CDATA[&quot;title &amp; artist info for each track can be acquire by screen scraping alone&quot;

That is true. However I would recommend that LibriVox publish this metadata as a distinct XML fragment for each work. Not only for the purposes of the feed generator, but for use by other aggregators that will want to get hold of what are, in effect, bibliographic records.]]></description>
		<content:encoded><![CDATA[<p>&#8220;title &amp; artist info for each track can be acquire by screen scraping alone&#8221;</p>
<p>That is true. However I would recommend that LibriVox publish this metadata as a distinct XML fragment for each work. Not only for the purposes of the feed generator, but for use by other aggregators that will want to get hold of what are, in effect, bibliographic records.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Minh</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70531</link>
		<dc:creator><![CDATA[Minh]]></dc:creator>
		<pubDate>Fri, 19 Oct 2007 04:03:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70531</guid>
		<description><![CDATA[Come to think of it, the title &amp; artist info for each track can be acquire by screen scrapping alone. No need to go to the MP3 themselves.]]></description>
		<content:encoded><![CDATA[<p>Come to think of it, the title &amp; artist info for each track can be acquire by screen scrapping alone. No need to go to the MP3 themselves.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeremy Dunck</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70486</link>
		<dc:creator><![CDATA[Jeremy Dunck]]></dc:creator>
		<pubDate>Fri, 19 Oct 2007 00:47:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70486</guid>
		<description><![CDATA[Correction: *httplib2* is better than *httplib*.  

:)]]></description>
		<content:encoded><![CDATA[<p>Correction: *httplib2* is better than *httplib*.  </p>
<p>:)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeremy Dunck</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70483</link>
		<dc:creator><![CDATA[Jeremy Dunck]]></dc:creator>
		<pubDate>Fri, 19 Oct 2007 00:47:23 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70483</guid>
		<description><![CDATA[Minh:
  The HTTP spec defines byte ranges.  Some web servers don&#039;t support them, but archive.org does.

  
  http://www.ietf.org/rfc/rfc2616.txt
  Section 14.35.1 is what you want.
  
  httplib2 is better than httplib2 for this kind of thing: http://code.google.com/p/httplib2/
  docs: http://bitworking.org/projects/httplib2/ref/http-objects.html
  
   
  Example of using httplib against archive.org to get part of the file:
  http://dpaste.com/22843/]]></description>
		<content:encoded><![CDATA[<p>Minh:<br />
  The HTTP spec defines byte ranges.  Some web servers don&#8217;t support them, but archive.org does.</p>
<p>  <a href="http://www.ietf.org/rfc/rfc2616.txt" rel="nofollow">http://www.ietf.org/rfc/rfc2616.txt</a><br />
  Section 14.35.1 is what you want.</p>
<p>  httplib2 is better than httplib2 for this kind of thing: <a href="http://code.google.com/p/httplib2/" rel="nofollow">http://code.google.com/p/httplib2/</a><br />
  docs: <a href="http://bitworking.org/projects/httplib2/ref/http-objects.html" rel="nofollow">http://bitworking.org/projects/httplib2/ref/http-objects.html</a></p>
<p>  Example of using httplib against archive.org to get part of the file:<br />
  <a href="http://dpaste.com/22843/" rel="nofollow">http://dpaste.com/22843/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: hugh</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70357</link>
		<dc:creator><![CDATA[hugh]]></dc:creator>
		<pubDate>Thu, 18 Oct 2007 16:43:21 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70357</guid>
		<description><![CDATA[our database holds the id3tags, i believe, so we could publish those too i think.]]></description>
		<content:encoded><![CDATA[<p>our database holds the id3tags, i believe, so we could publish those too i think.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Udell</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70295</link>
		<dc:creator><![CDATA[Jon Udell]]></dc:creator>
		<pubDate>Thu, 18 Oct 2007 12:37:00 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70295</guid>
		<description><![CDATA[&quot;request for the mp3 from archive.org returns a 302 - redirect, which shouldn’t be hard to deal with&quot;

Originally I had the script follow that redirect, but the LibriVox folks found it was better to let the RSS reader do that at feed fetch time.

&quot;I believe that ID3 tags are at the end of the MP3 file&quot;

Of course all the metadata comes from the LibriVox database. It could be scraped from the page, or perhaps LibriVox can publish it in a more tractable form.]]></description>
		<content:encoded><![CDATA[<p>&#8220;request for the mp3 from archive.org returns a 302 &#8211; redirect, which shouldn’t be hard to deal with&#8221;</p>
<p>Originally I had the script follow that redirect, but the LibriVox folks found it was better to let the RSS reader do that at feed fetch time.</p>
<p>&#8220;I believe that ID3 tags are at the end of the MP3 file&#8221;</p>
<p>Of course all the metadata comes from the LibriVox database. It could be scraped from the page, or perhaps LibriVox can publish it in a more tractable form.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Minh</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70216</link>
		<dc:creator><![CDATA[Minh]]></dc:creator>
		<pubDate>Thu, 18 Oct 2007 03:00:20 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70216</guid>
		<description><![CDATA[Hey Jon,

Making steady progress, I think.

Python is pretty strange. This statement freaked me out:

   return x, y, z

and of course:

   (x, y, z) = func(p)

But there are currently a couple of issues:

1) request for the mp3 from archive.org returns a 302 - redirect, which shouldn&#039;t be hard to deal with,

2) how is it that reading a 6K chunk in the middle of the file gives you minutes &amp; seconds? Pretty cool. But I believe that ID3 tags are at the end of the MP3 file, so currently, I can retrieve title &amp; artist when I pull down the entire file .... which is significantly larger than 6K.

Unless I can somehow just pull down the portion of the file that contain the ID3 tags.

We&#039;ll see...]]></description>
		<content:encoded><![CDATA[<p>Hey Jon,</p>
<p>Making steady progress, I think.</p>
<p>Python is pretty strange. This statement freaked me out:</p>
<p>   return x, y, z</p>
<p>and of course:</p>
<p>   (x, y, z) = func(p)</p>
<p>But there are currently a couple of issues:</p>
<p>1) request for the mp3 from archive.org returns a 302 &#8211; redirect, which shouldn&#8217;t be hard to deal with,</p>
<p>2) how is it that reading a 6K chunk in the middle of the file gives you minutes &amp; seconds? Pretty cool. But I believe that ID3 tags are at the end of the MP3 file, so currently, I can retrieve title &amp; artist when I pull down the entire file &#8230;. which is significantly larger than 6K.</p>
<p>Unless I can somehow just pull down the portion of the file that contain the ID3 tags.</p>
<p>We&#8217;ll see&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Udell</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70120</link>
		<dc:creator><![CDATA[Jon Udell]]></dc:creator>
		<pubDate>Wed, 17 Oct 2007 17:26:41 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70120</guid>
		<description><![CDATA[&quot;What language is this script in?&quot;

Python. 

http://jonudell.net/librivox.py
http://jonudell.net/mp3info.py]]></description>
		<content:encoded><![CDATA[<p>&#8220;What language is this script in?&#8221;</p>
<p>Python. </p>
<p><a href="http://jonudell.net/librivox.py" rel="nofollow">http://jonudell.net/librivox.py</a><br />
<a href="http://jonudell.net/mp3info.py" rel="nofollow">http://jonudell.net/mp3info.py</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Minh</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70060</link>
		<dc:creator><![CDATA[Minh]]></dc:creator>
		<pubDate>Wed, 17 Oct 2007 13:01:07 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-70060</guid>
		<description><![CDATA[Jon,
What language is this script in? I&#039;m doing some stuff w/ MP3 tags in .Net &amp; might be able to help.]]></description>
		<content:encoded><![CDATA[<p>Jon,<br />
What language is this script in? I&#8217;m doing some stuff w/ MP3 tags in .Net &amp; might be able to help.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kara Shallenberg</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-69898</link>
		<dc:creator><![CDATA[Kara Shallenberg]]></dc:creator>
		<pubDate>Wed, 17 Oct 2007 03:16:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-69898</guid>
		<description><![CDATA[Yup, that&#039;d sure be nice.  A lot of books have lovely descriptive chapter titles, too.  It would be great for those to get into the feed somehow.]]></description>
		<content:encoded><![CDATA[<p>Yup, that&#8217;d sure be nice.  A lot of books have lovely descriptive chapter titles, too.  It would be great for those to get into the feed somehow.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: hugh</title>
		<link>http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-69895</link>
		<dc:creator><![CDATA[hugh]]></dc:creator>
		<pubDate>Wed, 17 Oct 2007 03:06:14 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/2007/10/16/want-to-help-improve-librivox/#comment-69895</guid>
		<description><![CDATA[yeah, on the list of things to do... generate proper metadata in the rss feeds... help would be appreciated. all the data is there in standard format, just not rss-ized.]]></description>
		<content:encoded><![CDATA[<p>yeah, on the list of things to do&#8230; generate proper metadata in the rss feeds&#8230; help would be appreciated. all the data is there in standard format, just not rss-ized.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

