<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Facts and friction</title>
	<atom:link href="http://blog.jonudell.net/2009/04/03/facts-and-friction/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/</link>
	<description>Strategies for Internet citizens</description>
	<lastBuildDate>Sat, 11 Feb 2012 19:45:11 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Martin Millbrand</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-132456</link>
		<dc:creator><![CDATA[Martin Millbrand]]></dc:creator>
		<pubDate>Tue, 09 Mar 2010 13:58:06 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-132456</guid>
		<description><![CDATA[I believe you actually, I do believe! Should it be workable for you to get your webblog translated into Chinese? English is my own 2nd language.]]></description>
		<content:encoded><![CDATA[<p>I believe you actually, I do believe! Should it be workable for you to get your webblog translated into Chinese? English is my own 2nd language.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Three New Search Services: Wolfram&#124;Alpha, Microsoft Bing, Google Squared &#124; Disruptive Library Technology Jester</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-128316</link>
		<dc:creator><![CDATA[Three New Search Services: Wolfram&#124;Alpha, Microsoft Bing, Google Squared &#124; Disruptive Library Technology Jester]]></dc:creator>
		<pubDate>Tue, 16 Jun 2009 15:13:18 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-128316</guid>
		<description><![CDATA[[...] of Wolfram&#124;Alpha and what it might mean for those of us in the business of answering questions. Jon Udell talks about being able to &#8220;compute with facts in a more frictionless way.&#8221; There is [...]]]></description>
		<content:encoded><![CDATA[<p>[...] of Wolfram|Alpha and what it might mean for those of us in the business of answering questions. Jon Udell talks about being able to &#8220;compute with facts in a more frictionless way.&#8221; There is [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jodischneider.com/blog &#187; Wolfram&#124;Alpha Roundup</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127631</link>
		<dc:creator><![CDATA[jodischneider.com/blog &#187; Wolfram&#124;Alpha Roundup]]></dc:creator>
		<pubDate>Wed, 20 May 2009 04:58:15 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127631</guid>
		<description><![CDATA[[...] Jon Udell is hoping &#8220;to be able to compute with facts in a more frictionless way.&#8221; [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Jon Udell is hoping &#8220;to be able to compute with facts in a more frictionless way.&#8221; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Nielsen &#187; Biweekly links for 05/01/2009</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127450</link>
		<dc:creator><![CDATA[Michael Nielsen &#187; Biweekly links for 05/01/2009]]></dc:creator>
		<pubDate>Fri, 01 May 2009 10:53:47 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127450</guid>
		<description><![CDATA[[...] Facts and friction « Jon Udell [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Facts and friction « Jon Udell [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Book Review of - Simple Spreadsheets For Hard Decisions &#124; The Money Saving Fifty (50)</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127250</link>
		<dc:creator><![CDATA[Book Review of - Simple Spreadsheets For Hard Decisions &#124; The Money Saving Fifty (50)]]></dc:creator>
		<pubDate>Thu, 09 Apr 2009 12:03:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127250</guid>
		<description><![CDATA[[...] Facts and friction « Jon Udell [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Facts and friction « Jon Udell [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick Lothian</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127227</link>
		<dc:creator><![CDATA[Nick Lothian]]></dc:creator>
		<pubDate>Mon, 06 Apr 2009 23:34:44 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127227</guid>
		<description><![CDATA[Re: GoogleLookup - did you try that? Here&#039;s the results I get:

Providence,RI, population:	183,500
Boston, population:	4,467,000

Demo spreadsheet: http://spreadsheets.google.com/ccc?key=pq1fjlmNTLRo9yuyQAWZ_eg

You can choose what answer you want anyway - see http://googlesystem.blogspot.com/2009/03/alternative-answers-for-googlelookup.html]]></description>
		<content:encoded><![CDATA[<p>Re: GoogleLookup &#8211; did you try that? Here&#8217;s the results I get:</p>
<p>Providence,RI, population:	183,500<br />
Boston, population:	4,467,000</p>
<p>Demo spreadsheet: <a href="http://spreadsheets.google.com/ccc?key=pq1fjlmNTLRo9yuyQAWZ_eg" rel="nofollow">http://spreadsheets.google.com/ccc?key=pq1fjlmNTLRo9yuyQAWZ_eg</a></p>
<p>You can choose what answer you want anyway &#8211; see <a href="http://googlesystem.blogspot.com/2009/03/alternative-answers-for-googlelookup.html" rel="nofollow">http://googlesystem.blogspot.com/2009/03/alternative-answers-for-googlelookup.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bernie Hogan</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127218</link>
		<dc:creator><![CDATA[Bernie Hogan]]></dc:creator>
		<pubDate>Mon, 06 Apr 2009 14:04:33 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127218</guid>
		<description><![CDATA[@Joe, yeah - I think we&#039;re almost on the same page. I have a caveat, though. 

These queries still require exogenous knowledge that relates not to the data but to the way it is accessed. Python libraries are exogenous knowledge; they are simply there as a means to an end. The goal should be to absolutely minimize the need for additional such knowledge. Better syntax gets us part of the way there, but it still is incomplete. You still need to know which library and function. Python makes it almost as simple as possible, but it is as simple as possible within the domain of having code that is executed rather than question that is asked.

Computer literacy is not just about learning the limitations of the data, but also the seemingly arbitrary conventions of accessing it. We certainly need to learn better how to understand the limitations of various data. However, I don&#039;t think the end game is teaching everyone basic scripting. It is a problem to me if everyone needs to know how to code in order to ask computers for things would not require code if you asked a knowledgeable human being.

That is different than asking computers for things that only computer are good at, such as lists of answers, sorted and filtered. But between sites like Yahoo pipes and open.dapper.com we are getting closer to data feed nirvana.]]></description>
		<content:encoded><![CDATA[<p>@Joe, yeah &#8211; I think we&#8217;re almost on the same page. I have a caveat, though. </p>
<p>These queries still require exogenous knowledge that relates not to the data but to the way it is accessed. Python libraries are exogenous knowledge; they are simply there as a means to an end. The goal should be to absolutely minimize the need for additional such knowledge. Better syntax gets us part of the way there, but it still is incomplete. You still need to know which library and function. Python makes it almost as simple as possible, but it is as simple as possible within the domain of having code that is executed rather than question that is asked.</p>
<p>Computer literacy is not just about learning the limitations of the data, but also the seemingly arbitrary conventions of accessing it. We certainly need to learn better how to understand the limitations of various data. However, I don&#8217;t think the end game is teaching everyone basic scripting. It is a problem to me if everyone needs to know how to code in order to ask computers for things would not require code if you asked a knowledgeable human being.</p>
<p>That is different than asking computers for things that only computer are good at, such as lists of answers, sorted and filtered. But between sites like Yahoo pipes and open.dapper.com we are getting closer to data feed nirvana.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Udell</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127216</link>
		<dc:creator><![CDATA[Jon Udell]]></dc:creator>
		<pubDate>Mon, 06 Apr 2009 11:37:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127216</guid>
		<description><![CDATA[&gt; You know about the GoogleLookup function
&gt; in Google Spreadsheets, right?

Yes. In this case the answer for these two queries:

=GoogleLookup(&quot;Providence,RI&quot;,&quot;population&quot;)
=GoogleLookup(&quot;Boston&quot;,&quot;population&quot;)

is the same value: 2000.

The source is pages like:

http://quickfacts.census.gov/qfd/states/25/2507000.html

where GoogleLookup finds lines of text like:

Population, 2000]]></description>
		<content:encoded><![CDATA[<p>&gt; You know about the GoogleLookup function<br />
&gt; in Google Spreadsheets, right?</p>
<p>Yes. In this case the answer for these two queries:</p>
<p>=GoogleLookup(&#8220;Providence,RI&#8221;,&#8221;population&#8221;)<br />
=GoogleLookup(&#8220;Boston&#8221;,&#8221;population&#8221;)</p>
<p>is the same value: 2000.</p>
<p>The source is pages like:</p>
<p><a href="http://quickfacts.census.gov/qfd/states/25/2507000.html" rel="nofollow">http://quickfacts.census.gov/qfd/states/25/2507000.html</a></p>
<p>where GoogleLookup finds lines of text like:</p>
<p>Population, 2000</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick Lothian</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127213</link>
		<dc:creator><![CDATA[Nick Lothian]]></dc:creator>
		<pubDate>Sun, 05 Apr 2009 23:11:46 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127213</guid>
		<description><![CDATA[You know about the GoogleLookup function in Google Spreadsheets, right?

Some examples:

To insert the number of Internet users in Paraguay:
=GoogleLookup(&quot;Paraguay&quot;; &quot;internet users&quot;)
To insert the Earned Run Average of Roger Clemens:
=GoogleLookup(&quot;Roger Clemens&quot;; &quot;earned run average&quot;)

See http://docs.google.com/support/bin/answer.py?answer=54199]]></description>
		<content:encoded><![CDATA[<p>You know about the GoogleLookup function in Google Spreadsheets, right?</p>
<p>Some examples:</p>
<p>To insert the number of Internet users in Paraguay:<br />
=GoogleLookup(&#8220;Paraguay&#8221;; &#8220;internet users&#8221;)<br />
To insert the Earned Run Average of Roger Clemens:<br />
=GoogleLookup(&#8220;Roger Clemens&#8221;; &#8220;earned run average&#8221;)</p>
<p>See <a href="http://docs.google.com/support/bin/answer.py?answer=54199" rel="nofollow">http://docs.google.com/support/bin/answer.py?answer=54199</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Walkingshaw</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127210</link>
		<dc:creator><![CDATA[Andrew Walkingshaw]]></dc:creator>
		<pubDate>Sat, 04 Apr 2009 20:20:36 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127210</guid>
		<description><![CDATA[Jon wrote:
&gt; At what granularity? For example, is 
&gt; this URL...

Our URLs each point to a single time-series - they represent the value of a given quantity varying across time (plus any associated metadata). One of those pieces of metadata is the source (provenance) of the data - which for the case you highlighted is a spreadsheet from the European Central Bank containing the history of thirty-odd exchange rates.

So what we&#039;re doing is taking a document and transforming it into several addressable resources, putting a documented API onto the resources thus created, and building tools around that. 

Joe writes:
&gt; I hope that those tools expose their
&gt; innards so that if they turn out to not
&gt; have captured the right standardizations,
&gt; we can adapt them.
 
Our APIs are public and the data&#039;s downloadable, so although we by necessity impose a particular form on the data in Timetric, we&#039;d hope that ultimately it&#039;s no more siloed than it was up on the European Central Bank&#039;s website. Which form is more useful will more than likely depend on what one&#039;s trying to do.

Jon wrote:
&gt; Yes but time is a scarce resource, so a 
&gt; lot of questions go unasked/unanswered 
&gt; even by those who could do the fiddling.

Hopefully, the things services like ours can do - primarily through the API, in this context - help out in writing programs which give you the answers you&#039;re after, and provide facilities which let non-programmers get results they couldn&#039;t easily get otherwise. And as Joe writes, the boundaries between programmer and non-programmer are fuzzy; I used to be a computational physicist, and that&#039;s a field full of people who write programs but mostly wouldn&#039;t think of themselves as programmers. It&#039;s something they (sometimes) do, rather than something they are.]]></description>
		<content:encoded><![CDATA[<p>Jon wrote:<br />
&gt; At what granularity? For example, is<br />
&gt; this URL&#8230;</p>
<p>Our URLs each point to a single time-series &#8211; they represent the value of a given quantity varying across time (plus any associated metadata). One of those pieces of metadata is the source (provenance) of the data &#8211; which for the case you highlighted is a spreadsheet from the European Central Bank containing the history of thirty-odd exchange rates.</p>
<p>So what we&#8217;re doing is taking a document and transforming it into several addressable resources, putting a documented API onto the resources thus created, and building tools around that. </p>
<p>Joe writes:<br />
&gt; I hope that those tools expose their<br />
&gt; innards so that if they turn out to not<br />
&gt; have captured the right standardizations,<br />
&gt; we can adapt them.</p>
<p>Our APIs are public and the data&#8217;s downloadable, so although we by necessity impose a particular form on the data in Timetric, we&#8217;d hope that ultimately it&#8217;s no more siloed than it was up on the European Central Bank&#8217;s website. Which form is more useful will more than likely depend on what one&#8217;s trying to do.</p>
<p>Jon wrote:<br />
&gt; Yes but time is a scarce resource, so a<br />
&gt; lot of questions go unasked/unanswered<br />
&gt; even by those who could do the fiddling.</p>
<p>Hopefully, the things services like ours can do &#8211; primarily through the API, in this context &#8211; help out in writing programs which give you the answers you&#8217;re after, and provide facilities which let non-programmers get results they couldn&#8217;t easily get otherwise. And as Joe writes, the boundaries between programmer and non-programmer are fuzzy; I used to be a computational physicist, and that&#8217;s a field full of people who write programs but mostly wouldn&#8217;t think of themselves as programmers. It&#8217;s something they (sometimes) do, rather than something they are.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Udell</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127209</link>
		<dc:creator><![CDATA[Jon Udell]]></dc:creator>
		<pubDate>Sat, 04 Apr 2009 20:03:02 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127209</guid>
		<description><![CDATA[&gt; I boggle at the kinds of things 
&gt; “non-programmers” have achieved using 
&gt; Excel and cell formulas, and I have to
&gt;  believe that more people would be able to
&gt; learn to do those things if they could use
&gt; a more humane language to do them.

This is, by the way, the very motivation and rationale that&#039;s driving Oslo. It takes a hybrid approach to domain-specific languages, positing that people could actually create these for themselves, and then use them textually or graphically.]]></description>
		<content:encoded><![CDATA[<p>&gt; I boggle at the kinds of things<br />
&gt; “non-programmers” have achieved using<br />
&gt; Excel and cell formulas, and I have to<br />
&gt;  believe that more people would be able to<br />
&gt; learn to do those things if they could use<br />
&gt; a more humane language to do them.</p>
<p>This is, by the way, the very motivation and rationale that&#8217;s driving Oslo. It takes a hybrid approach to domain-specific languages, positing that people could actually create these for themselves, and then use them textually or graphically.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Germuska</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127208</link>
		<dc:creator><![CDATA[Joe Germuska]]></dc:creator>
		<pubDate>Sat, 04 Apr 2009 19:14:14 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127208</guid>
		<description><![CDATA[Responding to both Bernie and to Andrew, I don&#039;t think we disagree all that much, and of course I haven&#039;t seen Wolfram Alpha...  but I think there has to be some kind of &quot;meeting in the middle.&quot;  People and our tools evolve in synergy.

I&#039;m basically talking about what Jon (and Jeannette Wing) have been framing around the idea of &quot;computational literacy.&quot;  One of the reasons I&#039;m enthusiastic about Python is that I think it is within reach of the &quot;computationally literate&quot; even if they don&#039;t think of themselves as programmers.  

I definitely laud the idea of building tools that standardize and guide people into doing things the right way -- but I hope that those tools expose their innards so that if they turn out to not have captured the right standardizations, we can adapt them.  I boggle at the kinds of things &quot;non-programmers&quot; have achieved using MS Excel and cell formulas, and I have to believe that more people would be able to learn to do those things if they could use a more humane language to do them.


Jon wrote:
&gt; it does make me very curious to see what will happen when 
&gt; a substantial core of curated data winds up in the Wolfram 
&gt; silo with a very robust set of tools and services 
&gt; wrapped around it.

Good point. Perhaps Wolfram will end up as the Adobe of data tools and de facto standards...]]></description>
		<content:encoded><![CDATA[<p>Responding to both Bernie and to Andrew, I don&#8217;t think we disagree all that much, and of course I haven&#8217;t seen Wolfram Alpha&#8230;  but I think there has to be some kind of &#8220;meeting in the middle.&#8221;  People and our tools evolve in synergy.</p>
<p>I&#8217;m basically talking about what Jon (and Jeannette Wing) have been framing around the idea of &#8220;computational literacy.&#8221;  One of the reasons I&#8217;m enthusiastic about Python is that I think it is within reach of the &#8220;computationally literate&#8221; even if they don&#8217;t think of themselves as programmers.  </p>
<p>I definitely laud the idea of building tools that standardize and guide people into doing things the right way &#8212; but I hope that those tools expose their innards so that if they turn out to not have captured the right standardizations, we can adapt them.  I boggle at the kinds of things &#8220;non-programmers&#8221; have achieved using MS Excel and cell formulas, and I have to believe that more people would be able to learn to do those things if they could use a more humane language to do them.</p>
<p>Jon wrote:<br />
&gt; it does make me very curious to see what will happen when<br />
&gt; a substantial core of curated data winds up in the Wolfram<br />
&gt; silo with a very robust set of tools and services<br />
&gt; wrapped around it.</p>
<p>Good point. Perhaps Wolfram will end up as the Adobe of data tools and de facto standards&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Udell</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127207</link>
		<dc:creator><![CDATA[Jon Udell]]></dc:creator>
		<pubDate>Sat, 04 Apr 2009 15:46:40 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127207</guid>
		<description><![CDATA[&gt; how would you add your own facts?

We can only wait and see what the API will eventually enable. But initially, accepting input is a non-goal. Very counter-web-2.0, to be sure.

&gt; Timetric

I had read about that here: http://seanmcgrath.blogspot.com/2009/03/take-look-at-timetric.html

My first thought was: Hmm. GeoCommons wants me to upload upload data so we can understand its spatial dimension. Timetric wants me to upload data so we can work with the temporal dimension. In many cases it&#039;ll be the same data, but now analysis and curation will be happening in silos. 

There&#039;s no easy answer to this. But it does make me very curious to see what will happen when a substantial core of curated data winds up in the Wolfram silo with a very robust set of tools and services wrapped around it.

&gt; permanent URLs for given bits of data

At what granularity? For example, is this URL:

http://timetric.com/series/K63mz40KTzy3TjGdyXEoVA/

your alias for the source:

http://www.ecb.int/stats/eurofxref/eurofxref-hist.zip

Or for your transformation of the source? 

What is the atomic/versionable unit of reference? The datum? The row or column? The matrix? The collection?

&gt; as programmers, we can solve most of
&gt; the problems we think about, even if
&gt; it’s annoying and fiddly

Yes but time is a scarce resource, so a lot of questions go unasked/unanswered even by those who could do the fiddling.]]></description>
		<content:encoded><![CDATA[<p>&gt; how would you add your own facts?</p>
<p>We can only wait and see what the API will eventually enable. But initially, accepting input is a non-goal. Very counter-web-2.0, to be sure.</p>
<p>&gt; Timetric</p>
<p>I had read about that here: <a href="http://seanmcgrath.blogspot.com/2009/03/take-look-at-timetric.html" rel="nofollow">http://seanmcgrath.blogspot.com/2009/03/take-look-at-timetric.html</a></p>
<p>My first thought was: Hmm. GeoCommons wants me to upload upload data so we can understand its spatial dimension. Timetric wants me to upload data so we can work with the temporal dimension. In many cases it&#8217;ll be the same data, but now analysis and curation will be happening in silos. </p>
<p>There&#8217;s no easy answer to this. But it does make me very curious to see what will happen when a substantial core of curated data winds up in the Wolfram silo with a very robust set of tools and services wrapped around it.</p>
<p>&gt; permanent URLs for given bits of data</p>
<p>At what granularity? For example, is this URL:</p>
<p><a href="http://timetric.com/series/K63mz40KTzy3TjGdyXEoVA/" rel="nofollow">http://timetric.com/series/K63mz40KTzy3TjGdyXEoVA/</a></p>
<p>your alias for the source:</p>
<p><a href="http://www.ecb.int/stats/eurofxref/eurofxref-hist.zip" rel="nofollow">http://www.ecb.int/stats/eurofxref/eurofxref-hist.zip</a></p>
<p>Or for your transformation of the source? </p>
<p>What is the atomic/versionable unit of reference? The datum? The row or column? The matrix? The collection?</p>
<p>&gt; as programmers, we can solve most of<br />
&gt; the problems we think about, even if<br />
&gt; it’s annoying and fiddly</p>
<p>Yes but time is a scarce resource, so a lot of questions go unasked/unanswered even by those who could do the fiddling.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Walkingshaw</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127206</link>
		<dc:creator><![CDATA[Andrew Walkingshaw]]></dc:creator>
		<pubDate>Sat, 04 Apr 2009 15:08:36 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127206</guid>
		<description><![CDATA[.... argh. I must have hit submit at the wrong time. Sorry!

anyway, what I was saying: focussing on access neglects the possibility of making the data usful to as many people as possible once it&#039;s there. That, too, needs work in a couple of different directions, I reckon: in the systems we use to help people find the data, and in the systems we use to let people transform and analyse it. 

So I&#039;m intrigued by Wolfram Alpha because it merges both these things into a natural-language-like interface: &quot;find these facts, then do this bit of computation on them, and tell me the answer&quot;. It&#039;s the world&#039;s weirdest DSL, if you want to look at it that way. But the downside is that it looks like it needs a lot of the knowledge &quot;baked in&quot; - how would you add your own facts?

The startup I&#039;m working on, http://timetric.com/, takes a bit of a different approach. On the face of it, it&#039;s less ambitious than what Wolfram and co. are aiming to do, because we&#039;ve restricted the domain of facts we deal with we do numbers and their variation over time - time series, though you can always express constants as values which never change - and built an infrastructure with three important bits: permanent URLs for given bits of data, tools for finding and viewing data and uploading new series, and a calculation engine for deriving new time series from data already there.

One important thing we share with Wolfram Alpha, though, and which maybe speaks to your idea of data friction, is the idea that it&#039;s about expressing the computations people want to do in a way they find immediately familiar. Achieving that means leveraging a language people already know. Wolfram Alpha&#039;s using natural language; we&#039;re using a formula language which looks like the one people know from spreadsheets. Both of these are accessible to non-programmers - any answer which starts with &quot;learn Python&quot; has to be the wrong one somehow. Take the geopy example above: as programmers, we can solve most of the problems we think about, even if it&#039;s annoying and fiddly and involves writing altogether too many little parsers. But there are many more people who don&#039;t program at all, and making data much more useful is going to mean writing tools for them.]]></description>
		<content:encoded><![CDATA[<p>&#8230;. argh. I must have hit submit at the wrong time. Sorry!</p>
<p>anyway, what I was saying: focussing on access neglects the possibility of making the data usful to as many people as possible once it&#8217;s there. That, too, needs work in a couple of different directions, I reckon: in the systems we use to help people find the data, and in the systems we use to let people transform and analyse it. </p>
<p>So I&#8217;m intrigued by Wolfram Alpha because it merges both these things into a natural-language-like interface: &#8220;find these facts, then do this bit of computation on them, and tell me the answer&#8221;. It&#8217;s the world&#8217;s weirdest DSL, if you want to look at it that way. But the downside is that it looks like it needs a lot of the knowledge &#8220;baked in&#8221; &#8211; how would you add your own facts?</p>
<p>The startup I&#8217;m working on, <a href="http://timetric.com/" rel="nofollow">http://timetric.com/</a>, takes a bit of a different approach. On the face of it, it&#8217;s less ambitious than what Wolfram and co. are aiming to do, because we&#8217;ve restricted the domain of facts we deal with we do numbers and their variation over time &#8211; time series, though you can always express constants as values which never change &#8211; and built an infrastructure with three important bits: permanent URLs for given bits of data, tools for finding and viewing data and uploading new series, and a calculation engine for deriving new time series from data already there.</p>
<p>One important thing we share with Wolfram Alpha, though, and which maybe speaks to your idea of data friction, is the idea that it&#8217;s about expressing the computations people want to do in a way they find immediately familiar. Achieving that means leveraging a language people already know. Wolfram Alpha&#8217;s using natural language; we&#8217;re using a formula language which looks like the one people know from spreadsheets. Both of these are accessible to non-programmers &#8211; any answer which starts with &#8220;learn Python&#8221; has to be the wrong one somehow. Take the geopy example above: as programmers, we can solve most of the problems we think about, even if it&#8217;s annoying and fiddly and involves writing altogether too many little parsers. But there are many more people who don&#8217;t program at all, and making data much more useful is going to mean writing tools for them.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Walkingshaw</title>
		<link>http://blog.jonudell.net/2009/04/03/facts-and-friction/#comment-127205</link>
		<dc:creator><![CDATA[Andrew Walkingshaw]]></dc:creator>
		<pubDate>Sat, 04 Apr 2009 14:54:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.jonudell.net/?p=1336#comment-127205</guid>
		<description><![CDATA[The key thing here, to my mind at least, is that data is only any use if you can do something with it.

What most people miss, though, is that there are two variables there, and one of them is who &quot;you&quot; is.
The implication of that is that there are two ways to make data more useful. Most of us have focussed on access - making the data more available, which is all well and good, but]]></description>
		<content:encoded><![CDATA[<p>The key thing here, to my mind at least, is that data is only any use if you can do something with it.</p>
<p>What most people miss, though, is that there are two variables there, and one of them is who &#8220;you&#8221; is.<br />
The implication of that is that there are two ways to make data more useful. Most of us have focussed on access &#8211; making the data more available, which is all well and good, but</p>
]]></content:encoded>
	</item>
</channel>
</rss>

