<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tim Howgego &#187; Google</title>
	<atom:link href="http://timhowgego.com/category/google/feed" rel="self" type="application/rss+xml" />
	<link>http://timhowgego.com</link>
	<description>Thoughts, Ideas, Analysis</description>
	<lastBuildDate>Tue, 24 Aug 2010 08:58:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Iterative Video Development</title>
		<link>http://timhowgego.com/iterative-video-development.html</link>
		<comments>http://timhowgego.com/iterative-video-development.html#comments</comments>
		<pubDate>Thu, 12 Nov 2009 23:10:35 +0000</pubDate>
		<dc:creator>Tim Howgego</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Design]]></category>
		<category><![CDATA[El]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Machinima]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://timhowgego.com/?p=144</guid>
		<description><![CDATA[The internet allows products and services to be rapidly improved based on user feedback. So rapid, that iterative design should become the primary method of designing internet-based services. Not just as an Agile-like method of working, but as a method of specifying the product itself.
Partly it isn&#8217;t because creators haven&#8217;t adjusted their methods to match [...]]]></description>
			<content:encoded><![CDATA[<p>The internet allows products and services to be rapidly improved based on user feedback. So rapid, that iterative design <em>should</em> become the primary method of designing internet-based services. Not just as an <a href="http://agilemanifesto.org/" title="External link: Manifesto for Agile Software Development.">Agile</a>-like method of working, but as a method of specifying the product itself.</p>
<p>Partly it isn&#8217;t because creators haven&#8217;t adjusted their methods to match the new technology &#8211; we&#8217;re still wedded to a single start-to-finish process, with one outcome at the end. Partly it isn&#8217;t because feedback can be hard to gather and digest, and even hard to act upon.</p>
<p>An iterative method has become one of the defining characteristics of how I like to write, organise, and present text on the internet. At least, beyond this domain. But until now, I&#8217;ve struggled to apply it to internet-based video.</p>
<p>This article introduces internet-based iterative design, and uses YouTube&#8217;s &#8220;Hot Spot&#8221; analysis to show how we can start to apply an iterative approach to video and movie-making. <span id="more-144"></span></p>
<h3>Iterative Product Development</h3>
<p>The author of a published paper book generally gets one shot. One chance to have their works committed to paper. To be read by milllions. Or tens. A huge amount of effort goes into &#8220;getting it right&#8221;: Construction of text and story, editing, proof-checking. And in spite of this, book publishing remains a high-risk activity: For every top-selling author, there are others whose work ends up as pulp.</p>
<p>In contrast, the cost of making corrections or changes on the internet can be minor. At the extreme, the author simply types some new words. An update that might have taken a book publisher months or years, can be committed in minutes.</p>
<p>The ability to make rapid changes in response to rapidly gathered feedback makes the internet interesting: The most basic server access logs can be analysed to reveal that chapter 2 is generated much more interest than chapter 1, yet chapter 4 is hardly getting read. With enough readers, those patterns can be seen in days, or even hours. So perhaps the content in chapter 2 should be expanded, and we should re-write chapter 3 to better maintain interest?</p>
<p>Now we&#8217;ve expanded chapter 2, and noticed it has become even more popular. Obviously there&#8217;s a greater demand for the writing or information in chapter 2 than the author thought there was. Gradually the content evolves and gravitates towards (in the language of entrepreneurs) the nearest unserved market. Iterative product development isn&#8217;t just about &#8220;making it better&#8221;. It&#8217;s a way of finding an audience, customers, earnings.</p>
<p>The written word is an easy example to understand, but maybe all good design iterates in response to user feedback?</p>
<p>Probably always has. Stone wheels? Computers <a href="http://en.wikipedia.org/wiki/Honeywell_316" title="External link: Wikipedia - Honeywell 316.">sold as recipe books</a>? Especially where <em>a cool technology is looking for a problem</em>: An inventor that doesn&#8217;t start by trying to address a problem, but merely discovers a method of doing something. <a href="http://en.wikipedia.org/wiki/Post-it_note" title="External link: Wikipedia - Post-It note.">Post-It notes</a> are a popular example, but this pattern is common from the Victorian era onward. For example, it took <a href="http://www.capsu.org/history/" title="Capsule Pipelines - History.">half a century</a> of different people trying to use pressurized air for land-based transport propulsion, before a market niche was established.</p>
<p>The internet allows this process to happen a lot faster, but only if the presence of the internet becomes integral to the design process.</p>
<p>Personally, this methodology has turned a few highly technical pages on the mechanics of fishing, into a <a href="http://www.elsanglin.com/" title="El's Extreme Anglin' - World of Warcraft Fishing Guide.">fishing guide</a> read by millions (see <a href="#practice" title="Jump to section: Appendix: Iterative Writing in Practice.">Appendix: Iterative Writing in Practice</a> at the bottom).</p>
<h3>Limitations</h3>
<p>Iterative product development isn&#8217;t a panacea. Or a free ride to perfection and untold riches:</p>
<ul class="spacedlist">
<li>While internet-based products and services are comparatively (to manufactured goods) cheap and easy to change, constantly making changes can become very time-consuming. Designing with the expectation of change, helps. But ultimately you will reach a point where further changes don&#8217;t generate enough extra audience interest to (financially) justify the time spent making the changes. This is when to stop.</li>
<li>There is no guarantee that your product will find the <em>absolute</em> biggest unserved market, merely the biggest such market near to the topic/interest area you started with. If you started developing an idea in an obscure niche, it&#8217;s not realistic to expect to grow outside of that niche.</li>
<li>Iterative development is not an excuse to design garbage. Quality remains important: The first attempt has to be sufficiently &#8220;good&#8221; for enough people to use/read it to generate feedback.</li>
<li>The process of analysing feedback, and developing new content, requires 2 distinct skill-sets. Someone that is good at the second, may be unable to do the first. An instinctively good designer might still produce a better product, although (I would argue) their method leaves more to chance.</li>
<li>It&#8217;s not clear that this method could be applied to an entirely physical product &#8211; <a href="http://news.bbc.co.uk/1/hi/business/3086669.stm" title="External link: BBC - Store Wars: Fast Fashion.">Zara&#8217;s version of fast fashion</a> is a good example, yet customer feedback still takes <em>weeks</em> to filter through into new clothing.</li>
</ul>
<h3>Video</h3>
<p>Most internet-based text content is easy. Changes can be made and distributed in seconds. Reasonably good feedback is available using tools like <a href="http://www.google.com/analytics/" title="External link: Google Analytics.">Google Analytics</a>.</p>
<p>Video content poses a few problems:</p>
<ul>
<li>Minor changes require re-rendering and uploading, which (for anyone without Hollywood-scale production facilities) can take several hours, even for just a few minutes of video footage.</li>
<li>Major changes mean re-filming, editing, sound design, and similar alterations that could take days. May not even be possible, if showing specific events or people.</li>
<li>Detailed feedback is hard to get. At best you&#8217;ll get a reaction to the whole video &#8211; typically a number of people that watched, and the rating or comments of a tiny proportion of them.</li>
</ul>
<p>The first 2 problems aren&#8217;t going away anytime soon. The best defense is to save <em>all</em> the footage you shot, including materials that didn&#8217;t make the final edit. Recording at full 1280&#215;720 pixel resolution, 30 frames per second, I find that for each minute of the final movie:</p>
<ul>
<li>I shot about 20<abbr title="GigaByte">GB</abbr> of footage,</li>
<li>take about 10<abbr title="GigaByte">GB</abbr> to the editing stage, and</li>
<li>use about 5<abbr title="GigaByte">GB</abbr> in the final version.</li>
</ul>
<p>So I&#8217;ve already created a lot of redundancy &#8211; until I run out of hard disk space. That redundancy helps make minor changes, such as altering the length of a scene, but it won&#8217;t let you re-write the story or change the location.</p>
<p>However, small edit tweaks can make the difference between &#8220;good and great&#8221;, so some iteration is possible within video. In theory. The problem is that without detailed user feedback, how do we know what to improve?</p>
<p>A friend who worked in &#8220;new media&#8221; when it was new (in the mid-1990s), used say, &#8220;the skill was to know when to stop&#8221;. To misquote Damien Hirst, &#8220;a painting is finished after a long period of looking at it, during which nothing is added&#8221;. Personally, video editing involves a lot the later: Constantly replaying a rough version and making changes, until I start making adjustments that seem to make it worse again.</p>
<p>Unfortunately, the creator is not the audience. It&#8217;s easy for them to produce things that simply don&#8217;t appeal to anyone apart from themselves, don&#8217;t solve whatever problem their audience were having, or don&#8217;t appeal to viewers&#8217; emotions.</p>
<h3>Hot Spots</h3>
<p>Which is why I find <a href="http://googleblog.blogspot.com/2008/09/your-youtube-video-hot-or-not.html" title="External link: Google - Your YouTube video: Hot or Not?">YouTube&#8217;s Hot Spots</a> fascinating.</p>
<p>As often, it started by accident. I couldn&#8217;t upload <a href="http://www.youtube.com/watch?v=2e5HwJx3fyg" title="External link: You-Tube - Kalu'ak Fishing Derby.">the video below</a> to the host I normally use for embedded video. So the YouTube version of the video became the primary version. Almost all of the video&#8217;s 10,000 daily views were hosted on YouTube. This meant that within a day, YouTube&#8217;s &#8220;Insight&#8221; analytics were displaying some representative data about how users were viewing the movie.</p>
<p><object width="640" height="385"><param name="movie" value="http://www.youtube.com/v/2e5HwJx3fyg&amp;hl=en&amp;fs=1&amp;hd=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/2e5HwJx3fyg&amp;hl=en&amp;fs=1&amp;hd=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="640" height="385"></embed></object></p>
<p>The video is primarily a tutorial, intended to introduce <a href="http://www.elsanglin.com/kaluak_fishing_derby.html" title="Kalu'ak Fishing Derby.">the new World of Warcraft fishing contest</a>. All the footage is captured in-game &#8211; some of it &#8220;live&#8221; during the contest, some recorded afterwards. The whole movie was conceived, scripted, filmed, edited, and rendered over the course of 2 days.</p>
<p>Fishing is a good test, because it isn&#8217;t a terribly interesting thing to watch. It&#8217;s hard to make a &#8220;good&#8221; fishing video, especially for an audience that aren&#8217;t all <em>hardcore</em> anglers.</p>
<p>The video has been favourably rated, comments are complementary and (critically) not (yet) asking questions that the video was intended to answer. Plus a few other sites have embedded it. Good start, but could it be better?</p>
<h3>Frame-Based Feedback</h3>
<p>Below is the &#8220;Hot Spot&#8221; graph generated from the first 20,000 views. YouTube&#8217;s explanation of the measures:</p>
<blockquote><p>The ups-and-downs of viewership at each moment in your video, compared to videos of similar length. The higher the graph, the hotter your video: fewer viewers are leaving your video and they may also be rewinding to watch that point in the video again. Audience attention is an overall measure of your video&#8217;s ability to retain its audience.</p></blockquote>
<p>The base (x) axis shows the time the video has been running &#8211; it lasts 2 minutes and 2 seconds.</p>
<p><img src="http://timhowgego.com/files/kaluak_youtube_hot_spots.jpg" width="370" height="318" alt="Kalu'ak Fishing Derby YouTube Hot Spot Graph." title="Kalu'ak Fishing Derby YouTube Hot Spot Graph - read on for explanation." /></p>
<p>Let&#8217;s try and analyse what the graph shows. Numbers refer to points marked on the graph:</p>
<ol class="numberlist">
<li id="point_1">I added 12 seconds of introduction and title, primarily to give the viewer time focus, adjust the volume or screen size, or let any navigation/control widgets fade away. There are no credits &#8211; this is a 2 minute tutorial, not a feature film. Unfortunately there are several ways to read the initial downward decline:
<ul>
<li>Viewers think they missed something at the very start, so are restarting the video (the first sound is triggered while the screen is still black).</li>
<li>Viewers have observed the water, net, and title, and didn&#8217;t want to still be observing it 5 seconds later &#8211; they&#8217;re getting bored and leaving.</li>
<li>Some viewers started the video by accident, and never intended to watch, however good or otherwise.</li>
</ul>
<p> I compared this video to the graph for <a href="http://www.youtube.com/watch?v=QFqDPtrbWzY" title="External link: You-Tube - Dalaran Fishing.">an earlier YouTube-hosted video</a>. It has the same structure of introduction, but a longer, less focused (a wide, familiar city-scape) initial image. That second video was much <em>colder</em> over the first 10 seconds. So perhaps introductory shots need to be shorter, and move to &#8220;the action&#8221; faster?</li>
<li id="point_2">The titles are gone, and the video moves to a slow-paced tutorial, with gentle text and rather repetitive scenes. If you watch, the video, you&#8217;ll notice a lot of views of the same Walrus-like character, with a lot of gnomes (the pink-haired creature) casting a line or catching a fish. The graph says that&#8217;s &#8220;ok&#8221;, but remains far from hot. In contrast, the comparison video performs better at this stage. The main difference is that the comparision video moves between topics faster, with less repetition of similar-looking scenes. There are a few reasons for repetition: The gradual building of momentum (ever faster scene changes &#8211; see next point) was intended to create the sense of excitement that these &#8220;first player wins&#8221; competitions create, but I don&#8217;t think it works. I also wanted to show that the shark (the aim of the contest) could be caught in lots of different places. Past videos have lead players to conclude that only the one precise place shown in the video was valid. Overall, this stage should drag a lot less than it does, and if possible, be made less repetitive.</li>
<li id="point_3">The video gradually builds momentum, until by point 3, the scenes are changing at the rate of around 2 per second (slightly more by the end of the sequence). Audience engagement warms. Possibly this is &#8220;exciting&#8221;. Possibly too exciting, forcing some viewers to re-wind because they cannot digest the scenes fast enough?</li>
<li id="point_4">The top of this second rise in temperature is marked by the 3 bangs and flashes, cutting to blurred, greyscale, slow-motion sequences. If those don&#8217;t make you look, nothing will! In the video&#8217;s narrative this is the first time <em>something happens</em>: The gnome caught the shark, and is now running home, desperate to get back first. It&#8217;s one of those heart-stopping moments (and in the original storyboard, was intended to use heartbeats). Cool. That is, hot. But worth noting that special effects alone can provide a negative distraction. For example, the comparison video&#8217;s coldest moment is when a sequence of quotes and images of their who said them, are merged together into beautiful blue water. Looks great artistically, but doesn&#8217;t engage the audience.</li>
<li id="point_5">The heat is maintained while the first prize is displayed. This may be because the tempo of the video doesn&#8217;t slow down enough to let viewers digest everything (I had to win quickly, so making sure I had enough footage wasn&#8217;t a priority&#8230;). It may also reflect greater interest in one of the prizes (the ring). Either way, this section should have been longer.</li>
<li id="point_6">Contest won, interest is dropping. The 6th point occurs when the runner-up prize is displayed. Sadly, the high-point of the story is &#8220;the win&#8221;, yet the tutorial aspect of the video has to cover <em>not</em> winning. And chronologically, not winning happens after someone has won!</li>
<li id="point_7">We&#8217;re ending on a low, which can&#8217;t be good. I suspect this is because there isn&#8217;t much interest in &#8220;the boots&#8221; among many players. Perhaps I shouldn&#8217;t have dedicated 10 seconds to showing them being used, when the main aim of the video (how to win the contest) was clearly complete? The final giggle was an attempt to liven this section up a little, but comes too late.</li>
</ol>
<p>The Hot Spots graph doesn&#8217;t show everything. Doesn&#8217;t reflect any variation between different people viewing it. It may not even be desirable to keep a movie &#8220;hot&#8221; throughout. There are almost certainly other ways of analyzing viewer behavior.</p>
<p>But areas for improvement emerged that were not seen when making the video. Even if I don&#8217;t re-make this particular video, some of improvements will hopefully filter down into new videos.</p>
<h3>Yes, But</h3>
<p>It&#8217;s easy to dismiss this as an expensive training exercise: Wouldn&#8217;t it be better just to ensure movie-makers were experienced before they started? If all of them turned out <a href="http://www.imdb.com/title/tt0034583/" title="External link: IMDB - Casablanca (1942).">Casablanca</a>, I&#8217;d agree. In reality, expertise does not mean infallibly. While YouTube is almost infamous for showing how apparently (to my eyes) terrible videos can be <a href="http://www.youtube.com/videos" title="External link: YouTube - Most Viewed.">highly popular</a>. </p>
<p>It would be great to think that Saturday night&#8217;s cinema audience might see a slightly better version of a film than Friday night&#8217;s audience, based on what the first audience enjoyed most. But not terribly practical. Similarly, television news might be history before the second iteration.</p>
<p>But down at &#8220;YouTube level&#8221; iterative production methods start to become more viable. Still tricky, but something that only took 2 days to initially create, can probably be remade daily, if required.</p>
<p>This isn&#8217;t just about rapid re-production. More important is the ability to start to read the minds of an audience the creator can never see. Try to assess what aspects of the video should be expanded. What the audience want, but are only partly getting. And to do that analytically, without the movie-maker ever meeting their audience.</p>
<p>At the extreme, it&#8217;s the introduction of almost scientific methodology into an artistic process, traditionally based around the artist&#8217;s opinion of their own work, and their experiences to date.</p>
<p>Most intriguing is that &#8220;the next&#8221; Steven Spielberg (or similar) probably isn&#8217;t learning their art with <a href="http://en.wikipedia.org/wiki/Steven_Spielberg" title="External link: Wiwkipedia - Steven Spielberg.">an amateur 8mm camera</a>. They&#8217;ll be uploading camera-phone videos, animating Lego, or &#8220;<abbr title="Creating Machinima.">machinimating</abbr>&#8221; goblins. And there&#8217;s a chance they will start learning to use the analytical feedback available to them, in a way older generations never could&#8230;</p>
<h3>Learn More</h3>
<ul>
<li><a href="http://eu.techcrunch.com/2009/08/30/the-long-lost-formula-for-start-up-success-no-really/" title="External link: Techcrunch - The long lost formula for start-up success. No, really.">The long lost formula for start-up success. No, really</a> &#8211; Nigel Eccles (an Edinburgh acquaintance, although I don&#8217;t think we&#8217;ve ever discussed this) wrote something similar about iterative development. That we&#8217;re thinking alike isn&#8217;t entirely unexpected, since we&#8217;re both rather analytically-minded.</li>
<li><a href="http://alistair.cockburn.us/Incremental+versus+iterative+development" title="External link: Alistair Cockburn - Incremental versus iterative development.">Incremental versus iterative development</a> &#8211; Useful clarification of 2 often-confused terms, by Alistair Cockburn.</li>
<li><a href="http://en.wikipedia.org/wiki/Iterative_and_incremental_development" title="External link: Wikipedia - Iterative and incremental development.">Wikipedia</a> &#8211; Introduces various similar software-orientated methods.</li>
</ul>
<h3 id="practice">Appendix: Iterative Writing in Practice</h3>
<p>How did <a href="http://www.elsanglin.com/" title="El's Extreme Anglin' - World of Warcraft Fishing Guide.">one small gnome</a> attract so many readers, in spite of initially writing about <em>the wrong thing</em>? A healthy chunk of curiosity helps: Search and you may find things that work even better. But since you asked, consider this:</p>
<ul class="spacedlist">
<li>Split text into separate pages (not like this article): It is far easier to trace and monitor page views, than to work out where on a page a reader is reading.</li>
<li>Write around the edges of your topic: Both broader introductions and more specific detail than your core starting material. If the introductory material becomes more popular than the core, expand that introduction, and so on.</li>
<li>Understand who is trying to read: Specifically their education, age, time-pressure &#8211; and write to a style and length that this audience can read.</li>
<li>Watch what users do: Extensive forum discussions or comments are subtle indicators of what you need to offer. See what users do in the absence of anything you&#8217;ve written. A 200-post forum discussion about something you thought was trivial, clearly isn&#8217;t.</li>
<li>Personalise it: The internet is a scary place, and you don&#8217;t help ease that fear by presenting words as a robot.</li>
</ul>
<p>That isn&#8217;t the whole story. And there are many techniques within that. Remember that all the <a href="http://www.useit.com/" title="External link: Useit.com.">basic design guidelines</a> on things like the structuring of text still apply.</p>
<p>In my opinion, it&#8217;s an on-going exercise in the discovery of the fact that most people aren&#8217;t like you, and have different problems and needs. Logical, really: If you write <em>for yourself</em>, you&#8217;ve optimized the text for people like you. Yet people like you write&#8230; and so have the least need of someone else&#8217;s writing!</p>
]]></content:encoded>
			<wfw:commentRss>http://timhowgego.com/iterative-video-development.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Social Reconstruction of Public Transportation Information</title>
		<link>http://timhowgego.com/social-reconstruction-of-public-transportation-information.html</link>
		<comments>http://timhowgego.com/social-reconstruction-of-public-transportation-information.html#comments</comments>
		<pubDate>Wed, 09 Jul 2008 00:31:50 +0000</pubDate>
		<dc:creator>Tim Howgego</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Bus]]></category>
		<category><![CDATA[Collaboration]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Information Management]]></category>
		<category><![CDATA[Journey Planner]]></category>
		<category><![CDATA[Public Transport]]></category>
		<category><![CDATA[Rail]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[government]]></category>
		<category><![CDATA[micro-blogging]]></category>

		<guid isPermaLink="false">http://timhowgego.com/?p=56</guid>
		<description><![CDATA[The UK&#8217;s local public transport data is effectively a closed dataset. The situation in the US seems similar: In spite of the benefits only a handful of agencies have released raw data freely (such as BART and TriMet on the west coast of America).
That hasn&#8217;t stopped &#8220;screen-scraping&#8221; of data or simply typing in paper timetables [...]]]></description>
			<content:encoded><![CDATA[<p>The <abbr title="United Kingdom">UK</abbr>&#8217;s <a href="http://timhowgego.com/introduction-to-uk-local-public-transport-data.html" title="Introduction to UK Local Public Transport Data">local public transport data</a> is effectively a closed dataset. The situation in the <abbr title="United States">US</abbr> seems similar: In spite of <a href="http://headwayblog.com/2008/02/21/trimet-data-sharing-talk/" title="External link: Headway: Landmark talk on data sharing by TriMet's Tim McHugh.">the benefits</a> only a <a href="http://code.google.com/p/googletransitdatafeed/wiki/PublicFeeds" title="External link: Google Transit - PublicFeeds.">handful of agencies</a> have released raw data freely (such as <a href="http://bart.gov/schedules/developers/" title="External link: BART for Developers.">BART</a> and <a href="http://developer.trimet.org/" title="External link: TriMet Developer Resources.">TriMet</a> on the west coast of America).</p>
<p>That hasn&#8217;t stopped &#8220;screen-scraping&#8221; of data or simply typing in paper timetables (from <a href="http://urbanmapping.com/" title="External link: Urban Mapping.">Urban Mapping</a> to <a href="http://headwayblog.com/wiki/index.php?title=Category:Third-Party_Sites" title="External link: Headway Wiki - Third-Party Sites.">many listed here</a>). Unfortunately, the legal basis for scraping is complex, which creates significant risks for anyone building a business. For example, earlier this year, airline <a href="http://www.ryanair.com/" title="External link: Ryanair.">Ryanair</a> requested the removal of all their data from <a href="http://www.skyscanner.net/" title="External link: Skyscanner.">Skyscanner</a>, a flight price comparison site that gathers data by scraping airlines&#8217; websites. How many airlines would need to object to <em>their</em> data being scraped before a &#8220;price comparison&#8221; service becomes unusable?</p>
<p>User-generated mapping content is evolving, often to circumvent <a href="http://www.freeourdata.org.uk/" title="External link: Free Our Data.">restrictive distribution of national mapping</a>. Services include <a href="http://www.openstreetmap.org/" title="External link: OpenStreetMap.">OpenStreetMap</a> and the <a href="http://google-latlong.blogspot.com/2008/06/making-your-mark-on-world.html" title="External link: Lat Long Blog - Making your mark on the world.">recently announced</a> Google <a href="http://www.google.com/mapmaker" title="External link: Google Map Maker.">Map Maker</a>.</p>
<p>Micro-blogging, primarily through <a href="http://twitter.com/" title="External link: Twitter.">Twitter</a>, has started to show the potential of individual travellers to report information about their journeys: <a href="http://open.zenjiweb.com/" title="External link: Zenji Open Projects.">Ron Whitman</a>&#8217;s <a href="http://www.commuterfeed.com/" title="External link: Commuter Feed.">Commuter Feed</a> is a good example. <a href="http://tommorris.org/" title="External link: Tom Morris.">Tom Morris</a> has also experimented with London Twitter feeds.</p>
<p>This article outlines why the &#8220;social web&#8221;/tech-entrepreneur sector may wish to stop trying to use official sources of data, and instead apply the <em>technology</em> it understands best: People. <span id="more-56"></span></p>
<h3>The Big Picture</h3>
<p>I will use the example of UK local bus data to summarise the strategic issues for data providers. I can only presume the issues are similar elsewhere (comments welcome).</p>
<p>Explaining exactly who the data providers are is one of the many problems of trying to extract and use the data. I would provide more detail, but the topic is somewhat sensitive. The most critical point in the chain that constructs and distributes the data are local authorities &#8211; sub-regional public bodies, typically those responsible for large cities, conurbations or counties. They process the data, but are not under any statutory requirement to do so (no national government legislation requires it).</p>
<p>There are a number of issues for the existing data providers:</p>
<ol class="numberlist">
<li><strong>Mindset of centralised control</strong>: Most operators, public authorities, and other agencies, still have a mindset of centralised control of information, delivered to users via the method the agency believes is appropriate. This is heavily driven by the belief that only the agency can be accountable or impartial, and that incorrect information supplied by an uncontrolled third party is likely to damage the image of local transport service and generally reflect badly on the agency.</li>
<li><strong>Mindset of local</strong>: Most agencies are locally focused, locally orientated. It seems logical for them to commission a fully-functioning website or piece of information delivery software that is specific to their city, because their target market is local. There&#8217;s a lack of global perspective: An agency will typically commission a system that is specific to their city, even when 95% of the features would work for any city, and 90% are already in existing global products.</li>
<li><strong>Not appreciating trends in delivery channels</strong>: There is still an attitude of &#8220;we&#8217;ll provide a website&#8221;, without a comprehension that the number of channels for delivery of information is exploding far faster than any one agency can hope to construct bespoke user interfaces to cater for. Mobile devices, integration into social software. There would probably a market for a &#8220;WiFi-enabled&#8221; alarm clock that would ring later if your morning train had been delayed: We simply can&#8217;t define the limits for how this information might be used.</li>
<li><strong>Not appreciating trends in cost</strong>: Even large, well-funded agencies are starting to fall behind the technology. The cost of systems (many millions of dollars invested year on year in some cases) is starting to hurt. Logically the global system should win out, because one city is very much like another: There is considerable scope for sharing systems costs.</li>
</ol>
<h3>What It Means</h3>
<p>Long term we are heading for global providers of information, that pool data from local sources. That will be forced by the cost of technology. This can be seen in technology costs driving things like agglomeration in the groceries sector (such as Walmart) over the last 30 years. Also in the move from customised mainframe computing, to shared operating systems and platforms (such as Windows). This will be worse, because the number of systems will be simultaneously exploding alongside the complexity of those systems.</p>
<p>As these issues become progressively better understood, data will become more centralised. Even in agencies where (in my opinion) uniqueness and absolute control are culturally in-breed, such as London Transport/<abbr title="Transport for London">TfL</abbr>, cost will eventually win the argument.</p>
<p>However, centralised data handling does not automatically make the data open. Quite the opposite.</p>
<h3>Contracted Provision</h3>
<p>Currently, effective control of data is with local government. Many individuals within local government will naturally attempt to block any change that might leverage power away from them and their organisation. &#8220;Job protection&#8221; is an over-simplification, but helps explain the underlying position. But by contracting data handling and presentation to a third-party contractor, local government would gain the technological &#8220;economies of scale&#8221; (assuming the contractor won many contracts from different authorities) and notionally maintain control.</p>
<p>Use of third-party contractors is already common within the local government sector, particularly for Information Technology.</p>
<p>An example can be seen in <a href="http://edinburgh.cdmf.info/cdmf-webserver/edinburghMap.jsp" title="External link: Edinburgh Traffic Map.">Edinburgh City Council&#8217;s Traffic Map</a>. In spite of how it appears, the information isn&#8217;t powered directly by Edinburgh City Council or Google. Instead it is part of <a href="http://www.mottmac.com/" title="External link: Mott MacDonald.">Mott MacDonald</a>&#8217;s Common Data Management Facility, providing services under contract to many different local authorities.</p>
<p>In the UK public transport arena, <a href="http://www.trapezesoftware.com/" title="External link: Trapeze.">Trapeze</a> is a good example of the gradual agglomeration of data handling within a few large businesses, where historically many small software providers could be found.</p>
<p>The example above provides key driver information, and is somewhat useful, but is it the best outcome? I suspect not. Contracts tend to be priced highly, because local government clients are high risk: Their political control means that they can change their strategic direction and requirements unexpectedly. At best, customer feedback loops through local authorities are slow and politicised. At worst the design of the system will reflect the arbitrary views of a self-proclaimed expert (such as myself). Even if <em>you</em> think it is perfect, there is no scope for choice or creativity. <a href="http://timhowgego.com/implications-of-google-transit-in-the-uk.html#choice" title="Implications of Google Transit in the UK: Choice.">Choice is good and need not be expensive</a>.</p>
<h3>Social Provision</h3>
<p>Instead of using official data, why not let users reconstruct it? User-generated content is cheaper to create than information from professionally staffed sources: Since very many contributors do so little <em>work</em>, no individual expects payment. User-generated content can be just as accurate too, although this is not automatic: For example, a strong community will subject everything to peer review, weeding out poor information and contributors.</p>
<p>This is not an entirely theoretical position. There is a largely untapped human resource, just waiting to help.</p>
<p>The transport enthusiasts (transit fans, &#8220;spotters&#8221;) already collate and produce some extremely high quality information about certain technical aspects of operations and services. For example, sites such as <a href="http://www.londonbusroutes.net/" title="External link: LondonBusRoutes.net.">LondonBusRoutes.net</a> contain detail on the bus route timing and vehicle allocation (type and number of buses), which transpires to be difficult to extract from official sources. While it may be argued that these sites simply repackage official information, their very existence is a testament to the strength of underlying community.</p>
<p>Casual observation of people delayed on trains or in traffic suggests they derive some comfort from picking up their mobile (cell) phone and telling someone about it. Something they can <em>do</em>, in a scenario they otherwise have no control over. Their desire to communicate the same information to drivers or users 10 miles behind them (who might be able to re-plan their route, should they know) is untested. But the potential is intriguing.</p>
<p>Nobody has entirely worked out how to use these people; yet.</p>
<h3>Battle Lines</h3>
<p>If the social web/tech-entrepreneur sector chooses to <em>fight</em> the &#8220;status quo&#8221; head on, it does so against large multi-national <abbr title="Information Technology">IT</abbr> providers who support clients with historically entrenched positions. Not a contest that favours the underdog.</p>
<p>If the tech&#8217; &#8220;upstarts&#8221; can find a way to use this human resource effectively, they will ultimately provide a more cost-effective solution than the traditional &#8220;government <abbr title="Information technology">IT</abbr>&#8221; sector can offer. Integrate that user-generated information into the wider consumer internet, and the machinery of government simply won&#8217;t be able to justify its historic position of pouring millions into systems it controls. The &#8220;social web&#8221;/tech-entrepreneur sector <em>wins</em>.</p>
<p>The upstarts do not need perfect source data, if the implementation of results is considered to be better by users. The early <a href="http://www.internet.xephos.com/">Xephos</a> vs <a href="http://www.transportdirect.info/" title="External link: TransportDirect.">TransportDirect</a> comparisons provide some evidence. The success or failure of the social web/tech-entrepreneur sector is ultimately dependant on whether they can provide better information than official sources, using the resources and skills they have available to them.</p>
<p><em>Disclaimer: The contents of this article reflect my own personal analysis of the situation. This does not directly reflect advice to, or views of, government or anyone else involved in the handling and provision of public transportation data.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://timhowgego.com/social-reconstruction-of-public-transportation-information.html/feed</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>
