<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Super fast geospatial analysis</title>
	<atom:link href="http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/feed/" rel="self" type="application/rss+xml" />
	<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/</link>
	<description>Geospatial Technology, Web Mapping and Spatial Services</description>
	<lastBuildDate>Fri, 10 Feb 2012 19:09:18 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Bandovang.com</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8644</link>
		<dc:creator><![CDATA[Bandovang.com]]></dc:creator>
		<pubDate>Mon, 05 Jan 2009 03:33:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8644</guid>
		<description><![CDATA[Maybe Map Dot Net UX will help you about shortest route (http://www.mapdotnet.com/Pages3.0/products-services/mapdotnet-ux.aspx)]]></description>
		<content:encoded><![CDATA[<p>Maybe Map Dot Net UX will help you about shortest route (<a href="http://www.mapdotnet.com/Pages3.0/products-services/mapdotnet-ux.aspx" rel="nofollow">http://www.mapdotnet.com/Pages3.0/products-services/mapdotnet-ux.aspx</a>)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ho Nguyen</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8643</link>
		<dc:creator><![CDATA[Ho Nguyen]]></dc:creator>
		<pubDate>Tue, 09 Sep 2008 03:51:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8643</guid>
		<description><![CDATA[Hi all,

I&#039;m interresting in way to find the shortest route on map.

I use MapDotNet and SQL server 2008.

Do you guys have any ebook or article talk about this?

Thanks,
Ho Nguyen]]></description>
		<content:encoded><![CDATA[<p>Hi all,</p>
<p>I&#8217;m interresting in way to find the shortest route on map.</p>
<p>I use MapDotNet and SQL server 2008.</p>
<p>Do you guys have any ebook or article talk about this?</p>
<p>Thanks,<br />
Ho Nguyen</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mars Sjoden</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8642</link>
		<dc:creator><![CDATA[Mars Sjoden]]></dc:creator>
		<pubDate>Fri, 23 May 2008 14:38:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8642</guid>
		<description><![CDATA[Oh lord please!

I may actually have a life if I could run my queries, identities, analysis, complex statistic computations.

I&#039;m working on an Ecosystem Based Management plan ( very large areas ) in norther BC and I sure could use another 10, 20... 100 cpu&#039;s running for me.

*sigh*... whoops, an identity just finished, I better run the next one...

Yah, I could easily do with some extra horsepower.]]></description>
		<content:encoded><![CDATA[<p>Oh lord please!</p>
<p>I may actually have a life if I could run my queries, identities, analysis, complex statistic computations.</p>
<p>I&#8217;m working on an Ecosystem Based Management plan ( very large areas ) in norther BC and I sure could use another 10, 20&#8230; 100 cpu&#8217;s running for me.</p>
<p>*sigh*&#8230; whoops, an identity just finished, I better run the next one&#8230;</p>
<p>Yah, I could easily do with some extra horsepower.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: MTBMaven</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8641</link>
		<dc:creator><![CDATA[MTBMaven]]></dc:creator>
		<pubDate>Wed, 05 Mar 2008 19:01:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8641</guid>
		<description><![CDATA[I am a mere grasshopper compared to the experts contributing to this very interesting topic.  My graduate research revolves around the validation of ESRI&#039;s viewshed algorithm when compared to field derived viewsheds using LiDAR derived DEMs in an urban core.  

In the process of my research I have developed a conceptual algorithm for the calculation of viewsheds on TIN surfaces.  To my knowledge no commercial GIS can compute a viewshed on a TIN or return the results as a TIN, yet the literature suggests line of sight calculations (a core function of viewshed calculations) are more accurate when computed on TIN surfaces.  My algorithm utilizes brute force line of sight calculations to determine visibility.  When conducted on very large TIN surfaces, this would require large amounts of processing power.

Is this the type of computations you are interested in?]]></description>
		<content:encoded><![CDATA[<p>I am a mere grasshopper compared to the experts contributing to this very interesting topic.  My graduate research revolves around the validation of ESRI&#8217;s viewshed algorithm when compared to field derived viewsheds using LiDAR derived DEMs in an urban core.  </p>
<p>In the process of my research I have developed a conceptual algorithm for the calculation of viewsheds on TIN surfaces.  To my knowledge no commercial GIS can compute a viewshed on a TIN or return the results as a TIN, yet the literature suggests line of sight calculations (a core function of viewshed calculations) are more accurate when computed on TIN surfaces.  My algorithm utilizes brute force line of sight calculations to determine visibility.  When conducted on very large TIN surfaces, this would require large amounts of processing power.</p>
<p>Is this the type of computations you are interested in?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan S.</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8640</link>
		<dc:creator><![CDATA[Dan S.]]></dc:creator>
		<pubDate>Mon, 03 Mar 2008 22:17:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8640</guid>
		<description><![CDATA[A belated message to Paul: I hate to be a nit-picker, but a 1000x speed increase, or even 10,000x, is mostly useless for truly solving NP-hard problems. They wouldn&#039;t really be &#039;hard&#039; then, would they? 

((The classic example is the traveling salesperson problem, where nobody has figured a substantially better way to find the shortest route that visits each city in a list than checking all possible routes between them. The time it takes will grow with the number of possible routes, which is the factorial of the number of cities. Thus if you have 1000 cities, adding a 1001th city will slow the search for the optimal solution by a factor of 1001. It&#039;s clear that constant-multiplier speed improvements, even big ones, are mostly helpless in the face of this sort of performance characteristic.))
There are many heuristic and approximate ways of tackling these when you don&#039;t need the absolute perfect answer, and faster geoprocessing could be very helpful there...

Since you&#039;re interested in redistricting, you might be interested in a similar problem I have some experience in: automated conservation reserve design.  You can google up MARXAN for a tool which is widely used in an attempt to identify areas that preserve the most biodiversity while still doing so efficiently. Not very different from trying to capture voting-behavior-similar blocs. (Note: I&#039;m the author of a very similar and now rather out-of-date tool called SPOT.)

I&#039;m certain that similar methods are used to optimize forestry yield/profits, and probably ditto for other agricultural fields. Actually, similar methods are used all over the place by all kinds of industries.

As a practical matter, most of the time this sort of thing is tackled by first using geoprocessing and database crunching to boil things down to a Big Fat Matrix which is then fed into custom-written tools.]]></description>
		<content:encoded><![CDATA[<p>A belated message to Paul: I hate to be a nit-picker, but a 1000x speed increase, or even 10,000x, is mostly useless for truly solving NP-hard problems. They wouldn&#8217;t really be &#8216;hard&#8217; then, would they? </p>
<p>((The classic example is the traveling salesperson problem, where nobody has figured a substantially better way to find the shortest route that visits each city in a list than checking all possible routes between them. The time it takes will grow with the number of possible routes, which is the factorial of the number of cities. Thus if you have 1000 cities, adding a 1001th city will slow the search for the optimal solution by a factor of 1001. It&#8217;s clear that constant-multiplier speed improvements, even big ones, are mostly helpless in the face of this sort of performance characteristic.))<br />
There are many heuristic and approximate ways of tackling these when you don&#8217;t need the absolute perfect answer, and faster geoprocessing could be very helpful there&#8230;</p>
<p>Since you&#8217;re interested in redistricting, you might be interested in a similar problem I have some experience in: automated conservation reserve design.  You can google up MARXAN for a tool which is widely used in an attempt to identify areas that preserve the most biodiversity while still doing so efficiently. Not very different from trying to capture voting-behavior-similar blocs. (Note: I&#8217;m the author of a very similar and now rather out-of-date tool called SPOT.)</p>
<p>I&#8217;m certain that similar methods are used to optimize forestry yield/profits, and probably ditto for other agricultural fields. Actually, similar methods are used all over the place by all kinds of industries.</p>
<p>As a practical matter, most of the time this sort of thing is tackled by first using geoprocessing and database crunching to boil things down to a Big Fat Matrix which is then fed into custom-written tools.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter Batty</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8639</link>
		<dc:creator><![CDATA[Peter Batty]]></dc:creator>
		<pubDate>Mon, 03 Mar 2008 19:44:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8639</guid>
		<description><![CDATA[Dmitri, I was merely pointing out that you made it sound like &quot;time sharing&quot; was a concept from a historical age, and it isn&#039;t, as we both agree it seems. And I agree with you that of course there are pros and cons to that approach. To reiterate what I said before, our main focus is the high end of the market where people would have their own dedicated system, but I also think that a service offering could be an interesting option for making some of these capabilities available to those who can&#039;t afford their own dedicated system.

When I said that we are looking at different scales of problem I was simply going by your comment above where you said &quot;I grant you that getting into the multiple terabyte range requires much more thought about hardware and overall architecture than working in the hundreds of gigabyte range. But itâ€™s getting very close and it could well be this summer ...&quot;. In a later comment you specified a system with 80TB of disk storage, but that is a completely different question from how you can effectively run complex spatial analysis across that much data. I agree with you again when you said that this &quot;requires much more thought about hardware and overall architecture&quot;. So overall I think we&#039;re agreeing about a lot of things :).]]></description>
		<content:encoded><![CDATA[<p>Dmitri, I was merely pointing out that you made it sound like &#8220;time sharing&#8221; was a concept from a historical age, and it isn&#8217;t, as we both agree it seems. And I agree with you that of course there are pros and cons to that approach. To reiterate what I said before, our main focus is the high end of the market where people would have their own dedicated system, but I also think that a service offering could be an interesting option for making some of these capabilities available to those who can&#8217;t afford their own dedicated system.</p>
<p>When I said that we are looking at different scales of problem I was simply going by your comment above where you said &#8220;I grant you that getting into the multiple terabyte range requires much more thought about hardware and overall architecture than working in the hundreds of gigabyte range. But itâ€™s getting very close and it could well be this summer &#8230;&#8221;. In a later comment you specified a system with 80TB of disk storage, but that is a completely different question from how you can effectively run complex spatial analysis across that much data. I agree with you again when you said that this &#8220;requires much more thought about hardware and overall architecture&#8221;. So overall I think we&#8217;re agreeing about a lot of things <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dimitri</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8638</link>
		<dc:creator><![CDATA[Dimitri]]></dc:creator>
		<pubDate>Mon, 03 Mar 2008 17:03:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8638</guid>
		<description><![CDATA[Peter,

&lt;blockquote&gt;I smiled at your comment about â€œthe old days when no one could afford a supercomputer so people developed centralized architectures to time-share the few supercomputers that funds allowed to be builtâ€. Have you used Google recently? I donâ€™t think you could afford their hardware, but that doesnâ€™t mean you canâ€™t take advantage of their capabilities. &lt;/blockquote&gt;

That&#039;s an analogy that cuts both ways.  Sure,I agree that Google is a perfect example of a centralized architecture that time-shares a very expensive resource that otherwise no one could afford to use.   But even Google agrees that is not right for everyone and for all applications.

For example  even Google understands that time sharing the service it provides is not right all the time, so Google sells their search appliances directly to those users who want to and who can afford to run their own search infrastructure.  These are typically large organizations and especially large organizations dealing with sensitive data they would prefer not be accessible, not in any way, to an outsider, not even to a &quot;trusted&quot; outsider.

In general, the more proprietary the data and algorithms the more an organization wants to be sure that no outsider can get their hands either on the data or on the algorithms.

Historically, that&#039; s been one of the reason for the shift from time sharing of supercomputers to organizations running their own supercomputers.  If you are a drug company with billions of dollars invested in your portfolio of molecules, or an investment house invested  into computational finance or a political party investing a hundred million into computational electioneering you don&#039;t want to risk anyone else finding out what data you think is worth exploring or how you analyze that data or what your results may be.

Something else to consider is that as a technical matter Google is not very representative of, well, computing.    Google does a handful of computationally very simple things on a massive scale.  It is not remotely as complex as deep, custom analysis typical of, say, supercomputing or spatial analytics.

So, sure, if you want a very algorithmically lightweight scan of a snapshot of enormously large data,  yeah, Google is your thing.   If you want to program a heavyweight algorithm like the election studies example given or, say,  a protein folding and interaction algorithm making pairwise comparisons between a few million molecules, then Google is a terrible idea.

Those latter cases show other examples of why people often prefer to have supercomputing under their own control instead of time sharing: it is not just privacy, it is the ability to attain much stronger computational capacity.

In a sense, Google is about big disks and small brains.  If I understand your target market, you see both big disks and significant brains as well.  Historically, people who wanted massive computing intelligence, especially with very fast data access have increased their chances the more that they are able to keep such resources closer to them.

In the present case, I get the feeling that perhaps technology is moving faster than expectations, for instance:

&lt;blockquote&gt;we probably are talking about different scales of problem right now. The solution I am talking about is working with multiple terabytes today and that is where it is especially strong&lt;/blockquote&gt;

Well, the $100,000 cluster I advocated earlier was 80 terabytes with 5120   processors so it is true that perhaps I overspecified if you are aiming for smaller tasks, in the multiple terabytes instead of the many tens of terabytes. I apologize if I misunderstood and overshot the requirement.

If all you need do is multiple terabytes, well, then you could get away with a much smaller cluster, say one or two machines for a total of, say, 8 terabytes and 1024  processors.  That would keep it well under $20,000, a much more affordable solution.

But if the complaint that what I&#039;m suggesting is so expensive that it must be time shared, well, I&#039;d offer two observations:

First, the sorts of customers you mention have plenty of money and can easily afford $20,000 or $100,000.     They won&#039;t hesitate to spend that even if a time-shared solution is less expensive, because no way, no how will they give up the power of having 1024 to 5120 and more processors all for themselves.  

Second, although I grant you that spending $100,000 to get 5120  processors and 80 terabyte capacity sounds like a lot, that&#039;s only at today&#039;s prices.  A year from now it will be $60,000 and the year after that $30,000.   

Large customers tend to have slow procurements so by the time a lot of these folks get going it will probably be down to $25,000 with storage and computing capacities even higher than today.  Heck, in two or three years we could be talking 300  terabytes and 15,000  processors for that money! :-)

So no, I don&#039;t knock the idea of time-sharing an expensive resource to make it available to more people.   But I am saying that both security and performance are strong reasons for not time sharing and that the costs of direct, distributed technology are getting so low that it is already probably too inexpensive (as has become the case with supercomputing) to bother time sharing anyway.

Could you comment whether your plans are for specialized hardware, or is it software not tied to a specific hardware platform?]]></description>
		<content:encoded><![CDATA[<p>Peter,</p>
<blockquote><p>I smiled at your comment about â€œthe old days when no one could afford a supercomputer so people developed centralized architectures to time-share the few supercomputers that funds allowed to be builtâ€. Have you used Google recently? I donâ€™t think you could afford their hardware, but that doesnâ€™t mean you canâ€™t take advantage of their capabilities. </p></blockquote>
<p>That&#8217;s an analogy that cuts both ways.  Sure,I agree that Google is a perfect example of a centralized architecture that time-shares a very expensive resource that otherwise no one could afford to use.   But even Google agrees that is not right for everyone and for all applications.</p>
<p>For example  even Google understands that time sharing the service it provides is not right all the time, so Google sells their search appliances directly to those users who want to and who can afford to run their own search infrastructure.  These are typically large organizations and especially large organizations dealing with sensitive data they would prefer not be accessible, not in any way, to an outsider, not even to a &#8220;trusted&#8221; outsider.</p>
<p>In general, the more proprietary the data and algorithms the more an organization wants to be sure that no outsider can get their hands either on the data or on the algorithms.</p>
<p>Historically, that&#8217; s been one of the reason for the shift from time sharing of supercomputers to organizations running their own supercomputers.  If you are a drug company with billions of dollars invested in your portfolio of molecules, or an investment house invested  into computational finance or a political party investing a hundred million into computational electioneering you don&#8217;t want to risk anyone else finding out what data you think is worth exploring or how you analyze that data or what your results may be.</p>
<p>Something else to consider is that as a technical matter Google is not very representative of, well, computing.    Google does a handful of computationally very simple things on a massive scale.  It is not remotely as complex as deep, custom analysis typical of, say, supercomputing or spatial analytics.</p>
<p>So, sure, if you want a very algorithmically lightweight scan of a snapshot of enormously large data,  yeah, Google is your thing.   If you want to program a heavyweight algorithm like the election studies example given or, say,  a protein folding and interaction algorithm making pairwise comparisons between a few million molecules, then Google is a terrible idea.</p>
<p>Those latter cases show other examples of why people often prefer to have supercomputing under their own control instead of time sharing: it is not just privacy, it is the ability to attain much stronger computational capacity.</p>
<p>In a sense, Google is about big disks and small brains.  If I understand your target market, you see both big disks and significant brains as well.  Historically, people who wanted massive computing intelligence, especially with very fast data access have increased their chances the more that they are able to keep such resources closer to them.</p>
<p>In the present case, I get the feeling that perhaps technology is moving faster than expectations, for instance:</p>
<blockquote><p>we probably are talking about different scales of problem right now. The solution I am talking about is working with multiple terabytes today and that is where it is especially strong</p></blockquote>
<p>Well, the $100,000 cluster I advocated earlier was 80 terabytes with 5120   processors so it is true that perhaps I overspecified if you are aiming for smaller tasks, in the multiple terabytes instead of the many tens of terabytes. I apologize if I misunderstood and overshot the requirement.</p>
<p>If all you need do is multiple terabytes, well, then you could get away with a much smaller cluster, say one or two machines for a total of, say, 8 terabytes and 1024  processors.  That would keep it well under $20,000, a much more affordable solution.</p>
<p>But if the complaint that what I&#8217;m suggesting is so expensive that it must be time shared, well, I&#8217;d offer two observations:</p>
<p>First, the sorts of customers you mention have plenty of money and can easily afford $20,000 or $100,000.     They won&#8217;t hesitate to spend that even if a time-shared solution is less expensive, because no way, no how will they give up the power of having 1024 to 5120 and more processors all for themselves.  </p>
<p>Second, although I grant you that spending $100,000 to get 5120  processors and 80 terabyte capacity sounds like a lot, that&#8217;s only at today&#8217;s prices.  A year from now it will be $60,000 and the year after that $30,000.   </p>
<p>Large customers tend to have slow procurements so by the time a lot of these folks get going it will probably be down to $25,000 with storage and computing capacities even higher than today.  Heck, in two or three years we could be talking 300  terabytes and 15,000  processors for that money! <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>So no, I don&#8217;t knock the idea of time-sharing an expensive resource to make it available to more people.   But I am saying that both security and performance are strong reasons for not time sharing and that the costs of direct, distributed technology are getting so low that it is already probably too inexpensive (as has become the case with supercomputing) to bother time sharing anyway.</p>
<p>Could you comment whether your plans are for specialized hardware, or is it software not tied to a specific hardware platform?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8637</link>
		<dc:creator><![CDATA[Paul]]></dc:creator>
		<pubDate>Mon, 03 Mar 2008 01:46:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8637</guid>
		<description><![CDATA[@ Ralphie -- 

Actually, the US Supreme court says congressional redistricting can occur whenever the state assembly wants it to (&lt;a href=&quot;http://www.washingtonpost.com/wp-dyn/content/article/2006/06/28/AR2006062800660.html&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;).

As far as &quot;unknowns&quot; ---I would kindly offer that  you vastly underestimate retail market data.  Do you own a GM with OnStar or use MasterCard / Visa?  Supermarket club card?   There are no unknowns - only the un-calculated (apologies to Webster&#039;s and my 7th grade English teacher).]]></description>
		<content:encoded><![CDATA[<p>@ Ralphie &#8212; </p>
<p>Actually, the US Supreme court says congressional redistricting can occur whenever the state assembly wants it to (<a href="http://www.washingtonpost.com/wp-dyn/content/article/2006/06/28/AR2006062800660.html" rel="nofollow">here</a>).</p>
<p>As far as &#8220;unknowns&#8221; &#8212;I would kindly offer that  you vastly underestimate retail market data.  Do you own a GM with OnStar or use MasterCard / Visa?  Supermarket club card?   There are no unknowns &#8211; only the un-calculated (apologies to Webster&#8217;s and my 7th grade English teacher).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ralphie</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8636</link>
		<dc:creator><![CDATA[Ralphie]]></dc:creator>
		<pubDate>Mon, 03 Mar 2008 01:00:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8636</guid>
		<description><![CDATA[Paul: good example, two equations and six unknowns.

Bad deadline, however.  Redistricting won&#039;t take place until 2012.]]></description>
		<content:encoded><![CDATA[<p>Paul: good example, two equations and six unknowns.</p>
<p>Bad deadline, however.  Redistricting won&#8217;t take place until 2012.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul</title>
		<link>http://spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8635</link>
		<dc:creator><![CDATA[Paul]]></dc:creator>
		<pubDate>Sun, 02 Mar 2008 16:29:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.spatiallyadjusted.com/2008/02/28/super-fast-geospatial-analysis/#comment-8635</guid>
		<description><![CDATA[Two words: Congressional Redistricting.

Objective: optimized area allocation into &#039;n&#039; voter districts for either competitive or non-competitive election cycles (depending on your view of the democratic process).
Secondary by-product of above: model trend impacts based on media buy cost to optimize campaign spending and general predictive spatial consumer behavior based on &quot;injected&quot; catalysts.

Input 1: partisan voting history by precinct areas (discrete partisan votes totals with precinct x precinct ratios for turnout, total registered voters, total eligible to register, and total population)

Input 2: population migration (historical, current estimated, and predicted) by census blockgroup or even census block if you have the computational horses.

Input 3: aggregated consumer behavior records by multiple dimensions - usually aggregated by zipcode or block group (not a fan of zipcode area analysis - but that&#039;s usually where the biz data is scaled to)

Input 4: discrete and aggregated campaign contributions from the FEC database

Input 5: other spatial market segmentation data to control for on-demand measures (crime, education, vehicle type, etc.).

Input 6: political subdivision boundary data to constrain allocation model behavior and enforce current county and municipal communities of interest.

The exponential nature of allocation modeling at the state level for sub-county areas has generally been categorized as &quot;computationally &quot;NP hard&quot;&quot; ....  ergo the value of Mr. Batty&#039;s &quot;machine&quot;.

If this working &quot;on-demand&quot; model were available it would be a gold-mine for political interest groups and consumer marketing firms.

Of course there are models out there, but they frequently abstract or overly generalize the spatial dimension --- this model would be spatially accurate to the neighborhood level at each given point in time loaded into the model.  And the real value is in the predictive ability by location based on a combination of &quot;injects&quot; into the consumer environment (incl. gas prices, severe weather, approval ratings, etc.).

 I imagine it like a giant climate or weather model only the gradients represent a given spatial behavior across time instead of temperature, precipitation, etc.

If you could put a simple GUI on the front end and have it ready by July or August (in time to have fun before the Nov. election) - that would be great! :)]]></description>
		<content:encoded><![CDATA[<p>Two words: Congressional Redistricting.</p>
<p>Objective: optimized area allocation into &#8216;n&#8217; voter districts for either competitive or non-competitive election cycles (depending on your view of the democratic process).<br />
Secondary by-product of above: model trend impacts based on media buy cost to optimize campaign spending and general predictive spatial consumer behavior based on &#8220;injected&#8221; catalysts.</p>
<p>Input 1: partisan voting history by precinct areas (discrete partisan votes totals with precinct x precinct ratios for turnout, total registered voters, total eligible to register, and total population)</p>
<p>Input 2: population migration (historical, current estimated, and predicted) by census blockgroup or even census block if you have the computational horses.</p>
<p>Input 3: aggregated consumer behavior records by multiple dimensions &#8211; usually aggregated by zipcode or block group (not a fan of zipcode area analysis &#8211; but that&#8217;s usually where the biz data is scaled to)</p>
<p>Input 4: discrete and aggregated campaign contributions from the FEC database</p>
<p>Input 5: other spatial market segmentation data to control for on-demand measures (crime, education, vehicle type, etc.).</p>
<p>Input 6: political subdivision boundary data to constrain allocation model behavior and enforce current county and municipal communities of interest.</p>
<p>The exponential nature of allocation modeling at the state level for sub-county areas has generally been categorized as &#8220;computationally &#8220;NP hard&#8221;" &#8230;.  ergo the value of Mr. Batty&#8217;s &#8220;machine&#8221;.</p>
<p>If this working &#8220;on-demand&#8221; model were available it would be a gold-mine for political interest groups and consumer marketing firms.</p>
<p>Of course there are models out there, but they frequently abstract or overly generalize the spatial dimension &#8212; this model would be spatially accurate to the neighborhood level at each given point in time loaded into the model.  And the real value is in the predictive ability by location based on a combination of &#8220;injects&#8221; into the consumer environment (incl. gas prices, severe weather, approval ratings, etc.).</p>
<p> I imagine it like a giant climate or weather model only the gradients represent a given spatial behavior across time instead of temperature, precipitation, etc.</p>
<p>If you could put a simple GUI on the front end and have it ready by July or August (in time to have fun before the Nov. election) &#8211; that would be great! <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>

