Natural Language Processing is All Talk

I’ve talked about Natural Language Processing (NLP) before and how it is beginning to change the BIM/GIS space. But NLP is just part of the whole solution to change how analysis is run. I look at this as three parts:

  1. Natural Language Processing
  2. Curated Datasets
  3. Dynamic Computation

NLP is understanding ontologies more than anything else. When I ask how “big” something is, what do I mean by this. Let’s abstract this away a bit.

How big is Jupiter?

One could look at this a couple ways. What is the mass of Jupiter? What is the diameter of Jupiter? What is the volume of Jupiter? Being able to figure out intent of the question is critical to having everything else work. We all remember Siri and Alexa when they first started. They were pretty good at figuring out the weather but once you got out of those canned queries all bets were off. It is the same with using NLP with BIM or GIS. How long is something? Easy! Show me all mixed-use commercial zoned space near my project? Hard. Do we know what mixed-use commercial zoning is? Do we know where my project is? That because we need to know more about the ontology of our domain. How do we do this, learn about our domain? We need lots of data to teach the NLP and then run it through a Machine Learning (ML) tool such as Amazon Comprehend to figure out the context of the data and structure it in a way the NLP can understand out intents.

As discussed above, curated data to figure out ontology is important but it’s also important to help users run analysis without understanding what they need. Imagine using Siri, but you needed to provide your own weather service to find out the current temperature? While I have many friends who would love to do this, most people just don’t care. Keep it simple and tell me how warm it is. Same with this knowledge engine we’re talking about. I want to know zoning for New York City? It should be available and ready to use. Not only that, curated so it is normalized across geographies. Asking a question in New York or Boston (while there are unique rules in every city) should’t be difficult. Having this data isn’t as sexy as the NLP, but it sure as heck makes that NLP so much better and smarter. Plus, who wants to worry about do they have the latest zoning for a city, it should always be available and on demand.

Lastly once we understand the context of the natural language query and have data to analysis, we need to run the algorithms on the question. This is what we typically think of as GIS. Rather than manually running that buffer and identity, we use AI/ML to figure out the intent of the user using the ontology and grab the data for the analysis from the curated data repository. This used to be something very special, you needed to use some monolithic tool such as ArcGIS or MapInfo to accomplish the dynamic computation. But today these algorithms are open and available to anyone. Natural language lets us figure out what the user is asking and then run the correct analysis, even if they call it something different from what a GIS person might.
The “Alexa-like” natural language demos where the computer talks to users is fun, but much like the AR examples we see these days, not really useful in the context of real world use. Who wants their computer talking to them in an open office environment? But giving users who don’t know anything about structured GIS analysis the ability to perform complex GIS analysis is the game changer. It isn’t about how many seats of some GIS program are on everyones desk but how easy these NLP/AI/ML systems can be integrated into the existing workflows or websites. That’s where I see 2019 going, GIS everywhere.

GIS and the Keyboard

I think you can usually tell when a GIS Professional learned GIS by how they use their keyboard. Those who learned either on UNIX command line programs such as ArcInfo or GDAL seem to go out of their way to type commands either through keystrokes or scripting while those who learned in the GUI era, either ArcView 3.x or ArcGIS Desktop prefer to use a mouse. Now generalizing is always dangerous but it highlights things about how GIS analysis is done.

GUI GIS

I almost feel like Yakov Smirnoff saying “What a country!” when you realize that most of the complicated scripting commands of the 90s are completed almost perfectly by dropping a couple GIS layers on a wizard and keep clicking next. Esri should be commended for making these tools drop-dead simple to use. But it brings up the issue of does anyone under stand what is going on with these tools when they run them? Let’s take a simple example for Intersect.

Esri Intersect Tool

So simple right? You just take your input features, choose where the output feature goes and hit OK. Done. But what about those optional items below. How many people actually ever set those? Not many of course and many times you don’t need to set them but not understanding why they are options makes it dangerous that you might not perform your analysis correctly. I’ll say you don’t understand how to run a GIS command unless you understand not only what the command does but all the options.
You don’t have to learn Python to be a GIS Analyst, running Model Builder or just the tools from ArcCatalog is good enough. But if you find yourself not even seeing these options on the bottom, let alone understand what they are and why they are used, you aren’t anything more than a button pusher. And button pushers are easily replaced. The Esri Intersect Tool has many options and using it like below will only give you minimum power and understanding of how GIS works.

Esri Intersect Tool with blinders on.

In the old days of keyboards, you have to type commands out and know what each one did. In fact many commands wouldn’t run unless you put an option in. Part of it is when you type the words “fuzzy_tollerance” enough times you want to know what they heck it is. I think keyboard GIS connected users to the commands and concepts of GIS more than wizards do. Much like working with your hands connects people to woodworking, working with your keyboard connects people to GIS.

mapz on Common Student-Encountered GIS Problems

Link – Top 10 Most Common Student-Encountered GIS Problems

I have been working with and teaching undergrad and grad students GIS for 4+ years now and have compiled a list of the 10 most frequent problems that they encounter. In my current position, I spend about 15 hours per week holding office hours in the main GIS Lab on campus, where students, staff, and faculty can visit for GIS assistance, and rarely do I have a free moment. (Well, perhaps during intercession.) I often find myself explaining the same concepts and pointing out the same resources over and over again, so I wanted to pull together this list. ESRI ArcGIS is the main software application used on campus and so many of the examples below refer to this application.

I’ve noticed all these while using slave labor (sometimes called interns) on projects. The inability manage project work just kills them as well as making assumptions. I always tell them, just ask as many questions as you like because we’ve got many years of GIS experience here in our little shop and no sense reinventing the wheel.

GIS Has Arrived When…

Adobe blogs about it.

Link – Making inroads

Modern mapmaking now starts with GIS data from state or local government that includes way more information than you really want. The task nowadays is to remove the unwanted data from the map to reduce clutter and focus on the desired information. One of the tasks these Illustartor users have is to create linked networks of nodes to create the streets and highways we see on a map.

Dave Bouwman on the GIS Long Tail

Link – The GIS Longtail – Google, MSN, Yahoo and ESRI

The idea that Google/MSN/Yahoo is bad news for ESRI is based solely on the whiz-bang-flash of the new mass awakening to the fact that things can be put on a map. Anyone who thinks that Google is going to extend Google Earth to the point of enabling a city to manage it’s parcel base is delirious. Apart from the fact that it’s very difficult, there is no benefit to them. While Google does have a staff of geniuses, this does not mean they can simply whip up a full fledged professional GIS system. As for ESRI – I think they can only benefit from the increased attention paid to mapping in general, and GIS in particular. Once the public really starts to “get” maps, ESRI will be well positioned to facilitate “doing” something with the map – besides just plotting a point location.

Bouwman longtail

Dave Bouwman has just written a great article on the relationship between Google/Microsoft, ESRI and GIS as a whole. Dave hits it right on the head with ArcGIS vs Google Earth (or similar “consumer GIS” programs). Some have said that ArcGIS is the world’s largest software application built with Microsoft’s COM and while that may or may not be true, the plain fact is that ArcGIS has so many tools at the ready and these tools have decades of development behind them, that Google/Microsoft would be very hard pressed to compete. Now at the consumer end, that is a different story and it may be that GE and MapPoint eventually close the gap toward being a low end GIS tool, but even then you have to wonder about the quality of analysis that these tools may give the user given the lack of experience with GIS.

Time will tell, but as Dave points out so well in his post, Google Earth and ArcGIS are aimed at two very different markets and there is almost no overlap between them.

Jeff Thurston thinks we are all too focused on the GIS viewer

Link – Directions in Spatial Information

It is interesting to read and learn about all the things that Google is up to with respect to maps. Like I said a while ago, here; it has opened the door to increasing mapping awareness, especially to, although not limited to younger folks ‘ move over MTV.

Jeff pretty much says what I’ve been saying and what ESRI’s focus should be on. Tools to create GIS data are extremely important. Google isn’t a creator of this information, they are a consumer. Our GIS workflows are integrated into ESRI’s tools so for us to be successful, we need ESRI to continue to push the envelope. Our biggest problem though has been on the reader side of things. Jeff says ArcGlobe is ESRI’s “Google Earth”, but that requires you to have at least an ArcView license. Jeff does say why Google Earth has been successful and it is plainly the simplicity of the application. If you’ve ever used ArcGlobe you know you have total control over just about everything, but zooming and panning can get out of control (well maybe it is just I can’t work in a total 3d world).

The key point of Jeff’s post is that we can’t get to the future without improved spatial tools and he is right on. But the problem today is consumers are beginning to want to consume GIS services and the tools to do so are limited and into that void falls Google Earth.

Jeff Thurston thinks we are all too focused on the GIS viewer

Link – Directions in Spatial Information

It is interesting to read and learn about all the things that Google is up to with respect to maps. Like I said a while ago, here; it has opened the door to increasing mapping awareness, especially to, although not limited to younger folks ‘ move over MTV.

Jeff pretty much says what I’ve been saying and what ESRI’s focus should be on. Tools to create GIS data are extremely important. Google isn’t a creator of this information, they are a consumer. Our GIS workflows are integrated into ESRI’s tools so for us to be successful, we need ESRI to continue to push the envelope. Our biggest problem though has been on the reader side of things. Jeff says ArcGlobe is ESRI’s “Google Earth”, but that requires you to have at least an ArcView license. Jeff does say why Google Earth has been successful and it is plainly the simplicity of the application. If you’ve ever used ArcGlobe you know you have total control over just about everything, but zooming and panning can get out of control (well maybe it is just I can’t work in a total 3d world).

The key point of Jeff’s post is that we can’t get to the future without improved spatial tools and he is right on. But the problem today is consumers are beginning to want to consume GIS services and the tools to do so are limited and into that void falls Google Earth.