{"id":274,"date":"2014-01-24T19:47:37","date_gmt":"2014-01-24T19:47:37","guid":{"rendered":"http:\/\/opentextbc.ca\/natureofgeographicinformation\/?post_type=chapter&#038;p=274"},"modified":"2015-05-27T15:43:48","modified_gmt":"2015-05-27T15:43:48","slug":"1-overview-3","status":"publish","type":"chapter","link":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/chapter\/1-overview-3\/","title":{"raw":"TIGER, Topology and Geocoding","rendered":"TIGER, Topology and Geocoding"},"content":{"raw":"<h2>4.1. Overview<\/h2>\r\nIn the Chapter 3 we studied the population data produced by the U.S. Census Bureau, and some of the ways those data can be visualized with thematic maps.\r\n\r\nIn addition to producing data about the U.S. population and economy, the Census Bureau is a leading producer of digital map data. The Census Bureau's Geography Division created its \"Topologically Integrated Geographic Encoding and Referencing\" (TIGER) spatial database with help from the U.S. Geological Survey. In preparation for the 2010 census, the Bureau conducted a database redesign project that combined TIGER with a Master Address File (MAF) database. <strong>MAF\/TIGER<\/strong> enables the Bureau to associate census data, which it collects by household address, with the right census areas and voting districts. This is an example of a process called address-matching or <strong>geocoding<\/strong>.\r\n\r\nThe MAF\/TIGER database embodies the vector approach to spatial representation. It uses point, line, and polygon features to represent streets, water bodies, railroads, administrative boundaries, and select landmarks. In addition to the \"absolute\" locations of these features, which are encoded with latitude and longitude coordinates, MAF\/TIGER encodes their \"relative\" locations--a property called <strong>topology<\/strong>.\r\n\r\nMAF\/TIGER also includes attributes of these vector features including names, administrative codes, and, for many streets, address ranges and ZIP Codes. Vector feature sets are extracted from the MAF\/TIGER database to produce reference maps for census takers and thematic maps for census data users. Such extracts are called <strong>TIGER\/Line Shapefiles<\/strong>.\r\n\r\nCharacteristics of TIGER\/Line Shapefiles that make them useful to the Census Bureau also make them valuable to other government agencies and businesses. Because they are not protected by copyright, TIGER\/Line data have been widely adapted for many commercial uses. TIGER has been described as \"the first truly useful nationwide general-purpose spatial data set\" (Cooke 1997, p. 47). Some say that it jump-started a now-thriving geospatial data industry in the U.S.\r\n<h3>Objectives<\/h3>\r\nThe objective of this chapter is to familiarize you with MAF\/TIGER and two important concepts it exemplifies: topology and geocoding. Specifically, students who successfully complete Chapter 4 should be able to:\r\n<ol>\r\n\t<li>Explain how geographic entities are represented within MAF\/TIGER;<\/li>\r\n\t<li>Explain how geometric primitives in MAF\/TIGER are represented in TIGER\/Line Shapefile extracts;<\/li>\r\n\t<li>Define topology and explain why and how it is encoded in TIGER;<\/li>\r\n\t<li>Perform address geocoding; and<\/li>\r\n\t<li>Describe how TIGER\/Line files and similar products can be used for other applications, including routing and allocation.<\/li>\r\n<\/ol>\r\n<h3>Comments and Questions<\/h3>\r\nRegistered students are welcome to post comments, questions, and replies to questions about the text. Particularly welcome are anecdotes that relate the chapter text to your personal or professional experience. In addition, there are discussion forums available in the ANGEL course management system for comments and questions about topics that you may not wish to share with the whole world.\r\n\r\nTo post a comment, scroll down to the text box under \"Post new comment\" and begin typing in the text box, or you can choose to reply to an existing thread. When you are finished typing, click on either the \"Preview\" or \"Save\" button (Save will actually submit your comment). Once your comment is posted, you will be able to edit or delete it as needed. In addition, you will be able to reply to other posts at any time.\r\n\r\nNote: the first few words of each comment become its \"title\" in the thread.\r\n<h3><strong>Concept Map<\/strong><\/h3>\r\nYou may be interested in seeing the <a href=\"https:\/\/www.e-education.psu.edu\/files\/natureofgeoinfo\/file\/ch3-4_conceptmap(2).pdf\">concept map<\/a> used to guide development of Chapters 3 and 4.\r\n<h2>4.2. Checklist<\/h2>\r\n&nbsp;\r\n\r\nThe following checklist is for Penn State students who are registered for classes in which this text, and associated quizzes and projects in the ANGEL course management system, have been assigned. You may find it useful to print this page out first so that you can follow along with the directions.\r\n<table summary=\"Tasks to be compleated for the chapter\"><caption>Chapter 4 Checklist (for registered students only)<\/caption>\r\n<thead>\r\n<tr>\r\n<th>Step<\/th>\r\n<th>Activity<\/th>\r\n<th>Access\/Directions<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<th>1<\/th>\r\n<td><strong>Read<\/strong>\u00a0Chapter 4<\/td>\r\n<td>This is the second page of the Chapter. Click on the links at the bottom of the page to continue or to return to the previous page, or to go to the top of the chapter. You can also navigate the text via the links in the GEOG 482 menu on the left.<\/td>\r\n<\/tr>\r\n<tr>\r\n<th>2<\/th>\r\n<td>Submit\u00a0<strong>four practice quizzes<\/strong>including:\r\n<ul>\r\n\t<li>MAF and TIGER<\/li>\r\n\t<li>Shapefiles<\/li>\r\n\t<li>Topology<\/li>\r\n\t<li>Geocoding<\/li>\r\n<\/ul>\r\nPractice quizzes are not graded and may be submitted more than once.<\/td>\r\n<td>Go to ANGEL &gt; [your course section] &gt; Lessons tab &gt; Chapter 4 folder &gt; [quiz]<\/td>\r\n<\/tr>\r\n<tr>\r\n<th>3<\/th>\r\n<td>Perform\u00a0<strong>\u201cTry this\u201d activities<\/strong>including:\r\n<ul>\r\n\t<li>Explore availability of TIGER\/Line Shapefile geographies and features<\/li>\r\n\t<li>Download and view a TIGER\/Line Shapefile<\/li>\r\n\t<li>Geocode your address using a TIGER\/Line Shapefile<\/li>\r\n\t<li>Compare the geocoding performance of online routing services<\/li>\r\n\t<li>Explore resources about the Traveling Salesman Problem<\/li>\r\n<\/ul>\r\n\u201cTry this\u201d activities are not graded.<\/td>\r\n<td>Instructions are provided for each activity.<\/td>\r\n<\/tr>\r\n<tr>\r\n<th>4<\/th>\r\n<td>Submit the<strong>Chapter 4 Graded Quiz<\/strong><\/td>\r\n<td>ANGEL &gt; [your course section] &gt; Lessons tab &gt; Chapter 4 folder &gt; Chapter 4 Graded Quiz. See the Calendar tab in ANGEL for due dates.<\/td>\r\n<\/tr>\r\n<tr>\r\n<th>5<\/th>\r\n<td>\u00a0Read<strong>comments and questions<\/strong>posted by fellow students. Add comments and questions of your own, if any.<\/td>\r\n<td>\u00a0Comments and questions may be posted on any page of the text, or in a Chapter-specific discussion forum in ANGEL.<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h2>4.3. MAF\/TIGER<\/h2>\r\n<strong>MAF\/TIGER is the Census Bureau\u2019s geographic database system<\/strong>. Several factors prompted the U.S. Census Bureau to create MAF\/TIGER: the need to conduct the census by mail, the need to produce wayfinding aids for census field workers, and its mission to produce map and data products for census data users.\r\n<h3>CONDUCTING THE CENSUS BY MAIL<\/h3>\r\nAs the population of the U.S. increased it became impractical to have census takers visit every household in person. Since 1970, the Census Bureau has mailed questionnaires to most households with instructions that completed forms should be returned by mail. Most but certainly not all of these questionnaires are dutifully mailed\u2014about 72 percent of all questionnaires in 2010. At that rate the Census Bureau estimates that some $1.6 billion was saved by reducing the need for field workers to visit non-responding households.\r\n\r\n<img alt=\"Census 2010 questionnaire\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4_census2010.png\" \/>\r\n\r\n2010 Census questionnaire.\u00a0<a href=\"http:\/\/www.census.gov\/2010census\/about\/interactive-form.php\">For a question-by-question tour, go here<\/a>.\r\n\r\nTo manage its mail delivery and return operations, the Census Bureau relies upon a\u00a0<strong>Master Address File (MAF)<\/strong>. MAF is a complete inventory of housing units and many business locations in the U.S., Puerto Rico, and associated island areas. MAF was originally built from the U.S. Postal Service\u2019s Delivery Sequence File of all residential addresses. The MAF is updated through both corrections from field operations and a Local Update of Census Address (LUCA) program by which tribal, state, and local government liaisons review and suggest updates to local address records.\u00a0<strong>\u201cMAF\/TIGER\u201d refers to the coupling of the Master Address File with the TIGER spatial database<\/strong>, which together enable the Census Bureau to efficiently associate address-referenced census and survey data received by mail with geographic locations on the ground and tabulation areas of concern to Congress and many governmental agencies and businesses.\r\n\r\nIt\u2019s not as simple as it sounds. Postal addresses do not specify geographic locations precisely enough to fulfill the Census Bureau\u2019s constitutional mandate. An address is not a position in a grid coordinate system\u2013it is only one in a series of ill-defined positions along a route. The location of an address is often ambiguous because street names are not unique, numbering schemes are inconsistent, and because routes have two sides, left and right. Location matters, as you recall, because\u00a0<strong>census data must be accurately georeferenced to be useful for reapportionment, redistricting, and allocation of federal funds.<\/strong>\u00a0Thus the Census Bureau had to find a way to assign address referenced data automatically to particular census blocks, block groups, tracts, voting districts, and so on. That\u2019s what the \u201cGeographic Encoding and Referencing\u201d in the TIGER acronym refers to.\r\n<h3>MAPS FOR CENSUS FIELD WORKERS<\/h3>\r\nA second motivation that led to MAF\/TIGER was the need to help census takers find their way around. Millions of households fail to return questionnaires by mail, after all. Census takers (called \u201cenumerators\u201d at the Bureau) visit non-responding households in person.\u00a0<strong>Census enumerators need maps showing streets and select landmarks to help locate households.<\/strong>\u00a0Census supervisors need maps to assign census takers to particular territories. Field notes collected by field workers are an important source of updates and corrections to the MAF\/TIGER database.\r\n\r\nPrior to 1990, the Bureau relied on local sources for its maps. For example, 137 maps of different scales, quality, and age were used to cover the 30-square-mile St. Louis area during the 1960 census. The need for maps of consistent scale and quality forced the Bureau to become a map maker as well as a map user. Using the MAF\/TIGER system, Census Bureau geographers created over 17 million maps for a variety of purposes in preparation for the 2010 Census.\r\n<h3>DATA PRODUCTS<\/h3>\r\nThe Census Bureau\u2019s mission is not only to collect data, but also to make data products available to its constituents. In addition to the attribute data considered in Chapter 3, the Bureau disseminates a variety of geographic data products, including wall maps, atlases, and one of the earliest on-line mapping services, the TIGER Mapping Service. You can explore the<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/index.html\">Bureau\u2019s maps and cartographic data products here<\/a>.\r\n\r\n<img alt=\"Screenshot of the TIGER Map Server Browser\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/tms.gif\" \/>\r\n\r\nLaunched in 1995, the TIGER Mapping Service was one of the earliest Internet map services. Registered students will use its successor, American Factfinder, in Project 2.\r\n<h3>MAF\/TIGER DATABASE REDESIGN<\/h3>\r\nThe Census Bureau conducted a major redesign of the MAF\/TIGER database in the years leading up to the 2010 decennial census. What were separate, homegrown database systems (MAF and TIGER) are now unified in the industry-standard Oracle relational database management system. Benefits of this \u201ccommercial off-the-shelf\u201d (COTS) database software include concurrent multi-user access, greater user familiarity, and better integration with Web development tools. As Galdi (2005) explains in his white paper \u201cSpatial Data Storage and Topology in the Redesigned MAF\/TIGER System,\u201d the redesign \u201cmirrors a common trend in the Information Technology (IT) and Geographic Information System (GIS) industries: the integration of spatial and non-spatial data into a single enterprise data set\u201d (p. 2).\r\n\r\nConcurrent with the MAF\/TIGER redesign, the Census Bureau also updated the distribution format of its TIGER\/Line map data extracts. Consistent with the Bureau\u2019s COTS strategy, it adopted the defacto standard Esri \u201cShapefile\u201d format. The following pages consider characteristics of the spatial data stored in MAF\/TIGER and in TIGER\/Line Shapefile extracts.\r\n<h3><strong>PODCAST<\/strong><\/h3>\r\nHear more about\u00a0<a href=\"http:\/\/www.directionsmag.com\/images\/podcasts\/Census1.mp3\">how the Census Bureau\u2019s Geography Division uses MAF\/TIGER and related tools to create maps for the 2010 Census<\/a>.\r\n<h2>4.4. Vector Extracts from MAF\/TIGER<\/h2>\r\nThe Census Bureau began to develop a digital geographic database of 144 metropolitan areas in the 1960s. By 1990, the early efforts had evolved into\u00a0<strong>TIGER<\/strong>: a seamless digital geographic database that covered the whole of the United States and its territories. As discussed in the previous page, MAF\/TIGER succeeded TIGER in the lead-up to the 2010 Census.\r\n\r\n<strong>TIGER\/Line Shapefiles<\/strong>\u00a0are digital map data products extracted from the MAF\/TIGER database. They are freely available from the Census Bureau, and are suitable for use by individuals, businesses and other agencies that don\u2019t have direct access to MAF\/TIGER.\r\n\r\nThis section outlines the geographic entities represented in the MAF\/TIGER database, describes how a particular implementation of the vector data model is used to represent those entities, and considers the accuracy of digital features in relation to their counterparts on the ground. The following page considers characteristics of the \u201cShapefile\u201d data format used to distribute digital extracts from MAF\/TIGER.\r\n<h3>GEOGRAPHIES REPRESENTED IN TIGER AND SHAPEFILE EXTRACTS<\/h3>\r\nThe MAF\/TIGER database is selective. Only those geographic entities needed to fulfill the Census Bureau\u2019s operational mission are included. Entities that don\u2019t help the Census Bureau conduct its operations by mail, or help field workers navigate a neighborhood, are omitted. Terrain elevation data, for instance, are not included in MAF\/TIGER. A comprehensive list of the \u201cfeature classes\u201d and \u201csuperclasses\u201d included in MAF\/TIGER and Shapefiles can be found in Appendix F of the<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/data\/pdfs\/tiger\/tgrshp2012\/TGRSHP2012_TechDoc.pdf\">TIGER\/Line Shapefiles Technical Documentation<\/a>.\u00a0<strong>Examples of superclasses include<\/strong>:\r\n<ul>\r\n\t<li>Potential living quarters (e.g., sites of shelters, retirement homes, prisons, dormitories)<\/li>\r\n\t<li>Road\/path features (e.g., primary roads, secondary roads, local neighborhood roads)<\/li>\r\n\t<li>Hydrographic features (e.g., stream\/river, lake\/pond, ocean\/sea)<\/li>\r\n\t<li>Miscellaneous linear features (e.g., pipeline, powerline, fence line)<\/li>\r\n\t<li>Tabulation areas (e.g., county or equivalent, tract, block group, block<\/li>\r\n<\/ul>\r\n<table><caption>Excerpt from TIGER\/Line Technical Documentation<\/caption>\r\n<thead>\r\n<tr>\r\n<th>MTFCC<\/th>\r\n<th>FEATURE CLASS<\/th>\r\n<th>SUPERCLASS<\/th>\r\n<th>POINT<\/th>\r\n<th>LINEAR<\/th>\r\n<th>AREAL<\/th>\r\n<th>FEATURE CLASS DESCRIPTION<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<th>$1400<\/th>\r\n<td>Local Neighborhood Road, Rural Road, City Street<\/td>\r\n<td>Road\/Path Features<\/td>\r\n<td>N<\/td>\r\n<td>Y<\/td>\r\n<td>N<\/td>\r\n<td>Generally a paved non-arterial street, road, or byway that usually has a single lane of traffic in each direction. Roads in this feature class may be privately or publicly maintained. Scenic park roads would be included in this feature class, as would (depending on the region of the country) some unpaved roads.<\/td>\r\n<\/tr>\r\n<tr>\r\n<th>$1500<\/th>\r\n<td>Vehicular Trail (4WD)<\/td>\r\n<td>Road\/Path Features<\/td>\r\n<td>N<\/td>\r\n<td>Y<\/td>\r\n<td>N<\/td>\r\n<td>An unpaved dirt trail where a four-wheel drive vehicle is required. These vehicular trails are found almost exclusively in very rural areas. Minor, unpaved roads usable by ordinary cars and trucks belong in the $1400 category.<\/td>\r\n<\/tr>\r\n<tr>\r\n<th>$1630<\/th>\r\n<td>Ramp<\/td>\r\n<td>Road\/Path Features<\/td>\r\n<td>N<\/td>\r\n<td>Y<\/td>\r\n<td>N<\/td>\r\n<td>A road that allows controlled access from adjacent roads onto a limited access highway, often in the form of a cloverleaf interchange. These roads are unaddressable.<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nExcerpt from TIGER\/Line Technical Documentation (Census Bureau 2012) showing some of the feature classes included in the \u201cRoad\/Path Features\u201d superclass.\r\n\r\nNote also that<strong>\u00a0neither the MAF\/TIGER database nor TIGER\/Line Shapefiles include the population data collected through questionnaires and by census takers.<\/strong>\u00a0MAF\/TIGER merely provides the geographic framework within which address-referenced census data are tabulated.\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\n<h3>EXPLORING AVAILABLE TIGER\/LINE SHAPEFILES<\/h3>\r\nIn this Try This (One of 3 dealing with TIGER\/Line Shapefiles) you are going to explore which TIGER\/Line Shapefiles are available for download at various geographies and what information those files contain. We will be exploring the 2009 and 2010 versions of the TIGER\/Line Shapefile data sets. Versions from other years are available. Feel free to investigate those, too.\r\n<ul>\r\n\t<li>Follow\u00a0<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/data\/tiger.html\">this link to get to the TIGER Products page<\/a>\u00a0of the Census Bureau web site, then follow the\u00a0<strong>TIGER\/Line Shapefiles<\/strong>\u00a0link found under\u00a0<strong>Which product should I use?<\/strong>\u00a0to get to the Geography page.<\/li>\r\n\t<li>Link to the 2010 TIGER\/Line Shapefiles via the\u00a0<strong>2010<\/strong>\u00a0tab link.<\/li>\r\n\t<li>Select\u00a0<strong>Download<\/strong>, and then from the expanded list choose\u00a0<strong>Web Interface<\/strong>.<\/li>\r\n\t<li>Expand the pick list under\u00a0<strong>Select a layer type<\/strong>. Spend some time choosing different entries from the layer pick list and then using the<strong>Submit\u00a0<\/strong>button to navigate through the sub layers taking note of when you are offered access to a Download button. Take note of a couple of things. (1) Some of the pick lists make a selection available that allows you to download a shapefile dataset for the entire country. (2) For some of the choices you must navigate to the County level before the Download button is available<\/li>\r\n<\/ul>\r\nAs stated above we want you to get a sense of the sorts of data that are available for the various geographies \u2014 from the county to the national level. Perusing the various layers as I had you doing above makes it difficult to make an overall assessment of what data there is at a given geographic scale. Fortunately for our purposes the Census has provided a convenient table to help us in this regard.\r\n<ul>\r\n\t<li>You should still be on the 2010 TIGER\/Line Shapefiles | Select a layer type page.\r\nClick on the\u00a0<strong>Documentation\u00a0<\/strong>link in the upper right portion of the page. This will take you back to the Geography page.<\/li>\r\n\t<li>Select the\u00a0<strong>2010\u00a0<\/strong>tab again.<\/li>\r\n\t<li>Select\u00a0<strong>File Availability<\/strong>.\r\nStudy the table that appears.<\/li>\r\n\t<li>Note that there are columns titled\u00a0<em>State- and County-based Files,<\/em><em>Nation-based\u00a0<\/em><em>Files<\/em>, and\u00a0<em>American Indian Area-based\u00a0<\/em><em>Files<\/em>.<\/li>\r\n\t<li>Compare which geographies (the\u00a0<em>Layer\u00a0<\/em>column) are available in the<em>Nation-Based Files<\/em>\u00a0category to those available in the\u00a0<em>State-Based Files<\/em>\u00a0category.\r\nWhat files are available for a state that are not available for the whole nation?\u00a0 Can you think of reasons why these are not available as a single national file? Post a comment below to discuss with your fellow students.<\/li>\r\n\t<li>Now, compare the\u00a0<em>State-Based Files<\/em>\u00a0category to the\u00a0<em>County-Based Files<\/em>\u00a0category. What files available at the state level are also available at the county-level?\u00a0 Once again, share your thoughts with your peers.<\/li>\r\n<\/ul>\r\n<h3>GEOMETRIC PRIMITIVES<\/h3>\r\nLike other implementations of the vector data model, MAF\/TIGER represents geographic entities using geometric primitives including nodes (point features), edges (linear features), and faces (area features). These are defined and illustrated below.\r\n<ul>\r\n\t<li><strong>Nodes<\/strong>\u00a0(labeled \u201cN\u201d in the illustration below) are \u201c0-dimensional,\u201d consisting only of a single pair of latitude and longitude coordinates.\r\n<ul>\r\n\t<li>Nodes N21-23 are\u00a0<strong>isolated nodes<\/strong>. That is, they are not end points of edges.<\/li>\r\n<\/ul>\r\n<\/li>\r\n\t<li><strong>Edges<\/strong>\u00a0(labeled \u201cE\u201d in the illustration below) are 1-dimensional linear primitives used to represent streets, railroads, pipelines, and rivers.\r\n<ul>\r\n\t<li>The end points of an edge are called\u00a0<strong>connecting nodes<\/strong>.<\/li>\r\n\t<li>Each edge is assigned a direction, denoted by the arrowheads. The directionality of the edge allows the designation of a\u00a0<strong>Start Node<\/strong>\u00a0and an\u00a0<strong>End Node<\/strong>. The Start Node of edge E12 below is N9, and the End Node is N6.<\/li>\r\n\t<li>An edge may have intermediate points called\u00a0<strong>vertices<\/strong>\u00a0that define its shape.<\/li>\r\n<\/ul>\r\n<\/li>\r\n\t<li><strong>Faces<\/strong>\u00a0(labeled \u201cF\u201d in the illustration below) are the 2-dimensional geometric primitives used to represent entities like blocks, counties, and voting districts. A face is a polygon bounded by edges.\r\n<ul>\r\n\t<li>The directionality of an edge also allows\u00a0<strong>left and right faces<\/strong>\u00a0to be designated. Face F1 is on the left of edge E12 and face F2 is to the right.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ul>\r\n<img alt=\"Geometric primitives and topology used in the MAF\/TIGER database\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4_4_TIGER_primitives1.png\" \/>\r\n\r\n&nbsp;\r\n\r\nGeometric primitives of the Topologically Integrated Geographic Encoding and Referencing (TIGER) database. The figure shows what might be two adjacent Census blocks, with the bottom block bounded on the south by a river. The remaining edges might correspond to streets, and the isolated nodes might be landmarks such as a school, a church and a zoo.\r\n\r\n&nbsp;\r\n<h3>GEOMETRIC ACCURACY<\/h3>\r\nUntil recently the geometric accuracy of the vector features encoded in TIGER were notoriously poor (see illustration below). How poor?<strong>Through 2003<\/strong>, the\u00a0<a href=\"http:\/\/www.census.gov\/geo\/www\/tlmetadata\/metadata.html\">TIGER\/Line metadata<\/a>\u00a0stated that\r\n<blockquote>\r\n<div>Coordinates in the TIGER\/Line files have six implied decimal places, but the positional accuracy of these coordinates is not as great as the six decimal places suggest. The positional accuracy varies with the source materials used, but generally the information is no better than the established National Map Accuracy standards for 1:100,000-scale maps from the U.S. Geological Survey (Census Bureau 2003)\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\nHaving performed scale calculations in Chapter 2 you should be able to calculate the magnitude of error (ground distance) associated with 1:100,000-scale topographic maps.\u00a0 Recall that the allowed error for USGS topographic maps at scales of 1:20,000 or smaller is 1\/50 inch (see the\u00a0<a href=\"http:\/\/nationalmap.gov\/standards\/pdf\/NMAS647.PDF\">nationalmap standards pdf<\/a>)\r\n\r\n<\/div><\/blockquote>\r\n<img alt=\"Image of mismatch between TIGER street data and aerial image\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/TIGER_inaccuracy.png\" \/>\r\n\r\nDiscrepancy between pre-modernization TIGER\/Line file streets (red) and actual geometry of street network shown in an orthorectified aerial image (U.S. Census Bureau n.d).\r\n<h3>ACCURACY IMPROVEMENT<\/h3>\r\nStarting in 2002, in preparation for the 2010 census, the Census Bureau commissioned a six-year, $200 million MAF\/TIGER Accuracy Improvement Project (MTAIP).\u00a0<strong>One objective of the effort was to use GPS to capture accurate geographic coordinates for every household in the MAF<\/strong>.\u00a0<strong>Another objective was to improve the accuracy of TIGER\u2019s road\/path features.<\/strong>\u00a0The project aimed to adjust the geometry of street networks to align within 7.6 meters of street intersections observed in orthoimages or measured using GPS. The corrected streets are necessary not just for mapping, but for accurate geocoding. Because streets often form the boundaries of census areas, it is essential that accurate household locations are associated with accurate street networks.\r\n\r\nMTAIP integrated over 2,000 source files submitted by state, tribal, county, and local governments. Contractors used survey-grade GPS to evaluate the accuracy of a random sample of street centerline intersections of the integrated source files. The evaluation confirmed that most but not all features in the spatial database equal or exceed the 7.6 meter target. Uniform accuracy wasn\u2019t possible due to the diversity of local source materials used, though this accuracy is the standard in the \u201cAll Lines\u201d Shapefile extracts. The geometric accuracy of particular feature classes included in particular shapefiles are documented in the metadata associated with that shapefile extract.\r\n\r\nMTAIP was completed in 2008. In conjunction with the continuous American Community Survey and other census operations, corrections and updates are now ongoing. TIGER\/Line Shapefile updates are now released annually.\r\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\r\nRegistered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about MAF and TIGER.\r\n\r\nYou may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.\r\n<h2>4.5. Shapefiles<\/h2>\r\nSince 2007, TIGER\/Line extracts from the MAF\/TIGER database have been distributed in shapefile format. Esri introduced shapefiles in the early 1990s as the native digital vector data format of its ArcView software product. The shapefile format is proprietary, but open; its\u00a0<a href=\"http:\/\/www.esri.com\/library\/whitepapers\/pdfs\/shapefile.pdf\">technical specifications are published<\/a>\u00a0and can be implemented and used freely. Largely as a result of ArcView\u2019s popularity, shapefile has become a de facto standard for creation and interchange of vector geospatial data. The Census Bureau\u2019s adoption of Shapefile as a distribution format is therefore consistent with its overall strategy of conformance with mainstream information technology practices.\r\n<h3>ELEMENTS OF A SHAPEFILE DATA SET<\/h3>\r\nThe first thing GIS pros need to know about shapefiles is that\u00a0<strong>every shapefile data set includes a minimum of three files<\/strong>. One of the three required files stores the geometry of the digital features as sets of vector coordinates. A second required file holds an index that, much like the index in a book, allows quicker access to the spatial features and therefore speeds processing of a given operation involving a subset of features. The third required file stores attribute data in dBASE\u00a9 format, one of the earliest and most widely-used digital database management system formats. All of the files that make up a Shapefile data set have the same root or prefix name, followed by a three-letter suffix or file extension. The list below shows the names of the three required files making up a shapefile data set named \u201ccounties.\u201d Take note of the file extensions.\r\n<ul>\r\n\t<li>counties.shp: The main shape file, containing vector coordinate data<\/li>\r\n\t<li>counties.shx: The index file<\/li>\r\n\t<li>counties.dbf: The dBASE table<\/li>\r\n<\/ul>\r\nEsri lists twelve additional optional files, and practitioners are able to include still others. Two of the most important optional files are the \u201c.prj\u201d file, which includes the coordinate system definition, and \u201c.xml\u201d, which stores metadata. (Why do you suppose that something as essential as a coordinate system definition is considered \u201coptional\u201d?)\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\n<h3>DOWNLOADING AND VIEWING A TIGER\/LINE SHAPEFILE<\/h3>\r\nIn this\u00a0<em>Try This!<\/em>\u00a0(the second of 3 dealing with TIGER\/Line Shapefiles), you will download a TIGER\/Line Shapefile dataset, investigate the file structure of a typical Esri shapefile, and view it in GIS software.\r\n\r\nYou can use a free software application called\u00a0<strong>Global Mapper\u00a0<\/strong>(originally known as\u00a0<strong>dlgv32 Pro<\/strong>) to investigate TIGER\/Line shapefiles. Originally developed by the staff of the USGS Mapping Division at Rolla, Missouri as a data viewer for USGS data, Global Mapper has since been commercialized, but is available in a free trial version. The instructions below will guide you through the process of installing the software and opening the TIGER\/Line data.\r\n<ol>\r\n\t<li><strong>Downloading TIGER\/Line Shapefiles:\u00a0<\/strong>You are going to use the 2010 TIGER\/Line Shapefiles.\r\n<ul>\r\n\t<li>Return to the\u00a0<a href=\"http:\/\/www.census.gov\/cgi-bin\/geo\/shapefiles2010\/main\">2010 TIGER\/Line Shapefiles download page<\/a>.<\/li>\r\n\t<li>From the\u00a0<em>Select a layer type<\/em>\u00a0pick list, under\u00a0<em>Features<\/em>, choose\u00a0<strong>All Lines<\/strong>\u00a0and click\u00a0<strong>submit<\/strong>. (You are welcome to download and investigate any TIGER\/Line Shapefile(s), but we will use an\u00a0<em>All Lines<\/em>dataset in the geocoding Try This later in the chapter, so your downloading one here will make you more familiar with the content.)<\/li>\r\n\t<li>From the All Lines pick list select a state or territory and click<strong>Submit<\/strong>.<\/li>\r\n\t<li>Select a County from the next pick list that appears and click<strong>Download<\/strong>.<\/li>\r\n\t<li>Save the file to your computer.\r\nThe file you download should have a name like<em>tl_2010_42027_edges.zip<\/em>. The root name of this file,<em>tl_2010_<\/em><em>42027<\/em><em>_edges<\/em>\u00a0in this example, will also be the name of the shapefile dataset. The\u00a0<em>42027\u00a0<\/em>is a federal code that represents Pennsylvania (state 42) and Centre County (county 027). The five-digit code in your file name will depend on which state and county you selected.<\/li>\r\n\t<li>The data are compressed in a .zip archive. Extract the data to a new named folder in a known location. (Within the file hierarchy that is extracted there may be a second .zip file that needs to be uncompressed.)<\/li>\r\n<\/ul>\r\n<\/li>\r\n\t<li><strong>Investigating the shapefile data set:<\/strong>\r\n<ul>\r\n\t<li>Navigate to\u00a0<em>within<\/em>\u00a0the folder in which you stored your uncompressed TIGER\/Line Shapefile dataset.<\/li>\r\n\t<li>Notice the multiple files which make up the shapefile dataset, including:\r\n<ul>\r\n\t<li>tl_2010_42027_edges.shp, containing the vector coordinate data<\/li>\r\n\t<li>tl_2010_42027_edges.shp.xml, containing metadata<\/li>\r\n\t<li>tl_2010_42027_edges.shx, the index file<\/li>\r\n\t<li>tl_2010_42027_edges.dbf, the dBASE file<\/li>\r\n\t<li>tl_2010_42027_edges.prj, containing the projection\/spatial reference<\/li>\r\n<\/ul>\r\n<\/li>\r\n\t<li>All of the files work in concert to store the necessary components of the Esri\u00a0<em>shapefile data set<\/em>. You may be familiar with some of the individual files types. The contents of three of them can be easily viewed. Let\u2019s open those three. You can double click on the file and then select \u201cfrom a list of installed programs,\u201d or you may need to run the suggested application and open the file from within it. Let me know if you need help, or help each other in the ANGEL Chapter 4 Discussion Forum or in the Comments area below.\r\n<ul>\r\n\t<li>Open the\u00a0<strong>.dbf<\/strong>\u00a0file using Microsoft Excel.\r\nNote the typical row-column structure of a flat-file database. Can you find the four columns, or fields, that hold the address range information? Look for LFROMADD, etc. The field name LFROMADD is shorthand for Left From Address. The 10-character length of the field name points up one of the constraints of the dBASE format\u2013field names are limited to 10 characters.<\/li>\r\n\t<li>Open the<strong>\u00a0.xml<\/strong>\u00a0file using your web browser.\r\nYou should see the metadata information bracketed by\u00a0<em>tags<\/em>contained within directional brackets &lt; &gt;. XML stands for Extensible Markup Language, and is a common set of rules for encoding documents. Can you locate the portion of the document having to do with horizontal spatial accuracy?\u00a0 (Spatial accuracy metadata is available when you\u2019ve chosen the\u00a0<em>All Lines\u00a0<\/em>file as your candidate shapefile.)<\/li>\r\n\t<li>Open the<strong>\u00a0.prj<\/strong>\u00a0file using Notepad, or any vanilla text editor.\r\nThere are five pieces of information in this file, separated by commas. What are they? They should reinforce some of what you learned in Chapter 2 regarding what defines a geographic coordinate system.<\/li>\r\n\t<li>The\u00a0<strong>.shp<\/strong>\u00a0and\u00a0<strong>.shx<\/strong>\u00a0files are proprietary and specific to the functionality of the shapefile data set.<\/li>\r\n<\/ul>\r\n<\/li>\r\n\t<li>Discuss what you find with your classmates in comments below.<\/li>\r\n\t<li>Note that one should not alter the contents of any of these files with any application other than a GIS program that is designed for that task.<\/li>\r\n<\/ul>\r\n<\/li>\r\n\t<li><strong>Viewing the shapefile dataset in Global Mapper:<\/strong>\r\n<ul>\r\n\t<li>Download and install the Global Mapper software:\r\n<ol>\r\n\t<li>Navigate to the\u00a0<a href=\"http:\/\/www.bluemarblegeo.com\/products\/global-mapper.php\">Blue Marble Global Mapper site<\/a>.<\/li>\r\n\t<li>Download the trial version of the software<\/li>\r\n\t<li>Double-click on the setup file you downloaded to install the program<\/li>\r\n\t<li>Launch the Global Mapper program<\/li>\r\n<\/ol>\r\n<\/li>\r\n\t<li>After opening the Global Mapper software, choose\u00a0<em>Open Data File(s)..<\/em>. under the\u00a0<em>File\u00a0<\/em>menu, or click the \u201cOpen Your Own Data Files\u201d button in the center of the window.\u00a0 Navigate to the extracted shapefile dataset you downloaded above and open it. (Remember, your complete shapefile data set will have a name similar to<em>tl_2010_42027_edges<\/em>. It will show up in the\u00a0<em>Open\u00a0<\/em>dialog with a .shp extension.)<\/li>\r\n\t<li>You should be able to see all of the line features (the\u00a0<em>edges<\/em>, from the MAF\/TIGER database) contained in your county. If you are using the newest version of Global Mapper you should be able to discern roads from rivers\/streams from administrative boundaries, etc. In older versions of the application the default view showed all line features in a single color and line weight, so the user needed to use the symbolization tools to make the different classes of features distinguishable.\r\nWhat do you think has to be understood by the mapping application to allow it to automatically symbolize features differently? Post your thoughts below.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\n<h3>SHAPEFILE PRIMITIVES<\/h3>\r\nA single shapefile data set can contain one of three types of spatial data primitives, or features \u2013 points, lines or polygons (areas). The technical specification defines these as follows:\r\n<ul>\r\n\t<li><strong>Points<\/strong>: A point consists of a pair of double-precision coordinates in the order X,Y.<\/li>\r\n\t<li><strong>Lines<\/strong>: More specifically a polyline, is an ordered set of points, or vertices, that consists of one or more\u00a0<strong>parts<\/strong>. A part is a connected sequence of two or more points. Parts may or may not be connected to one another. Parts may or may not intersect one another.<\/li>\r\n\t<li><strong>Polygons<\/strong>: A polygon consists of one or more\u00a0<strong>rings<\/strong>. A ring is a connected sequence of four or more points, or vertices, that form a closed, non-self-intersecting loop.<\/li>\r\n\t<li><strong>Other<\/strong>: M (measured; route data) and Z (3D; vertical datum) versions of point, polyline and polygon Shapefile data sets can be created, but are not included in the TIGER\/Line Shapefile extracts.<\/li>\r\n<\/ul>\r\n<img alt=\"Diagram illustrating geometric primitives of the Shapefile format\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4_5_Shapefile_primitives.png\" \/>\r\n\r\n&nbsp;\r\n\r\nThree Shapefile data sets that could be extracted from the MAF\/TIGER data depicted on the preceding page\r\n\r\nAt left in the illustration above, a\u00a0<strong>polygon Shapefile data set<\/strong>\u00a0holds the Census blocks in which the edges from the MAF\/TIGER database have been combined to form two distinct polygons, P1 and P2. The diagram shows the two polygons separated to emphasize the fact that what is the single E12 edge in the MAF\/TIGER database (see the diagram on page 4)\u00a0 is now present in each of the Census block polygon features.\r\n\r\nIn the middle of the illustration, a\u00a0<strong>polyline Shapefile data set<\/strong>\u00a0holds seven line features (L1-7) that correspond to the seven edges in the MAF\/TIGER database. The directionality of the line features that represent streets corresponds to address range attributes in the associated dBASE\u00a9 table. Vertices define the shape of a polygon or a line, and the Start and End Nodes from the MAF\/TIGER database are now First and Last Vertices.\r\n\r\nFinally, at right in the illustration above, a\u00a0<strong>point Shapefile data set<\/strong>\u00a0holds the three isolated nodes from the MAF\/TIGER database.\r\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\r\nRegistered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about Shapefiles.\r\n\r\nYou may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.\r\n<h2>4.6. Topology<\/h2>\r\nTopology is different than topography. (You\u2019d be surprised how often these terms get mixed up.) In Chapter 2 you read about the various ways that\u00a0<em>absolute<\/em>\u00a0positions of features can be specified in a coordinate system, and how those coordinates can be projected or otherwise transformed. Topology refers to the\u00a0<em>relative<\/em>\u00a0positions of spatial features.\u00a0<strong>Topological relations among features<\/strong>\u2014such as containment, connectivity, and adjacency\u2014<strong>don\u2019t change when a dataset is transformed<\/strong>. For example, if an isolated node (representing a household) is located inside a face (representing a congressional district) in the MAF\/TIGER database, you can count on it remaining inside that face no matter how you might project, rubber-sheet, or otherwise transform the data. Topology is vitally important to the Census Bureau, whose constitutional mandate is to accurately associate population counts and characteristics with political districts and other geographic areas.\r\n\r\nAs David Galdi (2005) explains in his white paper \u201cSpatial Data Storage and Topology in the Redesigned MAF\/TIGER System,\u201d the \u201cTI\u201d in TIGER stands for \u201cTopologically Integrated.\u201d This means that\u00a0<strong>the various features represented in the MAF\/TIGER database<\/strong>\u2014such as streets, waterways, boundaries, and landmarks (but not elevation!)\u2014<strong>are not encoded on separate \u201clayers.\u201d<\/strong>\u00a0Instead, features are made up of a small set of geometric primitives\u2014including 0-dimensional nodes and vertices, 1-dimensional edges, and 2-dimensional faces\u2014without redundancy. That means that where a waterway coincides with a boundary, for instance, MAF\/TIGER represents them both with one set of edges, nodes and vertices. The attributes associated with the geometric primitives allow database operators to retrieve feature sets efficiently with simple spatial queries. The separate feature-specific TIGER\/Line Shapefiles published at the county level (such as point landmarks, hydrography, Census block boundaries, and, the \u201cAll Lines\u201d file you are using in the multi-part \u201cTry This\u201d) were extracted from the MAF\/TIGER database in that way. Notice, however, that when you examine a hydrography shapefile and a boundary shapefile, you will see redundant line segments where the features coincide. That fact confirms that\u00a0<strong>TIGER\/Line Shapefiles<\/strong>, unlike the MAF\/TIGER database itself,\u00a0<strong>are not topologically integrated<\/strong>. Desktop computers are now powerful enough to calculate\u00a0<strong>topology \u201con the fly\u201d<\/strong>from shapefiles or other non-topological data sets.\u00a0 However, the large batch processes performed by the Census Bureau still benefit from the MAF\/TIGER database\u2019s\u00a0<strong>persistent topology<\/strong>.\r\n\r\nMAF\/TIGER\u2019s topological data structure also benefits the Census Bureau by allowing it to automate error-checking processes. By definition, features in the TIGER\/Line files conform to a set of topological rules (Galdi 2005):\r\n<ol>\r\n\t<li>Every edge must be bounded by two nodes (start and end nodes)..<\/li>\r\n\t<li>Every edge has a left and right face.<\/li>\r\n\t<li>Every face has a closed boundary consisting of an alternating sequence of nodes and edges.<\/li>\r\n\t<li>There is an alternating closed sequence of edges and faces around every node.<\/li>\r\n\t<li>Edges do not intersect each other, except at nodes.<\/li>\r\n<\/ol>\r\nCompliance with these topological rules is an aspect of data quality called<strong>logical consistency<\/strong>.\u00a0 In addition, the boundaries of geographic areas that are related hierarchically\u2014such as blocks, block groups, tracts, and counties\u2014are represented with common, non-redundant edges. Features that do not conform to the topological rules can be identified automatically, and corrected by the Census geographers who edit the database. Given that the MAF\/TIGER database covers the entire U.S. and its territories, and includes many millions of primitives, the ability to identify errors in the database efficiently is crucial.\r\n\r\nSo how does topology help the Census Bureau assure the accuracy of population data needed for reapportionment and redistricting? To do so, the Bureau must aggregate counts and characteristics to various geographic areas, including blocks, tracts, and voting districts. This involves a process called \u201caddress matching\u201d or \u201caddress geocoding\u201d in which data collected by household is assigned a topologically-correct geographic location. The following pages explain how that works.\r\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\r\nRegistered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about Topology.\r\n\r\nYou may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.\r\n<h2>4.7. Geocoding<\/h2>\r\n<strong>Geocoding is the process used to convert location codes, such as street addresses or postal codes, into geographic (or other) coordinates.<\/strong>\u00a0The terms \u201caddress geocoding\u201d and \u201caddress mapping\u201d refer to the same process. Geocoding address-referenced population data is one of the Census Bureau\u2019s key responsibilities.\u00a0 However, as you know, it\u2019s also a very popular capability of online mapping and routing services. In addition, geocoding is an essential element of a suite of techniques that are becoming known as \u201cbusiness intelligence.\u201d We\u2019ll look at applications like these later in this chapter, but first let\u2019s consider how the Census Bureau performs address geocoding.\r\n<h3>ADDRESS GEOCODING AT THE U.S. CENSUS<\/h3>\r\nPrior to the MAF\/TIGER modernization project that led up to the decennial census of 2010, the TIGER database did not include a complete set of point locations for U.S. households. Lacking point locations, TIGER was designed to support address geocoding by approximation. As illustrated below, the pre-modernization TIGER database included<strong>address range attributes<\/strong>\u00a0for the edges that represent streets. Address range attributes were also included in the TIGER\/Line files extracted from TIGER. Coupled with the Start and End nodes bounding each edge, address ranges enable users to estimate locations of household addresses.\r\n\r\n<img alt=\"Diagram showing neighborhood map with addresses (top) and the adress data being recorded in program window (bottom)\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/address_ranges.gif\" \/>\r\n\r\nHow address range attributes were encoded in TIGER\/Line files (U.S. Census Bureau 1997). Address ranges in contemporary TIGER\/Line Shapefiles are similar, except that \u201cFrom\u201d (FR) and \u201cTo\u201d nodes are now called \u201cStart\u201d and \u201cEnd\u201d. Also, changes have been made to field (column) names in the attribute tables. Compare the names of the address range fields that you looked at in the second Try This exercise to those above.\r\n\r\nHere\u2019s how it works. The diagram above highlights an edge that represents a one-block segment of Oak Avenue. The edge is bounded by two nodes, labeled \u201cStart\u201d and \u201cEnd.\u201d A corresponding record in an attribute table includes the unique ID number (0007654320) that identifies the edge, along with starting and ending addresses for the left (FRADDL, TOADDL) and right (FRADDR, TOADDR) sides of Oak Avenue. Note also that the address ranges include potential addresses, not just existing ones. This is to make sure that the ranges will remain valid as new buildings are constructed along the street.\r\n\r\n<strong>A common geocoding error occurs when Start and End designations are assigned to the wrong connecting nodes<\/strong>. You may have read in Galdi\u2019s (2005) white paper \u201cSpatial Data Storage and Topology in the Redesigned MAF\/TIGER System,\u201d that in MAF\/TIGER, \u201can arbitrary direction is assigned to each edge, allowing designation of one of the nodes as the Start Node, and the other as the End Node\u201d (p. 3). If an edge\u2019s \u201cdirection\u201d happens not to correspond with its associated address ranges, a household location may be placed on the wrong side of a street.\r\n\r\nAlthough many local governments in the U.S. have developed their own GIS \u201cland bases\u201d with greater geometric accuracy than pre-modernization TIGER\/Line files, similar address geocoding errors still occur. Kathryn Robertson, a GIS Technician with the City of Independence, Missouri (and a student in the Fall 2000 offering of this course) pointed out how important it is that Start (or \u201cFrom\u201d) nodes and End (or \u201cTo\u201d) nodes correspond with the low and high addresses in address ranges. \u201cI learned this the hard way,\u201d she wrote, \u201cgeocoding all 5,768 segments for the city of Independence and getting some segments backward. When address matching was done, the locations were not correct. Therefore, I had to go back and look at the direction of my segments. I had a rule of thumb, all east-west streets were to start from west and go east; all north-south streets were to start from the south and go north\u201d (personal communication).\r\n\r\nAlthough this may have been a sensible strategy for the City of Independence, can you imagine a situation in which Kathryn\u2019s rule-of-thumb might not work for another municipality? If so, and if you\u2019re a registered student, please add a comment to this page.\r\n<h3>AFTER MAF\/TIGER MODERNIZATION<\/h3>\r\nIf TIGER had included accurate coordinate locations for every household, and correspondingly accurate streets and administrative boundaries, geocoding census data would be simple and less error-prone. Many local governments digitize locations of individual housing units when they build GIS land bases for property tax assessment, E-911 dispatch and other purposes. The MAF\/TIGER modernization project begun in 2002 aimed to accomplish this for the entire nationwide TIGER database in time for the 2010 census. The illustration below shows the intended result of the modernization project, including properly aligned streets, shorelines, and individual household locations, shown here in relation to an orthorectified aerial image.\r\n\r\n<img alt=\"Image showing modernized TIGER household locations and aligned streets\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/TIGER_goal.png\" \/>\r\n\r\nIntended accuracy and completeness of modernized TIGER data in relation to the real world. TIGER streets (yellow), shorelines (blue), and housing unit locations (red) are superimposed over an orthorectified aerial image. (U.S. Census Bureau n.d.). National coverage of housing unit locations and geometrically-accurate streets and other features were not available in 2000 or before.\r\n\r\nThe modernized MAF\/TIGER database described by Galdi (2005) is now in use, including precise geographic locations of over 100 million household units. However, because\u00a0<strong>household locations are considered confidential<\/strong>, users of TIGER\/Line Shapefiles extracted from the MAF\/TIGER database still must rely upon address geocoding using address ranges.\r\n<h3>LEVERAGING TIGER\/LINE DATA FOR PRIVATE ENTERPRISE<\/h3>\r\nLaunched in 1996, MapQuest was one of the earliest online mapping, geocoding and routing services. MapQuest combined the capabilities of two companies: a cartographic design firm with long experience in producing road atlases, \u201cTripTiks\u201d for the American Automobile Association, and other map products, and a start-up company that specialized in custom geocoding applications for business.\u00a0 Initially, MapQuest relied in part on TIGER\/Line street data extracted from the pre-modernization TIGER database. MapQuest and other commercial firms were able to build their businesses on TIGER data because of the U.S. government\u2019s wise decision not to restrict its reuse. It\u2019s been said that this decision triggered the rapid growth of the U.S. geospatial industry.\r\n\r\nLater on in this chapter we\u2019ll visit MapQuest and some of its more recent competitors. Next, however, you\u2019ll have a chance to see how geocoding is performed using a TIGER\/Line data in a GIS.\r\n<h2>4.8. Geocoding with TIGER\/Line Shapefiles<\/h2>\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\n<h3>GEOCODING IN A GIS<\/h3>\r\nPart 3 of 3 in the TIGER\/Line Shapefile\u00a0<em>Try This!<\/em>\u00a0series is not interactive but instead illustrates how the address ranges encoded in TIGER\/Line Shapefiles can be used to pinpoint (more or less!) the geographic locations of street addresses in the U.S.\r\n\r\nThe process of geocoding a location within a GIS begins with a line dataset (shapefile) with the necessary address range attributes.\u00a0 The following image is an example of the attribute table of a TIGER\/Line shapefile.\r\n\r\n<img alt=\"Screenshot of Attribute Table\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482Attributes.jpg\" \/>\r\n\r\nVisible in this image are just a few rows, which represent a handful of road segments and their corresponding address ranges.\u00a0 This shapefile contains over 29,000 road segments in total.\u00a0 Note the names of some of the attributes:\r\n<ul>\r\n\t<li>FULLNAME \u2013 The street name of the road segment<\/li>\r\n\t<li>LFROMADD \u2013 The address number at the beginning of the road segment on the left side of the street<\/li>\r\n\t<li>LTOADD \u2013 The address number at the end of the road segment on the left side of the street<\/li>\r\n\t<li>RFROMADD \u2013 The address number at the beginning of the road segment on the right side of the street<\/li>\r\n\t<li>RTOADD \u2013 The address number at the end of the road segment on the right side of the street<\/li>\r\n\t<li>ZIPL \u2013 The zip code area that is present to the left side of the road segment<\/li>\r\n\t<li>ZIPR \u2013 The zip code area that is present to the right side of the street<\/li>\r\n<\/ul>\r\nNext, the GIS software needs to know which of these attributes contains each piece of the necessary address range information.\u00a0 Some shapefiles use different names for their attributes, so the GIS can\u2019t always know which attribute contains the Right-Side-From-Address information, for example.\u00a0 In ArcGIS, for example, something called a Locator is configured that maps the attributes in the shapefile to the corresponding piece of necessary address information.\u00a0 The image below illustrates what this mapping looks like:\r\n\r\n<img alt=\"Screenshot of ArcGIS Locator\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482AddressLocator.jpg\" \/>\r\n\r\nNote the items with an asterisk (*).\u00a0 These are the minimum required attributes that need to be present in the shapfile for the geocoding to work.\u00a0 The items in the \u201cAlias Name\u201d column correspond to attributes in the shapefile.\r\n\r\nWe are now ready to find a location by searching for a street address!\u00a0 Let\u2019s geocode the location for \u201c1971 Fairwood Lane, 16803\u2033.\r\n\r\nWhen an address is specified, the GIS queries the attribute table to find rows with a matching street name in the correct zipcode.\u00a0 Also, the particular segment of the street that contains the address number is identified.\u00a0 The below image shows the corresponding selection in the attribute table:\r\n\r\n<img alt=\"Screenshot of Highlighted Attribute\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482HighlightedAttribute.jpg\" \/>\r\n\r\nThe image below shows the corresponding road segment highlighted on a map.\u00a0 The To and From address values for the road segment have been added so you can see the range of addresses.\r\n\r\n<img alt=\"Screenshot of Road Segment\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482RoadSegment.jpg\" \/>\r\n\r\nFinally, the GIS interpolates where along the road segment the value of 1971 occurs and places it on the appropriate side of the street based on the even\/odd values indicated in the attribute table.\u00a0 The image below shows the final result of the geocoding process:\r\n\r\n<img alt=\"Screenshot of Final Result\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482PointOnMap.jpg\" \/>\r\n\r\nThe accuracy of a geocoded location is dependent on a number of factors, including the quality of the line work in a shapefile, the accuracy of the address range attributes of each road segment, and the interpolation performed by the software.\u00a0 As you may see in the following section, different geocoding services may provide different location results due to the particular data and procedures used.\r\n<h2>4.9. Geocoding Online<\/h2>\r\nNo doubt you\u2019re familiar with one or more popular online mapping services. How well do they do at geocoding the location of a postal address? You can try it out for yourself at several Web-based mapping services, including\u00a0<a href=\"http:\/\/www.mapquest.com\/\">MapQuest.com<\/a>,\u00a0<a href=\"http:\/\/www.bing.com\/maps\/\">Microsoft\u2019s Bing Maps<\/a>, and\u00a0<a href=\"http:\/\/www.geocode.com\/\">Tele Atlas\/TomTom\u2019s Geocode.com<\/a>. Tele Atlas, for example, is a leading manufacturer of digital street data for vehicle navigation systems. To accommodate the routing tasks that navigation systems are called upon to serve, the streets are encoded as vector features whose attributes include address ranges. (In order to submit an address for geocoding at Geocode.com you have to set up a trial account through their EZ-Locate Interactive web tool or download the EZ-Locate software).\r\n\r\n<img alt=\"Screenshot of the Tele Atlas Geocode.com adress submission window\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4p9TeleAtlasTomTom_EZ-Locate_input_jan2013.jpg\" \/>\r\n\r\nSubmitting an address to Tele Atlas\u2019 Geocode.com service for geocoding. \u00a9 2013 TomTom North America, Inc. All rights reserved.\r\n\r\nShown above is the form by which you can geocode an address to a location in a Tele Atlas street database. The result is shown below.\r\n\r\n<img alt=\"Screenshot of Tele Atlas geocoding results window\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4p9TeleAtlasTomTom_EZ-Locate_result_jan2013.jpg\" \/>\r\n\r\nTele Atlas\u2019 Geocode.com service estimates the location of the address relative to the address range attributes encoded in its database. \u00a9 2013 TomTom North America, Inc. All rights reserved.\r\n\r\nLet\u2019s compare the geocoding capabilities of MapQuest.com to locate the address on an actual map.\r\n\r\n<img alt=\"Screenshot of Mapquest Address Locator 2013\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4p9Mapquest_jan2013.jpg\" \/>\r\n\r\nAddress geocoded by MapQuest.com. \u00a9 2013 MapQuest.com, Inc. All rights reserved.\r\n\r\nThe MapQuest.com map from 2013 estimates the address is close to its actual location. Below is a similar MapQuest product created back in 1998, when this course was first being developed. On the older map the same address is plotted on the opposite side of the street. What do you suppose is wrong with the address range attribute in that case?\r\n\r\nOn the map from 1998, also note the shapes of the streets. The street shapes in the 2011 map have been improved.\u00a0 The 1998 product seems to have been generated from the 1990 version of the TIGER\/Line files, which may have been all that was available for this relatively remote part of the country.\u00a0 Now MapQuest licenses street data from a business partner called\u00a0<a href=\"http:\/\/www.navteq.com\/\">NAVTEQ<\/a>.\r\n\r\n<img alt=\"Screenshot of MapQuest 1998\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/geocoding_mapquest.gif\" \/>\r\n\r\nSame address geocoded by MapQuest.com in 1998. \u00a9 1998 MapQuest.com, Inc. (formerly GeoSystems Global Corp.) All rights reserved.\r\n\r\nThe point of this section is to show that geocoding with address ranges involves a process of estimation. The Census Bureau\u2019s TIGER\/Line Shapefiles, like the commercial street databases produced by Tele Atlas, Navigation Technologies, and other private firms, represent streets as vector line segments. The vector segments are associated with address range attributes, one for the left side of the street, one for the right side. The geocoding process takes a street address as input, finds the line segment that represents the specified street, checks the address ranges to determine the correct side of the street, then estimates a location at the appropriate point between the minimum and maximum address for that segment and assignes an estimated latitude\/longitude coordinate to that location. For example, if the minimum address is 401, and the maximum is 421, a geocoding algorithm would locate address 411 at the midpoint of the street segment.\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\nTry one of these geocoding services for your address. Then compare the experience, and the result, with\u00a0<a href=\"http:\/\/maps.google.com\/\">Google Maps<\/a>, launched in 2005. Apply what we\u2019ve discussed in this chapter to try to explain inaccuracies in your results, if any. Registered students can log in and post comments directly to this page.\r\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\r\nRegistered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about Geocoding.\r\n\r\nYou may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.\r\n<h2>4.10. Applications beyond the Census Bureau<\/h2>\r\nTwo characteristics of MAF\/TIGER data, address range attributes and explicit topology, make them, and derivative products, valuable in many contexts. Consequently, firms like\u00a0<a href=\"http:\/\/www.navteq.com\/\">NAVTEQ<\/a>\u00a0and\u00a0<a href=\"http:\/\/www.teleatlas.com\/\">Tele Atlas<\/a>\u00a0(now owned by TomTom) have emerged to provide data with similar characteristics as MAF\/TIGER, but which are more up-to-date, more detailed and include additional feature classes. The purpose of the next section is to sketch some of the applications of data similar to MAF\/TIGER data beyond the Census Bureau.\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\nA\u00a0<a href=\"http:\/\/money.cnn.com\/2006\/02\/24\/Autos\/modern_mapmakers\/index.htm\">February 2006 article by Peter Valdes-Dapena in CNNMoney.com<\/a>describes the work of two NAVTEQ employees. See the link above or search on \u201cwhere those driving directions really come from\u201d\r\n<h2>4.11. Geocoding Your Customers<\/h2>\r\nGeocoded addresses allow governments and businesses to map where their constituents and customers live and work. Federal, state, and local government agencies know where their constituents live by virtue of censuses, as well as applications for licenses and registrations. Banks, credit card companies, and telecommunications firms are also rich in address-referenced customer data, including purchasing behaviors. Private businesses and services must be more resourceful.\r\n\r\nSome retail operations, for example, request addresses or ZIP Codes from customers, or capture address data from checks. Discount and purchasing club cards allow retailers to directly match purchasing behaviors with addresses. Customer addresses can also be harvested from automobile license plates. Business owners pay to record license plate numbers of cars parked in their parking lots or in their competitors. Addresses of registered owners can be purchased from organizations that acquire motor vehicle records from state departments of transportation.\r\n\r\nBusinesses with access to address-referenced customer data, vector street data attributed with address ranges, and GIS software and expertise, can define and analyze the\u00a0<strong>trade areas<\/strong>\u00a0within which most of their customers live and work. Companies can also focus direct mail advertising campaigns on their own trade areas, or their competitors\u2019. Furthermore, GIS can be used to analyze the socio-economic characteristics of the population within trade areas, enabling businesses to make sure that the products and services they offer meet the needs and preferences of target populations.\r\n\r\nPoliticians use the same tools to target appearances and campaign promotions.\r\n<h3><strong>TRY THIS!<\/strong><\/h3>\r\nCheck out the\u00a0<a href=\"http:\/\/www.ffiec.gov\/Geocode\/default.aspx\">geocoding system maintained by the Federal Financial Institution\u2019s Examination Council<\/a>. The FFIEC Geocoding system lets users enter a street address and get a census demographic report or a street map (Using Tele Atlas data). The system is intended for use by financial institutions that are covered by the Home Mortgage Disclosure Act (HMDA) and Community Reinvestment Act (CRA) to meet their reporting obligation.\r\n<h2>4.12. Delivering Products and Services<\/h2>\r\nOperations such as mail and package delivery, food and beverage distribution, and emergency medical services need to know not only where their customers are located, but how to deliver products and services to those locations as efficiently as possible. Geographic data products like TIGER\/Line Shapefiles are valuable to analysts responsible for prescribing the most efficient delivery routes. The larger and more complex the service areas of such organizations, the more incentive they have to automate their routing procedures.\r\n\r\nIn its simplest form,\u00a0<strong>routing<\/strong>\u00a0involves finding the shortest path through a network from an origin to a destination. Although shortest path algorithms were originally implemented in raster frameworks, transportation networks are now typically represented with vector feature data, like TIGER\/Line Shapefiles. Street segments are represented as digital line segments each formed by two points, a \u201cstart\u201d node and an \u201cend\u201d node. If the nodes are specified within geographic or plane coordinate systems, the distance between them can be calculated readily. Routing procedures sum the lengths of every plausible sequence of line segments that begins and ends at the specified locations. The sequence of segments associated with the smallest sum represents the shortest route.\r\n\r\nTo compare various possible sequences of segments, the data must indicate which line segment follows immediately after another line segment. In other words, the procedure needs to know about the connectivity of features. As discussed earlier, connectivity is an example of a topological relationship. If topology is not encoded in the data product, it can be calculated by the GIS software in which the procedure is coded.\r\n\r\n<img alt=\"Screenshot of MapQuest 1998\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/routing_form.gif\" \/>\r\n\r\nInput form for an early version of the\u00a0<a href=\"http:\/\/www.mapquest.com\/\">MapQuest<\/a>\u00a0routing utility. \u00a9 1998 MapQuest.com, Inc. All rights reserved.\r\n\r\nSeveral online travel planning services, including MapQuest.com and Google Maps, provide routing capabilities. Both take origin and destination addresses as input, and produce optimal routes as output. These services are based on vector feature databases in which street segments are attributed with address ranges, as well as with other data that describe the type and conditions of the roads they represent.\r\n\r\n<img alt=\"Screenshot of MapQuest options window\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/routing_options.gif\" \/>\r\n\r\nAn early interface to MapQuest\u2019s routing options. Different algorithms are required to calculate shortest and fastest routes. Specific attributes must be encoded in the database to provide the options to avoid limited access highways, toll roads, and ferry lanes. \u00a9 1998 MapQuest.com, Inc. All rights reserved.\r\n\r\nThe shortest route is not always the best. In the context of emergency medical services, for example, the fastest route is preferred, even if it entails longer distances than others. To determine fastest routes, additional attribute data must be encoded, such as speed limits, traffic volumes, one way streets, and other characteristics.\r\n\r\n<img alt=\"Screenshot of MapQuest maps\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/routing_map.gif\" \/>\r\n\r\nMapQuest routing solution. \u00a9 1998 MapQuest.com, Inc. All rights reserved.\r\n\r\nThen there are routing problems that involve multiple destinations\u2013a complex special case of routing called the\u00a0<strong>traveling salesman problem<\/strong>. School bus dispatchers, mail and package delivery service managers, and food and beverage distributors all seek to minimize the transportation costs involved in servicing multiple, dispersed destinations. As the number of destinations and the costs of travel increase, the high cost of purchasing up-to-date, properly attributed network data becomes easier to justify.\r\n<h3><strong>TRY THIS<\/strong><\/h3>\r\nThe Georgia Institute of Technology publishes an\u00a0<a href=\"http:\/\/www.tsp.gatech.edu\/\">extensive collection of resources about the Traveling Salesman Problem<\/a>.\r\n<h2>4.13. Delineating Service Areas<\/h2>\r\nThe need to redraw voting district boundaries every ten years was one of the motivations that led the Census Bureau to create its MAF\/TIGER database. Like voting districts, many other kinds of service area boundaries need to be revised periodically. School districts are a good example. The state of Massachusetts, for instance, has adopted school districting laws that are similar in effect to the constitutional criteria used to guide congressional redistricting. The Framingham (Massachusetts) School District\u2019s Racial Balance Policy once stated that \u201ceach elementary and middle school shall enroll a student body that is racially balanced. \u2026 each student body shall include a percentage of minority student, which reflects the system-wide percentage of minority students, plus or minus ten percent. \u2026 The racial balance required by this policy shall be established by redrawing school enrollment areas\u201d (Framingham Public Schools 1998). And bus routes must be redrawn as enrollment area boundaries change.\r\n\r\nThe\u00a0<a href=\"http:\/\/www.cms.k12.nc.us\/\">Charlotte-Mecklenberg (North Carolina) public school district<\/a>\u00a0also used racial balance as a districting criterion (although its policy was subsequently challenged in court). Charlotte-Mecklenberg consists of 133 schools, attended by over 100,000 students, about one third of whom ride a bus to school every day. District managers are responsible for routing 3,600 bus routes, traveling a total of 82,000 daily miles. A staff of eight routinely uses GIS to manage these tasks. GIS could not be used unless up-to-date, appropriately attributed, and topologically encoded data were available.\r\n\r\nAnother example of service area analysis is provided by the City of Beaverton, Oregon. In 1997, Beaverton officials realized that 25 percent of the volume of solid waste that was hauled away to land fills consisted of yard waste, such as grass clippings and leaves. Beaverton decided to establish a yard waste recycling program, but it knew that the program would not be successful if residents found it inconvenient to participate. A GIS procedure called\u00a0<strong>allocation<\/strong>\u00a0was used to partition Beaverton\u2019s street network into service areas that minimized the drive time from residents\u2019 homes to recycling facilities. Allocation procedures require vector-format data that includes the features, attributes, and topology necessary to calculate travel times from all residences to the nearest facility.\r\n\r\n<img alt=\"Screenshot of downtown Seattle GeoMap\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/drivetime_small.gif\" \/>\r\n\r\nTrade areas defined by 3 miles travel distance (blue) and 8 minutes travel time (yellow). (Francica n.d.). Used by permission.\r\n\r\nNaturally, private businesses concerned with delivering products and services are keenly interested in service area delineation. The screen capture above shows two\u00a0<strong>trade areas<\/strong>\u00a0surrounding a retail store location (\u201cSeattle Downtown\u201d) in a network database.\r\n\r\nFormer student Saskia Cohick (Winter 2006), who was then GIS Director for Tioga County, Pennsylvania, contributed another service area problem: \u201cThis is a topic that local governments are starting to deal with \u2026 To become Phase 2 wireless capable (that is, capable of finding a cell phone location from a 911 call center within 200 feet of the actual location), county call centers must have a layer called ESZs (Emergency Service Zones). This layer will tell the dispatcher who to send to the emergency (police, fire, medical, etc). The larger problem is to reach agreement between four fire companies (for example) as to where they do or do not respond.\u201d\r\n<h2>4.14. Summary<\/h2>\r\nTo fulfill its mission of being the preeminent producer of attribute data about the population and economy of the United States, the U.S. Census Bureau also became an innovative producer of digital geographic data. The Bureau designed its MAF\/TIGER database to support automatic geocoding of address-referenced census data, as well as automatic data quality control procedures. The key characteristics of TIGER\/Line Shapefiles, including use of vector features to represent geographic entities, and address range attributes to enable address geocoding, are now common features of proprietary geographic databases used for trade area analysis, districting, routing, and allocation.\r\n<h3><strong>QUIZ<\/strong><\/h3>\r\nRegistered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to access the graded quiz for this chapter. This one counts.\u00a0<strong>You may take graded quizzes only once.<\/strong>\r\n\r\nThe purpose of the quiz is to ensure that you have studied the text closely, that you have mastered the practice activities, and that you have fulfilled the chapter\u2019s learning objectives. You are free to review the chapter during the quiz. Once you\u2019ve submitted the quiz you will have completed Chapter 4.\r\n<h3>COMMENTS AND QUESTIONS<\/h3>\r\nRegistered students are welcome to post comments, questions, and replies to questions about the text. Particularly welcome are anecdotes that relate the chapter text to your personal or professional experience. In addition, there are discussion forums available in the ANGEL course management system for comments and questions about topics that you may not wish to share with the whole world.\r\n\r\nTo post a comment, scroll down to the text box under \u201cPost new comment\u201d and begin typing in the text box, or you can choose to reply to an existing thread. When you are finished typing, click on either the \u201cPreview\u201d or \u201cSave\u201d button (Save will actually submit your comment). Once your comment is posted, you will be able to edit or delete it as needed. In addition, you will be able to reply to other posts at any time.\r\n\r\nNote: the first few words of each comment become its \u201ctitle\u201d in the thread.\r\n<h2>4.15. Bibliography<\/h2>\r\nCharlotte-Mecklenberg Public Schools (n. d.). Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.cms.k12.nc.us\/\">http:\/\/www.cms.k12.nc.us<\/a>\r\n\r\nCooke, D. F. (1997). Topology and TIGER: The Census Bureau\u2019s Contribution. In T. W. Foresman (Ed.),\u00a0<em>The history of geographic information systems: Perspectives from the pioneers<\/em>. (pp. 47 \u2013 57). Upper Saddle River, NJ: Prentice Hall.\r\n\r\nDangermond, J. (1982). A Classification of Software Components Commonly Used in Geographic Information Systems. In\u00a0<em>Proceedings of the U.S.\u2014Australia Workshop on the Design and Implementation of Computer-Based Geographic Information Systems<\/em>, Honolulu, HI, pp. 0-91. In Demers, M.N. (1997)\u00a0<em>Fundamentals of Geographic Information Systems.<\/em>\u00a0John Wiley &amp; Sons, Inc.\r\n\r\nDiscreet Research (n.d.). Retrieved July 19, 1999 from<a href=\"http:\/\/www.dresearch.com\/\">http:\/\/www.dresearch.com<\/a>\r\n\r\nESRI (1998) Shapefile Technical Description, An ESRI White paper. Environmental Systems Research Institute, Inc. Retrieved October 4, 2010, from\u00a0<a href=\"http:\/\/www.esri.com\/library\/whitepapers\/pdfs\/shapefile.pdf\">http:\/\/www.esri.com\/library\/whitepapers\/pdfs\/shapefile.pdf<\/a>\r\n\r\nFederal Geographic Data Committee (April 2006). Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.fgdc.gov\/\">http:\/\/www.fgdc.gov<\/a>\r\n\r\nFramingham Public Schools (1998).\u00a0<em>Racial balance policy: Assignment of students to schools<\/em>. Retrieved July 19, 1999 from<a title=\"www.framingham.k12.ma.us\/update\/0198rbp.html\" href=\"http:\/\/www.framingham.k12.ma.us\/update\/0198rbp.html\">www.framingham.k12.ma.us\/update\/0198rbp.html<\/a>\u00a0(since retired).\r\n\r\nFrancica, J. (n.d.).\u00a0<em>Geodezix Consulting<\/em>. Retrieved July 19, 1999 from<a title=\"www.geodezix.com\" href=\"http:\/\/www.geodezix.com\/\">www.geodezix.com<\/a>\u00a0(since retired).\r\n\r\nGaldi, D. (2005). Spatial Data Storage and Topology in the Redesigned MAF\/TIGER System. Retrieved 19 October 2010 from<a title=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/topo_and_data_stor.html\" href=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/topo_and_data_stor.html\">http:\/\/www.census.gov\/geo\/mtep_obj2\/topo_and_data_stor.html<\/a>\u00a0(since retired).\r\n\r\nMapQuest (n.d. a). Retrieved July 19, 1998 from<a href=\"http:\/\/www.mapquest.com\/\">http:\/\/www.mapquest.com<\/a>\r\n\r\nMapQuest (n.d. b). Retrieved January 15, 2013 from<a href=\"http:\/\/www.mapquest.com\/\">http:\/\/www.mapquest.com<\/a>\r\n\r\nMarx, R. M. (Ed.). (1990). The Census Bureau\u2019s TIGER system. [Special issue].\u00a0<em>Cartography and Geographic Information Systems<\/em>\u00a017:1.\r\n\r\nNavigation Technologies Inc. (2006).\u00a0<em>Welcome to NavTech<\/em>. Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.navtech.com\/\">http:\/\/www.navtech.com<\/a>\r\n\r\nRammage, S. and P. Woodsford (2002). The Benefits of Topoplogy in the Database. Retrieved October 6, 2010 from<a href=\"http:\/\/spatialnews.geocomm.com\/features\/laserscan2\/\">http:\/\/spatialnews.geocomm.com\/features\/laserscan2\/<\/a>\r\n\r\nTeleAtlas (2006).\u00a0<em>Welcome to TeleAtlas<\/em>. Retrieved May 3, 2006 from<a href=\"http:\/\/www.teleatlas.com\/Pub\/Home\">http:\/\/www.teleatlas.com\/Pub\/Home<\/a>\u00a0(since retired).\r\n\r\nTheobald, D. M. (2001). Understanding Topology and Shapefiles.<em>ArcUser<\/em>\u00a0April-June 2001. Retrieved October 5, 2010 from<a href=\"http:\/\/www.esri.com\/news\/arcuser\/0401\/topo.html\">http:\/\/www.esri.com\/news\/arcuser\/0401\/topo.html<\/a>\r\n\r\nU.S. Census Bureau (1997).\u00a0<em>TIGER\/Line Files (1997 Technical Documentation)<\/em>. Retrieved January 2, 1999 from<a title=\"http:\/\/www.census.gov\/geo\/tiger\/TIGER97C.pdf\" href=\"http:\/\/www.census.gov\/geo\/tiger\/TIGER97C.pdf\">http:\/\/www.census.gov\/geo\/tiger\/TIGER97C.pdf<\/a>\u00a0(since retired).\r\n\r\nU.S. Census Bureau (2003). TIGER\/Line Files, 2003 (metadata). Retrieved February 3, 2008 from<a href=\"http:\/\/www.census.gov\/geo\/www\/tlmetadata\/tl2003meta.txt\">http:\/\/www.census.gov\/geo\/www\/tlmetadata\/tl2003meta.txt<\/a>\r\n\r\nU.S. Census Bureau (n. d.). 21st Century MAF\/TIGER Enhancements. Retrieved February 3, 2008 from<a title=\"http:\/\/www.census.gov\/geo\/mod\/overview.pdf\" href=\"http:\/\/www.census.gov\/geo\/mod\/overview.pdf\">http:\/\/www.census.gov\/geo\/mod\/overview.pdf<\/a>\u00a0(since retired).\r\n\r\nU.S. Census Bureau (2004). MAF\/TIGER Redesign Project Overview. Retrieved October 19, 2010 from<a title=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/obj2_issuepaper12_2004.pdf\" href=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/obj2_issuepaper12_2004.pdf\">http:\/\/www.census.gov\/geo\/mtep_obj2\/obj2_issuepaper12_2004.pdf<\/a>(since retired).\r\n\r\nU.S. Census Bureau (2005).\u00a0<em>Geography division map gallery.<\/em>\u00a0Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.census.gov\/geo\/www\/mapGallery\/\">http:\/\/www.census.gov\/geo\/www\/mapGallery\/<\/a>\r\n\r\nU.S. Census Bureau (2012). TIGER\/Line Shapefiles Technical Documentation. Retrieved June, 2013 from of the<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/data\/pdfs\/tiger\/tgrshp2012\/TGRSHP2012_TechDoc.pdf\">http:\/\/www.census.gov\/geo\/maps-data\/data\/pdfs\/tiger\/tgrshp2012\/TGRSHP2012_TechDoc.pdf<\/a>","rendered":"<h2>4.1. Overview<\/h2>\n<p>In the Chapter 3 we studied the population data produced by the U.S. Census Bureau, and some of the ways those data can be visualized with thematic maps.<\/p>\n<p>In addition to producing data about the U.S. population and economy, the Census Bureau is a leading producer of digital map data. The Census Bureau&#8217;s Geography Division created its &#8220;Topologically Integrated Geographic Encoding and Referencing&#8221; (TIGER) spatial database with help from the U.S. Geological Survey. In preparation for the 2010 census, the Bureau conducted a database redesign project that combined TIGER with a Master Address File (MAF) database. <strong>MAF\/TIGER<\/strong> enables the Bureau to associate census data, which it collects by household address, with the right census areas and voting districts. This is an example of a process called address-matching or <strong>geocoding<\/strong>.<\/p>\n<p>The MAF\/TIGER database embodies the vector approach to spatial representation. It uses point, line, and polygon features to represent streets, water bodies, railroads, administrative boundaries, and select landmarks. In addition to the &#8220;absolute&#8221; locations of these features, which are encoded with latitude and longitude coordinates, MAF\/TIGER encodes their &#8220;relative&#8221; locations&#8211;a property called <strong>topology<\/strong>.<\/p>\n<p>MAF\/TIGER also includes attributes of these vector features including names, administrative codes, and, for many streets, address ranges and ZIP Codes. Vector feature sets are extracted from the MAF\/TIGER database to produce reference maps for census takers and thematic maps for census data users. Such extracts are called <strong>TIGER\/Line Shapefiles<\/strong>.<\/p>\n<p>Characteristics of TIGER\/Line Shapefiles that make them useful to the Census Bureau also make them valuable to other government agencies and businesses. Because they are not protected by copyright, TIGER\/Line data have been widely adapted for many commercial uses. TIGER has been described as &#8220;the first truly useful nationwide general-purpose spatial data set&#8221; (Cooke 1997, p. 47). Some say that it jump-started a now-thriving geospatial data industry in the U.S.<\/p>\n<h3>Objectives<\/h3>\n<p>The objective of this chapter is to familiarize you with MAF\/TIGER and two important concepts it exemplifies: topology and geocoding. Specifically, students who successfully complete Chapter 4 should be able to:<\/p>\n<ol>\n<li>Explain how geographic entities are represented within MAF\/TIGER;<\/li>\n<li>Explain how geometric primitives in MAF\/TIGER are represented in TIGER\/Line Shapefile extracts;<\/li>\n<li>Define topology and explain why and how it is encoded in TIGER;<\/li>\n<li>Perform address geocoding; and<\/li>\n<li>Describe how TIGER\/Line files and similar products can be used for other applications, including routing and allocation.<\/li>\n<\/ol>\n<h3>Comments and Questions<\/h3>\n<p>Registered students are welcome to post comments, questions, and replies to questions about the text. Particularly welcome are anecdotes that relate the chapter text to your personal or professional experience. In addition, there are discussion forums available in the ANGEL course management system for comments and questions about topics that you may not wish to share with the whole world.<\/p>\n<p>To post a comment, scroll down to the text box under &#8220;Post new comment&#8221; and begin typing in the text box, or you can choose to reply to an existing thread. When you are finished typing, click on either the &#8220;Preview&#8221; or &#8220;Save&#8221; button (Save will actually submit your comment). Once your comment is posted, you will be able to edit or delete it as needed. In addition, you will be able to reply to other posts at any time.<\/p>\n<p>Note: the first few words of each comment become its &#8220;title&#8221; in the thread.<\/p>\n<h3><strong>Concept Map<\/strong><\/h3>\n<p>You may be interested in seeing the <a href=\"https:\/\/www.e-education.psu.edu\/files\/natureofgeoinfo\/file\/ch3-4_conceptmap(2).pdf\">concept map<\/a> used to guide development of Chapters 3 and 4.<\/p>\n<h2>4.2. Checklist<\/h2>\n<p>&nbsp;<\/p>\n<p>The following checklist is for Penn State students who are registered for classes in which this text, and associated quizzes and projects in the ANGEL course management system, have been assigned. You may find it useful to print this page out first so that you can follow along with the directions.<\/p>\n<table summary=\"Tasks to be compleated for the chapter\">\n<caption>Chapter 4 Checklist (for registered students only)<\/caption>\n<thead>\n<tr>\n<th>Step<\/th>\n<th>Activity<\/th>\n<th>Access\/Directions<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<th>1<\/th>\n<td><strong>Read<\/strong>\u00a0Chapter 4<\/td>\n<td>This is the second page of the Chapter. Click on the links at the bottom of the page to continue or to return to the previous page, or to go to the top of the chapter. You can also navigate the text via the links in the GEOG 482 menu on the left.<\/td>\n<\/tr>\n<tr>\n<th>2<\/th>\n<td>Submit\u00a0<strong>four practice quizzes<\/strong>including:<\/p>\n<ul>\n<li>MAF and TIGER<\/li>\n<li>Shapefiles<\/li>\n<li>Topology<\/li>\n<li>Geocoding<\/li>\n<\/ul>\n<p>Practice quizzes are not graded and may be submitted more than once.<\/td>\n<td>Go to ANGEL &gt; [your course section] &gt; Lessons tab &gt; Chapter 4 folder &gt; [quiz]<\/td>\n<\/tr>\n<tr>\n<th>3<\/th>\n<td>Perform\u00a0<strong>\u201cTry this\u201d activities<\/strong>including:<\/p>\n<ul>\n<li>Explore availability of TIGER\/Line Shapefile geographies and features<\/li>\n<li>Download and view a TIGER\/Line Shapefile<\/li>\n<li>Geocode your address using a TIGER\/Line Shapefile<\/li>\n<li>Compare the geocoding performance of online routing services<\/li>\n<li>Explore resources about the Traveling Salesman Problem<\/li>\n<\/ul>\n<p>\u201cTry this\u201d activities are not graded.<\/td>\n<td>Instructions are provided for each activity.<\/td>\n<\/tr>\n<tr>\n<th>4<\/th>\n<td>Submit the<strong>Chapter 4 Graded Quiz<\/strong><\/td>\n<td>ANGEL &gt; [your course section] &gt; Lessons tab &gt; Chapter 4 folder &gt; Chapter 4 Graded Quiz. See the Calendar tab in ANGEL for due dates.<\/td>\n<\/tr>\n<tr>\n<th>5<\/th>\n<td>\u00a0Read<strong>comments and questions<\/strong>posted by fellow students. Add comments and questions of your own, if any.<\/td>\n<td>\u00a0Comments and questions may be posted on any page of the text, or in a Chapter-specific discussion forum in ANGEL.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>4.3. MAF\/TIGER<\/h2>\n<p><strong>MAF\/TIGER is the Census Bureau\u2019s geographic database system<\/strong>. Several factors prompted the U.S. Census Bureau to create MAF\/TIGER: the need to conduct the census by mail, the need to produce wayfinding aids for census field workers, and its mission to produce map and data products for census data users.<\/p>\n<h3>CONDUCTING THE CENSUS BY MAIL<\/h3>\n<p>As the population of the U.S. increased it became impractical to have census takers visit every household in person. Since 1970, the Census Bureau has mailed questionnaires to most households with instructions that completed forms should be returned by mail. Most but certainly not all of these questionnaires are dutifully mailed\u2014about 72 percent of all questionnaires in 2010. At that rate the Census Bureau estimates that some $1.6 billion was saved by reducing the need for field workers to visit non-responding households.<\/p>\n<p><img decoding=\"async\" alt=\"Census 2010 questionnaire\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4_census2010.png\" \/><\/p>\n<p>2010 Census questionnaire.\u00a0<a href=\"http:\/\/www.census.gov\/2010census\/about\/interactive-form.php\">For a question-by-question tour, go here<\/a>.<\/p>\n<p>To manage its mail delivery and return operations, the Census Bureau relies upon a\u00a0<strong>Master Address File (MAF)<\/strong>. MAF is a complete inventory of housing units and many business locations in the U.S., Puerto Rico, and associated island areas. MAF was originally built from the U.S. Postal Service\u2019s Delivery Sequence File of all residential addresses. The MAF is updated through both corrections from field operations and a Local Update of Census Address (LUCA) program by which tribal, state, and local government liaisons review and suggest updates to local address records.\u00a0<strong>\u201cMAF\/TIGER\u201d refers to the coupling of the Master Address File with the TIGER spatial database<\/strong>, which together enable the Census Bureau to efficiently associate address-referenced census and survey data received by mail with geographic locations on the ground and tabulation areas of concern to Congress and many governmental agencies and businesses.<\/p>\n<p>It\u2019s not as simple as it sounds. Postal addresses do not specify geographic locations precisely enough to fulfill the Census Bureau\u2019s constitutional mandate. An address is not a position in a grid coordinate system\u2013it is only one in a series of ill-defined positions along a route. The location of an address is often ambiguous because street names are not unique, numbering schemes are inconsistent, and because routes have two sides, left and right. Location matters, as you recall, because\u00a0<strong>census data must be accurately georeferenced to be useful for reapportionment, redistricting, and allocation of federal funds.<\/strong>\u00a0Thus the Census Bureau had to find a way to assign address referenced data automatically to particular census blocks, block groups, tracts, voting districts, and so on. That\u2019s what the \u201cGeographic Encoding and Referencing\u201d in the TIGER acronym refers to.<\/p>\n<h3>MAPS FOR CENSUS FIELD WORKERS<\/h3>\n<p>A second motivation that led to MAF\/TIGER was the need to help census takers find their way around. Millions of households fail to return questionnaires by mail, after all. Census takers (called \u201cenumerators\u201d at the Bureau) visit non-responding households in person.\u00a0<strong>Census enumerators need maps showing streets and select landmarks to help locate households.<\/strong>\u00a0Census supervisors need maps to assign census takers to particular territories. Field notes collected by field workers are an important source of updates and corrections to the MAF\/TIGER database.<\/p>\n<p>Prior to 1990, the Bureau relied on local sources for its maps. For example, 137 maps of different scales, quality, and age were used to cover the 30-square-mile St. Louis area during the 1960 census. The need for maps of consistent scale and quality forced the Bureau to become a map maker as well as a map user. Using the MAF\/TIGER system, Census Bureau geographers created over 17 million maps for a variety of purposes in preparation for the 2010 Census.<\/p>\n<h3>DATA PRODUCTS<\/h3>\n<p>The Census Bureau\u2019s mission is not only to collect data, but also to make data products available to its constituents. In addition to the attribute data considered in Chapter 3, the Bureau disseminates a variety of geographic data products, including wall maps, atlases, and one of the earliest on-line mapping services, the TIGER Mapping Service. You can explore the<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/index.html\">Bureau\u2019s maps and cartographic data products here<\/a>.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of the TIGER Map Server Browser\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/tms.gif\" \/><\/p>\n<p>Launched in 1995, the TIGER Mapping Service was one of the earliest Internet map services. Registered students will use its successor, American Factfinder, in Project 2.<\/p>\n<h3>MAF\/TIGER DATABASE REDESIGN<\/h3>\n<p>The Census Bureau conducted a major redesign of the MAF\/TIGER database in the years leading up to the 2010 decennial census. What were separate, homegrown database systems (MAF and TIGER) are now unified in the industry-standard Oracle relational database management system. Benefits of this \u201ccommercial off-the-shelf\u201d (COTS) database software include concurrent multi-user access, greater user familiarity, and better integration with Web development tools. As Galdi (2005) explains in his white paper \u201cSpatial Data Storage and Topology in the Redesigned MAF\/TIGER System,\u201d the redesign \u201cmirrors a common trend in the Information Technology (IT) and Geographic Information System (GIS) industries: the integration of spatial and non-spatial data into a single enterprise data set\u201d (p. 2).<\/p>\n<p>Concurrent with the MAF\/TIGER redesign, the Census Bureau also updated the distribution format of its TIGER\/Line map data extracts. Consistent with the Bureau\u2019s COTS strategy, it adopted the defacto standard Esri \u201cShapefile\u201d format. The following pages consider characteristics of the spatial data stored in MAF\/TIGER and in TIGER\/Line Shapefile extracts.<\/p>\n<h3><strong>PODCAST<\/strong><\/h3>\n<p>Hear more about\u00a0<a href=\"http:\/\/www.directionsmag.com\/images\/podcasts\/Census1.mp3\">how the Census Bureau\u2019s Geography Division uses MAF\/TIGER and related tools to create maps for the 2010 Census<\/a>.<\/p>\n<h2>4.4. Vector Extracts from MAF\/TIGER<\/h2>\n<p>The Census Bureau began to develop a digital geographic database of 144 metropolitan areas in the 1960s. By 1990, the early efforts had evolved into\u00a0<strong>TIGER<\/strong>: a seamless digital geographic database that covered the whole of the United States and its territories. As discussed in the previous page, MAF\/TIGER succeeded TIGER in the lead-up to the 2010 Census.<\/p>\n<p><strong>TIGER\/Line Shapefiles<\/strong>\u00a0are digital map data products extracted from the MAF\/TIGER database. They are freely available from the Census Bureau, and are suitable for use by individuals, businesses and other agencies that don\u2019t have direct access to MAF\/TIGER.<\/p>\n<p>This section outlines the geographic entities represented in the MAF\/TIGER database, describes how a particular implementation of the vector data model is used to represent those entities, and considers the accuracy of digital features in relation to their counterparts on the ground. The following page considers characteristics of the \u201cShapefile\u201d data format used to distribute digital extracts from MAF\/TIGER.<\/p>\n<h3>GEOGRAPHIES REPRESENTED IN TIGER AND SHAPEFILE EXTRACTS<\/h3>\n<p>The MAF\/TIGER database is selective. Only those geographic entities needed to fulfill the Census Bureau\u2019s operational mission are included. Entities that don\u2019t help the Census Bureau conduct its operations by mail, or help field workers navigate a neighborhood, are omitted. Terrain elevation data, for instance, are not included in MAF\/TIGER. A comprehensive list of the \u201cfeature classes\u201d and \u201csuperclasses\u201d included in MAF\/TIGER and Shapefiles can be found in Appendix F of the<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/data\/pdfs\/tiger\/tgrshp2012\/TGRSHP2012_TechDoc.pdf\">TIGER\/Line Shapefiles Technical Documentation<\/a>.\u00a0<strong>Examples of superclasses include<\/strong>:<\/p>\n<ul>\n<li>Potential living quarters (e.g., sites of shelters, retirement homes, prisons, dormitories)<\/li>\n<li>Road\/path features (e.g., primary roads, secondary roads, local neighborhood roads)<\/li>\n<li>Hydrographic features (e.g., stream\/river, lake\/pond, ocean\/sea)<\/li>\n<li>Miscellaneous linear features (e.g., pipeline, powerline, fence line)<\/li>\n<li>Tabulation areas (e.g., county or equivalent, tract, block group, block<\/li>\n<\/ul>\n<table>\n<caption>Excerpt from TIGER\/Line Technical Documentation<\/caption>\n<thead>\n<tr>\n<th>MTFCC<\/th>\n<th>FEATURE CLASS<\/th>\n<th>SUPERCLASS<\/th>\n<th>POINT<\/th>\n<th>LINEAR<\/th>\n<th>AREAL<\/th>\n<th>FEATURE CLASS DESCRIPTION<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<th>$1400<\/th>\n<td>Local Neighborhood Road, Rural Road, City Street<\/td>\n<td>Road\/Path Features<\/td>\n<td>N<\/td>\n<td>Y<\/td>\n<td>N<\/td>\n<td>Generally a paved non-arterial street, road, or byway that usually has a single lane of traffic in each direction. Roads in this feature class may be privately or publicly maintained. Scenic park roads would be included in this feature class, as would (depending on the region of the country) some unpaved roads.<\/td>\n<\/tr>\n<tr>\n<th>$1500<\/th>\n<td>Vehicular Trail (4WD)<\/td>\n<td>Road\/Path Features<\/td>\n<td>N<\/td>\n<td>Y<\/td>\n<td>N<\/td>\n<td>An unpaved dirt trail where a four-wheel drive vehicle is required. These vehicular trails are found almost exclusively in very rural areas. Minor, unpaved roads usable by ordinary cars and trucks belong in the $1400 category.<\/td>\n<\/tr>\n<tr>\n<th>$1630<\/th>\n<td>Ramp<\/td>\n<td>Road\/Path Features<\/td>\n<td>N<\/td>\n<td>Y<\/td>\n<td>N<\/td>\n<td>A road that allows controlled access from adjacent roads onto a limited access highway, often in the form of a cloverleaf interchange. These roads are unaddressable.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Excerpt from TIGER\/Line Technical Documentation (Census Bureau 2012) showing some of the feature classes included in the \u201cRoad\/Path Features\u201d superclass.<\/p>\n<p>Note also that<strong>\u00a0neither the MAF\/TIGER database nor TIGER\/Line Shapefiles include the population data collected through questionnaires and by census takers.<\/strong>\u00a0MAF\/TIGER merely provides the geographic framework within which address-referenced census data are tabulated.<\/p>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<h3>EXPLORING AVAILABLE TIGER\/LINE SHAPEFILES<\/h3>\n<p>In this Try This (One of 3 dealing with TIGER\/Line Shapefiles) you are going to explore which TIGER\/Line Shapefiles are available for download at various geographies and what information those files contain. We will be exploring the 2009 and 2010 versions of the TIGER\/Line Shapefile data sets. Versions from other years are available. Feel free to investigate those, too.<\/p>\n<ul>\n<li>Follow\u00a0<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/data\/tiger.html\">this link to get to the TIGER Products page<\/a>\u00a0of the Census Bureau web site, then follow the\u00a0<strong>TIGER\/Line Shapefiles<\/strong>\u00a0link found under\u00a0<strong>Which product should I use?<\/strong>\u00a0to get to the Geography page.<\/li>\n<li>Link to the 2010 TIGER\/Line Shapefiles via the\u00a0<strong>2010<\/strong>\u00a0tab link.<\/li>\n<li>Select\u00a0<strong>Download<\/strong>, and then from the expanded list choose\u00a0<strong>Web Interface<\/strong>.<\/li>\n<li>Expand the pick list under\u00a0<strong>Select a layer type<\/strong>. Spend some time choosing different entries from the layer pick list and then using the<strong>Submit\u00a0<\/strong>button to navigate through the sub layers taking note of when you are offered access to a Download button. Take note of a couple of things. (1) Some of the pick lists make a selection available that allows you to download a shapefile dataset for the entire country. (2) For some of the choices you must navigate to the County level before the Download button is available<\/li>\n<\/ul>\n<p>As stated above we want you to get a sense of the sorts of data that are available for the various geographies \u2014 from the county to the national level. Perusing the various layers as I had you doing above makes it difficult to make an overall assessment of what data there is at a given geographic scale. Fortunately for our purposes the Census has provided a convenient table to help us in this regard.<\/p>\n<ul>\n<li>You should still be on the 2010 TIGER\/Line Shapefiles | Select a layer type page.<br \/>\nClick on the\u00a0<strong>Documentation\u00a0<\/strong>link in the upper right portion of the page. This will take you back to the Geography page.<\/li>\n<li>Select the\u00a0<strong>2010\u00a0<\/strong>tab again.<\/li>\n<li>Select\u00a0<strong>File Availability<\/strong>.<br \/>\nStudy the table that appears.<\/li>\n<li>Note that there are columns titled\u00a0<em>State- and County-based Files,<\/em><em>Nation-based\u00a0<\/em><em>Files<\/em>, and\u00a0<em>American Indian Area-based\u00a0<\/em><em>Files<\/em>.<\/li>\n<li>Compare which geographies (the\u00a0<em>Layer\u00a0<\/em>column) are available in the<em>Nation-Based Files<\/em>\u00a0category to those available in the\u00a0<em>State-Based Files<\/em>\u00a0category.<br \/>\nWhat files are available for a state that are not available for the whole nation?\u00a0 Can you think of reasons why these are not available as a single national file? Post a comment below to discuss with your fellow students.<\/li>\n<li>Now, compare the\u00a0<em>State-Based Files<\/em>\u00a0category to the\u00a0<em>County-Based Files<\/em>\u00a0category. What files available at the state level are also available at the county-level?\u00a0 Once again, share your thoughts with your peers.<\/li>\n<\/ul>\n<h3>GEOMETRIC PRIMITIVES<\/h3>\n<p>Like other implementations of the vector data model, MAF\/TIGER represents geographic entities using geometric primitives including nodes (point features), edges (linear features), and faces (area features). These are defined and illustrated below.<\/p>\n<ul>\n<li><strong>Nodes<\/strong>\u00a0(labeled \u201cN\u201d in the illustration below) are \u201c0-dimensional,\u201d consisting only of a single pair of latitude and longitude coordinates.\n<ul>\n<li>Nodes N21-23 are\u00a0<strong>isolated nodes<\/strong>. That is, they are not end points of edges.<\/li>\n<\/ul>\n<\/li>\n<li><strong>Edges<\/strong>\u00a0(labeled \u201cE\u201d in the illustration below) are 1-dimensional linear primitives used to represent streets, railroads, pipelines, and rivers.\n<ul>\n<li>The end points of an edge are called\u00a0<strong>connecting nodes<\/strong>.<\/li>\n<li>Each edge is assigned a direction, denoted by the arrowheads. The directionality of the edge allows the designation of a\u00a0<strong>Start Node<\/strong>\u00a0and an\u00a0<strong>End Node<\/strong>. The Start Node of edge E12 below is N9, and the End Node is N6.<\/li>\n<li>An edge may have intermediate points called\u00a0<strong>vertices<\/strong>\u00a0that define its shape.<\/li>\n<\/ul>\n<\/li>\n<li><strong>Faces<\/strong>\u00a0(labeled \u201cF\u201d in the illustration below) are the 2-dimensional geometric primitives used to represent entities like blocks, counties, and voting districts. A face is a polygon bounded by edges.\n<ul>\n<li>The directionality of an edge also allows\u00a0<strong>left and right faces<\/strong>\u00a0to be designated. Face F1 is on the left of edge E12 and face F2 is to the right.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><img decoding=\"async\" alt=\"Geometric primitives and topology used in the MAF\/TIGER database\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4_4_TIGER_primitives1.png\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Geometric primitives of the Topologically Integrated Geographic Encoding and Referencing (TIGER) database. The figure shows what might be two adjacent Census blocks, with the bottom block bounded on the south by a river. The remaining edges might correspond to streets, and the isolated nodes might be landmarks such as a school, a church and a zoo.<\/p>\n<p>&nbsp;<\/p>\n<h3>GEOMETRIC ACCURACY<\/h3>\n<p>Until recently the geometric accuracy of the vector features encoded in TIGER were notoriously poor (see illustration below). How poor?<strong>Through 2003<\/strong>, the\u00a0<a href=\"http:\/\/www.census.gov\/geo\/www\/tlmetadata\/metadata.html\">TIGER\/Line metadata<\/a>\u00a0stated that<\/p>\n<blockquote>\n<div>Coordinates in the TIGER\/Line files have six implied decimal places, but the positional accuracy of these coordinates is not as great as the six decimal places suggest. The positional accuracy varies with the source materials used, but generally the information is no better than the established National Map Accuracy standards for 1:100,000-scale maps from the U.S. Geological Survey (Census Bureau 2003)<\/p>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<p>Having performed scale calculations in Chapter 2 you should be able to calculate the magnitude of error (ground distance) associated with 1:100,000-scale topographic maps.\u00a0 Recall that the allowed error for USGS topographic maps at scales of 1:20,000 or smaller is 1\/50 inch (see the\u00a0<a href=\"http:\/\/nationalmap.gov\/standards\/pdf\/NMAS647.PDF\">nationalmap standards pdf<\/a>)<\/p>\n<\/div>\n<\/blockquote>\n<p><img decoding=\"async\" alt=\"Image of mismatch between TIGER street data and aerial image\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/TIGER_inaccuracy.png\" \/><\/p>\n<p>Discrepancy between pre-modernization TIGER\/Line file streets (red) and actual geometry of street network shown in an orthorectified aerial image (U.S. Census Bureau n.d).<\/p>\n<h3>ACCURACY IMPROVEMENT<\/h3>\n<p>Starting in 2002, in preparation for the 2010 census, the Census Bureau commissioned a six-year, $200 million MAF\/TIGER Accuracy Improvement Project (MTAIP).\u00a0<strong>One objective of the effort was to use GPS to capture accurate geographic coordinates for every household in the MAF<\/strong>.\u00a0<strong>Another objective was to improve the accuracy of TIGER\u2019s road\/path features.<\/strong>\u00a0The project aimed to adjust the geometry of street networks to align within 7.6 meters of street intersections observed in orthoimages or measured using GPS. The corrected streets are necessary not just for mapping, but for accurate geocoding. Because streets often form the boundaries of census areas, it is essential that accurate household locations are associated with accurate street networks.<\/p>\n<p>MTAIP integrated over 2,000 source files submitted by state, tribal, county, and local governments. Contractors used survey-grade GPS to evaluate the accuracy of a random sample of street centerline intersections of the integrated source files. The evaluation confirmed that most but not all features in the spatial database equal or exceed the 7.6 meter target. Uniform accuracy wasn\u2019t possible due to the diversity of local source materials used, though this accuracy is the standard in the \u201cAll Lines\u201d Shapefile extracts. The geometric accuracy of particular feature classes included in particular shapefiles are documented in the metadata associated with that shapefile extract.<\/p>\n<p>MTAIP was completed in 2008. In conjunction with the continuous American Community Survey and other census operations, corrections and updates are now ongoing. TIGER\/Line Shapefile updates are now released annually.<\/p>\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\n<p>Registered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about MAF and TIGER.<\/p>\n<p>You may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.<\/p>\n<h2>4.5. Shapefiles<\/h2>\n<p>Since 2007, TIGER\/Line extracts from the MAF\/TIGER database have been distributed in shapefile format. Esri introduced shapefiles in the early 1990s as the native digital vector data format of its ArcView software product. The shapefile format is proprietary, but open; its\u00a0<a href=\"http:\/\/www.esri.com\/library\/whitepapers\/pdfs\/shapefile.pdf\">technical specifications are published<\/a>\u00a0and can be implemented and used freely. Largely as a result of ArcView\u2019s popularity, shapefile has become a de facto standard for creation and interchange of vector geospatial data. The Census Bureau\u2019s adoption of Shapefile as a distribution format is therefore consistent with its overall strategy of conformance with mainstream information technology practices.<\/p>\n<h3>ELEMENTS OF A SHAPEFILE DATA SET<\/h3>\n<p>The first thing GIS pros need to know about shapefiles is that\u00a0<strong>every shapefile data set includes a minimum of three files<\/strong>. One of the three required files stores the geometry of the digital features as sets of vector coordinates. A second required file holds an index that, much like the index in a book, allows quicker access to the spatial features and therefore speeds processing of a given operation involving a subset of features. The third required file stores attribute data in dBASE\u00a9 format, one of the earliest and most widely-used digital database management system formats. All of the files that make up a Shapefile data set have the same root or prefix name, followed by a three-letter suffix or file extension. The list below shows the names of the three required files making up a shapefile data set named \u201ccounties.\u201d Take note of the file extensions.<\/p>\n<ul>\n<li>counties.shp: The main shape file, containing vector coordinate data<\/li>\n<li>counties.shx: The index file<\/li>\n<li>counties.dbf: The dBASE table<\/li>\n<\/ul>\n<p>Esri lists twelve additional optional files, and practitioners are able to include still others. Two of the most important optional files are the \u201c.prj\u201d file, which includes the coordinate system definition, and \u201c.xml\u201d, which stores metadata. (Why do you suppose that something as essential as a coordinate system definition is considered \u201coptional\u201d?)<\/p>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<h3>DOWNLOADING AND VIEWING A TIGER\/LINE SHAPEFILE<\/h3>\n<p>In this\u00a0<em>Try This!<\/em>\u00a0(the second of 3 dealing with TIGER\/Line Shapefiles), you will download a TIGER\/Line Shapefile dataset, investigate the file structure of a typical Esri shapefile, and view it in GIS software.<\/p>\n<p>You can use a free software application called\u00a0<strong>Global Mapper\u00a0<\/strong>(originally known as\u00a0<strong>dlgv32 Pro<\/strong>) to investigate TIGER\/Line shapefiles. Originally developed by the staff of the USGS Mapping Division at Rolla, Missouri as a data viewer for USGS data, Global Mapper has since been commercialized, but is available in a free trial version. The instructions below will guide you through the process of installing the software and opening the TIGER\/Line data.<\/p>\n<ol>\n<li><strong>Downloading TIGER\/Line Shapefiles:\u00a0<\/strong>You are going to use the 2010 TIGER\/Line Shapefiles.\n<ul>\n<li>Return to the\u00a0<a href=\"http:\/\/www.census.gov\/cgi-bin\/geo\/shapefiles2010\/main\">2010 TIGER\/Line Shapefiles download page<\/a>.<\/li>\n<li>From the\u00a0<em>Select a layer type<\/em>\u00a0pick list, under\u00a0<em>Features<\/em>, choose\u00a0<strong>All Lines<\/strong>\u00a0and click\u00a0<strong>submit<\/strong>. (You are welcome to download and investigate any TIGER\/Line Shapefile(s), but we will use an\u00a0<em>All Lines<\/em>dataset in the geocoding Try This later in the chapter, so your downloading one here will make you more familiar with the content.)<\/li>\n<li>From the All Lines pick list select a state or territory and click<strong>Submit<\/strong>.<\/li>\n<li>Select a County from the next pick list that appears and click<strong>Download<\/strong>.<\/li>\n<li>Save the file to your computer.<br \/>\nThe file you download should have a name like<em>tl_2010_42027_edges.zip<\/em>. The root name of this file,<em>tl_2010_<\/em><em>42027<\/em><em>_edges<\/em>\u00a0in this example, will also be the name of the shapefile dataset. The\u00a0<em>42027\u00a0<\/em>is a federal code that represents Pennsylvania (state 42) and Centre County (county 027). The five-digit code in your file name will depend on which state and county you selected.<\/li>\n<li>The data are compressed in a .zip archive. Extract the data to a new named folder in a known location. (Within the file hierarchy that is extracted there may be a second .zip file that needs to be uncompressed.)<\/li>\n<\/ul>\n<\/li>\n<li><strong>Investigating the shapefile data set:<\/strong>\n<ul>\n<li>Navigate to\u00a0<em>within<\/em>\u00a0the folder in which you stored your uncompressed TIGER\/Line Shapefile dataset.<\/li>\n<li>Notice the multiple files which make up the shapefile dataset, including:\n<ul>\n<li>tl_2010_42027_edges.shp, containing the vector coordinate data<\/li>\n<li>tl_2010_42027_edges.shp.xml, containing metadata<\/li>\n<li>tl_2010_42027_edges.shx, the index file<\/li>\n<li>tl_2010_42027_edges.dbf, the dBASE file<\/li>\n<li>tl_2010_42027_edges.prj, containing the projection\/spatial reference<\/li>\n<\/ul>\n<\/li>\n<li>All of the files work in concert to store the necessary components of the Esri\u00a0<em>shapefile data set<\/em>. You may be familiar with some of the individual files types. The contents of three of them can be easily viewed. Let\u2019s open those three. You can double click on the file and then select \u201cfrom a list of installed programs,\u201d or you may need to run the suggested application and open the file from within it. Let me know if you need help, or help each other in the ANGEL Chapter 4 Discussion Forum or in the Comments area below.\n<ul>\n<li>Open the\u00a0<strong>.dbf<\/strong>\u00a0file using Microsoft Excel.<br \/>\nNote the typical row-column structure of a flat-file database. Can you find the four columns, or fields, that hold the address range information? Look for LFROMADD, etc. The field name LFROMADD is shorthand for Left From Address. The 10-character length of the field name points up one of the constraints of the dBASE format\u2013field names are limited to 10 characters.<\/li>\n<li>Open the<strong>\u00a0.xml<\/strong>\u00a0file using your web browser.<br \/>\nYou should see the metadata information bracketed by\u00a0<em>tags<\/em>contained within directional brackets &lt; &gt;. XML stands for Extensible Markup Language, and is a common set of rules for encoding documents. Can you locate the portion of the document having to do with horizontal spatial accuracy?\u00a0 (Spatial accuracy metadata is available when you\u2019ve chosen the\u00a0<em>All Lines\u00a0<\/em>file as your candidate shapefile.)<\/li>\n<li>Open the<strong>\u00a0.prj<\/strong>\u00a0file using Notepad, or any vanilla text editor.<br \/>\nThere are five pieces of information in this file, separated by commas. What are they? They should reinforce some of what you learned in Chapter 2 regarding what defines a geographic coordinate system.<\/li>\n<li>The\u00a0<strong>.shp<\/strong>\u00a0and\u00a0<strong>.shx<\/strong>\u00a0files are proprietary and specific to the functionality of the shapefile data set.<\/li>\n<\/ul>\n<\/li>\n<li>Discuss what you find with your classmates in comments below.<\/li>\n<li>Note that one should not alter the contents of any of these files with any application other than a GIS program that is designed for that task.<\/li>\n<\/ul>\n<\/li>\n<li><strong>Viewing the shapefile dataset in Global Mapper:<\/strong>\n<ul>\n<li>Download and install the Global Mapper software:\n<ol>\n<li>Navigate to the\u00a0<a href=\"http:\/\/www.bluemarblegeo.com\/products\/global-mapper.php\">Blue Marble Global Mapper site<\/a>.<\/li>\n<li>Download the trial version of the software<\/li>\n<li>Double-click on the setup file you downloaded to install the program<\/li>\n<li>Launch the Global Mapper program<\/li>\n<\/ol>\n<\/li>\n<li>After opening the Global Mapper software, choose\u00a0<em>Open Data File(s)..<\/em>. under the\u00a0<em>File\u00a0<\/em>menu, or click the \u201cOpen Your Own Data Files\u201d button in the center of the window.\u00a0 Navigate to the extracted shapefile dataset you downloaded above and open it. (Remember, your complete shapefile data set will have a name similar to<em>tl_2010_42027_edges<\/em>. It will show up in the\u00a0<em>Open\u00a0<\/em>dialog with a .shp extension.)<\/li>\n<li>You should be able to see all of the line features (the\u00a0<em>edges<\/em>, from the MAF\/TIGER database) contained in your county. If you are using the newest version of Global Mapper you should be able to discern roads from rivers\/streams from administrative boundaries, etc. In older versions of the application the default view showed all line features in a single color and line weight, so the user needed to use the symbolization tools to make the different classes of features distinguishable.<br \/>\nWhat do you think has to be understood by the mapping application to allow it to automatically symbolize features differently? Post your thoughts below.<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<h3>SHAPEFILE PRIMITIVES<\/h3>\n<p>A single shapefile data set can contain one of three types of spatial data primitives, or features \u2013 points, lines or polygons (areas). The technical specification defines these as follows:<\/p>\n<ul>\n<li><strong>Points<\/strong>: A point consists of a pair of double-precision coordinates in the order X,Y.<\/li>\n<li><strong>Lines<\/strong>: More specifically a polyline, is an ordered set of points, or vertices, that consists of one or more\u00a0<strong>parts<\/strong>. A part is a connected sequence of two or more points. Parts may or may not be connected to one another. Parts may or may not intersect one another.<\/li>\n<li><strong>Polygons<\/strong>: A polygon consists of one or more\u00a0<strong>rings<\/strong>. A ring is a connected sequence of four or more points, or vertices, that form a closed, non-self-intersecting loop.<\/li>\n<li><strong>Other<\/strong>: M (measured; route data) and Z (3D; vertical datum) versions of point, polyline and polygon Shapefile data sets can be created, but are not included in the TIGER\/Line Shapefile extracts.<\/li>\n<\/ul>\n<p><img decoding=\"async\" alt=\"Diagram illustrating geometric primitives of the Shapefile format\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4_5_Shapefile_primitives.png\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Three Shapefile data sets that could be extracted from the MAF\/TIGER data depicted on the preceding page<\/p>\n<p>At left in the illustration above, a\u00a0<strong>polygon Shapefile data set<\/strong>\u00a0holds the Census blocks in which the edges from the MAF\/TIGER database have been combined to form two distinct polygons, P1 and P2. The diagram shows the two polygons separated to emphasize the fact that what is the single E12 edge in the MAF\/TIGER database (see the diagram on page 4)\u00a0 is now present in each of the Census block polygon features.<\/p>\n<p>In the middle of the illustration, a\u00a0<strong>polyline Shapefile data set<\/strong>\u00a0holds seven line features (L1-7) that correspond to the seven edges in the MAF\/TIGER database. The directionality of the line features that represent streets corresponds to address range attributes in the associated dBASE\u00a9 table. Vertices define the shape of a polygon or a line, and the Start and End Nodes from the MAF\/TIGER database are now First and Last Vertices.<\/p>\n<p>Finally, at right in the illustration above, a\u00a0<strong>point Shapefile data set<\/strong>\u00a0holds the three isolated nodes from the MAF\/TIGER database.<\/p>\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\n<p>Registered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about Shapefiles.<\/p>\n<p>You may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.<\/p>\n<h2>4.6. Topology<\/h2>\n<p>Topology is different than topography. (You\u2019d be surprised how often these terms get mixed up.) In Chapter 2 you read about the various ways that\u00a0<em>absolute<\/em>\u00a0positions of features can be specified in a coordinate system, and how those coordinates can be projected or otherwise transformed. Topology refers to the\u00a0<em>relative<\/em>\u00a0positions of spatial features.\u00a0<strong>Topological relations among features<\/strong>\u2014such as containment, connectivity, and adjacency\u2014<strong>don\u2019t change when a dataset is transformed<\/strong>. For example, if an isolated node (representing a household) is located inside a face (representing a congressional district) in the MAF\/TIGER database, you can count on it remaining inside that face no matter how you might project, rubber-sheet, or otherwise transform the data. Topology is vitally important to the Census Bureau, whose constitutional mandate is to accurately associate population counts and characteristics with political districts and other geographic areas.<\/p>\n<p>As David Galdi (2005) explains in his white paper \u201cSpatial Data Storage and Topology in the Redesigned MAF\/TIGER System,\u201d the \u201cTI\u201d in TIGER stands for \u201cTopologically Integrated.\u201d This means that\u00a0<strong>the various features represented in the MAF\/TIGER database<\/strong>\u2014such as streets, waterways, boundaries, and landmarks (but not elevation!)\u2014<strong>are not encoded on separate \u201clayers.\u201d<\/strong>\u00a0Instead, features are made up of a small set of geometric primitives\u2014including 0-dimensional nodes and vertices, 1-dimensional edges, and 2-dimensional faces\u2014without redundancy. That means that where a waterway coincides with a boundary, for instance, MAF\/TIGER represents them both with one set of edges, nodes and vertices. The attributes associated with the geometric primitives allow database operators to retrieve feature sets efficiently with simple spatial queries. The separate feature-specific TIGER\/Line Shapefiles published at the county level (such as point landmarks, hydrography, Census block boundaries, and, the \u201cAll Lines\u201d file you are using in the multi-part \u201cTry This\u201d) were extracted from the MAF\/TIGER database in that way. Notice, however, that when you examine a hydrography shapefile and a boundary shapefile, you will see redundant line segments where the features coincide. That fact confirms that\u00a0<strong>TIGER\/Line Shapefiles<\/strong>, unlike the MAF\/TIGER database itself,\u00a0<strong>are not topologically integrated<\/strong>. Desktop computers are now powerful enough to calculate\u00a0<strong>topology \u201con the fly\u201d<\/strong>from shapefiles or other non-topological data sets.\u00a0 However, the large batch processes performed by the Census Bureau still benefit from the MAF\/TIGER database\u2019s\u00a0<strong>persistent topology<\/strong>.<\/p>\n<p>MAF\/TIGER\u2019s topological data structure also benefits the Census Bureau by allowing it to automate error-checking processes. By definition, features in the TIGER\/Line files conform to a set of topological rules (Galdi 2005):<\/p>\n<ol>\n<li>Every edge must be bounded by two nodes (start and end nodes)..<\/li>\n<li>Every edge has a left and right face.<\/li>\n<li>Every face has a closed boundary consisting of an alternating sequence of nodes and edges.<\/li>\n<li>There is an alternating closed sequence of edges and faces around every node.<\/li>\n<li>Edges do not intersect each other, except at nodes.<\/li>\n<\/ol>\n<p>Compliance with these topological rules is an aspect of data quality called<strong>logical consistency<\/strong>.\u00a0 In addition, the boundaries of geographic areas that are related hierarchically\u2014such as blocks, block groups, tracts, and counties\u2014are represented with common, non-redundant edges. Features that do not conform to the topological rules can be identified automatically, and corrected by the Census geographers who edit the database. Given that the MAF\/TIGER database covers the entire U.S. and its territories, and includes many millions of primitives, the ability to identify errors in the database efficiently is crucial.<\/p>\n<p>So how does topology help the Census Bureau assure the accuracy of population data needed for reapportionment and redistricting? To do so, the Bureau must aggregate counts and characteristics to various geographic areas, including blocks, tracts, and voting districts. This involves a process called \u201caddress matching\u201d or \u201caddress geocoding\u201d in which data collected by household is assigned a topologically-correct geographic location. The following pages explain how that works.<\/p>\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\n<p>Registered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about Topology.<\/p>\n<p>You may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.<\/p>\n<h2>4.7. Geocoding<\/h2>\n<p><strong>Geocoding is the process used to convert location codes, such as street addresses or postal codes, into geographic (or other) coordinates.<\/strong>\u00a0The terms \u201caddress geocoding\u201d and \u201caddress mapping\u201d refer to the same process. Geocoding address-referenced population data is one of the Census Bureau\u2019s key responsibilities.\u00a0 However, as you know, it\u2019s also a very popular capability of online mapping and routing services. In addition, geocoding is an essential element of a suite of techniques that are becoming known as \u201cbusiness intelligence.\u201d We\u2019ll look at applications like these later in this chapter, but first let\u2019s consider how the Census Bureau performs address geocoding.<\/p>\n<h3>ADDRESS GEOCODING AT THE U.S. CENSUS<\/h3>\n<p>Prior to the MAF\/TIGER modernization project that led up to the decennial census of 2010, the TIGER database did not include a complete set of point locations for U.S. households. Lacking point locations, TIGER was designed to support address geocoding by approximation. As illustrated below, the pre-modernization TIGER database included<strong>address range attributes<\/strong>\u00a0for the edges that represent streets. Address range attributes were also included in the TIGER\/Line files extracted from TIGER. Coupled with the Start and End nodes bounding each edge, address ranges enable users to estimate locations of household addresses.<\/p>\n<p><img decoding=\"async\" alt=\"Diagram showing neighborhood map with addresses (top) and the adress data being recorded in program window (bottom)\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/address_ranges.gif\" \/><\/p>\n<p>How address range attributes were encoded in TIGER\/Line files (U.S. Census Bureau 1997). Address ranges in contemporary TIGER\/Line Shapefiles are similar, except that \u201cFrom\u201d (FR) and \u201cTo\u201d nodes are now called \u201cStart\u201d and \u201cEnd\u201d. Also, changes have been made to field (column) names in the attribute tables. Compare the names of the address range fields that you looked at in the second Try This exercise to those above.<\/p>\n<p>Here\u2019s how it works. The diagram above highlights an edge that represents a one-block segment of Oak Avenue. The edge is bounded by two nodes, labeled \u201cStart\u201d and \u201cEnd.\u201d A corresponding record in an attribute table includes the unique ID number (0007654320) that identifies the edge, along with starting and ending addresses for the left (FRADDL, TOADDL) and right (FRADDR, TOADDR) sides of Oak Avenue. Note also that the address ranges include potential addresses, not just existing ones. This is to make sure that the ranges will remain valid as new buildings are constructed along the street.<\/p>\n<p><strong>A common geocoding error occurs when Start and End designations are assigned to the wrong connecting nodes<\/strong>. You may have read in Galdi\u2019s (2005) white paper \u201cSpatial Data Storage and Topology in the Redesigned MAF\/TIGER System,\u201d that in MAF\/TIGER, \u201can arbitrary direction is assigned to each edge, allowing designation of one of the nodes as the Start Node, and the other as the End Node\u201d (p. 3). If an edge\u2019s \u201cdirection\u201d happens not to correspond with its associated address ranges, a household location may be placed on the wrong side of a street.<\/p>\n<p>Although many local governments in the U.S. have developed their own GIS \u201cland bases\u201d with greater geometric accuracy than pre-modernization TIGER\/Line files, similar address geocoding errors still occur. Kathryn Robertson, a GIS Technician with the City of Independence, Missouri (and a student in the Fall 2000 offering of this course) pointed out how important it is that Start (or \u201cFrom\u201d) nodes and End (or \u201cTo\u201d) nodes correspond with the low and high addresses in address ranges. \u201cI learned this the hard way,\u201d she wrote, \u201cgeocoding all 5,768 segments for the city of Independence and getting some segments backward. When address matching was done, the locations were not correct. Therefore, I had to go back and look at the direction of my segments. I had a rule of thumb, all east-west streets were to start from west and go east; all north-south streets were to start from the south and go north\u201d (personal communication).<\/p>\n<p>Although this may have been a sensible strategy for the City of Independence, can you imagine a situation in which Kathryn\u2019s rule-of-thumb might not work for another municipality? If so, and if you\u2019re a registered student, please add a comment to this page.<\/p>\n<h3>AFTER MAF\/TIGER MODERNIZATION<\/h3>\n<p>If TIGER had included accurate coordinate locations for every household, and correspondingly accurate streets and administrative boundaries, geocoding census data would be simple and less error-prone. Many local governments digitize locations of individual housing units when they build GIS land bases for property tax assessment, E-911 dispatch and other purposes. The MAF\/TIGER modernization project begun in 2002 aimed to accomplish this for the entire nationwide TIGER database in time for the 2010 census. The illustration below shows the intended result of the modernization project, including properly aligned streets, shorelines, and individual household locations, shown here in relation to an orthorectified aerial image.<\/p>\n<p><img decoding=\"async\" alt=\"Image showing modernized TIGER household locations and aligned streets\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/TIGER_goal.png\" \/><\/p>\n<p>Intended accuracy and completeness of modernized TIGER data in relation to the real world. TIGER streets (yellow), shorelines (blue), and housing unit locations (red) are superimposed over an orthorectified aerial image. (U.S. Census Bureau n.d.). National coverage of housing unit locations and geometrically-accurate streets and other features were not available in 2000 or before.<\/p>\n<p>The modernized MAF\/TIGER database described by Galdi (2005) is now in use, including precise geographic locations of over 100 million household units. However, because\u00a0<strong>household locations are considered confidential<\/strong>, users of TIGER\/Line Shapefiles extracted from the MAF\/TIGER database still must rely upon address geocoding using address ranges.<\/p>\n<h3>LEVERAGING TIGER\/LINE DATA FOR PRIVATE ENTERPRISE<\/h3>\n<p>Launched in 1996, MapQuest was one of the earliest online mapping, geocoding and routing services. MapQuest combined the capabilities of two companies: a cartographic design firm with long experience in producing road atlases, \u201cTripTiks\u201d for the American Automobile Association, and other map products, and a start-up company that specialized in custom geocoding applications for business.\u00a0 Initially, MapQuest relied in part on TIGER\/Line street data extracted from the pre-modernization TIGER database. MapQuest and other commercial firms were able to build their businesses on TIGER data because of the U.S. government\u2019s wise decision not to restrict its reuse. It\u2019s been said that this decision triggered the rapid growth of the U.S. geospatial industry.<\/p>\n<p>Later on in this chapter we\u2019ll visit MapQuest and some of its more recent competitors. Next, however, you\u2019ll have a chance to see how geocoding is performed using a TIGER\/Line data in a GIS.<\/p>\n<h2>4.8. Geocoding with TIGER\/Line Shapefiles<\/h2>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<h3>GEOCODING IN A GIS<\/h3>\n<p>Part 3 of 3 in the TIGER\/Line Shapefile\u00a0<em>Try This!<\/em>\u00a0series is not interactive but instead illustrates how the address ranges encoded in TIGER\/Line Shapefiles can be used to pinpoint (more or less!) the geographic locations of street addresses in the U.S.<\/p>\n<p>The process of geocoding a location within a GIS begins with a line dataset (shapefile) with the necessary address range attributes.\u00a0 The following image is an example of the attribute table of a TIGER\/Line shapefile.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of Attribute Table\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482Attributes.jpg\" \/><\/p>\n<p>Visible in this image are just a few rows, which represent a handful of road segments and their corresponding address ranges.\u00a0 This shapefile contains over 29,000 road segments in total.\u00a0 Note the names of some of the attributes:<\/p>\n<ul>\n<li>FULLNAME \u2013 The street name of the road segment<\/li>\n<li>LFROMADD \u2013 The address number at the beginning of the road segment on the left side of the street<\/li>\n<li>LTOADD \u2013 The address number at the end of the road segment on the left side of the street<\/li>\n<li>RFROMADD \u2013 The address number at the beginning of the road segment on the right side of the street<\/li>\n<li>RTOADD \u2013 The address number at the end of the road segment on the right side of the street<\/li>\n<li>ZIPL \u2013 The zip code area that is present to the left side of the road segment<\/li>\n<li>ZIPR \u2013 The zip code area that is present to the right side of the street<\/li>\n<\/ul>\n<p>Next, the GIS software needs to know which of these attributes contains each piece of the necessary address range information.\u00a0 Some shapefiles use different names for their attributes, so the GIS can\u2019t always know which attribute contains the Right-Side-From-Address information, for example.\u00a0 In ArcGIS, for example, something called a Locator is configured that maps the attributes in the shapefile to the corresponding piece of necessary address information.\u00a0 The image below illustrates what this mapping looks like:<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of ArcGIS Locator\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482AddressLocator.jpg\" \/><\/p>\n<p>Note the items with an asterisk (*).\u00a0 These are the minimum required attributes that need to be present in the shapfile for the geocoding to work.\u00a0 The items in the \u201cAlias Name\u201d column correspond to attributes in the shapefile.<\/p>\n<p>We are now ready to find a location by searching for a street address!\u00a0 Let\u2019s geocode the location for \u201c1971 Fairwood Lane, 16803\u2033.<\/p>\n<p>When an address is specified, the GIS queries the attribute table to find rows with a matching street name in the correct zipcode.\u00a0 Also, the particular segment of the street that contains the address number is identified.\u00a0 The below image shows the corresponding selection in the attribute table:<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of Highlighted Attribute\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482HighlightedAttribute.jpg\" \/><\/p>\n<p>The image below shows the corresponding road segment highlighted on a map.\u00a0 The To and From address values for the road segment have been added so you can see the range of addresses.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of Road Segment\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482RoadSegment.jpg\" \/><\/p>\n<p>Finally, the GIS interpolates where along the road segment the value of 1971 occurs and places it on the appropriate side of the street based on the even\/odd values indicated in the attribute table.\u00a0 The image below shows the final result of the geocoding process:<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of Final Result\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/482PointOnMap.jpg\" \/><\/p>\n<p>The accuracy of a geocoded location is dependent on a number of factors, including the quality of the line work in a shapefile, the accuracy of the address range attributes of each road segment, and the interpolation performed by the software.\u00a0 As you may see in the following section, different geocoding services may provide different location results due to the particular data and procedures used.<\/p>\n<h2>4.9. Geocoding Online<\/h2>\n<p>No doubt you\u2019re familiar with one or more popular online mapping services. How well do they do at geocoding the location of a postal address? You can try it out for yourself at several Web-based mapping services, including\u00a0<a href=\"http:\/\/www.mapquest.com\/\">MapQuest.com<\/a>,\u00a0<a href=\"http:\/\/www.bing.com\/maps\/\">Microsoft\u2019s Bing Maps<\/a>, and\u00a0<a href=\"http:\/\/www.geocode.com\/\">Tele Atlas\/TomTom\u2019s Geocode.com<\/a>. Tele Atlas, for example, is a leading manufacturer of digital street data for vehicle navigation systems. To accommodate the routing tasks that navigation systems are called upon to serve, the streets are encoded as vector features whose attributes include address ranges. (In order to submit an address for geocoding at Geocode.com you have to set up a trial account through their EZ-Locate Interactive web tool or download the EZ-Locate software).<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of the Tele Atlas Geocode.com adress submission window\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4p9TeleAtlasTomTom_EZ-Locate_input_jan2013.jpg\" \/><\/p>\n<p>Submitting an address to Tele Atlas\u2019 Geocode.com service for geocoding. \u00a9 2013 TomTom North America, Inc. All rights reserved.<\/p>\n<p>Shown above is the form by which you can geocode an address to a location in a Tele Atlas street database. The result is shown below.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of Tele Atlas geocoding results window\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4p9TeleAtlasTomTom_EZ-Locate_result_jan2013.jpg\" \/><\/p>\n<p>Tele Atlas\u2019 Geocode.com service estimates the location of the address relative to the address range attributes encoded in its database. \u00a9 2013 TomTom North America, Inc. All rights reserved.<\/p>\n<p>Let\u2019s compare the geocoding capabilities of MapQuest.com to locate the address on an actual map.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of Mapquest Address Locator 2013\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/ch4p9Mapquest_jan2013.jpg\" \/><\/p>\n<p>Address geocoded by MapQuest.com. \u00a9 2013 MapQuest.com, Inc. All rights reserved.<\/p>\n<p>The MapQuest.com map from 2013 estimates the address is close to its actual location. Below is a similar MapQuest product created back in 1998, when this course was first being developed. On the older map the same address is plotted on the opposite side of the street. What do you suppose is wrong with the address range attribute in that case?<\/p>\n<p>On the map from 1998, also note the shapes of the streets. The street shapes in the 2011 map have been improved.\u00a0 The 1998 product seems to have been generated from the 1990 version of the TIGER\/Line files, which may have been all that was available for this relatively remote part of the country.\u00a0 Now MapQuest licenses street data from a business partner called\u00a0<a href=\"http:\/\/www.navteq.com\/\">NAVTEQ<\/a>.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of MapQuest 1998\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/geocoding_mapquest.gif\" \/><\/p>\n<p>Same address geocoded by MapQuest.com in 1998. \u00a9 1998 MapQuest.com, Inc. (formerly GeoSystems Global Corp.) All rights reserved.<\/p>\n<p>The point of this section is to show that geocoding with address ranges involves a process of estimation. The Census Bureau\u2019s TIGER\/Line Shapefiles, like the commercial street databases produced by Tele Atlas, Navigation Technologies, and other private firms, represent streets as vector line segments. The vector segments are associated with address range attributes, one for the left side of the street, one for the right side. The geocoding process takes a street address as input, finds the line segment that represents the specified street, checks the address ranges to determine the correct side of the street, then estimates a location at the appropriate point between the minimum and maximum address for that segment and assignes an estimated latitude\/longitude coordinate to that location. For example, if the minimum address is 401, and the maximum is 421, a geocoding algorithm would locate address 411 at the midpoint of the street segment.<\/p>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<p>Try one of these geocoding services for your address. Then compare the experience, and the result, with\u00a0<a href=\"http:\/\/maps.google.com\/\">Google Maps<\/a>, launched in 2005. Apply what we\u2019ve discussed in this chapter to try to explain inaccuracies in your results, if any. Registered students can log in and post comments directly to this page.<\/p>\n<h3><strong>PRACTICE QUIZ<\/strong><\/h3>\n<p>Registered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to take a self-assessment quiz about Geocoding.<\/p>\n<p>You may take practice quizzes as many times as you wish. They are not scored and do not affect your grade in any way.<\/p>\n<h2>4.10. Applications beyond the Census Bureau<\/h2>\n<p>Two characteristics of MAF\/TIGER data, address range attributes and explicit topology, make them, and derivative products, valuable in many contexts. Consequently, firms like\u00a0<a href=\"http:\/\/www.navteq.com\/\">NAVTEQ<\/a>\u00a0and\u00a0<a href=\"http:\/\/www.teleatlas.com\/\">Tele Atlas<\/a>\u00a0(now owned by TomTom) have emerged to provide data with similar characteristics as MAF\/TIGER, but which are more up-to-date, more detailed and include additional feature classes. The purpose of the next section is to sketch some of the applications of data similar to MAF\/TIGER data beyond the Census Bureau.<\/p>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<p>A\u00a0<a href=\"http:\/\/money.cnn.com\/2006\/02\/24\/Autos\/modern_mapmakers\/index.htm\">February 2006 article by Peter Valdes-Dapena in CNNMoney.com<\/a>describes the work of two NAVTEQ employees. See the link above or search on \u201cwhere those driving directions really come from\u201d<\/p>\n<h2>4.11. Geocoding Your Customers<\/h2>\n<p>Geocoded addresses allow governments and businesses to map where their constituents and customers live and work. Federal, state, and local government agencies know where their constituents live by virtue of censuses, as well as applications for licenses and registrations. Banks, credit card companies, and telecommunications firms are also rich in address-referenced customer data, including purchasing behaviors. Private businesses and services must be more resourceful.<\/p>\n<p>Some retail operations, for example, request addresses or ZIP Codes from customers, or capture address data from checks. Discount and purchasing club cards allow retailers to directly match purchasing behaviors with addresses. Customer addresses can also be harvested from automobile license plates. Business owners pay to record license plate numbers of cars parked in their parking lots or in their competitors. Addresses of registered owners can be purchased from organizations that acquire motor vehicle records from state departments of transportation.<\/p>\n<p>Businesses with access to address-referenced customer data, vector street data attributed with address ranges, and GIS software and expertise, can define and analyze the\u00a0<strong>trade areas<\/strong>\u00a0within which most of their customers live and work. Companies can also focus direct mail advertising campaigns on their own trade areas, or their competitors\u2019. Furthermore, GIS can be used to analyze the socio-economic characteristics of the population within trade areas, enabling businesses to make sure that the products and services they offer meet the needs and preferences of target populations.<\/p>\n<p>Politicians use the same tools to target appearances and campaign promotions.<\/p>\n<h3><strong>TRY THIS!<\/strong><\/h3>\n<p>Check out the\u00a0<a href=\"http:\/\/www.ffiec.gov\/Geocode\/default.aspx\">geocoding system maintained by the Federal Financial Institution\u2019s Examination Council<\/a>. The FFIEC Geocoding system lets users enter a street address and get a census demographic report or a street map (Using Tele Atlas data). The system is intended for use by financial institutions that are covered by the Home Mortgage Disclosure Act (HMDA) and Community Reinvestment Act (CRA) to meet their reporting obligation.<\/p>\n<h2>4.12. Delivering Products and Services<\/h2>\n<p>Operations such as mail and package delivery, food and beverage distribution, and emergency medical services need to know not only where their customers are located, but how to deliver products and services to those locations as efficiently as possible. Geographic data products like TIGER\/Line Shapefiles are valuable to analysts responsible for prescribing the most efficient delivery routes. The larger and more complex the service areas of such organizations, the more incentive they have to automate their routing procedures.<\/p>\n<p>In its simplest form,\u00a0<strong>routing<\/strong>\u00a0involves finding the shortest path through a network from an origin to a destination. Although shortest path algorithms were originally implemented in raster frameworks, transportation networks are now typically represented with vector feature data, like TIGER\/Line Shapefiles. Street segments are represented as digital line segments each formed by two points, a \u201cstart\u201d node and an \u201cend\u201d node. If the nodes are specified within geographic or plane coordinate systems, the distance between them can be calculated readily. Routing procedures sum the lengths of every plausible sequence of line segments that begins and ends at the specified locations. The sequence of segments associated with the smallest sum represents the shortest route.<\/p>\n<p>To compare various possible sequences of segments, the data must indicate which line segment follows immediately after another line segment. In other words, the procedure needs to know about the connectivity of features. As discussed earlier, connectivity is an example of a topological relationship. If topology is not encoded in the data product, it can be calculated by the GIS software in which the procedure is coded.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of MapQuest 1998\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/routing_form.gif\" \/><\/p>\n<p>Input form for an early version of the\u00a0<a href=\"http:\/\/www.mapquest.com\/\">MapQuest<\/a>\u00a0routing utility. \u00a9 1998 MapQuest.com, Inc. All rights reserved.<\/p>\n<p>Several online travel planning services, including MapQuest.com and Google Maps, provide routing capabilities. Both take origin and destination addresses as input, and produce optimal routes as output. These services are based on vector feature databases in which street segments are attributed with address ranges, as well as with other data that describe the type and conditions of the roads they represent.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of MapQuest options window\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/routing_options.gif\" \/><\/p>\n<p>An early interface to MapQuest\u2019s routing options. Different algorithms are required to calculate shortest and fastest routes. Specific attributes must be encoded in the database to provide the options to avoid limited access highways, toll roads, and ferry lanes. \u00a9 1998 MapQuest.com, Inc. All rights reserved.<\/p>\n<p>The shortest route is not always the best. In the context of emergency medical services, for example, the fastest route is preferred, even if it entails longer distances than others. To determine fastest routes, additional attribute data must be encoded, such as speed limits, traffic volumes, one way streets, and other characteristics.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of MapQuest maps\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/routing_map.gif\" \/><\/p>\n<p>MapQuest routing solution. \u00a9 1998 MapQuest.com, Inc. All rights reserved.<\/p>\n<p>Then there are routing problems that involve multiple destinations\u2013a complex special case of routing called the\u00a0<strong>traveling salesman problem<\/strong>. School bus dispatchers, mail and package delivery service managers, and food and beverage distributors all seek to minimize the transportation costs involved in servicing multiple, dispersed destinations. As the number of destinations and the costs of travel increase, the high cost of purchasing up-to-date, properly attributed network data becomes easier to justify.<\/p>\n<h3><strong>TRY THIS<\/strong><\/h3>\n<p>The Georgia Institute of Technology publishes an\u00a0<a href=\"http:\/\/www.tsp.gatech.edu\/\">extensive collection of resources about the Traveling Salesman Problem<\/a>.<\/p>\n<h2>4.13. Delineating Service Areas<\/h2>\n<p>The need to redraw voting district boundaries every ten years was one of the motivations that led the Census Bureau to create its MAF\/TIGER database. Like voting districts, many other kinds of service area boundaries need to be revised periodically. School districts are a good example. The state of Massachusetts, for instance, has adopted school districting laws that are similar in effect to the constitutional criteria used to guide congressional redistricting. The Framingham (Massachusetts) School District\u2019s Racial Balance Policy once stated that \u201ceach elementary and middle school shall enroll a student body that is racially balanced. \u2026 each student body shall include a percentage of minority student, which reflects the system-wide percentage of minority students, plus or minus ten percent. \u2026 The racial balance required by this policy shall be established by redrawing school enrollment areas\u201d (Framingham Public Schools 1998). And bus routes must be redrawn as enrollment area boundaries change.<\/p>\n<p>The\u00a0<a href=\"http:\/\/www.cms.k12.nc.us\/\">Charlotte-Mecklenberg (North Carolina) public school district<\/a>\u00a0also used racial balance as a districting criterion (although its policy was subsequently challenged in court). Charlotte-Mecklenberg consists of 133 schools, attended by over 100,000 students, about one third of whom ride a bus to school every day. District managers are responsible for routing 3,600 bus routes, traveling a total of 82,000 daily miles. A staff of eight routinely uses GIS to manage these tasks. GIS could not be used unless up-to-date, appropriately attributed, and topologically encoded data were available.<\/p>\n<p>Another example of service area analysis is provided by the City of Beaverton, Oregon. In 1997, Beaverton officials realized that 25 percent of the volume of solid waste that was hauled away to land fills consisted of yard waste, such as grass clippings and leaves. Beaverton decided to establish a yard waste recycling program, but it knew that the program would not be successful if residents found it inconvenient to participate. A GIS procedure called\u00a0<strong>allocation<\/strong>\u00a0was used to partition Beaverton\u2019s street network into service areas that minimized the drive time from residents\u2019 homes to recycling facilities. Allocation procedures require vector-format data that includes the features, attributes, and topology necessary to calculate travel times from all residences to the nearest facility.<\/p>\n<p><img decoding=\"async\" alt=\"Screenshot of downtown Seattle GeoMap\" src=\"http:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-content\/uploads\/sites\/19\/2014\/01\/drivetime_small.gif\" \/><\/p>\n<p>Trade areas defined by 3 miles travel distance (blue) and 8 minutes travel time (yellow). (Francica n.d.). Used by permission.<\/p>\n<p>Naturally, private businesses concerned with delivering products and services are keenly interested in service area delineation. The screen capture above shows two\u00a0<strong>trade areas<\/strong>\u00a0surrounding a retail store location (\u201cSeattle Downtown\u201d) in a network database.<\/p>\n<p>Former student Saskia Cohick (Winter 2006), who was then GIS Director for Tioga County, Pennsylvania, contributed another service area problem: \u201cThis is a topic that local governments are starting to deal with \u2026 To become Phase 2 wireless capable (that is, capable of finding a cell phone location from a 911 call center within 200 feet of the actual location), county call centers must have a layer called ESZs (Emergency Service Zones). This layer will tell the dispatcher who to send to the emergency (police, fire, medical, etc). The larger problem is to reach agreement between four fire companies (for example) as to where they do or do not respond.\u201d<\/p>\n<h2>4.14. Summary<\/h2>\n<p>To fulfill its mission of being the preeminent producer of attribute data about the population and economy of the United States, the U.S. Census Bureau also became an innovative producer of digital geographic data. The Bureau designed its MAF\/TIGER database to support automatic geocoding of address-referenced census data, as well as automatic data quality control procedures. The key characteristics of TIGER\/Line Shapefiles, including use of vector features to represent geographic entities, and address range attributes to enable address geocoding, are now common features of proprietary geographic databases used for trade area analysis, districting, routing, and allocation.<\/p>\n<h3><strong>QUIZ<\/strong><\/h3>\n<p>Registered Penn State students should return now to the Chapter 4 folder in ANGEL (via the Resources menu to the left) to access the graded quiz for this chapter. This one counts.\u00a0<strong>You may take graded quizzes only once.<\/strong><\/p>\n<p>The purpose of the quiz is to ensure that you have studied the text closely, that you have mastered the practice activities, and that you have fulfilled the chapter\u2019s learning objectives. You are free to review the chapter during the quiz. Once you\u2019ve submitted the quiz you will have completed Chapter 4.<\/p>\n<h3>COMMENTS AND QUESTIONS<\/h3>\n<p>Registered students are welcome to post comments, questions, and replies to questions about the text. Particularly welcome are anecdotes that relate the chapter text to your personal or professional experience. In addition, there are discussion forums available in the ANGEL course management system for comments and questions about topics that you may not wish to share with the whole world.<\/p>\n<p>To post a comment, scroll down to the text box under \u201cPost new comment\u201d and begin typing in the text box, or you can choose to reply to an existing thread. When you are finished typing, click on either the \u201cPreview\u201d or \u201cSave\u201d button (Save will actually submit your comment). Once your comment is posted, you will be able to edit or delete it as needed. In addition, you will be able to reply to other posts at any time.<\/p>\n<p>Note: the first few words of each comment become its \u201ctitle\u201d in the thread.<\/p>\n<h2>4.15. Bibliography<\/h2>\n<p>Charlotte-Mecklenberg Public Schools (n. d.). Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.cms.k12.nc.us\/\">http:\/\/www.cms.k12.nc.us<\/a><\/p>\n<p>Cooke, D. F. (1997). Topology and TIGER: The Census Bureau\u2019s Contribution. In T. W. Foresman (Ed.),\u00a0<em>The history of geographic information systems: Perspectives from the pioneers<\/em>. (pp. 47 \u2013 57). Upper Saddle River, NJ: Prentice Hall.<\/p>\n<p>Dangermond, J. (1982). A Classification of Software Components Commonly Used in Geographic Information Systems. In\u00a0<em>Proceedings of the U.S.\u2014Australia Workshop on the Design and Implementation of Computer-Based Geographic Information Systems<\/em>, Honolulu, HI, pp. 0-91. In Demers, M.N. (1997)\u00a0<em>Fundamentals of Geographic Information Systems.<\/em>\u00a0John Wiley &amp; Sons, Inc.<\/p>\n<p>Discreet Research (n.d.). Retrieved July 19, 1999 from<a href=\"http:\/\/www.dresearch.com\/\">http:\/\/www.dresearch.com<\/a><\/p>\n<p>ESRI (1998) Shapefile Technical Description, An ESRI White paper. Environmental Systems Research Institute, Inc. Retrieved October 4, 2010, from\u00a0<a href=\"http:\/\/www.esri.com\/library\/whitepapers\/pdfs\/shapefile.pdf\">http:\/\/www.esri.com\/library\/whitepapers\/pdfs\/shapefile.pdf<\/a><\/p>\n<p>Federal Geographic Data Committee (April 2006). Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.fgdc.gov\/\">http:\/\/www.fgdc.gov<\/a><\/p>\n<p>Framingham Public Schools (1998).\u00a0<em>Racial balance policy: Assignment of students to schools<\/em>. Retrieved July 19, 1999 from<a title=\"www.framingham.k12.ma.us\/update\/0198rbp.html\" href=\"http:\/\/www.framingham.k12.ma.us\/update\/0198rbp.html\">www.framingham.k12.ma.us\/update\/0198rbp.html<\/a>\u00a0(since retired).<\/p>\n<p>Francica, J. (n.d.).\u00a0<em>Geodezix Consulting<\/em>. Retrieved July 19, 1999 from<a title=\"www.geodezix.com\" href=\"http:\/\/www.geodezix.com\/\">www.geodezix.com<\/a>\u00a0(since retired).<\/p>\n<p>Galdi, D. (2005). Spatial Data Storage and Topology in the Redesigned MAF\/TIGER System. Retrieved 19 October 2010 from<a title=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/topo_and_data_stor.html\" href=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/topo_and_data_stor.html\">http:\/\/www.census.gov\/geo\/mtep_obj2\/topo_and_data_stor.html<\/a>\u00a0(since retired).<\/p>\n<p>MapQuest (n.d. a). Retrieved July 19, 1998 from<a href=\"http:\/\/www.mapquest.com\/\">http:\/\/www.mapquest.com<\/a><\/p>\n<p>MapQuest (n.d. b). Retrieved January 15, 2013 from<a href=\"http:\/\/www.mapquest.com\/\">http:\/\/www.mapquest.com<\/a><\/p>\n<p>Marx, R. M. (Ed.). (1990). The Census Bureau\u2019s TIGER system. [Special issue].\u00a0<em>Cartography and Geographic Information Systems<\/em>\u00a017:1.<\/p>\n<p>Navigation Technologies Inc. (2006).\u00a0<em>Welcome to NavTech<\/em>. Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.navtech.com\/\">http:\/\/www.navtech.com<\/a><\/p>\n<p>Rammage, S. and P. Woodsford (2002). The Benefits of Topoplogy in the Database. Retrieved October 6, 2010 from<a href=\"http:\/\/spatialnews.geocomm.com\/features\/laserscan2\/\">http:\/\/spatialnews.geocomm.com\/features\/laserscan2\/<\/a><\/p>\n<p>TeleAtlas (2006).\u00a0<em>Welcome to TeleAtlas<\/em>. Retrieved May 3, 2006 from<a href=\"http:\/\/www.teleatlas.com\/Pub\/Home\">http:\/\/www.teleatlas.com\/Pub\/Home<\/a>\u00a0(since retired).<\/p>\n<p>Theobald, D. M. (2001). Understanding Topology and Shapefiles.<em>ArcUser<\/em>\u00a0April-June 2001. Retrieved October 5, 2010 from<a href=\"http:\/\/www.esri.com\/news\/arcuser\/0401\/topo.html\">http:\/\/www.esri.com\/news\/arcuser\/0401\/topo.html<\/a><\/p>\n<p>U.S. Census Bureau (1997).\u00a0<em>TIGER\/Line Files (1997 Technical Documentation)<\/em>. Retrieved January 2, 1999 from<a title=\"http:\/\/www.census.gov\/geo\/tiger\/TIGER97C.pdf\" href=\"http:\/\/www.census.gov\/geo\/tiger\/TIGER97C.pdf\">http:\/\/www.census.gov\/geo\/tiger\/TIGER97C.pdf<\/a>\u00a0(since retired).<\/p>\n<p>U.S. Census Bureau (2003). TIGER\/Line Files, 2003 (metadata). Retrieved February 3, 2008 from<a href=\"http:\/\/www.census.gov\/geo\/www\/tlmetadata\/tl2003meta.txt\">http:\/\/www.census.gov\/geo\/www\/tlmetadata\/tl2003meta.txt<\/a><\/p>\n<p>U.S. Census Bureau (n. d.). 21st Century MAF\/TIGER Enhancements. Retrieved February 3, 2008 from<a title=\"http:\/\/www.census.gov\/geo\/mod\/overview.pdf\" href=\"http:\/\/www.census.gov\/geo\/mod\/overview.pdf\">http:\/\/www.census.gov\/geo\/mod\/overview.pdf<\/a>\u00a0(since retired).<\/p>\n<p>U.S. Census Bureau (2004). MAF\/TIGER Redesign Project Overview. Retrieved October 19, 2010 from<a title=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/obj2_issuepaper12_2004.pdf\" href=\"http:\/\/www.census.gov\/geo\/mtep_obj2\/obj2_issuepaper12_2004.pdf\">http:\/\/www.census.gov\/geo\/mtep_obj2\/obj2_issuepaper12_2004.pdf<\/a>(since retired).<\/p>\n<p>U.S. Census Bureau (2005).\u00a0<em>Geography division map gallery.<\/em>\u00a0Retrieved July 19, 1999 from\u00a0<a href=\"http:\/\/www.census.gov\/geo\/www\/mapGallery\/\">http:\/\/www.census.gov\/geo\/www\/mapGallery\/<\/a><\/p>\n<p>U.S. Census Bureau (2012). TIGER\/Line Shapefiles Technical Documentation. Retrieved June, 2013 from of the<a href=\"http:\/\/www.census.gov\/geo\/maps-data\/data\/pdfs\/tiger\/tgrshp2012\/TGRSHP2012_TechDoc.pdf\">http:\/\/www.census.gov\/geo\/maps-data\/data\/pdfs\/tiger\/tgrshp2012\/TGRSHP2012_TechDoc.pdf<\/a><\/p>\n","protected":false},"author":1,"menu_order":1,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":["david-dibiase"],"pb_section_license":""},"chapter-type":[],"contributor":[47],"license":[],"class_list":["post-274","chapter","type-chapter","status-publish","hentry","contributor-david-dibiase"],"part":82,"_links":{"self":[{"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/chapters\/274","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/wp\/v2\/users\/1"}],"version-history":[{"count":4,"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/chapters\/274\/revisions"}],"predecessor-version":[{"id":883,"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/chapters\/274\/revisions\/883"}],"part":[{"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/parts\/82"}],"metadata":[{"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/chapters\/274\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/wp\/v2\/media?parent=274"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/pressbooks\/v2\/chapter-type?post=274"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/wp\/v2\/contributor?post=274"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/opentextbc.ca\/natureofgeographicinformation\/wp-json\/wp\/v2\/license?post=274"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}