Archived Intelligent Transportation System Data Quality: Preliminary Analyses of San Antonio TransGuide Data

Author(s):  
Shawn Turner ◽  
Luke Albert ◽  
Byron Gajewski ◽  
William Eisele

Described are three data quality attributes that are considered relevant to intelligent transportation system (ITS) data archiving: suspect or erroneous data, missing data, and data accuracy. Preliminary analyses of loop detector data from the TransGuide system in San Antonio were performed to identify the nature and extent of these data quality concerns in typical archived ITS data. The findings of the analyses indicated that missing data were inevitable, accounting for about one in five of all possible data records. Error detection rules were developed to screen for suspect or erroneous data, which accounted for only 1 percent of all possible data records. Baseline testing of TransGuide detector accuracy showed mixed results; one location collected traffic volumes within 5 percent of ground truth, whereas traffic volumes at another location ranged from 12 to 38 percent of ground truth. It was concluded that data quality procedures will be essential for realizing the full potential of archived ITS data.

1998 ◽  
Vol 1625 (1) ◽  
pp. 124-130 ◽  
Author(s):  
Robert E. Brydia ◽  
Shawn M. Turner ◽  
William L. Eisele ◽  
Jyh C. Liu

The intelligent transportation system (ITS) components deployed in U.S. urban areas produce vast amounts of data. These ITS data often are used for real-time operations and then are discarded. Few transportation management centers have any mechanism for sharing the data resources among other transportation groups or agencies within the same jurisdiction. Meanwhile, transportation analysts and researchers often struggle to obtain accurate, reliable data about existing transportation performance and patterns. The development of an ITS data management system (referred to as ITS DataLink) that is used to store, access, analyze, and present data from the TransGuide center in San Antonio, Texas, is presented. Data outputs are both tabular and graphical. No user costs are associated with the system except for an Internet connection.


Sign in / Sign up

Export Citation Format

Share Document