spatial_extrapolation

Aug 29, 2017

685115c · Aug 29, 2017

This branch is 29 commits behind CityofToronto/bdit_volumes:master.

Name	Name	Last commit message	Last commit date
parent directory ..
img	img	update	Aug 29, 2017
README.md	README.md	update	Aug 29, 2017
major_arterials_proximity_regr.png	major_arterials_proximity_regr.png	Update documentation	Aug 25, 2017
query_avg_neighbour_volumes.sql	query_avg_neighbour_volumes.sql	update	Aug 29, 2017
query_avg_neighbour_volumes_eval.sql	query_avg_neighbour_volumes_eval.sql	update	Aug 29, 2017
query_coord_volume.sql	query_coord_volume.sql	update	Aug 29, 2017
query_neighbour_volume.sql	query_neighbour_volume.sql	update	Aug 29, 2017
query_relation_groups_test.sql	query_relation_groups_test.sql	Spatial Extrapolation to profiles and daily totals	Aug 29, 2017
query_relation_groups_train.sql	query_relation_groups_train.sql	update	Aug 29, 2017
query_semi_variogram.sql	query_semi_variogram.sql	Bug Fix + Add Kriging to doc	Jul 6, 2017
spatial_extrapolation.py	spatial_extrapolation.py	update	Aug 29, 2017

README.md

Spatial Extrapolation

Purpose

To fill in gaps in the City of Toronto's count program in order to provide a complete picture of volumes across the entire city.

Methodology

Several methods are tested for this purpose and are detailed below.

Average of Nearest Neighbours (confidence code = 3)
Nearest neighbours: 5 (or less) segments of the same road class that are maximum 300m (nearest point to point distance) away from the target segment
Linear Regression (Based on Proximity only) (confidence code = 4)
Take the volumes of the nearest 5 segments of the same road class as dependent variables (ordered by proximity).
Linear Regression (Directional) (confidence code = 2)
Take the volumes of the nearest 2 parallel segments and 2 perpendicular segments as dependent variables.
Kriging
Implemented using the Gaussian Process model from scikit-learn
Input: (4-dimensional) from_x, from_y, to_x, to_y (coordinate information from the start and end of the segment)
Output: volume
Covariance matrix is constructed based on the coordinate information of the segments in order to find the spatial correlation of volumes.

Methodology Evaluation

Regression

Major Arterials

-	Linear Regression (proximity only)	Direction Linear Regression	Average of Nearest Neighbours
Scatter plot
Root Mean Squared Error	4374	4232	4554
Coef. of Det.	0.480	0.542	0.492

Minor Arterials

-	Linear Regression (proximity only)	Direction Linear Regression	Average of Nearest Neighbours
Scatter plot
Root Mean Squared Error	2285	2143	2067
Coef. of Det.	0.345	0.461	0.341

Collectors

-	Linear Regression (proximity only)	Direction Linear Regression	Average of Nearest Neighbours
Scatter plot
Root Mean Squared Error	1349	1263	1233
Coef. of Det.	0.312	0.268	0.364

Locals

-	Linear Regression (proximity only)	Direction Linear Regression	Average of Nearest Neighbours
Scatter Plot
Root Mean Squared Error	736	732	718
Coef. of Det.	0.230	0.046	0.213

Directional Regression Coefficients

Road Class	Perpendicular Segs Coef	Parallel Segs Coef
Major Arterials	0.0077 -0.0013	0.4404 0.4340
Minor Arterials	-0.0132 0.0429	0.4129 0.2954
Collectors	0.0104 0.0249	0.3937 0.1681
Locals	0.0037 0.0129	0.1779 0.2441

The coefficients indicate a strong correlation between upstream and downstream segments and a week if existent relationship between perpendicular segments. As we move from major arterials to locals, the relationship gets messier.

Kriging

Road Class	Semivariogram
Major Arterial
Minor Arterial
Collector

The relationship between distance and volume relationship is weak. The variance does not fit any model very well. A Gaussian Process Kriging model was fitted to each road class anyway and the results are inferior than regression. Therefore kriging is not used in actual implementation.

Implementation

Road Class	Method
Major Arterials	Directional Linear Regression
Minor Arterials	Directional Linear Regression
Collectors	Average of Neighbours
Locals	Average of Neighbours

Note that expressways are not included. However, there are uncounted expressways that need to be included in the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

spatial_extrapolation

spatial_extrapolation

README.md

Spatial Extrapolation

Purpose

Methodology