RA2CE feature: Optimal routes between origins and destinations#
This notebook contains an example of the optimal routes origin-destination analysis of the RA2CE model. This analysis finds the shortest (distance-weighed) or quickest (time-weighed) route between all Origins and all Destinations input by the user.
This notebook should be used in combination with the predefined data. A user could create their own examples, using the RA2CE functionality of their interest. This requires the right folder setup, and the correct settings in the configuration files (network.ini and analyses.ini).
In this notebook, we will guide you through the origin-destination analysis as if it were used in an emergency response setting.
First of all, we will import the packages we need to execute the notebook:
[ ]:
import geopandas as gpd
import matplotlib.pyplot as plt
from pathlib import Path
import rasterio
Then, we have to set some basic variables. Keep in mind that you will have to specify the path to your local machine yourself. First, we will set the path to the ra2ce folder and we will initialize the names for the network and analysis ini files.
Afterwards, let’s find the examples folder on your machine.
[ ]:
from ra2ce.ra2ce_handler import Ra2ceHandler #import the ra2cehandler to run ra2ce analyses
Emergency response example#
This example aims to show how emergency responders could use RA2CE to respond to a flood event, in Beira. Emergency responders can for example be interested in which roads are flooded in the area. They might also be interested in gaining insights into the locations of the residential areas impacted by the event and accessible routes to hospitals or locations that will function as shelters during a part of the recovery phase.
With these RA2CE analyses, it is for example possible to create analyses like the one below in Sint Maarten:
Note that these images were generating using QGIS
Optimal routes origin-destination analysis without hazard#
Firstly, define the folder in which the data exist:
[ ]:
# specify the path to the folder holding the RA2CE folder and input data
root_dir = Path("data" ,"optimal_routes_analysis_without_hazard")
assert root_dir.exists()
Creating origins and destinations data#
For origins, we want to know how many people live where. To estimate this, we make use of the global WorldPop database. This can be used as origin points, after a manual raster to point conversion in QGIS (this was already done). Bear in mind that the origin data needs to be of a specific type (point shapefile/gpkg) and have a specific structure: it should hold an ID-column and there should be a field with the origin_counts (these can also be 0). The origin_counts attribute can hold any numeric value which will be used to count how many units (in this example, people) use a certain road segment to reach its destination, and per destination how many units can reach that destination.
The same goes for the destinations. The destinations also needs to be a point shapefile/gpkg and needs an ID-column. It does not require an origin count column, however, destinations need a category column in which you specify the category of a destination (e.g., hospital, supermarket). For this example, we have used OpenStreetMaps (OSM) to derive the destination data.
The CRS of both origin and destination files is required to be in WGS84 EPSG:4326.
After creating this data, the user needs to save the data in the static/network folder.
Below, we first introduce the origins generated from WorldPop. These hold locations of residents and the number of residents in each point (generated from a grid).
[ ]:
origins_inspection = root_dir.joinpath("static", "network", "origins.shp")
origins_gdf = gpd.read_file(origins_inspection, driver = "SHP")
origins_gdf.head()
origins_gdf.explore(column = "POPULATION", cmap = "viridis_r", tiles="CartoDB dark_matter")
In this example, we look at the accessibility from residential homes to educational institutes and hospitals during the flood. We have created a layer of these points as destinations using the openOSM plugin in QGIS. You can inspect these files in QGIS. We will also load them here:
[ ]:
destinations_inspection = root_dir.joinpath("static", "network", "destinations.shp")
destinations_gdf = gpd.read_file(destinations_inspection, driver = "SHP")
destinations_gdf.explore(column = "category", cmap = ['Blue', 'Green'], tiles="CartoDB dark_matter")
Check whether the necessary attributes and values are in your files:
Note: Make sure your geometry is point and not multipoint.
[ ]:
origins_gdf.head()
[ ]:
destinations_gdf.head()
Specifying the .ini files#
Now, to tell RA2CE how to use the origins and destinations, the user needs to specify the file names in the network.ini.
Notice how you can keep the [network] parameters the same as in the RA2CE Basics example. RA2CE will automatically use the graph you have already created. Or, if you want to create a new graph, it can run both processes consecutively.
Network.ini content > [project] name = beira [network] directed = False source = OSM download primary_file = None diversion_file = None file_id = rfid_c polygon = region_polygon.geojson network_type = drive road_types = motorway,motorway_link,primary,primary_link,secondary,secondary_link,tertiary,tertiary_link,residential save_gpkg = True [origins_destinations] origins = origins.shp destinations = destinations.shp origins_names = A destinations_names = B id_name_origin_destination = OBJECTID origin_count = POPULATION origin_out_fraction = 1 category = category [hazard] hazard_map = None hazard_id = None hazard_field_name = None aggregate_wl = None hazard_crs = None [cleanup] snapping_threshold = None segmentation_length = None merge_lines = True merge_on_id = False cut_at_intersections = False
We now need to update our analysis initialisation files using the preferred OD-analysis (there are multiple). For now we will consider the optimal_route_origin_closest_destination analysis. This analysis finds from each origin location, the closest destination per category. Another analysis that RA2CE offers is an OD analysis which tries to find routes from all origins to all destinations, but we won’t run that one in this example. All types of losses analyses can be found here.
Navigate to the folder on your local machine with which you want to perform the analysis, and change the analysis.ini accordingly. Weighing defines the criterium based on which the optimal routes will be defined. It accepts time as well.
Analysis.ini content > [project] name = beira [analysis1] name = optimal route origin destination analysis = optimal_route_origin_destination weighing = distance save_gpkg = True save_csv = True
Set the paths to the initialization files and check if the files exist.
[ ]:
_network_ini_name = "network.ini" #set the name for the network.ini
_analysis_ini_name = "analysis.ini" #set the name for the analysis.ini
network_ini = root_dir.joinpath(_network_ini_name)
assert network_ini.is_file()
analysis_ini = root_dir.joinpath(_analysis_ini_name)
assert analysis_ini.is_file()
Run RA2CE. Notice the information RA2CE gives you. Education locations are referred to as D1 (destination 1) and hospital locations as D2 (destination 2).
[ ]:
handler = Ra2ceHandler(network=network_ini, analysis=analysis_ini)
handler.configure()
handler.run_analysis()
Inspecting results#
Let’s do some output exploration!
[ ]:
analysis_output_path = root_dir.joinpath("output", "optimal_route_origin_destination")
optimal_routes_gdf = gpd.read_file(analysis_output_path.joinpath('optimal_route_origin_destination.gpkg'))
optimal_routes_gdf ['origin_list'] = optimal_routes_gdf.apply(lambda row: row['origin'].split(','), axis=1)
optimal_routes_gdf ['destination_list'] = optimal_routes_gdf.apply(lambda row: row['destination'].split(','), axis=1)
optimal_routes_gdf.head() #show the origins
[ ]:
origin_destination_nodes_gdf = gpd.read_file(root_dir.joinpath('static', 'output_graph', 'origins_destinations_graph_nodes.gpkg'))
origin_destination_nodes_gdf ['od_list'] = origin_destination_nodes_gdf.apply(lambda row: row['od_id'].split(',') if row['od_id'] != 'nan' else None, axis=1)
origin_destination_nodes_gdf.head()
If we want to visualize this in a way that shows the routes clearly, it requires some filtering. You for example need to filter the destination of interest. You can do this here or in a GIS software. The optimal routes are shown with a colour scheme which represents the distances of the optimal paths.
[ ]:
optimal_routes_b_4_without_hazard_gdf = optimal_routes_gdf[optimal_routes_gdf['destination_list'].apply(lambda x: 'B_4' in x)] # filter on destination B4
b_4_gdf = origin_destination_nodes_gdf[origin_destination_nodes_gdf['od_list'].apply(lambda x: 'B_4' in x if isinstance(x,list) else False)] # filter on destination B4
origins_with_optimal_route_b_4 = origin_destination_nodes_gdf[origin_destination_nodes_gdf['od_id'].isin(optimal_routes_b_4_without_hazard_gdf['origin'])] # both the origins and the destinations file hold destination information
optimal_routes_b_4_with_hazard_map = optimal_routes_b_4_without_hazard_gdf.explore(column='length', tiles="CartoDB dark_matter")
b_4_map = b_4_gdf.explore(m=optimal_routes_b_4_with_hazard_map, color='blue', marker_kwds={'radius':10}, tiles="CartoDB dark_matter")
origins_with_optimal_route_b_4.explore(m=b_4_map, color='green', marker_kwds={'radius':5}, tiles="CartoDB dark_matter")
Optimal routes origin-destination analysis with hazard#
Firstly, define the folder in which the data exist:
[ ]:
# specify the path to the folder holding the RA2CE folder and input data
root_dir = Path("data", "optimal_routes_analysis_with_hazard")
assert root_dir.exists()
Secondly, we need a flood map! We have an example flood map in our example folder. Let’s inspect it.
Note: the flood map needs to be in .tif format
[ ]:
hazard_folder = root_dir.joinpath("static", "hazard") # find the hazard folder where you locate your floo dmap
hazard_map = hazard_folder.joinpath("max_flood_depth.tif") # set the location of the hazard map
# Open the TIF file using rasterio
with rasterio.open(hazard_map) as src:
# Read the TIF file as a numpy array
tif_array = src.read(1) # Change the band index (1) if necessary
plt.figure(figsize=(10, 10))
plt.imshow(tif_array, cmap='Blues') # Change the colormap if desired
plt.colorbar(label='Pixel Values')
plt.title('Flood map')
plt.show()
To use the flood map with RA2CE, we need to fill in the [hazard] section in the network.ini.
Specify the flood map name in the hazard_map parameter in network.ini. RA2CE expects the flood map to be located in the hazard folder. The aggregate_wl parameter in analysis.ini can be set to either ‘max’, ‘min’ or ‘mean’ to take the maximum, minimum or mean water depth per road segment when the exposure of the roads to a certain hazard (map) is determined.
Set the right CRS for the flood map in the hazard_crs parameter. This CRS can be different from the origins, destinations and road network. RA2CE will reproject the network to the CRS of the flood map and will reproject the road back to the original CRS when the CRS differs.
You can run a RA2CE analysis with only the hazard overlay on the roads. The results can be found in the output_graph folder. The data with *’*_hazard’*, contains the result of the overlay with the hazard. Here, we will load the data from the example graph folder.
If you want to practice with this, create a new project folder where you specify the hazard map and perform the hazard analysis with RA2CE
Notice the column EV1_ma. This refers to the hazard. This column holds the water depth of the road segment. ‘EV1’ stands for ‘Event 1’ (you can run multiple flood maps, the column results will be called EV1, EV2, EV3, etc.). ‘_ma’ refers to maximum flood depth, which is the parameter specified in the network.ini. We always use the maximum water depth for the OD analysis because a vehicle can only use a road segment when it can drive through the largest water depth on that road segment.
Specifying the .ini files#
Now, to tell RA2CE how to use the origins and destinations, the user needs to specify the file names in the network.ini.
Notice how you can keep the [network] parameters the same as in the RA2CE Basics example. RA2CE will automatically use the graph you have already created. Or, if you want to create a new graph, it can run both processes consecutively.
Network.ini content > [project] name = beira [network] directed = False source = OSM download primary_file = None diversion_file = None file_id = rfid_c polygon = region_polygon.geojson network_type = drive road_types = motorway,motorway_link,primary,primary_link,secondary,secondary_link,tertiary,tertiary_link,residential save_gpkg = True [origins_destinations] origins = origins.shp destinations = destinations.shp origins_names = A destinations_names = B id_name_origin_destination = OBJECTID origin_count = POPULATION origin_out_fraction = 1 category = category [hazard] hazard_map = max_flood_depth.tif hazard_id = None hazard_field_name = waterdepth aggregate_wl = max hazard_crs = EPSG:32736 [cleanup] snapping_threshold = None segmentation_length = None merge_lines = True merge_on_id = False cut_at_intersections = False
We now need to update our analysis initialisation files using the preferred OD-analysis (there are multiple). For now we will consider the optimal_route_origin_closest_destination analysis. This analysis finds from each origin location, the closest accessible non-flooded destination per category. Another analysis that RA2CE offers is an OD analysis which tries to find routes from all origins to all destinations, but we won’t run that one in this example. All types of losses analyses can be found here.
Navigate to the folder on your local machine with which you want to perform the analysis, and change the analysis.ini accordingly.
With the aggregate_wl parameter, the user can choose which type of aggregation of the water level on the road segment (max, mean, min) the analysis should consider. The threshold is the hazard intensity which determines road disruption, in this case it is set to 0.5, which relates to 0.5 m of water depth on the road segment. Weighing defines the criterium based on which the optimal routes will be defined. It accepts time as well. The analysis considers distance as measure for finding the optimal routes. With the parameter calculate_route_without_disruption set to True, RA2CE will calculate the routes with hazard disruption and without. Finally, with the parameters save_gpkg and save_csv the user can choose to save or not save resulting output shapefiles/gpkg (we are in a transition from shp to gpkg)/csvs.
Analysis.ini content > [project] name = beira [analysis1] name = optimal route origin destination analysis = optimal_route_origin_destination aggregate_wl = max threshold = 0.5 weighing = distance calculate_route_without_disruption = True save_gpkg = True save_csv = True
Set the paths to the initialization files and check if the files exist.
[ ]:
_network_ini_name = "network.ini" #set the name for the network.ini
_analysis_ini_name = "analysis.ini" #set the name for the analysis.ini
network_ini = root_dir.joinpath(_network_ini_name)
assert network_ini.is_file()
analysis_ini = root_dir.joinpath(_analysis_ini_name)
assert analysis_ini.is_file()
Run RA2CE. Notice the information RA2CE gives you. Education locations are referred to as D1 (destination 1) and hospital locations as D2 (destination 2).
[ ]:
handler = Ra2ceHandler(network=network_ini, analysis=analysis_ini)
handler.configure()
handler.run_analysis()
Inspecting results#
Let’s do some output exploration!
[ ]:
analysis_output_path = root_dir.joinpath("output", "multi_link_origin_destination")
optimal_routes_gdf = gpd.read_file(analysis_output_path.joinpath('multi_link_origin_destination.gpkg'))
optimal_routes_gdf ['origin_list'] = optimal_routes_gdf.apply(lambda row: row['origin'].split(','), axis=1)
optimal_routes_gdf ['destination_list'] = optimal_routes_gdf.apply(lambda row: row['destination'].split(','), axis=1)
optimal_routes_gdf.head()
Notice the different columns. Especially the columns ‘EV1_ma_PD1’ and ‘EV1_ma_PD2’ are of interest. They refer to ‘EV1’ (event 1), maximum water depth (ma) and Destination 1 (education) or 2 (hospital).
Below, we visualise which origins have access to their closest destination, given the disruption of the road network because of the flood.
Note: below we visualize the access to education (D1).
[ ]:
origin_destination_nodes_gdf = gpd.read_file(root_dir.joinpath('static', 'output_graph', 'origins_destinations_graph_nodes.gpkg'))
origin_destination_nodes_gdf ['od_list'] = origin_destination_nodes_gdf.apply(lambda row: row['od_id'].split(',') if row['od_id'] != 'nan' else None, axis=1)
origin_destination_nodes_gdf.head()
If we want to visualize this in a way that shows the routes clearly, it requires some filtering. You for example need to filter the destination of interest. You can do this here or in a GIS software. The optimal routes are shown with a colour scheme which represents the distances of the optimal paths.
[ ]:
optimal_routes_b_4_with_hazard_gdf = optimal_routes_gdf[optimal_routes_gdf['destination_list'].apply(lambda x: 'B_4' in x)] # filter on destination B4
b_4_gdf = origin_destination_nodes_gdf[origin_destination_nodes_gdf['od_list'].apply(lambda x: 'B_4' in x if isinstance(x,list) else False)] # filter on destination B4
origins_with_optimal_route_b_4 = origin_destination_nodes_gdf[origin_destination_nodes_gdf['od_id'].isin(optimal_routes_b_4_with_hazard_gdf['origin'])] # both the origins and the destinations file hold destination information
optimal_routes_b_4_with_hazard_map = optimal_routes_b_4_with_hazard_gdf.explore(column='length', tiles="CartoDB dark_matter")
b_4_map = b_4_gdf.explore(m=optimal_routes_b_4_with_hazard_map, color='blue', marker_kwds={'radius':10}, tiles="CartoDB dark_matter")
origins_with_optimal_route_b_4.explore(m=b_4_map, color='green', marker_kwds={'radius':5}, tiles="CartoDB dark_matter")
Now lets inspect the flooded roads (water depth bigger than 0.5 m). The colder the link colors, the deeper the water-level on the roads.
[ ]:
edges_gdf = gpd.read_file(root_dir.joinpath('static', 'output_graph', 'base_graph_hazard_edges.gpkg'))
impacted_edges_gdf = edges_gdf[edges_gdf['EV1_ma'] > 0.5]
impacted_edges_gdf.explore(column ='EV1_ma', tiles="CartoDB dark_matter")