Performing Spatial Joins

Opozorilo

This tutorial is now obsolete. A new and updated version is available at Prostorsko združevanje podatkov (QGIS3)

Spatial Join is a classic GIS problem - transferring attributes from one layer to another based on their spatial relationship. In QGIS, this functionality is available through the Join Attributes by Location tool.

Overview of the task

We will use 2 layers - A shapefile of burough boundaries of New York city and another shapefile of nursing home locations in New York city. We will use spatial join technique to find the total nursing home capacity for each of the buroughs.

Other skills you will learn

  • Deleting columns from the attribute table of a layer.

Get the data

NYC Open Data Portal is an excellent source of free data for New York city.

  1. Download the Borough Boundaries zip file using the Export option on the portal.

../_images/1193.png
  1. Download the Nursing Homes zip file using the Export option on the portal.

../_images/2159.png

For convenience, you may directly download a copy of the datasets from the links below:

nybb_12c.zip

OEM_NursingHomes_001.zip

Procedure

  1. Go to Layer ‣ Add Vector Layer. Browse to the downloaded zip file nybb_12c.zip and select Open.

../_images/3100.png
  1. Select the layer nybb.shp and click OK.

../_images/462.png
  1. Repeat the steps 3 and 4 for the another file OEM_NursingHomes_001.zip and load the OEM_NursingHomes_001.shp layer. Once both the layers are loaded, right-click on the OEM_NursingHomes_001 layer and select Open Attribute Table.

../_images/556.png
  1. Examine the attributes available for each feature. Since our task is to calculate the total nursing home capacity for each borough, we can use the attribute Capacity which can join to the boundaries layer.

../_images/654.png
  1. Go to Vector ‣ Data Mangement Tools ‣ Join attributes by location.

../_images/753.png
  1. The Target vector layer is the one we want to add the attributes to. In our case, this will be the boroughs boundary nybb layer. The Join vector layer will be the nursing homes OEM_NursingHomes_001 layer. As we want to sum the capacity of nursing homes, select Take summary of intersecting features and choose Sum. Name the output file as nyc_borough_join.shp. In the Output table select Keep all records`.

../_images/851.png
  1. Once the process finishes, select Yes when asked if you want to add the layer to TOC. The new layer nyc_borough_join would have the features from nybb layer along with spatially joined attributes from OEM_NursingHomes_001 layer. Right-click on the layer and select Open Attribute Table.

../_images/950.png
  1. You will see a column SUMCapacit in the attribute table. This is the sum of the Capacity attribute for the nursing homes that fall within each borough feature.

../_images/1058.png
  1. This is the answer we are looking for. But there are extra columns that we do not need in our output. Let’s clean up our output. Click on the Toggle editing button and then the Delete column button.

../_images/1194.png
  1. Press Control-A to select all columns in the Delete Attributes dialog. Next hold the Control key and de-select the columns you want to keep. Click OK.

../_images/1260.png
  1. In the attribute table, click Toggle editing button again to save the changes.

../_images/1357.png
  1. Back in the QGIS Canvas, use the Identify tool to verify that the output file has the desired attributes for each burough feature.

../_images/1454.png

If you want to give feedback or share your experience with this tutorial, please comment below. (requires GitHub account)