Sample datasets
There are a number of available sample datasets that can be readily used to test Whitebox Workflows for Python. The following is a description of all available datasets:
Dataset Name | File Name | Description | Compressed Size |
---|---|---|---|
Guelph_landsat | band1.tif...band7.tif | 7 bands of a sub-area of a Landsat 5 data set | 10.9 MB |
Grand_Junction | DEM.tif | A small digital elevation model (DEM) in high-relief | 5.8 MB |
GTA_lidar | GTA_lidar.laz | An airborne lidar point cloud, in LAZ format | 54.3 MB |
jay_brook | jay_brook.laz | An airborne lidar point cloud, in LAZ format | 76.3 MB |
Jay_State_Forest | DEM.tif | A lidar raster digital elevation model (DEM) | 27.7 MB |
Kitchener_lidar | Kitchener_lidar.laz | An airborne lidar point cloud, in LAZ format | 41.6 MB |
London_air_photo | London_air_photo.tif | A high-res RGB air photo | 87.3 MB |
mill_brook | mill_brook.laz | An airborne lidar point cloud, in LAZ format | 49.9 MB |
peterborough_drumlins | peterborough_drumlins.tif | An lidar raster digital elevation model (DEM) | 22.0 MB |
Southern_Ontario_roads | roads_utm.shp | Vector roads layer for a section of Southern Ontario | 7.1 MB |
StElisAk | StElisAk.laz | An airborne lidar point cloud, in LAZ format | 54.5 MB |
The data can be downloaded using the download_sample_data
function within the whitebox_workflows
module. For example,
import whitebox_workflows as wbw
from whitebox_workflows import WbEnvironment
wbe = WbEnvironment()
wbe.working_directory = wbw.download_sample_data('Kitchener_lidar')
print(f'Data have been stored in: {wbe.working_directory}')
The data will be downloaded to a location within your HOME directory and the download_sample_data
function will return the address of the dataset directory. This can be useful for updating the WbEnvironment.working_directory
property, as in the script above. The data will be downloaded in a compressed file format (zip) and will be automatically decompressed after the download has completed. The download_sample_data
function will automatically terminate after ten minutes. You may encounter this issue if you attempt to download some of the larger sample datasets using a slower Internet connection.