The following is a basic steps for setting up duckdb and it's spatial extension, PostGEESE. This notebook can be used as starting point for building poor man's data lake (or to be precise, data 'pond') for geospatial data science analytics using DuckDB's latest Spatial Extension.
Credit to:
- Mark Litwintschik, one of the very first articles discussing about the wonderful of DuckDB to the geospatial community
- Mark Forrest for the inspiration.
- Wei-Meng Lee for the examples on JupySQL
- Download the data from here, convert the CSV file to GeoJSON using QGIS. Name it
london-trees.geojson. We can also use parquet, which is smaller in size. The spatial extension can also read CSV's using its GDAL driver ( see the docs forst_read()here). Change the code accordingly. - Put the file in the same folder
- Fire up your Jupyter Notebook or upload to Google Colab
- Have fun!