OGC GeoParquet best practices

The OGC offers a best practices guide for distributing #GeoParquet files, covering compression, spatial indexing and ordering, row group sizes, partitioning, and metadata. The guide also lists concrete commands for common tools so you can produce optimally distributed files right away.
Author
Published

May 22, 2026

As part of its GeoParquet format repository, the OGC1 has published a guide to best practices for distributing data in GeoParquet forma. The goal behind the guide is offering more clarity how to optimally distribute GeoParquet files, given the many options of various libraries and tools working with Parquet files (that may not be ideal for GeoParquet handling).

Optimised ordering of rows in a GeoParquet file: The file loads globally and keeps loading until all data is fetched. (source: OGC)

Non-optimal ordering of rows in a GeoParquet file (source: OGC)

The guide covers topics such as compression, spatial indexing and ordering, row group size, partitioning, and metadata. It also briefly lists commands for common tools, both explaining what the default options will do as well as listing a commands that will yield GeoParquet files that follow the guide’s recommendations.

If this is interesting to you, see also geoparquet-io and the GeoStandards.ch guide on COGs.

Footnotes

  1. Open Geospatial Consortium.↩︎