Hi Michael,

This error comes very deep inside libparquet, actually in libthrift. The more natural assumption would be that it would be due to a corrupted Parquet file. If you disabled multithreading (GDAL_NUM_THREADS=1) and enabled --debug on, perhaps this would happen on the same file ? I was wondering also if that couldn't come from RAM exhaustion, but that doesn't seem likely (did you monitor the RAM consumption?). It would be interesting to see if that would also occur after fetching locally the files under overturemaps-us-west-2/release/2024-07-22.0/theme=buildings and running the conversion from the local files

Even

Le 30/07/2024 à 20:37, Michael Smith via gdal-dev a écrit :

Hi all,

I was converting the overture maps parquet data to geopackage and got this error using gdal master (via conda).

GDAL_NUM_THREADS=ALL_CPUS CPL_TMPDIR=/data2 ogr2ogr -f gpkg /data/overture_buildings.gpkg /vsis3/overturemaps-us-west-2/release/2024-07-22.0/theme=buildings "theme=buildings"  -progress  -nlt PROMOTE_TO_MULTI -nln buildings -skipfailures

ERROR 1: ReadNext() failed: Couldn't deserialize thrift: TProtocolException: Exceeded size limit

Not sure what size limit it exceeded? I’ve had this happen several times but at very different points in the process.

Ideas?

Mike

--

Michael Smith

US Army Corps of Engineers

Remote Sensing/GIS Center


_______________________________________________
gdal-dev mailing list
gdal-dev@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/gdal-dev

--
http://www.spatialys.com
My software is free, but my time generally not.
_______________________________________________
gdal-dev mailing list
gdal-dev@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/gdal-dev

Reply via email to