Documentation Index
Fetch the complete documentation index at: https://mintlify.com/terrafloww/rasteret/llms.txt
Use this file to discover all available pages before exploring further.
Function Signature
Description
Wrap a read-ready Arrow object as a Collection. This is the lightweight re-entry path for workflows where you already have a table derived from an existing Collection and want to keep using Rasteret reads without re-running ingest/enrichment. Unlikebuild_from_table(), this function performs no COG enrichment, normalization, or persistence. It validates the read contract and wraps the provided Arrow object as-is.
Use build_from_table() for first-time external Parquet ingest.
Parameters
Arrow object to wrap.
pyarrow.dataset.Dataset is recommended for large collections to keep scans lazy. Despite the parameter name, both table and dataset inputs are first-class.Optional collection name.
Optional data source identifier. If omitted, Rasteret attempts to infer it from schema metadata or the
collection column.Optional collection description.
Optional temporal start to attach to the Collection object.
Optional temporal end to attach to the Collection object.
When
True (default), require at least one *_metadata column and validate those columns are struct-typed with required COG metadata fields.Returns
A wrapped Collection ready for
get_numpy(), get_xarray(), and to_torchgeo_dataset() when the necessary band metadata columns are present.Raises
- TypeError: If the input is not a
pyarrow.Tableorpyarrow.dataset.Dataset. - ValueError: If required columns are missing or band metadata is invalid.
- UserWarning: If a large in-memory
pyarrow.Tableis provided (>2 GiB or >40% of system RAM).
Usage Example
Performance Notes
Related Functions
- build_from_table() - Build from external Parquet with normalization/enrichment
- load() - Load a persisted Collection
- Collection - Collection class reference