akimbo.read_parquet

akimbo.read_parquet(url: str, storage_options: dict | None = None, extract: bool = True, backend: str = 'pandas', **kwargs)[source]

Read a Parquet dataset with nested data into a Series or DataFrame.

This may cope with some deeply nested structures that pandas refuses to read by itself.

You can pass a selection of columns to read (list of strings), and other columns will not be parsed into memory. Each of these labels may be a root of deeper-nested structs, or use “*” globbing.

Parameters:
  • url (data location) – Directory with data files, single file or glob pattern

  • storage_options (any arguments for an fsspec backend)

  • extract (whether to turn top-level records into a dataframe. If False,) – will return a series.

  • backend (one of "pandas", "polars" or "dask")