akimbo.read_json

akimbo.read_json(url: str, storage_options: dict | None = None, schema: dict | None = None, extract: bool = True, backend: str = 'pandas', **kwargs)[source]

Read a JSON dataset with nested data into a Series or DataFrame.

You can pass a selection of columns to read (list or jsonschema format), using schema=, and other columns will not be parsed into memory. See the docs for ak.from_json for further details.

(examples to come)

Parameters:
  • url (data location (may include glob characters))

  • storage_options (any arguments for an fsspec backend)

  • schema (if given, the JSONschema expected in the data; this allows for) – selecting only some part of the record structure, this saving on some parsing time and potentially a lot of memory footprint. Even if reading all the data, providing a schema will lead to better performance.

  • extract (whether to turn top-level records into a dataframe. If False,) – will return a series.

  • backend (one of "pandas", "polars" or "dask")