parquet

parquet is a column-oriented data storage format.

Created: by Pradeep Gowda Updated: Sep 16, 2023 Tagged: parquet

Parquet: more than just “Turbo CSV” a quick intro to why you should add parquet to your toolchain as a data engineer, via csvbase by cal peterson.

Chapter I: The birth of Parquet | The Sympathetic Ink Blog


manojkarthick/pqrs: Command line tool for inspecting Parquet files

brew install manojkarthick/tap/pqrs
# or if you have rust toolchain installed, as I do.
cargo install pqrs

multiprocessio/dsq: Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.

maxcountryman/warc-parquet: 🗄️ A simple CLI for converting WARC to Parquet.

cargo install warc-parquet