Guru's Verification engine ensures consistency, confidence, and trust in the knowledge your organization shares. Learn more.

Data | Apache Arrow

Apache arrow [1] is a tabular memory format that provides a fast, memory-efficient data processing paradigm that is unified across multiple languages.

The main selling points are:

  1. the arrow library provides speedy (de)serialization of tabular data, similar to something like FST [2]; besides the advantages for storing and retrieving data, this enables the ability to work with datasets that are too large fit into local memory [6].

  2. It is optimized for speed and memory usage for some common operations [4] [5].

  3. it has specifications across multiple languages and so makes it easy and fast to interop between any languages that it's implemented on.

Resources:

You must have Author or Collection Owner permission to create Guru Cards. Contact your team's Guru admins to use this template.