Ben Chuanlong Du's Blog

It is never too late to learn.

Data Frame Implementations in Rust

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Polars

Polars is a fast multi-threaded DataFrame library in Rust and Python.

datafusion

datafusion is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format …

Hands on the Python module dask

Installation

  1. You have to install the complete version of Dask (using the command pip3 install dask[complete]) if you need support of extended memory (for handling big data) and schedulers (for performance). The default installation version (pip3 install dask) of Dask does not include those features out-of-box.