Ben Chuanlong Du's Blog

It is never too late to learn.

Run Commands on Remote Machines

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

On a Sinsgle Machine

SSH

  1. The pipeline command is run locally. If you want the pipeline command to run remotely, place the whole command to be run remotely in double/single …

Tips on Apache Arrow

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

[Feather vs Parquet]https://github.com/wesm/feather/issues/188

References

https://github.com/wesm/feather

Hands on the json Module in Python

Tips and Traps

  1. It is suggested that you avoid using JSON for serializing and deserializing data. Please refer to Shotcomes of JSON for detailed discussions on this. TOML and YAML are better text-based alternatives to JSON. If serialization and deserialization is done in Python only, pickle