Ben Chuanlong Du's Blog

It is never too late to learn.

Docker APIs

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Python

docker-py

docker-py is a Python library for the Docker Engine API. It lets you do anything the docker command does, but from within Python apps – run containers, manage containers, manage …

Work With Multiple Spark Installations

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

spark-submit and spark-shell

Overwrite the PATH environment variable before invoking spark-submit and/or spark-shell often resolves the issue.

Spark in Jupyter/Lab Notebooks

Remove or reset the environment variable HADOOP_CONF_DIR resolves …