Ben Chuanlong Du's Blog

It is never too late to learn.

Collection Functions in Spark

Tips and Traps

  1. If you use PySpark instead of Spark/Scala, pandas udf is a great alternative to all those (complicated) collections functions discussed here. Leveraging pandas udf, each partition of a Spark DataFrame can be converted to a pandas DataFrame without copying the underlying data, you can then do transforms on pandas DataFrames which will be converted back to partitons of a Spark DataFrame.

Slice in Rust

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Comparison of Collections in C++ and Java

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Plain Old Array

  1. The length/size of array is as in the declaration. Each element of the array is initialized to the default value (null for object).

  2. Array in Java does …