Ben Chuanlong Du's Blog

It is never too late to learn.

Use the ulimit Command to Control the Stack Size

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

http://en.tldp.org/LDP/solrhe/Securing-Optimizing-Linux-RH-Edition-v1.3/x4733.html

https://stackoverflow.com/questions/14471564/what-does-ulimit-s-unlimited-do

https://access.redhat.com/solutions/384633

Hands on pandas.Series in Python

pandas.Series.str

  1. The attribute pandas.Series.str can only be used with Series of str values. You will either encounter an AttributionError (Can only use .str accessor with string values, which use np.object_ dtype in pandas) or find it to yield a Series of NaN's if you invoke it on a Series of non-string values. If you have control of the DataFrame, the preferred way is to cast the type the column to str

Split a Dataset into Train and Test Datasets in Python

Scikit-learn Compatible Packages

sklearn.model_selection.train_test_split is the best way to split a dataset into train and test subset for scikit-learn compatible packages (scikit-learn, XGBoost, LightGBM, etc.). It supports splitting both iterable objects (numpy array, list, pandas Series) and pandas DataFrames. When splitting an iterable object, it returns (train, test) where train and test are lists. When splitting a pandas DataFrame, it returns (train, test)

Tips on Darknet and Yolo

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://pjreddie.com/darknet/tiny-darknet/

https://pjreddie.com/darknet/

Build Spark from Source

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

You can download prebuilt binary Spark at https://spark.apache.org/downloads.html. This is where you should get started and it will likely satisfy your need most of the time …