Ben Chuanlong Du's Blog

It is never too late to learn.

Tips on AutoGluon

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://github.com/awslabs/autogluon

https://autogluon.mxnet.io/

AutoGluon automatically inferences the problem type. However, you are still able to specify the probelm type if AutoGluon fails to infer the …

Split a Dataset into Train and Test Datasets in Python

Scikit-learn Compatible Packages

sklearn.model_selection.train_test_split is the best way to split a dataset into train and test subset for scikit-learn compatible packages (scikit-learn, XGBoost, LightGBM, etc.). It supports splitting both iterable objects (numpy array, list, pandas Series) and pandas DataFrames. When splitting an iterable object, it returns (train, test) where train and test are lists. When splitting a pandas DataFrame, it returns (train, test)

Tips on LSTM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

http://colah.github.io/posts/2015-08-Understanding-LSTMs/