Ben Chuanlong Du's Blog

It is never too late to learn.

Public Cloud Offering GPU Support

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://lambdalabs.com/

https://www.floydhub.com/

https://colab.research.google.com/notebooks/intro.ipynb#recent=true

Amazon AWS

Below is a list of instances that have 1 GPU on Amazon …

Tips on AutoGluon

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://github.com/awslabs/autogluon

https://autogluon.mxnet.io/

AutoGluon automatically inferences the problem type. However, you are still able to specify the probelm type if AutoGluon fails to infer the …

Split a Dataset into Train and Test Datasets in Python

Scikit-learn Compatible Packages

sklearn.model_selection.train_test_split is the best way to split a dataset into train and test subset for scikit-learn compatible packages (scikit-learn, XGBoost, LightGBM, etc.). It supports splitting both iterable objects (numpy array, list, pandas Series) and pandas DataFrames. When splitting an iterable object, it returns (train, test) where train and test are lists. When splitting a pandas DataFrame, it returns (train, test)