Ben Chuanlong Du's Blog

It is never too late to learn.

Tree-Based Models in scikit-learn

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Comments

  1. If you want a simple decision tree,

DecisionTreeRegressor is prefer to DecisionTreeClassifier as DecisionTreeRegressor is more flexible.

As a matter of fact, ensemble tree-based models (no matter classifiers or regressors) are based on DecisionTreeRegressor instead of DecisionTreeClassifier .

Tips on the Mojo Programming Language

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Check installed Python packages in Mojo notebook.

%%python
import os
os.system("pip list")

References

https://playground.modular.com/

Double Dipping in Machine Learning

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Double dipping is a term for overfitting a model through both building and evaluating the model on the same data-set, yielding inappropriately high statistical significance and circular logic.

References

Preparing Data for AI

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

General Tips

  1. When you label individual images, it is better to use numerical labels (even though text labels are easier to understand) so that you can avoid mapping between numbers (use …