Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Tips and Traps¶
fit_transform
is used on training data whiletransform
is used on test data.
References¶
What and why behind fit_transform() and transform() in scikit-learn!
Categorical Data in Pandas
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Tree-Based Models in scikit-learn
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Comments¶
- If you want a simple decision tree,
DecisionTreeRegressor is prefer to DecisionTreeClassifier as DecisionTreeRegressor is more flexible.
As a matter of fact, ensemble tree-based models (no matter classifiers or regressors) are based on DecisionTreeRegressor instead of DecisionTreeClassifier .
Visualize Decision Trees¶
What is the meaning of "value" in a node in sklearn decisiontree plot_tree
Tips on the Mojo Programming Language
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Check installed Python packages in Mojo notebook.
%%python
import os
os.system("pip list")
References
https://playground.modular.com/
Get Location of Max Value in a Pandas Series
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
idxmax Return the row label of the maximum value.
argmax: Return int position of the largest value in the Series.
Ruff Is An Extremely Fast Python Linter Written in Rust
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
https://github.com/pylint-dev/pylint/blob/47cb11f4cb01a61f83d915d88e828f103a479980/pyproject.toml