Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Tips and Traps¶
fit_transform
is used on training data whiletransform
is used on test data.
References¶
What and why behind fit_transform() and transform() in scikit-learn!
Tree-Based Models in scikit-learn
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Comments¶
- If you want a simple decision tree,
DecisionTreeRegressor is prefer to DecisionTreeClassifier as DecisionTreeRegressor is more flexible.
As a matter of fact, ensemble tree-based models (no matter classifiers or regressors) are based on DecisionTreeRegressor instead of DecisionTreeClassifier .
Visualize Decision Trees¶
What is the meaning of "value" in a node in sklearn decisiontree plot_tree
Tips on the Mojo Programming Language
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Check installed Python packages in Mojo notebook.
%%python
import os
os.system("pip list")
References
https://playground.modular.com/
Double Dipping in Machine Learning
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Double dipping is a term for overfitting a model through both building and evaluating the model on the same data-set, yielding inappropriately high statistical significance and circular logic.
References
Tips on Large Language Models
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
https://github.com/jmorganca/ollama Get up and running with large language models locally
https://github.com/OpenBMB/ToolBench 🔨This project (ToolLLM) aims …
Tips on FeatureTools
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Tool Review: Lessons learned from using FeatureTools to simplify the process of Feature Engineering