AI Learning

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Feature Engineering

https://mmlspark.blob.core.windows.net/website/index.html

Is discrete variables easier to handle than continous variables (in random forest)? Is there any advantage of discretize variables? The eseential question is how is categorical varialbes handled in RF? Does RF use category variables directly or does it have to convert it to numerical somehow?
Random forest has a way to impute missing values. What if I treat missing values in categorical predictors and a new class? It sounds like a good ...

https://openai.com/blog/dall-e/