Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Tips and Traps¶
The YAML specification does not forbid duplicated keys.
serde-yaml
ignores duplicated keys except the last one silently, by default. To overwrite this behavior, you have to define your own struct and use the tag serde_with::rust::maps_duplicate_key_is_error
Effect of Duplicating Observations in Linear Models
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
coefficients don't change but variance become smaller. use formula to show it ...
Complete Duplication of All Data Points
Complete Duplication of Some Data Points
Duplication with Noise
common in computer vision …
Spark Issue: Duplicated Partitions
Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
There seems to be an issue in Spark that it might fail to overwrite files
even if mode of spark.write
is set to be "overwrite".