Ben Chuanlong Du's Blog

It is never too late to learn.

Lightdm Tips

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

  1. it seems that you can choose to enable VNC while starting lightdm. Does this solves the problem of auto-login? If you are allowed to login into the computer via VNC without …

Tips on Multimedia

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

  1. XMP is the recommended way to tag many media files

Experiment Design

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

  1. Complete Randomized Design (CRD)

  2. Randomized Complete Block Design (CBD)

    • same RNE as CRD
  3. Latin Square Design (LSD)
    • same RNE as CRD
  4. Balanced Incomplete Block Design

    • all treatments cannot fit in any …

Sampling Methods

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Probability Sampling

  • Random Sampling

  • Systematic Sampling

  • Stratified Sampling

Non-probability Sampling

  • Convenience Sampling

  • Judgement Sampling

  • Quota Sampling

  • Snowball Sampling

bias

Model Fitting in ANOVA Analysis

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Contrast

  1. Usually it does not matter what contrast(s) you use for factors in linear model problems[^20], so you can choose appropriate contrast(s) so that your problem is most …

Read/Write CSV in PySpark

Load Data in CSV Format

  1. .load is a general method for reading data in different format. You have to specify the format of the data via the method .format of course. .csv (both for CSV and TSV), .json and .parquet are specializations of .load. .format is optional if you use a specific loading function (csv, json, etc.).