Ben Chuanlong Du's Blog

It is never too late to learn.

GPU Related Issues and Solutions

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Tips

  1. Training a model requires significantly more CPU/GPU memories than running inference using the model.

  2. torch.cuda.empty_cache() doesn't help if memory is not enough

  3. It is suggested that you …

Benchmark of Popular Deep Learning Models

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Memory

  1. ResNet152 inferencing requies about (a little bit less than) 2G GPU memory and training can be done with 8G GPU memory (need to set a small batch size). Deeper ResNet …

Fix the CrashLoopBackOff Issue of Pod in Kubernetes

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Define command as ["/busybox/sh", "-c", "tail -f /dev/null"] instead of ["/busybox/sh", "-c", "tail", "-f", "/dev/null"]