Ben Chuanlong Du's Blog

It is never too late to learn.

Scaling Law for LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Beyond neural scaling laws – Paper Explained

Scaling Laws refer to the observed trend of some machine learning architectures (notably transformers) to scale their performance on predictable power law when given more …

Retrieval-Augmented Generation for LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

What is RAG

What is retrieval-augmented generation? https://research.ibm.com/blog/retrieval-augmented-generation-RAG

What is retrieval-augmented generation? @ YouTube https://www.youtube.com/watch?v=T-D1OfcDW1M

Fine-Tune LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

LoRA: Low-Rank Adaptation of Large Language Models

https://arxiv.org/abs/2106.09685

Tips on ONNX

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

ONNX Runtime

ONNX Runtime is a cross-platform inference and training machine-learning accelerator. ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as …

JIT and Domain Specific Compilers

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

numba

Hummingbird

Hummingbird compiles trained ML models into tensor computation for faster inference.

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

XLA

deep learning …