Ben Chuanlong Du's Blog

It is never too late to learn.

Extracting Data from PDF Files

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Sometimes, a PDF file is corrupted or encrypted making it hard to extract data from it directly. In this case, you can convert a PDF page to an image first and …

Scaling Law for LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Beyond neural scaling laws – Paper Explained

Scaling Laws refer to the observed trend of some machine learning architectures (notably transformers) to scale their performance on predictable power law when given more …

Retrieval-Augmented Generation for LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

What is RAG

What is retrieval-augmented generation? https://research.ibm.com/blog/retrieval-augmented-generation-RAG

What is retrieval-augmented generation? @ YouTube https://www.youtube.com/watch?v=T-D1OfcDW1M

Fine-Tune LLM

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

LoRA: Low-Rank Adaptation of Large Language Models

https://arxiv.org/abs/2106.09685

Tips on ONNX

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

ONNX Runtime

ONNX Runtime is a cross-platform inference and training machine-learning accelerator. ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as …