Save and Load PyTorch Models

Apr 09, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

PyTorch uses pickle to serialize and deserialize objects.
The PyTorch convention is to use the file extension .pt or .pth for saving model (or its parameters) and use the file extension …

Tips on Deep Graph Learning

Apr 06, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://github.com/dmlc/dgl

Tips on the Transformers Python Library for NLP

Mar 06, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://github.com/huggingface/transformers#quick-tour

https://github.com/huggingface/transformers

https://huggingface.co/transformers/

BERT

GPT 2

XLNet

Tokenization in NLP

Mar 06, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Libraries

SentencePiece

SentencePiece is an unsupervised text tokenizer for Neural Network-based text generation.

Subword Algorithms for NLP

Mar 06, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Classic word representation cannot handle unseen word or rare word well. Character embeddings is one of the solution to overcome out-of-vocabulary (OOV). However, it may be too fine-grained and miss some …

Terminologies and Concepts in NLP

Mar 06, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Word Embedding Character Embedding Subword Embeddling Tokenization

General Language Understanding Evaluation (GLUE)

Natural Language Generation (NLG) Natural Language Generation, as defined by Artificial Intelligence: Natural Language Processing Fundamentals, is the “process …

← Older Newer →

Ben Chuanlong Du's Blog

It is never too late to learn.

Save and Load PyTorch Models

Tips on Deep Graph Learning

Tips on the Transformers Python Library for NLP

Tokenization in NLP

Libraries

SentencePiece

Subword Algorithms for NLP

Terminologies and Concepts in NLP