Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
Code for the paper "Language Models are Unsupervised Multitask Learners"
The project huggingface/transfomer has implementation of transfomer based modles (such as GPT-2 ) .
References
https://openai …