Tips on Transformer in NLP

Jan 13, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

http://nlp.seas.harvard.edu/2018/04/03/attention.html

https://blog.floydhub.com/the-transformer-in-pytorch/

http://jalammar.github.io/illustrated-transformer/

https://towardsdatascience.com/transformers-141e32e69591

Feature Scaling in Machine Learning

Jan 13, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://en.wikipedia.org/wiki/Feature_scaling

https://www.jeremyjordan.me/batch-normalization/

How to use Data Scaling Improve Deep Learning Model Stability and Performance

https://medium.com/@urvashilluniya/why-data-normalization-is-necessary-for-machine-learning-models-681b65a05029

Understand Attention in NLP

Jan 08, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/

https://medium.com/@joealato/attention-in-nlp-734c6fa9d983

Tips on Word2Vec

Jan 08, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Word2Vec

https://code.google.com/archive/p/word2vec/

Hierarchical Softmasx

Negative Sampling

Google Word2Vec claims that hierarchical softmax is better for infrequent words while negative sampling is better for frequent words …

Compresion of Deep Learning Models

Jan 08, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

MobileNet

一、网络修剪

网络修剪，采用当网络权重非 …

Difference Between torch.nn.Module and torch.nn.functional

Jan 07, 2020

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

Modules in torch.nn are internal implemented based on torch.nn.functional. Modules in torch.nn are easier to use while torch.nn.functional is more flexible. It is recommended to …

← Older Newer →

Ben Chuanlong Du's Blog

It is never too late to learn.

Tips on Transformer in NLP

Feature Scaling in Machine Learning

Understand Attention in NLP

Tips on Word2Vec

Word2Vec

Hierarchical Softmasx

Negative Sampling

Compresion of Deep Learning Models

Difference Between torch.nn.Module and torch.nn.functional