Bernoulli distribution

BERT

Geometric distribution

Model Compression

Quantization

sampling

Weight Pruning

word embeddings

word2vec