PAPER NOTES: PARAMETER-EFFICIENT FINE-TUNING
A collection of lightweight fine-tuning methods.
PAPER NOTES: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS
LoRA is a lightweight fine-tuning technique for large models.
PAPER NOTES: ATTENTION IS ALL YOU NEED
Notes on this classic masterpiece: Attention is All You Need.
DEFINITION AND ALGORITHMS OF TRIE
Trie Algorithm
HOW TO TRAIN A CHINESE TOKENIZER MODEL USING SENTENCEPIECE
What is SentencePiece
DEEP LEARNING: WHAT IS REGULARISATION ?
Categories and Comparison of Regularization Methods