Dyt (tanh norm) Causal Decoder Adapter tuning Prefix Decoder Encoder- Decoder Teacher forcing RMSNorm deepseek Post- LN Prompt tuning KV caching Supervised Fine- Tuning (SFT) Multi Query Attention - MQA Llama Hallucinations Multi-Head Latent Attention (MLA) Incremental Decoding RLAIF Alignment Chain- of- Thought Training difficulties ICL Knowledge recency GPT Loss Spikes GLaM GLU Gemini Swish DeepNorm Pre- LN DPO Prefix tuning SwiGLU Instruction Tuning RLHF Low-Rank Adaptation (LoRA) Grouped Query Attention - GQA Paged Attention Dyt (tanh norm) Causal Decoder Adapter tuning Prefix Decoder Encoder- Decoder Teacher forcing RMSNorm deepseek Post- LN Prompt tuning KV caching Supervised Fine- Tuning (SFT) Multi Query Attention - MQA Llama Hallucinations Multi-Head Latent Attention (MLA) Incremental Decoding RLAIF Alignment Chain- of- Thought Training difficulties ICL Knowledge recency GPT Loss Spikes GLaM GLU Gemini Swish DeepNorm Pre- LN DPO Prefix tuning SwiGLU Instruction Tuning RLHF Low-Rank Adaptation (LoRA) Grouped Query Attention - GQA Paged Attention
(Print) Use this randomly generated list as your call list when playing the game. There is no need to say the BINGO column name. Place some kind of mark (like an X, a checkmark, a dot, tally mark, etc) on each cell as you announce it, to keep track. You can also cut out each item, place them in a bag and pull words from the bag.
Dyt (tanh norm)
Causal Decoder
Adapter tuning
Prefix Decoder
Encoder-Decoder
Teacher forcing
RMSNorm
deepseek
Post-LN
Prompt tuning
KV caching
Supervised Fine-Tuning (SFT)
Multi Query Attention - MQA
Llama
Hallucinations
Multi-Head Latent Attention (MLA)
Incremental Decoding
RLAIF
Alignment
Chain-of-Thought
Training difficulties
ICL
Knowledge recency
GPT
Loss Spikes
GLaM
GLU
Gemini
Swish
DeepNorm
Pre-LN
DPO
Prefix tuning
SwiGLU
Instruction Tuning
RLHF
Low-Rank Adaptation (LoRA)
Grouped Query Attention - GQA
Paged Attention