LLM 9
- [Paper Review] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- [Paper Review] Qwen 2.5
- [Paper Review] Llama 3
- [Paper Review] Llama 2: Open Foundation and Fine-Tuned Chat Models
- [Paper Review] QLoRA: Efficient Finetuning of Quantized LLMs
- [Paper Review] Alpaca: A Strong, Replicable Instruction-Following Model
- [Paper Review] LLaMA: Open and Efficient Foundation Language Models
- [Paper Review] LoRA: Low-Rank Adaptation of Large Language Models
- [Paper Review] GPT-3: Language Models are Few-Shot Learners