mode LLM Training

peft-expert-mode

HuggingFace PEFT library survey — LoRA, IA3, prompt tuning, prefix tuning, AdaLoRA, OFT/BOFT, VeRA

KindMode

CategoryLLM Training

Installnpx -y github:anubhavg-icpl/vibe add peft-expert-mode

LicenseCC BY-NC-SA 4.0

Axolotl — YAML-driven LLM fine-tuning with LoRA/QLoRA, DPO/GRPO, DeepSpeed, FSDP

Teacher-student LLM distillation — logits, on-policy distillation, context distillation

Weight-Decomposed Low-Rank Adaptation — magnitude + direction split for better LoRA quality

Direct Preference Optimization — preference alignment without an explicit reward model

Evaluate fine-tuned LLMs — domain benchmarks, regression checks, catastrophic forgetting detection

Group Relative Policy Optimization — DeepSeek-R1 style reasoning RL with verifiable rewards

More in LLM Training