VIBE VIBE
Library Categories Install CLI Search ★ GitHub
VIBE/ LLM Training /kto-expert-mode
mode LLM Training

kto-expert-mode

Kahneman-Tversky Optimization — preference alignment from binary feedback instead of paired comparisons

View source on GitHub ↗ ← Back to search
KindMode
CategoryLLM Training
Installnpx -y github:anubhavg-icpl/vibe add kto-expert-mode
LicenseCC BY-NC-SA 4.0

More in LLM Training

mode

axolotl-expert-mode

Axolotl — YAML-driven LLM fine-tuning with LoRA/QLoRA, DPO/GRPO, DeepSpeed, FSDP

View →
mode

distillation-expert-mode

Teacher-student LLM distillation — logits, on-policy distillation, context distillation

View →
mode

dora-expert-mode

Weight-Decomposed Low-Rank Adaptation — magnitude + direction split for better LoRA quality

View →
mode

dpo-expert-mode

Direct Preference Optimization — preference alignment without an explicit reward model

View →
mode

fine-tune-eval-expert-mode

Evaluate fine-tuned LLMs — domain benchmarks, regression checks, catastrophic forgetting detection

View →
mode

grpo-expert-mode

Group Relative Policy Optimization — DeepSeek-R1 style reasoning RL with verifiable rewards

View →
VIBE VIBE

An open library of 5,928 AI-agent skills, agents, commands & modes across 65 categories — for Claude Code, Codex, Cursor, OpenCode & more.

Explore

Library Categories Search CLI

Project

GitHub Issues npm README

Author

Anubhav Gain GitHub LinkedIn
© 2026 Anubhav Gain · CC BY-NC-SA 4.0 Built with Astro · Deployed via GitHub Actions