Search: instruction-tuning | AgentSkillsRepo

fine-tuning-with-trl 0.30

zechenzhangAGI / ai-research-skills-fine-tuning-with-trl exact

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF,...

★ 1,712 ai

ai ai-research claude claude-code

fine-tuning-with-trl 0.30

ovachiever / droid-tings-fine-tuning-with-trl exact

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF,...

★ 19 ai

llm-fine-tuning 0.29

omer-metin / skills-for-antigravity-llm-fine-tuning exact

Use when adapting large language models to specific tasks, domains, or behaviors - covers LoRA, QLoRA, PEFT, instruction tuning, and full fine-tuning strategiesUse when ", " mentioned.

★ 5 ai

ai-agents antigravity antigravity-ide skills

crafting-instructions 0.26

oaustegard / claude-skills-crafting-instructions exact

Generate optimized instructions for Claude (Project instructions, Skills, or standalone prompts). Use when users request creating project setups, writing effective prompts, building Skills, or...

★ 30 ai

claude claude-skill claude-skills

tuning-panel 0.26

petekp / agent-skills-tuning-panel exact

Create visual parameter tuning panels for iterative adjustment of animations, layouts, colors, typography, physics, or any numeric/visual values. Use when the user asks to "create a tuning panel",...

★ 1 development

peft-fine-tuning 0.24

zechenzhangAGI / ai-research-skills-peft-fine-tuning exact

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with...

★ 1,712 ai

ai ai-research claude claude-code

Fine-Tuning Assistant 0.22

eddiebe147 / claude-settings-fine-tuning-assistant exact

Guide model fine-tuning processes for customized AI performance

★ 8 ai

fine-tuning-expert 0.22

ngxtm / devkit-fine-tuning-expert exact

Use when fine-tuning LLMs, training custom models, or optimizing model performance for specific tasks. Invoke for parameter-efficient methods, dataset preparation, or model adaptation.

★ 0 ai

agent ai automation claude

fine-tuning-expert 0.22

Jeffallan / claude-skills-fine-tuning-expert exact

Use when fine-tuning LLMs, training custom models, or optimizing model performance for specific tasks. Invoke for parameter-efficient methods, dataset preparation, or model adaptation.

★ 134 ai

ai-agents claude claude-code claude-marketplace

improve-instructions 0.22

philoserf / claude-code-setup-improve-instructions exact

Analyzes conversation history to improve CLAUDE.md files. Use when you notice patterns in how Claude misunderstands requests, want to consolidate repeated guidance, or improve instruction clarity...

★ 9 ai

claude-code

expert-instruction 0.22

YuniorGlez / gemini-elite-core-expert-instruction exact

Primary Instruction Protocol for Senior Engineering Agents. Expert in Cognitive Architectures, Memory Systems, and 2026 Context Engineering (Updated for v0.27.0).

★ 3 ai

llm-tuning-patterns 0.21

parcadei / continuous-claude-v3-llm-tuning-patterns exact

LLM Tuning Patterns

★ 3,433 ai

agents claude-code claude-code-cli claude-code-hooks

llm-fine-tuning-guide 0.21

qodex-ai / ai-agent-skills-llm-fine-tuning-guide exact

Master fine-tuning of large language models for specific domains and tasks. Covers data preparation, training techniques, optimization strategies, and evaluation methods. Use when adapting models...

★ 1 ai

vector-index-tuning 0.21

rmyndharis / antigravity-skills-vector-index-tuning exact

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

★ 187 tools

vector-index-tuning 0.21

halay08 / fullstack-agent-skills-vector-index-tuning exact

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

★ 0 tools

fine-tuning-data-generator 0.20

markpitt / claude-skills-fine-tuning-data-generator exact

Generates comprehensive synthetic fine-tuning datasets in ChatML format (JSONL) for use with Unsloth, Axolotl, and similar training frameworks. Gathers requirements, creates datasets with diverse...

★ 5 ai

dspy-miprov2-optimizer 0.19

OmidZamani / dspy-skills-dspy-miprov2-optimizer exact

This skill should be used when the user asks to "optimize a DSPy program", "use MIPROv2", "tune instructions and demos", "get best DSPy performance", "run Bayesian optimization", mentions...

★ 20 ai

agent-skills claude-code claude-skills dspy

prompt-engineering 0.17

cosmix / loom-prompt-engineering exact

Designs and optimizes prompts for large language models including system prompts, agent signals, and few-shot examples. Covers instruction design, prompt security, chain-of-thought reasoning, and...

★ 6 ai

agentic-coding agents claude claude-code

model_finetuning 0.17

DonggangChen / antigravity-agentic-skills-model-finetuning exact

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF,...

★ 2 ai

model_finetuning 0.17

vuralserhat86 / antigravity-agentic-skills-model-finetuning exact

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF,...

★ 27 ai

Confirm

Submit a Skill