Search: model-evaluation | AgentSkillsRepo

ready ~/ agentskillsrepo

login

5368 results (41.4ms) page 5 / 269

Classification Modeling 0.00

aj-geddes / useful-ai-prompts-classification-modeling exact

Build binary and multiclass classification models using logistic regression, decision trees, and ensemble methods for categorical prediction and classification

★ 55 ai

model-quantization 0.00

martinholovsky / claude-skills-generator-model-quantization exact

Expert skill for AI model quantization and optimization. Covers 4-bit/8-bit quantization, GGUF conversion, memory optimization, and quality-performance tradeoffs for deploying LLMs in...

★ 20 ai

agent-evaluation 0.00

halay08 / fullstack-agent-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

404kidwiz / agent-skills-backup-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

Ianfr13 / claude-code-plugins-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

ramidamolis-alt / agent-skills-workflows-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

ngxtm / devkit-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent ai automation claude

agent-evaluation 0.00

sickn33 / antigravity-awesome-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 2,844 ai

agentic-skills ai-agents antigravity autonomous-coding

agent-evaluation 0.00

automindtechnologie-jpg / ultimate-skill-md-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

cleodin / antigravity-awesome-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 1 ai

agentic-skills ai-agents antigravity antigravity-ide

tech-stack-evaluator 0.00

matteocervelli / llms-tech-stack-evaluator exact

Auto-activates during requirements analysis to evaluate technical stack

★ 16 tools

cofounder-evaluator 0.00

shipshitdev / library-cofounder-evaluator exact

Use this skill when users need to evaluate potential co-founders, assess founder compatibility, design equity splits, or navigate co-founder relationships. Activates for "should I work with this...

★ 4 ai

claude-code codex commands skills

model-optimization 0.00

omer-metin / skills-for-antigravity-model-optimization exact

Use when reducing model size, improving inference speed, or deploying to edge devices - covers quantization, pruning, knowledge distillation, ONNX export, and TensorRT optimizationUse when ", " mentioned.

★ 5 ai

ai-agents antigravity antigravity-ide skills

risk-modeling 0.00

omer-metin / skills-for-antigravity-risk-modeling exact

Use when building VaR models, stress testing portfolios, Monte Carlo simulations, or implementing enterprise risk management - covers market risk, credit risk, and operational risk frameworksUse...

★ 5 ai

ai-agents antigravity antigravity-ide skills

style-modeler 0.00

dongbeixiaohuo / writing-agent-style-modeler exact

当用户需要学习某种风格、提取写作配方、建立风格库或模仿特定作者时调用。深度解构文本的15个维度，包括作者画像、思维内核、创作路径、互动设计等，建模为可精准复制的风格文件。触发词：风格建模、提取风格、学习风格、模仿写作、解构文章、写作配方、风格库。

★ 42 ai

ai-writing claude-code content-generation deepseek

ml-model-explainer 0.00

dkyazzentwatwa / chatgpt-skills-ml-model-explainer exact

Explain ML model predictions using SHAP values, feature importance, and decision paths with visualizations.

★ 7 ai

chatgpt claude-skills

ML Model Explanation 0.00

aj-geddes / useful-ai-prompts-ml-model-explanation exact

Interpret machine learning models using SHAP, LIME, feature importance, partial dependence, and attention visualization for explainability

★ 55 ai

data-model-creation 0.00

TencentCloudBase / skills-data-model-creation exact

Optional advanced tool for complex data modeling. For simple table creation, use relational-database-tool directly with SQL statements.

★ 6 data

creating-financial-models 0.00

ronnycoding / claude-creating-financial-models exact

This skill provides an advanced financial modeling suite with DCF analysis, sensitivity testing, Monte Carlo simulations, and scenario planning for investment decisions

★ 8 development

developing-flax-models 0.00

yonesuke / skills-developing-flax-models exact

A comprehensive guide for developing, training, and managing neural networks using Flax NNX. Use when defining models, managing state, or writing training loops.

★ 0 ai