2747 results (33.1ms) page 36 / 138
zechenzhangAGI / ai-research-skills-grpo-rl-training exact

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

ovachiever / droid-tings-grpo-rl-training exact

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

agamm / claude-code-owasp-owasp-security exact

Use when reviewing code for security vulnerabilities, implementing authentication/authorization, handling user input, or discussing web application security. Covers OWASP Top 10:2025, ASVS 5.0,...

ralph 0.00
jeninh / ampskills-dotfile-ralph exact

Autonomous feature development - setup and execution. Triggers on: ralph, set up ralph, run ralph, run the loop, implement tasks. Two phases: (1) Setup - chat through feature, create tasks with...

ralph 0.00
homburg / agent-skills-ralph exact

Autonomous feature development - setup and execution. Triggers on: ralph, set up ralph, run ralph, run the loop, implement tasks. Two phases: (1) Setup - chat through feature, create tasks with...

ralph 0.00
snarktank / amp-skills-ralph exact

Autonomous feature development - setup and execution. Triggers on: ralph, set up ralph, run ralph, run the loop, implement tasks. Two phases: (1) Setup - chat through feature, create tasks with...

vladm3105 / aidoc-flow-framework-google-adk exact

Develop agentic software and multi-agent systems using Google ADK in Python

0xbeedao / agentic-tools-python-testing-patterns exact

Implement comprehensive testing strategies with pytest, fixtures, mocking, and test-driven development. Use when writing Python tests, setting up test suites, or implementing testing best practices.

ovachiever / droid-tings-python-testing-patterns exact

Implement comprehensive testing strategies with pytest, fixtures, mocking, and test-driven development. Use when writing Python tests, setting up test suites, or implementing testing best practices.

jimobrien / moltcities-skill exact

Publish and manage your cryptographically-verified site on MoltCities β€” the residential layer of the agent internet. Includes inbox, messaging, and agent discovery.

eugenepyvovarov / mcpbundler-agent-skills-marketplace-model-trainer exact

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward...

huggingface / skills-hugging-face-model-trainer exact

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward...