Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4...
AI & LLM
LLM integrations, prompt engineering, and AI orchestration
Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an...
A skill that creates new Claude skills and automatically shares them on Slack using Rube for seamless team...
Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or...
Toolkit for creating animated GIFs optimized for Slack, with validators for size constraints and composable...
Accelerate LLM inference using speculative decoding, Medusa multiple heads, and lookahead decoding techniques. Use...
Use this skill for reinforcement learning tasks including training RL agents (PPO, SAC, DQN, TD3, DDPG, A2C, etc.),...
Statistical modeling toolkit. OLS, GLM, logistic, ARIMA, time series, hypothesis tests, diagnostics, AIC/BIC, for...
Use this skill when working with symbolic mathematics in Python. This skill should be used for symbolic computation...
|
Visualize training metrics, debug models with histograms, compare experiments, visualize model graphs, and profile...
Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production...
Toolkit for styling artifacts with a theme. These artifacts can be slides, docs, reportings, HTML landing pages,...
Use this skill when working with scientific research tools and workflows across bioinformatics, cheminformatics,...
Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs,...
Graph-based drug discovery toolkit. Molecular property prediction (ADMET), protein modeling, knowledge graph...
This skill should be used when working with pre-trained transformer models for natural language processing, computer...
Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment,...
UI design system toolkit for Senior UI Designer including design token generation, component documentation,...
Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization
Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM....
Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production...
Track ML experiments with automatic logging, visualize training in real-time, optimize hyperparameters with sweeps,...
OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and...