Search: tensor | AgentSkillsRepo

docstring 0.30

pytorch / pytorch-docstring exact

Write docstrings for PyTorch functions and methods following PyTorch conventions. Use when writing or updating docstrings in PyTorch code.

★ 97,036 ai

neural-network autograd gpu numpy

metal-kernel 0.30

pytorch / pytorch-metal-kernel exact

Write Metal/MPS kernels for PyTorch operators. Use when adding MPS device support to operators, implementing Metal shaders, or porting CUDA kernels to Apple Silicon. Covers native_functions.yaml...

★ 97,036 ai

neural-network autograd gpu numpy

at-dispatch-v2 0.29

pytorch / pytorch-at-dispatch-v2 exact

Convert PyTorch AT_DISPATCH macros to AT_DISPATCH_V2 format in ATen C++ code. Use when porting AT_DISPATCH_ALL_TYPES_AND*, AT_DISPATCH_FLOATING_TYPES*, or other dispatch macros to the new v2 API....

★ 97,036 development

neural-network autograd gpu numpy

add-uint-support 0.26

pytorch / pytorch-add-uint-support exact

Add unsigned integer (uint) type support to PyTorch operators by updating AT_DISPATCH macros. Use when adding support for uint16, uint32, uint64 types to operators, kernels, or when user mentions...

★ 97,036 ai

neural-network autograd gpu numpy

serving-llms-vllm 0.25

ovachiever / droid-tings-serving-llms-vllm exact

Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with...

★ 19 ai

serving-llms-vllm 0.25

zechenzhangAGI / ai-research-skills-serving-llms-vllm exact

Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with...

★ 1,712 ai

ai ai-research claude claude-code

domain-ml 0.25

actionbook / rust-skills-domain-ml exact

Use when building ML/AI apps in Rust. Keywords: machine learning, ML, AI, tensor, model, inference, neural network, deep learning, training, prediction, ndarray, tch-rs, burn, candle, 机器学习, 人工智能, 模型推理

★ 596 ai

training-llms-megatron 0.25

ovachiever / droid-tings-training-llms-megatron exact

Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models >1B parameters, need maximum GPU efficiency (47% MFU on...

★ 19 ai

training-llms-megatron 0.25

zechenzhangAGI / ai-research-skills-training-llms-megatron exact

Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models >1B parameters, need maximum GPU efficiency (47% MFU on...

★ 1,712 ai

ai ai-research claude claude-code

triaging-issues 0.23

pytorch / pytorch-triaging-issues exact

Triages GitHub issues by routing to oncall teams, applying labels, and closing questions. Use when processing new PyTorch issues or when asked to triage an issue.

★ 97,036 ai

neural-network autograd gpu numpy

skill-writer 0.23

pytorch / pytorch-skill-writer exact

Guide users through creating Agent Skills for Claude Code. Use when the user wants to create, write, author, or design a new Skill, or needs help with SKILL.md files, frontmatter, or skill structure.

★ 97,036 ai

neural-network autograd gpu numpy

pytorch-fsdp 0.07

ovachiever / droid-tings-pytorch-fsdp exact

Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP - parameter sharding, mixed precision, CPU offloading, FSDP2

★ 19 ai

pytorch-fsdp 0.07

zechenzhangAGI / ai-research-skills-pytorch-fsdp exact

Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP - parameter sharding, mixed precision, CPU offloading, FSDP2

★ 1,712 ai

ai ai-research claude claude-code

torchforge-rl-training 0.07

zechenzhangAGI / ai-research-skills-torchforge-rl-training exact

Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms. Use when you want clean RL abstractions, easy algorithm experimentation, or...

★ 1,712 ai

ai ai-research claude claude-code

qutip 0.07

K-Dense-AI / claude-scientific-skills-qutip exact

Quantum physics simulation library for open quantum systems. Use when studying master equations, Lindblad dynamics, decoherence, quantum optics, or cavity QED. Best for physics research, open...

★ 6,907 ai

ai-scientist bioinformatics chemoinformatics claude

runtime-skills 0.07

llama-farm / llamafarm-runtime-skills exact

Universal Runtime best practices for PyTorch inference, Transformers models, and FastAPI serving. Covers device management, model loading, memory optimization, and performance tuning.

★ 810 ai

ai edge edge-computing llama3

deepspeed 0.07

zechenzhangAGI / ai-research-skills-deepspeed exact

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

★ 1,712 ai

ai ai-research claude claude-code

deepspeed 0.07

ovachiever / droid-tings-deepspeed exact

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

★ 19 ai

torch-geometric 0.07

ovachiever / droid-tings-torch-geometric exact

Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, for geometric deep learning.

★ 19 ai

torch-geometric 0.07

K-Dense-AI / claude-scientific-skills-torch-geometric exact

Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, for geometric deep learning.

★ 6,907 ai

ai-scientist bioinformatics chemoinformatics claude

Confirm

Submit a Skill