aiconfigurator
NVIDIA AIConfigurator — optimal LLM serving configuration for disaggregated/aggregated deployments, parallelism selection (TP/PP/EP/DP), quantization, and MOE planning. Use when planning model deployment topology on NVIDIA GPUs.
Changelog: Source: GitHub https://github.com/tylertitsworth/skills
No comments yet. Be the first one!