aiconfigurator
NVIDIA AIConfigurator — optimal LLM serving configuration for disaggregated/aggregated deployments, parallelism selection (TP/PP/EP/DP), quantization, and MOE planning. Use when planning model deployment topology on NVIDIA GPUs.
更新日志: Source: GitHub https://github.com/tylertitsworth/skills
还没有评论,快来第一个发言吧。