You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Joe DiPrima
fbdcc807b3
Integrate NVIDIA Spark playbooks: CUDA sm_121, TensorRT-LLM, fine-tuning, Ollama, ComfyUI
Phase 4: Parsed all 9 playbooks from build.nvidia.com/spark.
Key findings: CUDA compute capability sm_121, toolkit 13.0, TensorRT-LLM
confirmed, fine-tuning scripts (SFT/LoRA/QLoRA up to 70B), Nemotron-3-Nano
30B MoE, speculative decoding (EAGLE-3/Draft-Target), ComfyUI image gen,
Ollama+Open WebUI, RAPIDS scientific computing, DGX Dashboard on port 11000,
NVIDIA Sync full documentation. 11 questions resolved.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
1 month ago |
| .. |
|
llm-memory-estimation.md
|
Initial knowledge base for Dell Pro Max GB10 expert agent
|
1 month ago |