• v6.5 b2f8626fee

    Refactor validate modes, fix benchmark report and IPMI power

    mchus released this 2026-04-08 00:42:12 +03:00 | 224 commits to main since this release

    • Replace diag level 1-4 dropdown with Validate/Stress radio buttons
    • Validate: dcgmi L2, 60s CPU, 256MB/1p memtester, SMART short
    • Stress: dcgmi L3 + targeted_stress in Run All, 30min CPU, 1GB/3p memtester, SMART long/NVMe extended
    • Parallel GPU mode: spawn single task for all GPUs instead of splitting per model
    • Benchmark table: per-GPU columns for sequential runs, server-wide column for parallel
    • Benchmark report converted to Markdown with server model, GPU model, version in header; only steady-state charts
    • Fix IPMI power parsing in benchmark (was looking for 'Current Power', correct field is 'Instantaneous power reading')

    Co-Authored-By: Claude Sonnet 4.6 noreply@anthropic.com

    Downloads