-
Notifications
You must be signed in to change notification settings - Fork 173
Issues: NVIDIA/dcgm-exporter
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Add cURL, wget or something similar for basic localhost URL checks that metrics are being produced.
enhancement
New feature or request
#453
opened Feb 11, 2025 by
hassanbabaie
Restart using the 'latest' tag for new container image releases
#451
opened Feb 9, 2025 by
andscape-dev
DCP metrics not collected due to missing DCGM modules
bug
Something isn't working
#449
opened Feb 3, 2025 by
age9990
Running latest DCGM exporter not working on GKE
question
Further information is requested
#448
opened Feb 2, 2025 by
puneetloya
Missing NVSwitch Bandwidth (RX / TX) in dcgm-exporter
question
Further information is requested
#446
opened Jan 24, 2025 by
suranchoi
DCGM-Exporter crashes on startup when using MIG w/ A100
bug
Something isn't working
#443
opened Jan 16, 2025 by
csibbitt
Move exporter-metrics-volume to daemonset.yaml
enhancement
New feature or request
#442
opened Jan 15, 2025 by
ysk24ok
Data Collection Issue with A100 GPUs and MIG Mode using DCGM Exporter
bug
Something isn't working
#441
opened Jan 15, 2025 by
doddisam
Skipping line 3 ('DCGM_FI_PROF_GR_ENGINE_ACTIVE'): metric not enabled
question
Further information is requested
#439
opened Jan 14, 2025 by
SeungyeopShin
Data Collection Issue with H100 GPUs and MIG Mode using DCGM Exporter
question
Further information is requested
#434
opened Dec 19, 2024 by
coreminw
After the dcp indicator is enabled, dcgm-exporter reports an error
question
Further information is requested
#431
opened Dec 10, 2024 by
15234660879
Exporter does not provide any of the DCGM_FI_DEV_*_UTIL metrics
question
Further information is requested
#430
opened Dec 4, 2024 by
kt-pham
Memory usage increased 2.25x after upgrading from 3.3.6-3.4.2 to 3.3.9-3.6.1
#425
opened Nov 22, 2024 by
age9990
Support collecting pod labels
enhancement
New feature or request
#423
opened Nov 20, 2024 by
mtparet
dcgm-exporter counter value goes down
bug
Something isn't working
#417
opened Nov 14, 2024 by
luccabb
Not collecting GPU metrics; Error getting devices count: Cannot perform the requested operation because NVML doesn't exist on this system
question
Further information is requested
#416
opened Nov 13, 2024 by
saichanumolu9
Checksum mismatch for github.com/emicklei/go-restful/[email protected]
bug
Something isn't working
#415
opened Nov 7, 2024 by
WilliamVenner
Segfaults with dcgm-exporter 3.3.0 and higher
bug
Something isn't working
#412
opened Oct 30, 2024 by
andrewjamesbrown
Segmentation fault when running with the default configuration for the GPU Operator on kind
bug
Something isn't working
#409
opened Oct 29, 2024 by
klueska
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.