Files
claude-config/skills/rocm-gpu.md

1.5 KiB

ROCm & AMD GPU Computing Skill

Hardware

  • GPU: AMD Radeon RX 6800 XT (Navi 21)
  • VRAM: 16GB
  • Architecture: gfx1030
  • Kernel: 6.8.0-55-generic (with amdgpu-dkms)

Installed ROCm Components

  • ROCm 6.3.2
  • rocminfo
  • rocm-smi
  • hip-runtime-amd
  • miopen-hip
  • rocblas
  • rocfft
  • rocrand

Essential Commands

GPU Monitoring

rocm-smi                           # GPU status
rocm-smi --showproductname         # GPU info
rocm-smi --showpid                 # Show processes
rocminfo                           # Detailed ROCm info

Environment Variables

# Add to ~/.bashrc for permanent setup
export PATH=/opt/rocm/bin:$PATH
export LD_LIBRARY_PATH=/opt/rocm/lib:$LD_LIBRARY_PATH

PyTorch with ROCm

# Install PyTorch for ROCm
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm6.0

# Verify GPU access
python -c "import torch; print(torch.cuda.is_available()); print(torch.cuda.get_device_name(0))"

Docker with ROCm

docker run -it --device=/dev/kfd --device=/dev/dri --group-add video rocm/pytorch:latest

Performance Tuning

  • Set HSA_OVERRIDE_GFX_VERSION=10.3.0 for compatibility
  • GPU temperature: Check with rocm-smi (normal <85°C)
  • Power limit: 264W (default)

Common Issues

  • "No ROCm-capable GPU": Check kernel is 6.8.0-55, not 6.17
  • Missing libraries: Ensure /opt/rocm/lib is in LD_LIBRARY_PATH
  • Permission denied: User must be in 'render' and 'video' groups