
If I do a regular stress test on both the CPU and GPU (either the Power option in OCCT or separate prime95 + MSI Kombustor tests), stability seems fine (no driver crashing or display blanking). This seems to still happen even with a new GPU. G.SKILL RAM (stock 2133MHz, XMP 3600MHz running at 1.4V).I have a 4K display connected over HDMI at 60Hz.Windows, DirectX, Vulkan, VC++ redists, chipset drivers, and BIOS are all up-to-date.Can't quite remember if this happened on Windows 10 LTSC/1809 with WDDM 2.5.Happens at both stock and XMP RAM speeds/timings.
Doesn't seem to happen with general desktop usage (web browsing mostly) on Windows or Linux. No thermal throttling (CPU and GPU are below 80C even under stress-tests). Happens on both stock voltages/clocks, and maxed 1.2V voltages (with stock clocks) across all states in Wattman on core and memory, along with maxed 30% power limit. Happens on both PCI-E slots (PCI-E 3.0 x8 and x16). Happens on both 20.8.1 and 20.8.2 AMD drivers. Happens on both WDDM 2.6 (with enterprise drivers above) and 2.7. Tried V19 Enterprise graphics drivers (back before 2020 I think it was Q4). Happens at random points with FFXIV and Age of Empires 2 DE (both with their native renderers and DXVK). Can reproduce it very quickly with OCCT and the GPU Memtest. Happens on a few randomly-attempted VBIOSes.
This happens on both the gaming and compute BIOS. I have a replacement GPU arriving at some point (tried all other solutions I could think of and I'm thinking the GPU is defective), but I'd like to know if there's anything else I can try: I've only tried Beat Saber and Blade & Sorcery. I noticed that it hasn't happened at all when I'm in VR though strangely, but I'm not sure if it's just because I don't play too long or what (normally I do 30-min sessions).
This happens normally after 5-10 minutes, but I can go up to an hour sometimes before it happens. In games, I usually get TDRs (game freezes, screen goes black for under a minute, then returns either with the game still running or crashed).