Anyone with insight or ideas is highly appreciated. I’ve had this issue for a few weeks now. 1660 Super, Ryzen 5800x. I suspect this issue to possibly be related to the fact my system seems unaware of the fan RPM of my GPU, but I don’t know how to resolve that or if it’s the cause. inxi output (https://clbin.com/kxLGy):
When playing any game (Yuzu, Genshin Impact, GTAV) after a certain time spent playing my GPU fans will instantly spike up to 100% and stay there. Even if I close the game/app, the fans will still stay maxxed out until I restart the computer. I have MangoHUD running in all of my games, and I can see that my GPU temperatures are never getting even close to high enough to warrant this. It seems like once the card reaches a certain (relatively low) temprature threshold, maybe around 55c, it just goes into killer mode and stays there for no reason.
I reinstalled Windows to test, and that was the only solution I’ve found for this issue (no fan problems on Win10).
Issue is present:
On Manjaro, Endeavour, Pop!_OS
On kernels 5.10, 5.13, and 5.15
On nvidia drivers 495, 475, and 470
I’ve also attempted setting the “Coolbits” option to various points, however utilities like GreenWithEnvy seem unable to see the fan RPM. I know the coolbits options are otherwise working, because it did open the overclock options. Attempting to set a manual fan profile has no effect.
Please let me know if there are any tests I should run, or logs when attempting to launch some of these games. I’m not entirely sure where or how to capture information from what’s happening.
These are the fans on the shroud of the 1660S, so the only plug on the card is the 6 pin PCI-E plug. I did open the case and confirm everything was seated correctly and plugged in without issue.
I installed Win10 as a way of trying to make sure it wasn’t a hardware issue, after having it affect all the Linux flavors I had listed. On the same hardware with no touches, Win10 didn’t have issue. I was also able to run this card on Linux for the last year+, and only a few weeks ago this began, but don’t know what started it.
I’ve also checked BIOS settings and there’s nothing there that I can see that should impact this.
So it’s the GPU fans you were talking about. Funny you mentioned it because i have a GTX 1060 and i have never seen the fans run ever. Supposed they don’t even come on until it reaches 50 degrees? I don’t know if they even work? It seems bizarre but i have no issues with it paired up to a 5 Ghz Intel i7-8086K It’s very quite even with the case fans. I don’t think it’s ever gotten hot enough.
Edit: I also just wanted to mention on my Ryzen 3800X i have a Gigabyte RX 590 8 GB card and it has lighting on the top of the card that say’s Gigabyte and Fan Stop. So it has RGB lighting that lights the Fan Stop when they are OFF.
I think the fans are running when it’s under 50c, pretty sure. I’m able to run the computer without issue until I start stressing the graphic card hard enough by playing a game.
It’s happened before in Yuzu, but it’s actually very rare for Yuzu to spike it. It seems more GPU demanding things are quicker to set it off - seems like it’s hitting a threshold, spiking the fans to 100%, and then staying there for some reason. I know for sure I’ve never seen MangoHUD show the temperature go over 60c, on anything.
I feel like this is possibly related, but don’t know the cause:
Sensors:
System Temperatures: cpu: N/A mobo: N/A gpu: nvidia temp: 30 C Fan Speeds (RPM): N/A gpu: nvidia fan: 0%
GreenWithEnvy and other utilities have also been unable to identify the fan speed/usage.
That is strange as well as my Nvidia is strange not even running as far as i can tell. One of these days I’m going to look and see if they actually are? I don’t do games so i really have overkill considering the processor has HD 630 Intel graphics also.
I’d check for a BIOS update, and a VBIOS update, and if it happens with several different distros (and in Windows?) then it might be a hardware issue and therefore worth an RMA.
No i just meant his motherboard is already the latest Bios update. Sorry I should have worded that differently. But i am interested in Graphics card update as that is something i have never done not being a gamer and all.
I’m not sure I can update the firmware on the graphics card directly. Searching Nvidia’s website and a quick online search, I don’t think there’s anything available for the 1660 Super. When I went back to Windows 10 for a day to test, I downloaded the latest 497 drivers from Nvidia’s website and installed them, I’m not sure if there could have been any firmware update bundled with that but regardless the issue was still present when returning to EndeavourOS.
Jonathon, just to clarify, Windows doesn’t exhibit this problem. For right now it’s been every Linux distro I’ve attempted (Manjaro, Endeavour, Pop!)
I’m really curious if there’s some sort of package, configuration, or whatever I could use for Linux to see the fan RPM off of the graphics card. I have no way of knowing for sure but I feel like that could be related.
Same as GreenWithEnvy or the inxi output, nvidia-settings is also unable to determine the fan speed. I get a generic error attempting to upload my screenshot, but my temperature was 31c, and under “fan information” ID, RPM, % are all 0. My “Control type” and “Cooling target” are both the same as yours. GPU fan settings below are disabled, unlike yours mine also shows “0” for fan 0 speed.
I confirmed I had lm_sensors installed and ran through sensors-detect as well, answering “yes” to all and rebooting.
I’m going to undo those changes to Xwrapper since it didn’t resolve the problem and I also don’t want X running as root every time I turn the PC on.
To make things more confusing/weird, sometimes when the fan spikes after I reboot the fan noticeably drops in RPM, but still runs high (maybe 50-70% fan speed). Even AFTER rebooting, the fan (sometimes) seems to stay at that percentage until I reboot once again, at which point I can hear the fan go silent.
Absolutely no idea what’s going on here, I really don’t want to format this PC again much less stick it back on Windows…