Failed to boot after swapping Nvidia GPU for AMD GPU

Endeavour is failing to boot after removing my GTX1070 and replacing it with an RX6700. I am able to boot to a live disk and found the following in the logs. Can someone point me in the right direction to resolve these errors? I would expect the Nvidia errors, but am assuming the boot issues is with the second and third block of errors

Feb 25 15:06:58 stephen-ms7c02 kernel: NVRM: No NVIDIA GPU found.
Feb 25 15:06:58 stephen-ms7c02 kernel: nvidia-nvlink: Unregistered Nvlink Core, major device number 238
Feb 25 15:06:58 stephen-ms7c02 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 238
Feb 25 15:06:58 stephen-ms7c02 kernel: NVRM: No NVIDIA GPU found.
Feb 25 15:06:58 stephen-ms7c02 kernel: nvidia-nvlink: Unregistered Nvlink Core, major device number 238
Feb 25 15:06:58 stephen-ms7c02 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 238
Feb 25 15:06:58 stephen-ms7c02 kernel: NVRM: No NVIDIA GPU found.

Feb 25 15:06:58 stephen-ms7c02 systemd[1]: Found device Samsung_SSD_850_EVO_250GB 1.
Feb 25 15:06:58 stephen-ms7c02 systemd[1]: Found device Samsung_SSD_850_EVO_250GB 1.
Feb 25 15:06:58 stephen-ms7c02 systemd[1]: Starting File System Check on /dev/disk/by-uuid/11F1-FE06...
Feb 25 15:06:58 stephen-ms7c02 systemd[1]: Starting File System Check on /dev/sdb1...
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[532]: fsck.ext4: Device or resource busy while trying to open /dev/sdb1
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[532]: Filesystem mounted or opened exclusively by another program?
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[528]: fsck failed with exit status 8.
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[528]: Ignoring error.
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: fsck.ext4: Bad magic number in super-block while trying to open /dev/sdb1
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: /dev/sdb1:
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: The superblock could not be read or does not describe a valid ext2/ext3/ext4
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: filesystem.  If the device is valid and it really contains an ext2/ext3/ext4
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: filesystem (and not swap or ufs or something else), then the superblock
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: is corrupt, and you might try running e2fsck with an alternate superblock:
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]:     e2fsck -b 8193 <device>
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]:  or
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]:     e2fsck -b 32768 <device>
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[531]: /dev/sdb1 contains a vfat file system
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[525]: fsck failed with exit status 8.
Feb 25 15:06:58 stephen-ms7c02 systemd-fsck[525]: Ignoring error.
Feb 25 15:06:58 stephen-ms7c02 systemd[1]: Finished File System Check on /dev/sdb1.
Feb 25 15:06:58 stephen-ms7c02 kernel: piix4_smbus 0000:00:14.0: SMBus Host Controller at 0xb00, revision 0
Feb 25 15:06:58 stephen-ms7c02 kernel: piix4_smbus 0000:00:14.0: Using register 0x02 for SMBus port selection
Feb 25 15:06:58 stephen-ms7c02 kernel: piix4_smbus 0000:00:14.0: Auxiliary SMBus Host Controller at 0xb20
Feb 25 15:06:58 stephen-ms7c02 kernel: ccp 0000:2a:00.1: enabling device (0000 -> 0002)
Feb 25 15:06:58 stephen-ms7c02 kernel: ccp 0000:2a:00.1: ccp: unable to access the device: you might be running a broken BIOS.
Feb 25 15:06:58 stephen-ms7c02 kernel: ccp 0000:2a:00.1: psp enabled
Feb 25 15:06:58 stephen-ms7c02 kernel: input: PC Speaker as /devices/platform/pcspkr/input/input2
Feb 25 15:06:58 stephen-ms7c02 kernel: r8168 Gigabit Ethernet driver 8.051.02-NAPI loaded
Feb 25 15:06:58 stephen-ms7c02 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 238
Feb 25 15:06:58 stephen-ms7c02 kernel: 
Feb 25 15:06:58 stephen-ms7c02 kernel: RAPL PMU: API unit is 2^-32 Joules, 1 fixed counters, 163840 ms ovfl timer
Feb 25 15:06:58 stephen-ms7c02 kernel: RAPL PMU: hw unit of domain package 2^-16 Joules
Feb 25 15:06:58 stephen-ms7c02 kernel: NVRM: No NVIDIA GPU found.
Feb 25 15:06:58 stephen-ms7c02 systemd[1]: Finished File System Check on /dev/disk/by-uuid/11F1-FE06.
Feb 25 15:06:58 stephen-ms7c02 kernel: nvidia-nvlink: Unregistered Nvlink Core, major device number 238
Feb 25 15:06:58 stephen-ms7c02 kernel: cryptd: max_cpu_qlen set to 1000
Feb 25 15:06:59 stephen-ms7c02 kernel: EXT4-fs (sdb1): VFS: Can't find ext4 filesystem
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Finished Load Kernel Modules.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: mnt-SSD.mount: Mount process exited, code=exited, status=32/n/a
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: mnt-SSD.mount: Failed with result 'exit-code'.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Failed to mount /mnt/SSD.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Dependency failed for Local File Systems.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: local-fs.target: Job local-fs.target/start failed with result 'dependency'.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: local-fs.target: Triggering OnFailure= dependencies.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: systemd-ask-password-console.path: Deactivated successfully.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Stopped Dispatch Password Requests to Console Directory Watch.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: systemd-ask-password-wall.path: Deactivated successfully.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Stopped Forward Password Requests to Wall Directory Watch.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Reached target Sound Card.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Reached target Timer Units.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Commit a transient machine-id on disk was skipped because of an unmet condition check (ConditionPathIsMountPoint=/etc/machine-id).
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Reached target Network.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Reached target Network is Online.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Reached target Socket Units.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Started Emergency Shell.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Reached target Emergency Mode.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Rebuild Dynamic Linker Cache was skipped because no trigger condition checks were met.
Feb 25 15:06:59 stephen-ms7c02 systemd[1]: Starting Set Up Additional Binary Formats...

I think your system is still trying to boot with the non existent nvidia card, I would uninstall the drivers and install the amd ones, as well as remove the nvidia drm kernel parameter.

Not sure if you can get into a tty and remove all the nvidia drivers? The AMD runs on amdgpu kernel module but if other nvidia stuff is trying to load because it hasn’t been removed then it’s not going to boot.

I did attempt to remove the nvidia package, but get an error at the end

[root@EndeavourOS /]# sudo pacman -Rs nvidia-dkms 
checking dependencies...

Packages (1) nvidia-dkms-525.89.02-2

Total Removed Size:  68.66 MiB

:: Do you want to remove these packages? [Y/n] y
:: Running pre-transaction hooks...
(1/1) Remove DKMS modules
==> dkms remove --no-depmod nvidia/525.89.02 -k 6.1.12-arch1-1
==> depmod 6.1.12-arch1-1
:: Processing package changes...
(1/1) removing nvidia-dkms                                                                                                     [----------------------------------------------------------------------------] 100%
:: Running post-transaction hooks...
(1/2) Arming ConditionNeedsUpdate...
(2/2) eos-nvidia-fix.hook
==> Building image from preset: /etc/mkinitcpio.d/linux.preset: 'default'
  -> -k /boot/vmlinuz-linux -c /etc/mkinitcpio.conf -g /boot/initramfs-linux.img
==> Starting build: 6.1.12-arch1-1
  -> Running build hook: [base]
  -> Running build hook: [udev]
  -> Running build hook: [autodetect]
  -> Running build hook: [modconf]
  -> Running build hook: [block]
==> WARNING: Possibly missing firmware for module: xhci_pci
  -> Running build hook: [keyboard]
  -> Running build hook: [keymap]
  -> Running build hook: [filesystems]
  -> Running build hook: [fsck]
==> ERROR: module not found: `nvidia'
==> ERROR: module not found: `nvidia_uvm'
==> ERROR: module not found: `nvidia_drm'
==> Generating module dependencies
==> Creating zstd-compressed initcpio image: /boot/initramfs-linux.img
==> WARNING: errors were encountered during the build. The image may not be complete.
==> Building image from preset: /etc/mkinitcpio.d/linux.preset: 'fallback'
  -> -k /boot/vmlinuz-linux -c /etc/mkinitcpio.conf -g /boot/initramfs-linux-fallback.img -S autodetect
==> Starting build: 6.1.12-arch1-1
  -> Running build hook: [base]
  -> Running build hook: [udev]
  -> Running build hook: [modconf]
  -> Running build hook: [block]
==> WARNING: Possibly missing firmware for module: bfa
==> WARNING: Possibly missing firmware for module: qed
==> WARNING: Possibly missing firmware for module: qla2xxx
==> WARNING: Possibly missing firmware for module: qla1280
==> WARNING: Possibly missing firmware for module: aic94xx
==> WARNING: Possibly missing firmware for module: wd719x
==> WARNING: Possibly missing firmware for module: xhci_pci
  -> Running build hook: [keyboard]
  -> Running build hook: [keymap]
  -> Running build hook: [filesystems]
  -> Running build hook: [fsck]
==> ERROR: module not found: `nvidia'
==> ERROR: module not found: `nvidia_uvm'
==> ERROR: module not found: `nvidia_drm'
==> Generating module dependencies
==> Creating zstd-compressed initcpio image: /boot/initramfs-linux-fallback.img
==> WARNING: errors were encountered during the build. The image may not be complete.
error: command failed to execute correctly

I don’t want to tell you to do something that is not going to work so maybe would be best for someone else to answer you on this. I think if it was me i would have removed all the drivers first then shutdown and removed the nvidia card and installed the new amd card. This is the way i would approach this on Windows also.

Edit: Things have changed recently with the newer installs using dracut. Not sure if you have a newer install with dracut & are using grub or systemd?

Edit2: The way i solved this for another user but they were using grub and an older install without dracut.

Edit: Not sure what cpu you have either so you may not need ucode.

Looks like file /etc/mkinitcpio.conf has nvidia leftovers in MODULES.

2 Likes