HarvesterOfEyes

joined 11 months ago
[–] HarvesterOfEyes@piefed.social 4 points 2 months ago

Okay, what's the biggest and most active gamer community on Matrix?

As far as I know, https://friendlylinuxplayers.org/ . I'm sure it's not as active as whatever Discord communities you're in but it's fairly active and actually friendly.

[–] HarvesterOfEyes@piefed.social 1 points 3 months ago

I use mkinitcpio. You mean regenerating the initramfs? If so, yeah I've done that a few times without much success. I suppose reinstalling the bootloader could be an option but I've tried using a live ISO from another distro (Ubuntu) and the problem still manifested itself. But it is something to keep in mind, so thanks.

Anyway, so far it seems to be a faulty PCIe slot, so I changed the GPU to another slot and things seem to be working fine. But I'll wait until tomorrow to do a final edit to my post.

[–] HarvesterOfEyes@piefed.social 2 points 3 months ago* (last edited 3 months ago)

PieFed isn't letting me edit the OP due to an unexpected error. The errors keep piling up, haha! [EDIT: It's fixed now!]

Just wanted to thank all of you wonderful people for all the help you've given me. I love each and everyone of you (even the ones who skimmed through my post :p). A user on the other thread I created in the Arch Linux community suggested I add the nomedeset parameter, with which I managed to boot into the system. I updated it and installed linux-lts along with linux-lts-headers. Adjusted /boot/loader/entries/arch_linux.conf to switch to the lts kernel by default and rebooted the PC. Unfortunately, didn't work but I got logs! Here's the relevant part, I think:

mai 03 11:04:23 arch kernel: amdgpu: [powerplay] Failed message: 0xe, input parameter: 0x0, error code: 0xffffffff  
mai 03 11:04:23 arch kernel: amdgpu: [powerplay] Failed message: 0x4, input parameter: 0x2000000, error code: 0xffffffff  
mai 03 11:04:23 arch kernel: [drm:resource_construct [amdgpu]] *ERROR* DC: unexpected audio fuse!  
mai 03 11:04:23 arch kernel: [drm] Display Core v3.2.316 initialized on DCE 12.0  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: [drm] Timeout wait for RLC serdes 0,0  
mai 03 11:04:23 arch kernel: [drm] kiq ring mec 2 pipe 1 q 0  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_0.2.1.0 test failed (-110)  
mai 03 11:04:23 arch kernel: [drm:amdgpu_gfx_enable_kcq [amdgpu]] *ERROR* KCQ enable failed  
mai 03 11:04:23 arch kernel: [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* hw_init of IP block <gfx_v9_0> failed -110  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: amdgpu: amdgpu_device_ip_init failed  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: amdgpu: Fatal error during GPU init  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: amdgpu: amdgpu: finishing device.  

I did a search and it seems like it's the GPU's fault due to the ring errors. I think. I remembered I have an old nvidia GPU laying around so I'm going to try to reseat the current GPU and, if that doesn't work, try the old one. Not sure if I have to uninstall the amd drivers or if it's ok to have both the amd and nvidia drivers installed. If that doesn't work, I'm going to go through all the other suggestions y'all gave me to try and pinpoint the problem.

Again, thank you so much!

[–] HarvesterOfEyes@piefed.social 2 points 3 months ago* (last edited 3 months ago)

debug nomedeset

It worked! I managed to boot into the system. Updated it and installed linux-lts along with linux-lts-headers. Adjusted /boot/loader/entries/arch_linux.conf to switch to the lts kernel by default and rebooted the PC. Unfortunately, didn't work but I got logs! Here's the relevant part, I think:

mai 03 11:04:23 arch kernel: amdgpu: [powerplay] Failed message: 0xe, input parameter: 0x0, error code: 0xffffffff  
mai 03 11:04:23 arch kernel: amdgpu: [powerplay] Failed message: 0x4, input parameter: 0x2000000, error code: 0xffffffff  
mai 03 11:04:23 arch kernel: [drm:resource_construct [amdgpu]] *ERROR* DC: unexpected audio fuse!  
mai 03 11:04:23 arch kernel: [drm] Display Core v3.2.316 initialized on DCE 12.0  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm] *ERROR* No EDID read.  
mai 03 11:04:23 arch kernel: [drm] Timeout wait for RLC serdes 0,0  
mai 03 11:04:23 arch kernel: [drm] kiq ring mec 2 pipe 1 q 0  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_0.2.1.0 test failed (-110)  
mai 03 11:04:23 arch kernel: [drm:amdgpu_gfx_enable_kcq [amdgpu]] *ERROR* KCQ enable failed  
mai 03 11:04:23 arch kernel: [drm:amdgpu_device_init.cold [amdgpu]] *ERROR* hw_init of IP block <gfx_v9_0> failed -110  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: amdgpu: amdgpu_device_ip_init failed  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: amdgpu: Fatal error during GPU init  
mai 03 11:04:23 arch kernel: amdgpu 0000:0a:00.0: amdgpu: amdgpu: finishing device.  

I did a search and it seems like it's the GPU's fault due to the ring errors. I think. I remembered I have an old nvidia GPU laying around so I'm going to try to reseat the current GPU and, if that doesn't work, try the old one. Not sure if I have to uninstall the amd drivers or if it's ok to have both the amd and nvidia drivers installed.

EDIT in case you missed it: So, I changed the GPU to a different PCIe slot and everything's working fine so far. I'm not celebrating just yet because when this first happened a few months ago, I'd hard reset the PC and everything would work fine. But if I shut it down and let it pass like 12 hours before I'd power it on again, the problem would reappear. So I'm just basically waiting for tomorrow now.

[–] HarvesterOfEyes@piefed.social 7 points 3 months ago

Will do it tomorrow, thanks!

[–] HarvesterOfEyes@piefed.social 4 points 3 months ago

I tried adding the kernel parameter mentioned in that thread but it didn't work. But thank you anyway!

[–] HarvesterOfEyes@piefed.social 2 points 3 months ago

Regarding your edit: no, I haven't tried that, but I will keep those suggestions in mind, thanks!

[–] HarvesterOfEyes@piefed.social 8 points 3 months ago (2 children)

Yeah, it might be the dreaded hardware problem, then.

[–] HarvesterOfEyes@piefed.social 1 points 3 months ago

Yep, has to be for the future, unfortunately, can't access my Arch system in any way right now, even with a live iso :(. But thanks!

[–] HarvesterOfEyes@piefed.social 4 points 3 months ago (1 children)

No, I had to use the latest one. Nope, tried the Ubuntu live ISO but it also didn't work.

[–] HarvesterOfEyes@piefed.social 1 points 3 months ago (2 children)

Will try that after doing what Zikeji suggested. Thanks!

[–] HarvesterOfEyes@piefed.social 1 points 3 months ago

Those are all good tips, thanks! Will do that tomorrow and report back.

view more: ‹ prev next ›