Opened 4 years ago

Last modified 6 months ago

#15781 new bug

Kernel panic: "General Protection Exception" during running SBCL regression tests

Reported by: alpopa Owned by: nobody
Priority: normal Milestone: Unscheduled
Component: System/Kernel Version: R1/Development
Keywords: Cc: korli
Blocked By: #14802 Blocking: #15761
Platform: x86-64

Description

This bug was reported by Douglas Katzman, an SBCL hacker. It is different from #15761 in that Douglas applied some more patches to SBCL. Here is the original message:

There are some issues exposed by the regression suite (things I didn't finish), but I can't actually complete the run because I reliably get a kernel panic (below). I have no vested interest in subscribing to the Haiku bug tracker to file a bug. So if you haven't already reported this exact panic, it may behoove you to do so. I'm running with 8 simulated cores. I could try fewer, but I don't feel like hammering on this problem.

Attachments (17)

Kernel.log (2.7 KB ) - added by alpopa 4 years ago.
CPU.jpg (179.3 KB ) - added by alpopa 4 years ago.
pulse.jpg (263.8 KB ) - added by alpopa 4 years ago.
input_events.jpg (1.6 MB ) - added by alpopa 4 years ago.
sbcl-test-logs-1037.zip (10.2 KB ) - added by alpopa 4 years ago.
sc-d.txt (3.0 KB ) - added by alpopa 4 years ago.
KDL1.jpg (3.3 MB ) - added by alpopa 4 years ago.
KDL2.jpg (3.5 MB ) - added by alpopa 4 years ago.
KDL3.jpg (3.5 MB ) - added by alpopa 4 years ago.
KDL4.jpg (3.6 MB ) - added by alpopa 4 years ago.
syslog.compilation-long (512.0 KB ) - added by alpopa 4 years ago.
kdl-vm.png (112.4 KB ) - added by alpopa 4 years ago.
sc-d-vm.png (77.3 KB ) - added by alpopa 4 years ago.
kdl2-vm.png (64.8 KB ) - added by alpopa 4 years ago.
haiku.json (1.4 KB ) - added by alpopa 4 years ago.
KDL-hrev55161.jpg (3.6 MB ) - added by alpopa 3 years ago.
KDL on hrev55161
syslog.hrev55161 (512.0 KB ) - added by alpopa 3 years ago.
Syslog on hrev55161 (hopefully it is relevant)

Change History (38)

by alpopa, 4 years ago

Attachment: Kernel.log added

comment:1 by waddlesplash, 4 years ago

It is different from #15761 in that Douglas applied some more patches to SBCL.

This is probably irrelevant; the kernel panics are likely the same. Nobody has reported the actual backtrace in #15761, so I'm just going to close that one as a dupe of this.

comment:2 by waddlesplash, 4 years ago

Blocking: 15761 added

comment:3 by waddlesplash, 4 years ago

Actually, this backtrace is missing due to the "READ/WRITE FAULT". Please run sc -d at the KDL prompt the next time you encounter this; that should get past that and show the real backtrace.

comment:4 by alpopa, 4 years ago

Douglas has KDL during running in tests folder the following:

./parallel-exec.sh 8

I performed this test several times and, while the system froze with CPU usage almost 100%, could not reproduce KDL. It may be unrelated to SBCL but to running Haiku in Qemu (I tested on real hardware). I will test this scenario more extensively to see if I can find any pattern.

by alpopa, 4 years ago

Attachment: CPU.jpg added

by alpopa, 4 years ago

Attachment: pulse.jpg added

by alpopa, 4 years ago

Attachment: input_events.jpg added

by alpopa, 4 years ago

Attachment: sbcl-test-logs-1037.zip added

by alpopa, 4 years ago

Attachment: sc-d.txt added

comment:5 by alpopa, 4 years ago

Attached file sc-d.txt as output of sc -d by Douglas.

comment:6 by waddlesplash, 4 years ago

and, while the system froze with CPU usage almost 100%, could not reproduce KDL.

If you have a PS/2 keyboard, try running Alt+PrntScrn+D; this will drop you directly to the KDL prompt and you can run a backtrace from there.

comment:7 by waddlesplash, 4 years ago

Cc: korli added
Component: SystemSystem/Kernel

I'm not sure what to make of this backtrace; it looks like it's occuring in _start directly, but how can that be when the system has been running for a while? CC'ing korli for his analysis.

comment:8 by alpopa, 4 years ago

If you have a PS/2 keyboard, try running Alt+PrntScrn+D; this will drop you directly to the KDL prompt and you can run a backtrace from there.

I test on iMac which doesn't have PS/2 port. I will try to do the same on PC with this port and PS/2 keyboard.

by alpopa, 4 years ago

Attachment: KDL1.jpg added

by alpopa, 4 years ago

Attachment: KDL2.jpg added

by alpopa, 4 years ago

Attachment: KDL3.jpg added

by alpopa, 4 years ago

Attachment: KDL4.jpg added

comment:9 by alpopa, 4 years ago

I also had KDL several times (attachments KDL1.jpg, KDL2.jpg, KDL3.jpg, KDL4.jpg). The exception is reported as "x87 Floating-Point Error".

comment:10 by waddlesplash, 4 years ago

Blocked By: 14802 added

That's #14802.

comment:11 by alpopa, 4 years ago

It is possible that KDL on my system is identical to #14802 but it seems to be different from Douglas' KDL reported here.

by alpopa, 4 years ago

Attachment: syslog.compilation-long added

comment:12 by alpopa, 4 years ago

New attachment when the system restarted after quite long compilation - syslog.compilation-long. It may be related to #15817.

by alpopa, 4 years ago

Attachment: kdl-vm.png added

by alpopa, 4 years ago

Attachment: sc-d-vm.png added

comment:13 by alpopa, 4 years ago

Retested in Qemu. KDL and result of sc -d are attached: kdl-vm.png and sc-d-vm.png.

by alpopa, 4 years ago

Attachment: kdl2-vm.png added

comment:14 by alpopa, 4 years ago

After KDL during compilation, I restarted the system and then new KDL occurred during booting, very similar to Douglas' one - kdl2-vm.png.

comment:15 by alpopa, 4 years ago

Haiku hrev54010 x86_64 is used in Qemu testing.

comment:16 by X512, 4 years ago

What Qemu version are you using? Does hardware acceleration enabled? If enabled, what virtualization system is used (KVM etc) and what CPU model is on host system?

comment:17 by alpopa, 4 years ago

I am using Qemu 4.2.0 for Windows. KVM is disabled, TCG is enabled. The host system is Intel Core i7 7820HQ 2.9 GHz with 32 GB RAM.

by alpopa, 4 years ago

Attachment: haiku.json added

comment:18 by alpopa, 4 years ago

Complete VM configuration is attached - haiku.json.

comment:19 by korli, 3 years ago

alpopa, any change to check this bug on a current nightly image? It could be fixed as of hrev55135.

comment:20 by alpopa, 3 years ago

Unfortunately, the KDL is still here. I will attach new logs and screenshots.

by alpopa, 3 years ago

Attachment: KDL-hrev55161.jpg added

KDL on hrev55161

by alpopa, 3 years ago

Attachment: syslog.hrev55161 added

Syslog on hrev55161 (hopefully it is relevant)

comment:21 by korli, 6 months ago

Please check with hrev57363 or newer

Note: See TracTickets for help on using tickets.