Opened 8 years ago
Closed 6 years ago
#13191 closed bug (fixed)
[Flurry] screensaver crashes in __strtod_internal
Reported by: | diver | Owned by: | waddlesplash |
---|---|---|---|
Priority: | normal | Milestone: | R1/beta1 |
Component: | Add-Ons/Screen Savers | Version: | R1/Development |
Keywords: | Cc: | kallisti5 | |
Blocked By: | Blocking: | #14023 | |
Platform: | All |
Description
hrev50849 x86_64.
Selecting Flurry screensaver crashes ScreenSaver preflet.
Not reproducible in x86_gcc2.
Attachments (16)
Change History (59)
by , 8 years ago
Attachment: | ScreenSaver-9243-debug-10-01-2017-18-16-42.report added |
---|
comment:1 by , 8 years ago
Owner: | changed from | to
---|---|
Status: | new → assigned |
by , 8 years ago
Attachment: | ScreenSaver-788-debug-16-01-2017-19-04-05.report added |
---|
Another crash report.
comment:2 by , 7 years ago
This is because the FlurryView::AttachedToWindow()
logic looks like the following:
void FlurryView::AttachedToWindow() { LockGL(); BGLView::AttachedToWindow(); // Calls UnlockGL() // Do stuff assuming a context is active }
When glGetString(GL_VERSION)
is called, it returns a null string. The GLife screensaver has the same problem, but instead of crashing it just renders incorrectly.
For both Flurry and GLife, moving the LockGL
call *after* the call to BGLView::AttachedToWindow
fixes the problem.
I'll try to create a patch tomorrow.
by , 7 years ago
Attachment: | 0001-Ensure-screen-savers-have-an-active-GL-context.patch added |
---|
comment:3 by , 7 years ago
patch: | 0 → 1 |
---|
comment:4 by , 7 years ago
(Maybe BGLView::AttachedToWindow()
shouldn't unconditionally unlock the GL context; I can persue that route instead if it's preferable)
comment:6 by , 7 years ago
The BeBook seems to indicate that the existing behavior of the screensavers should be correct, meaning my patch should be rejected. The current BGLView::AttachedToWindow()
locking logic was added to Haiku in fafc6e4b776ba7c9eff20696976d9fdb85113682 (2010) without explanation.
There are two options I can see:
- Revert that part of the change, making Haiku match BeOS more closely based on the BeBook's code samples, but breaking applications written to depend on Haiku's current behavior.
- Only call
UnlockGL()
iff the initial state ofBGLView::fDisplayLocker::IsLocked()
is false. That is, restore the original state right after setting up the glViewport.
I'll pursue the second option as a pull request to the mesa package in haikuports, if that's the right procedure?
comment:7 by , 7 years ago
patch: | 1 → 0 |
---|
comment:8 by , 7 years ago
Cc: | added |
---|
CC'ing kallisti5 as he's in charge of Mesa-related stuff.
Yes, that's probably correct. Note that there are 2 versions of Mesa though (Mesa 7 for GCC2 and current Mesa for GCC5+) and both will need to be patched I think.
comment:9 by , 7 years ago
Aren't LockGL() and UnlockGL() recursive locks? (sorry, I'm not familiar with the API)
If they are, then there should be no problem here - AttachedToWindow could safely call LockGL then UnlockGL which would restore the initial state of the lock.
A patch was submitted here: https://github.com/haikuports/haikuports/pull/2070/files Submitting directly to github.com/haiku/mesa (for gcc2) and upstream mesa (for gcc5) would be better than having the patch in haikuports.
comment:10 by , 7 years ago
LockGL()
and UnlockGL()
currently don't behave recursively. I think they probably should?
I appreciate everybody taking the time to help me muddle my way through what a proper fix for this looks like :D
comment:11 by , 7 years ago
The Be Book does not specify whether or not they are recursive. Guess someone needs to fire up BeOS and find out...
comment:12 by , 7 years ago
Let's dig out that BeOS machine. Maybe someone has code I could easily test?
comment:13 by , 7 years ago
Blocking: | 14023 added |
---|
comment:14 by , 7 years ago
I can assist work on a 'clean room' Mesa 17.1.10 re-implementation for Haiku - if we all come to the same conclusions.
comment:15 by , 7 years ago
The core problem is people keep bumping the version of Mesa in our Haikuports repository without testing if it actually works. Mesa 12-15 were stable, but without the bandwith to work on it bugs creep in.
I've been trying to keep up, but real life and all the other Haiku work I have get in the way.
The SCons build system we use in Mesa is slowly being decommissioned. I've added Haiku support to their new Meson build system... but haven't had time to track down all the bugs.
Help wanted!
And *please* don't start talking of forking Mesa. The project moves fast... lets just not bump versions until we know that the version we bump to is stable + functional :-)
comment:16 by , 7 years ago
We need a release branch at haikuports and not let random people push/merge changes to it; at least for packages required for Haiku to run.
comment:18 by , 7 years ago
Issue (Haiku hrev51877 x86_64, Mesa 17.1.7-7):
Flurry - crashes before Screensaver preview. GLinfo - Segmentation fault on exit with Mesa 17.1.7 GLife - Misrendering (cubes larger than normal) Gravity - Misrendering (cubes larger than normal)
by , 7 years ago
Attachment: | GLInfo-647-debug-14-04-2018-14-20-17.report added |
---|
GLInfo crash on exit
by , 7 years ago
Attachment: | ScreenSaver-858-debug-14-04-2018-14-23-28.report added |
---|
Flurry crash on hrev51877 x86_64
comment:19 by , 7 years ago
See attachment: https://dev.haiku-os.org/attachment/ticket/13191/haiku_flurry_x64.jpeg
Confirming that GLife, GL Info, and Flurry all work, exit, and render properly on Haiku hrev49856 x86_64:
Software Rasterizer Mesa Project 2.1 Mesa 10.5.2 GLU 1.3, GLU 1.5
IMPORTANT: Nightly ISO x86_64 image builds with Mesa 17.1.7 have the issue.
by , 7 years ago
Attachment: | haiku_flurry_x64.jpeg added |
---|
Haiku hrev49856 x86_64 w/Mesa 10.5.2 working properly with GLife, GL Info, and Flurry.
by , 7 years ago
Attachment: | ken-haiku51904-mesa-17.1.png added |
---|
Mesa-swrast review on Haiku hrev51904 x86_64
follow-up: 22 comment:21 by , 7 years ago
Should we downgrade Mesa to 10.5.2 for beta1 to get rid of crashing bugs in the release?
comment:23 by , 7 years ago
Should we downgrade Mesa to 10.5.2 for beta1 to get rid of crashing bugs in the release?
Yes.
10.5.2 was the last time I had a lot of free time to work on Mesa. Someone has been blindly bumping the version we use and not testing it.
Mesa moves fast, and we use a lot of non-public API calls to get the bare minimum done for rendering. The Mesa guys do their best with Haiku support, but without a full-time dev Haiku falls behind.
comment:24 by , 7 years ago
The reason we've been updating Mesa "blindly" is because of LLVM breaking its API with every major release. Mesa keeps up, but the old version will of course not work; so in order to go back we'd need to patch Mesa 10 to work with current LLVM. From the patches I've read, that is probably not an easy thing to do at all...
comment:25 by , 7 years ago
The Mesa 17.1.10 library is fine and tested for stability. Built on Haiku x86_gcc2 and x86_64.
comment:26 by , 7 years ago
Actually, it looks like this PR fixes the issue: https://github.com/haikuports/haikuports/pull/2070
comment:27 by , 7 years ago
Since the Be Book didn't say whether or not [Un]LockGL is recursive, and there's no way to determine if a BGLView is locked or not, I looked at the disassembly of R5's libGL. It seems that it's unlocking BLockers; probably these BLockers. However, it also seems that the context is not switched on UnlockGL, only on LockGL, or in other words, the last context stays current until a different LockGL is called.
But my assembly-fu is poor, so that could also be incorrect. At any rate, it seems a recursive lock is the way to go.
comment:28 by , 7 years ago
It seems that TinyGL's implementation of GLView was a recursive lock also: https://github.com/mojaves/TinyGL/blob/master/BeOS/GLView.cpp#L37
comment:29 by , 7 years ago
Ha you used my R5 headers! I wouldn't necessarily follow BeOS's implementation here in terms of locking.
comment:30 by , 7 years ago
We have to follow R5 closely, otherwise we run into compatibility problems, as people often used the locking in er... "creative" ways: unlocking from a different thread, recursive locking, ...
Rather than disassembling, we could also write a test app and run it on both R5 and Haiku. If you have something to test, I still have an R5 machine around for that purpose (as I already mentionned a few years ago in comment 12, but I still don't have any code to test :)).
comment:31 by , 7 years ago
Ref: #6400 - Upon program exit, some OpenGL programs not closing same thread as opened.
comment:33 by , 7 years ago
in morph3d.c, you might want to change #include "glut_wrapper.h"
to #include <GL/freeglut.h>
if testing.
by , 7 years ago
Attachment: | flurry_haiku-hrev52138_x64-cocobean.jpeg added |
---|
Haiku hrev52138 x86_64 w/Mesa 17.1.10 working properly with GLife, Gravity, and Flurry screensavers (swrast driver)
by , 7 years ago
Attachment: | flurry_haiku-hrev52138_x64_test2_cocobean.jpeg added |
---|
comment:35 by , 6 years ago
Please stop adding useless comments. The issue is already well-understood and only needs someone to sit down and code a fix.
comment:36 by , 6 years ago
+1. We know sw_rast works. It also can't render much more than a teapot. (which is why I dropped it in favor of our llvmpipe driver)
The issue is the llvmpipe driver uses some internal hooks and private void* which have changed in recent releases. The mesa guys have been great to try and "tiptoe" around the Haiku code, but without a dedicated developer, it falls apart within a year or two.
I have a branch here working to improve the llvmpipe situation using their new API's: https://gitlab.freedesktop.org/kallisti5/mesa/commits/hgl-modernize-drawable
Currently, things crash though. Help always welcome.
Testing process:
- git clone https://gitlab.freedesktop.org/kallisti5/mesa.git
- git branch -u origin/hgl-modernize-drawable
- git checkout hgl-modernize-drawable
- scons
- find libGL.so in the build directory and put in ~/config/non-packaged/lib/
- mkdir ~/config/non-packaged/add-ons/opengl
- find libswpipe.so in the build directory and put in ~/config/non-packaged/add-ons/opengl/
- run GLInfo
- debug :-)
comment:37 by , 6 years ago
@kallisti5 - Applied patches and built Mesa 18.2 (git). I reviewed the screensavers per the 'Be Advanced Topics' specs and was able to fix them to work with mesa-swpipe by doing something similar to i80and's approach. All the other GL apps I tested properly exit and kill app threads without the BeOS-specific code.
Tested with Mesa mesa-swpipe 17.1.10, 18.1.5, & 18.2 (git)
by , 6 years ago
Flurry screensaver for Haiku hrev52153 x86_64 (fully tested/working)
comment:39 by , 6 years ago
patch: | 0 → 1 |
---|
by , 6 years ago
Attachment: | haiku-flurry.patch added |
---|
Flurry screensaver patch based on i80and patch (tested on Haiku x86_64, mesa-swpipe 17.1.10)
comment:40 by , 6 years ago
Again, please stop "testing". The problem is already well understood, and I just need to take the time to patch Mesa. i80and's approach is partially correct but not entirely so; patching Flurry to "workaround" such issues is not acceptable.
comment:41 by , 6 years ago
Milestone: | Unscheduled → R1/beta1 |
---|---|
Owner: | changed from | to
Platform: | x86-64 → All |
comment:42 by , 6 years ago
Status: | assigned → in-progress |
---|
comment:43 by , 6 years ago
Resolution: | → fixed |
---|---|
Status: | in-progress → closed |
It looks like it crashes here https://cgit.freedesktop.org/mesa/glu/tree/src/libutil/mipmap.c#n3414