Summary: | [r600g, bisected] regression: NI/Turks WebGL (FishGL) massive speed decrease ~33% | ||
---|---|---|---|
Product: | Mesa | Reporter: | Dieter Nützel <Dieter> |
Component: | Drivers/Gallium/r600 | Assignee: | Default DRI bug account <dri-devel> |
Status: | RESOLVED FIXED | QA Contact: | Default DRI bug account <dri-devel> |
Severity: | normal | ||
Priority: | medium | CC: | maraeo, mario.kleiner, nhaehnle |
Version: | git | ||
Hardware: | x86-64 (AMD64) | ||
OS: | Linux (All) | ||
Whiteboard: | |||
i915 platform: | i915 features: |
Description
Dieter Nützel
2016-08-10 04:12:46 UTC
Argh. http://www.fishgl.com/ Should read (the other way around): look at the aquarium: ~30 fps -> ~20 fps and below look from inside (diver): ~60 fps -> ~40 fps All the other stuff stays. SOLVED with Mario's commit 2cc880c If I revert this speed is BAD as with Nicolai's 3735a925ef5692c836c4d26d6adee370dae1c2b0 commit. commit 2cc880cba54d687a122298c8187ecc31b4a0ee2d Author: Mario Kleiner <mario.kleiner.de@gmail.com> Date: Fri Aug 26 18:59:05 2016 +0200 r600: increase performance for DRI PRIME offloading if 2nd GPU is Evergreen+ This is a direct port of Marek Olšáks patch "radeonsi: increase performance for DRI PRIME offloading if 2nd GPU is CIK or VI" to r600. It uses SDMA for the detiling blit from renderoffload VRAM to GTT, as SDMA is much faster for tiled->linear blits from VRAM to GTT. Testing on a dual Radeon HD-5770 setup reduced the time for the render offload gpu to get its rendering into system RAM from approximately 16 msecs for simple rendering at 1920x1080 pixel 32 bpp to 5 msecs, a > 3x speedup! This was measured using ftrace to trace the time the radeon kms driver waited on the dmabuf fence of the renderoffload gpu to complete. All in all this brought the time for a flip down from 20 msecs to 9 msecs, so the prime setup can display at full 60 fps instead of barely 30 fps vsync'ed. The current r600 implementation supports SDMA on Evergreen and later, but not R600/R700 due to some bugs apparently present in their SDMA implementation. Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com> Cc: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> :040000 040000 16967e652cc0708f670ab8b6d63e5eb629fbd6a0 e62fa916bd1706eb1d61975765d77d76cfae0fd2 Msrc So I'm somewhat unsure if I should close this. Mario, Marek, Nicolai could it be that we get another boost if both patches 'work together'? Does reverting / disabling Nicolai's change still increase performance? (In reply to Michel Dänzer from comment #3) Hello Michel, sorry that I haven't had you on my radar...,-) > Does reverting / disabling Nicolai's change still increase performance? I've checked. (Only disabled like above 'cause it do not revert clean.) No, not that I can measure (with fishgl / digikam / Blender / FreeCAD). So I tend to say really NO. But the whole system feels snappier than ever since Mario's commit. Writing this with Nicolai's change disabled... I need some sleep. Then let's call this fixed. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.