Summary: | [BDW ppgtt Bisected]igt/pm_rps/reset sporadically causes system hang | ||||||
---|---|---|---|---|---|---|---|
Product: | DRI | Reporter: | lu hua <huax.lu> | ||||
Component: | DRM/Intel | Assignee: | Nick Hoath <nicholas.hoath> | ||||
Status: | CLOSED FIXED | QA Contact: | Intel GFX Bugs mailing list <intel-gfx-bugs> | ||||
Severity: | critical | ||||||
Priority: | highest | CC: | intel-gfx-bugs, przanoni | ||||
Version: | unspecified | ||||||
Hardware: | All | ||||||
OS: | Linux (All) | ||||||
Whiteboard: | |||||||
i915 platform: | i915 features: | ||||||
Attachments: |
|
Description
lu hua
2015-01-21 07:26:22 UTC
b8d24a06568368076ebd5a858a011699a97bfa42 is the first bad commit. commit b8d24a06568368076ebd5a858a011699a97bfa42 Author: Mika Kuoppala <mika.kuoppala@linux.intel.com> AuthorDate: Wed Jan 28 17:03:14 2015 +0200 Commit: Daniel Vetter <daniel.vetter@ffwll.ch> CommitDate: Thu Jan 29 18:03:07 2015 +0100 drm/i915: Remove nested work in gpu error handling Now when we declare gpu errors only through our own dedicated hangcheck workqueue there is no need to have a separate workqueue for handling the resetting and waking up the clients as the deadlock concerns are no more. The only exception is i915_debugfs::i915_set_wedged, which triggers error handling through process context. However as this is only used through test harness it is responsibility for test harness not to introduce hangs through both debug interface and through hangcheck mechanism at the same time. Remove gpu_error.work and let the hangcheck work do the tasks it used to. v2: Add a big warning sign into i915_debugfs::i915_set_wedged (Chris) Cc: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Mika Kuoppala <mika.kuoppala@intel.com> Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch> (In reply to Ding Heng from comment #1) > b8d24a06568368076ebd5a858a011699a97bfa42 is the first bad commit. > commit b8d24a06568368076ebd5a858a011699a97bfa42 > Author: Mika Kuoppala <mika.kuoppala@linux.intel.com> > AuthorDate: Wed Jan 28 17:03:14 2015 +0200 > Commit: Daniel Vetter <daniel.vetter@ffwll.ch> > CommitDate: Thu Jan 29 18:03:07 2015 +0100 > > drm/i915: Remove nested work in gpu error handling It's a separate bug, report bug 88928 I test on drm-intel-nightly kernel(98592c_20150122) with i915.enable_execlists=0, it works well. Test on the latest drm-intel-nightly(8b4216_20150203) kernel, it has bug 88928. add i915.enable_ppgtt=0,it works well. Bisect it. There are only 'skip'ped commits left to test. The first bad commit could be any of: 2d12955a3e539f0938b4b90d1eade852105ba290 72f95afa5faaf899f7344879b6ccd5f0cb271b28 We cannot bisect more! Test failure should be fixed by the same fix as bug 88096 Warning trace should probably be investigated separately This patch mentioned is comment #6 is merged now. Can you please retest against -nightly? Pass with nightly branch latest b18ca534ab790c19aefe8ecbec46d1bc7a31ce1e(2015-02-26).Change state to verified. Closing old verified. |
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.