Bug 107033 - Randomly black screen on boot
Summary: Randomly black screen on boot
Status: RESOLVED MOVED
Alias: None
Product: DRI
Classification: Unclassified
Component: DRM/Intel (show other bugs)
Version: DRI git
Hardware: x86-64 (AMD64) Linux (All)
: medium major
Assignee: Manasi
QA Contact: Intel GFX Bugs mailing list
URL:
Whiteboard: Triaged, ReadyForDev
Keywords:
Depends on:
Blocks:
 
Reported: 2018-06-25 14:59 UTC by Andriy Sa
Modified: 2019-11-29 17:48 UTC (History)
6 users (show)

See Also:
i915 platform: BSW/CHT
i915 features: display/eDP


Attachments
dmesg fail drm-tip (277.29 KB, text/plain)
2018-06-25 14:59 UTC, Andriy Sa
no flags Details
intel_reg_dumper on fail (83.37 KB, text/plain)
2018-06-25 15:01 UTC, Andriy Sa
no flags Details
vbios on fail (64.00 KB, application/octet-stream)
2018-06-25 15:17 UTC, Andriy Sa
no flags Details
intel_reg_dumper on OK (81.80 KB, text/plain)
2018-06-25 15:19 UTC, Andriy Sa
no flags Details
Xorg on fail (20.49 KB, text/plain)
2018-06-26 05:00 UTC, Andriy Sa
no flags Details
dpcd dump on monitor OK (1.00 MB, application/octet-stream)
2018-06-26 19:46 UTC, Andriy Sa
no flags Details
dmesg black screen, i915.disable_power_well=0 (93.98 KB, text/plain)
2018-07-01 20:06 UTC, Andriy Sa
no flags Details
Patch to increase the panel power sequencing delay (1.30 KB, patch)
2019-05-22 22:56 UTC, Manasi
no flags Details | Splinter Review

Description Andriy Sa 2018-06-25 14:59:30 UTC
Created attachment 140328 [details]
dmesg fail drm-tip

Hello,

The machine randomly start with black screen (there are park of this machines. Some of them work OK, some have issue and some have issue more often)
after again reboot it may be Ok or may be fail and start only after 3-th time
some times every restart is ok (10..20 in a row)

System work and I have access over ssh

According to dmesg, the eDP port can not turn on the power and switch to HDMI-A-2

[    2.418002] [drm:intel_dp_init_panel_power_sequencer_registers [i915]] panel power sequencer register settings: PP_ON 0x87d00001, PP_OFF 0x1f40001, PP_DIV 0x270f06
[    2.418059] [drm:edp_panel_vdd_on [i915]] Turning eDP port C VDD on
[    2.418114] [drm:edp_panel_vdd_on [i915]] PP_STATUS: 0x80000008 PP_CONTROL: 0xabcd000b
[    2.420630] [drm:intel_dp_aux_xfer [i915]] dp_aux_ch timeout status 0x70150064
[    2.423130] [drm:intel_dp_aux_xfer [i915]] dp_aux_ch timeout status 0x70150064
...
[    2.498094] [drm:drm_dp_dpcd_access [drm_kms_helper]] Too many retries, giving up. First error: -110
[    2.498096] [drm] failed to retrieve link info, disabling eDP


-- chipset: Intel(R) Celeron(R) CPU  N3060 
    Gigabyte Technology Co., Ltd. N3060TN-EM
    Intel Corporation Device 22b1 (rev 35) (prog-if 00 [VGA controller])
-- system architecture: x86_64
-- kernel: 4.10 4.16 4.18(drm-tip/2018-06-23)
-- Linux distribution: Ubuntu 16.04
-- Display connector: eDP

cat /sys/class/drm/card0/error - is empty

On normal boot
ls -l /sys/class/drm/card0/
total 0
drwxr-xr-x 5 root root    0 Jun 25 14:07 card0-DP-1
drwxr-xr-x 6 root root    0 Jun 25 14:07 card0-eDP-1
drwxr-xr-x 3 root root    0 Jun 25 14:07 card0-HDMI-A-1
...

On black screen 
ls -l /sys/class/drm/card0/
total 0
drwxr-xr-x 5 root root    0 Jun 25 13:04 card0-DP-1
drwxr-xr-x 3 root root    0 Jun 25 13:04 card0-HDMI-A-1
drwxr-xr-x 3 root root    0 Jun 25 13:04 card0-HDMI-A-2
...
Comment 1 Andriy Sa 2018-06-25 15:01:41 UTC
Created attachment 140329 [details]
intel_reg_dumper on fail
Comment 2 Andriy Sa 2018-06-25 15:17:46 UTC
Created attachment 140330 [details]
vbios on fail
Comment 3 Andriy Sa 2018-06-25 15:19:46 UTC
Created attachment 140331 [details]
intel_reg_dumper on OK
Comment 4 Andriy Sa 2018-06-26 05:00:26 UTC
Created attachment 140340 [details]
Xorg on fail
Comment 5 Jani Saarinen 2018-06-26 06:04:10 UTC
Has this worked before so being regression?
Comment 6 Andriy Sa 2018-06-26 08:22:25 UTC
Kernels older than 4.10 were not used, so there is no information about regression.
I checked 4.16 and 4.18, and the problem is present
I'll check older kernels.
Comment 7 Ville Syrjala 2018-06-26 13:57:13 UTC
Smells like some kind of semi-broken eDP->LVDS bridge chip again.

When the display works please do a 'dd if=/dev/drm_dp_aux0 of=dpcd.dump' and attach the resulting file to this bug. It should come out 1 MiB in size.
Comment 8 Andriy Sa 2018-06-26 19:46:54 UTC
Created attachment 140353 [details]
dpcd dump on monitor OK
Comment 9 Ville Syrjala 2018-06-26 20:37:03 UTC
(In reply to Andriy Sa from comment #8)
> Created attachment 140353 [details]
> dpcd dump on monitor OK

Absolutely nothing identifiable there :( Can you check the board to see which eDP->LVDS chip is mounted there? I would assume it's close by to the LVDS connector. I couldn't find any high res images of a N3060TN board (in fact I can't find anything really). I did find a picture of a N3160TN board but couldn't tell from that which chip it has.
Comment 10 Manasi 2018-06-28 00:47:27 UTC
From the logs it looks like the eDP panel is not turned on fully before link training and so you see the AUX timeouts.
Could you test it with i915.disable_power_well=0 boot parameter?

You can also tell if eDP is on if intel_digital_port_connected() is returning true at that point where you see aux timeouts.

Manasi
Comment 11 Andriy Sa 2018-07-01 20:04:05 UTC
(In reply to Ville Syrjala from comment #9)
> Absolutely nothing identifiable there :( Can you check the board to see
> which eDP->LVDS chip is mounted there? I would assume it's close by to the
> LVDS connector. I couldn't find any high res images of a N3060TN board (in
> fact I can't find anything really). I did find a picture of a N3160TN board
> but couldn't tell from that which chip it has.
Sorry, but it takes time to check the board.


(In reply to Manasi from comment #10)
> From the logs it looks like the eDP panel is not turned on fully before link
> training and so you see the AUX timeouts.
> Could you test it with i915.disable_power_well=0 boot parameter?
i915.disable_power_well=0 did not affect the problem

> You can also tell if eDP is on if intel_digital_port_connected() is
> returning true at that point where you see aux timeouts.
It looks like eDP is off at that point, intel_digital_port_connected() return false

I add additional debug print in drivers/gpu/drm/i915/intel_dp.c; intel_dp_aux_ch()
...
if (status & DP_AUX_CH_CTL_TIME_OUT_ERROR) {
        bool port_status = intel_digital_port_connected(dev_priv, dp_to_dig_port(intel_dp));
        DRM_DEBUG_KMS("dp_aux_ch intel_digital_port_connected = %d\n", port_status);
	DRM_DEBUG_KMS("dp_aux_ch timeout status 0x%08x\n", status);
...


on dmesg
...
[    2.333004] [drm:intel_dp_aux_ch [i915]] dp_aux_ch intel_digital_port_connected = 0
[    2.333055] [drm:intel_dp_aux_ch [i915]] dp_aux_ch timeout status 0x70150064
[    2.335550] [drm:intel_dp_aux_ch [i915]] dp_aux_ch intel_digital_port_connected = 0
[    2.335608] [drm:intel_dp_aux_ch [i915]] dp_aux_ch timeout status 0x70150064
[    2.335623] [drm:drm_dp_dpcd_access [drm_kms_helper]] Too many retries, giving up. First error: -110
[    2.335624] [drm] failed to retrieve link info, disabling eDP
...
Comment 12 Andriy Sa 2018-07-01 20:06:08 UTC
Created attachment 140417 [details]
dmesg black screen, i915.disable_power_well=0
Comment 13 Lakshmi 2018-08-26 05:32:12 UTC
Manasi, Ville attached logs are helpful to proceed further?
Comment 14 Manasi 2019-05-22 22:07:52 UTC
It could be the case of the eDP panels where the panel power sequencing delay needs to increase (increase T11/T12 delay).
@Ville, @Clint any thoughts?

Manasi
Comment 15 Manasi 2019-05-22 22:56:30 UTC
Created attachment 144326 [details] [review]
Patch to increase the panel power sequencing delay

Could you please test this patch to see if this fixes the black screen issue?
This increases the panel power cycle delay so that eDP panel gets sufficient time to turn on before link training
Comment 16 Lakshmi 2019-07-13 18:24:10 UTC
@Andriy, any feedback with the attached patch from Manasi?
Comment 17 Lakshmi 2019-08-27 10:39:26 UTC
(In reply to Lakshmi from comment #16)
> @Andriy, any feedback with the attached patch from Manasi?

No feedback from more than one month. Dropping the priority of the bug to Medium.

@Andriy, can you provide the feedback as requested?
Comment 18 Martin Peres 2019-11-29 17:48:05 UTC
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been closed from further activity.

You can subscribe and participate further through the new bug through this link to our GitLab instance: https://gitlab.freedesktop.org/drm/intel/issues/124.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.