Opened 3 years ago

Closed 3 years ago

Last modified 4 weeks ago

#17448 closed bug (fixed)

KDL when checkout out stress-ng on 32bit

Reported by: Begasus Owned by: nobody
Priority: normal Milestone: R1/beta4
Component: System/Kernel Version: R1/beta3
Keywords: Cc:
Blocked By: Blocking: #17041
Platform: All

Description

Running stress-ng --cpu 4 --vm 2 --hdd 1 --fork 8 --timeout 2m --metrics in Terminal sends me into KDL

Attachments (1)

DSC_0387.JPG (3.5 MB ) - added by Begasus 3 years ago.

Change History (11)

by Begasus, 3 years ago

Attachment: DSC_0387.JPG added

comment:1 by waddlesplash, 3 years ago

Reproducible?

comment:2 by Begasus, 3 years ago

Yes, tried with stress-ng --cpu 2 --vm 2 --hdd 1 --fork 8 --timeout 2m --metrics as I only have 2 cpu's here, still send me into KDL

comment:3 by waddlesplash, 3 years ago

Component: - GeneralSystem/Kernel

comment:4 by waddlesplash, 3 years ago

This is reproducible on x86_64, though it seems a bit intermittently.

comment:5 by waddlesplash, 3 years ago

Dumping a team structure in the KDL:

TEAM: 0xffffffff86d36408
id:               637 (0x27d)
serial_number:    252
name:             'stress-ng'
args:             '/bin/stress-ng --cpu 4 --vm 2 --hdd 1 --fork 8 --timeout 2m --m'
hash_next:        0x0000000000000000
parent:           0xffffffff86d3f008 (id = 624)
children:         0x0000000000000000
num_threads:      1
state:            1
flags:            0x0
io_context:       0xffffffff86ee4c00
address_space:    0xffffffff86ac4ca0
user data:        0x00007f613b102000 (area 15547)
free user thread: 0x0000000000000000
main_thread:      0xffffffff86fdcfc0
thread_list:      0xffffffff86fdcfc0
group_id:         624
session_id:       411

Note that state=1, i.e. TEAM_STATE_BIRTH. That's a little unexpected, shouldn't it be TEAM_STATE_NORMAL at the point we enter userland?

comment:6 by waddlesplash, 3 years ago

The problem is that the process has no group, i.e. in LockTeamAndProcessGroup. Is that a legal state for a team to have? Because quite a lot of this code does not handle that case...

comment:7 by waddlesplash, 3 years ago

After making some other tweaks, it seems group_id is always non-zero while group is still NULL. So that indicates something else is going wrong, indeed.

The state being 1 is an oversight in fork_team and isn't related.

comment:8 by waddlesplash, 3 years ago

Ah, I think this occurs in the middle of the team destruction; the team structures above aren't the one actually being modified, and the one that is seems to be gone from the teams hash at the point we reach LockTeamAndProcessGroup.

comment:9 by waddlesplash, 3 years ago

Milestone: UnscheduledR1/beta4
Resolution: fixed
Status: newclosed

Fixed in hrev55886.

comment:10 by waddlesplash, 4 weeks ago

Blocking: 17041 added
Note: See TracTickets for help on using tickets.