Note: This is a public test instance of Red Hat Bugzilla. The data contained within is a snapshot of the live data so any changes you make will not be reflected in the production Bugzilla. Email is disabled so feel free to test any aspect of the site that you want. File any problems you find or give feedback at bugzilla.redhat.com.

Bug 100691

Summary: kernel-smp unstable - apps crash and/or oopses
Product: [Retired] Red Hat Linux Beta Reporter: Tom Wood <woodt>
Component: kernelAssignee: Dave Jones <davej>
Status: CLOSED WORKSFORME QA Contact: Brian Brock <bbrock>
Severity: high Docs Contact:
Priority: medium    
Version: beta1CC: ivo, pfrields, riel, wtogami
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2003-10-21 20:14:15 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 100643    
Attachments:
Description Flags
output of "lspci -v"
none
oops number 1
none
oops none

Description Tom Wood 2003-07-24 13:06:45 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.4) Gecko/20030703

Description of problem:
SMP kernel unstable.  Randomly placed oopses and/or application segfaults when
able to boot fully.  Cannot duplicate any single instance reliably, but can get
errors on demand.  Single processor kernel works fine on same machine.  ACPI
on/off, pci=noacpi doesn't matter.  Motherboard is a Tyan Tiger MPX with USB
support on the mobo (earlier Tiger MPX mobos had no USB support because 760MPX
chipset is broken w.r.t. USB).  BIOS version is 4.0.5, which is the latest and
greatest at the time of this writing.

Version-Release number of selected component (if applicable):
kernel-smp-2.4.21-20.1.2024.2.1.nptl

How reproducible:
Always

Steps to Reproduce:
1. Boot SMP kernel on Tyan Tiger MPX
2. Experience random oopses during boot and/or app failures afterward.
3.
    

Actual Results:  Seemingly random oopses and segfaults.

Expected Results:  No problems.

Additional info:

Attaching lspci output

Comment 1 Tom Wood 2003-07-24 13:07:42 UTC
Created attachment 93102 [details]
output of "lspci -v"

Comment 2 Bill Nottingham 2003-07-28 20:35:58 UTC
Can you post some of the oopses/crashes?

Comment 3 Tom Wood 2003-07-30 20:01:07 UTC
Can post oopses but can't run them through ksymoops because ksymoops isn't
included in serern.  Will the stock one mentioned in the ksymoops README work
for this beta kernel?

Comment 4 Arjan van de Ven 2003-07-30 20:03:11 UTC
the kernel itself will ksymoops the oopses, if you do it by hand they just get
corrupted

Comment 5 Tom Wood 2003-07-30 20:04:01 UTC
Created attachment 93271 [details]
oops number 1

oops in ext3_write_super - looks bad, fellas.  Only occurs when booted into SMP
kernel.  System ran without a hitch for a week with UP kernel.

Comment 6 Tom Wood 2003-07-30 20:04:51 UTC
Created attachment 93272 [details]
oops

oops in ext3_new_block - another nasty looking beast.

Comment 7 Tom Wood 2003-08-20 14:49:15 UTC
Installing RHES on this same machine generated a CPU1 machine check exception. 
So, I suspect that the hardware is to blame - probably the second CPU is toast
(not hard to do with exposed silicon without heatspreader protection and a
cranky heatsink mechanism).  It would be nice if the Severn/Cambridge kernels
also trapped these exceptions well enough to report this.

Comment 8 Bill Nottingham 2003-10-21 20:14:15 UTC
Closing as working for us, then.