[sf-lug] Pointers for how to track down source of system hang

Charles N Wyble charles at thewybles.com
Thu Jan 31 15:19:03 PST 2008



Kristian,

I can say unequivocally that I *am* using the proprietary nvidia 
driver.  This is to provide the 3D support that Compiz requires.  I can 
certainly try running the system without it installed for some period of 
time.

[Charles N Wyble] 

I believe in a previous message you stated that you saw the hang even with
Compiz turned off? Is the hardware 32 or 64 bit and if 64bit are you running
a 
32 or 64 bit version of Ubuntu? 


As to the gap time between faults, I can only give a rough estimate, as 
it's variable.  Some times the system hangs multiple time in one day 
with a gap time ranging from < an hour to > 4+ hours.  However, other 
times the system remains stable for several days.

[Charles N Wyble] 

This sounds like it may be a hardware issue especially the fact that it
doesn't respond to sysrq sequences. Although this may be due to the kernel
not supporting those. 

May I recommend the following page:
https://help.ubuntu.com/community/DebuggingProcedures

also 

https://wiki.ubuntu.com/LinuxKernelCrashDumpSpec 
http://packages.ubuntu.com/gutsy/utils/crash
https://wiki.ubuntu.com/KernelTeamBugPolicies

may be of interest.

Installing the network crash dump utility is highly recommended. 
Also see 
http://packages.ubuntu.com/dapper/devel/lcrash


You may also find Oprofile useful.

I know lots of info. :) 

Linux crash reporting/handling isn't the best unfortunately. 







As of today I've gone back to running without Compiz but have re-enabled 
the audio (it's a necessity for some of the development work I'm doing) 
and thus far it's remained stable, but that's no indication of whether 
or not it will remain so.

[Charles N Wyble] 

Interesting. Are you running a Xen kernel by chance? 


As an aside, it has at times remained very stable over as long as a week 
or more with both Compiz running and the sound driver installed, which 
makes me think it's going to be something else.  The key for me is to be 
sure I'm checking all relevant logs and that I have all possible logging 
enabled.  I've stated the four (4) files I'm aware of, but would love 
feedback on other sources of information.

 [Charles N Wyble] 

Well the system logs are useful for a lot of things. However a kernel
crash (called an oops) requires a fair amount of special monitoring
tools. I wish all the distros came with it turned on and configured
out of the box. 

Sorry for the brain dump format of this e-mail. I hope to get the
Info out there and spark more discussion. 





More information about the sf-lug mailing list