PDA

View Full Version : linux server lockup with slimserver referenced in the crash



bklaas
2005-07-13, 08:33
this morning my linux machine locked up completely and had to be rebooted, marking the first time in about 5 years of use that has happened.

I'm not sure if this /var/log/messages dump is useful, but slimserver is referenced in the log.

I am using SlimServer Version: 6.0.2 - trunk to power two squeezeboxen, one wired, one wireless. At the time of the freeze, neither was playing and they were configured to sync with each other.
Server is a PIII/766MHz running the 2.6.9-1.681_FC3 kernel.

[root@shaggy ~]# cat /var/log/messages | grep '10:13:26'
Jul 13 10:13:26 shaggy kernel: Unable to handle kernel paging request at virtual address 005da358
Jul 13 10:13:26 shaggy kernel: printing eip:
Jul 13 10:13:26 shaggy kernel: 0211ce7c
Jul 13 10:13:26 shaggy kernel: *pde = 00000000
Jul 13 10:13:26 shaggy kernel: Oops: 0000 [#1]
Jul 13 10:13:26 shaggy kernel: Modules linked in: parport_pc lp parport autofs4 i2c_dev i2c_core sunrpc microcode sr_mod ide_scsi dm_mod sd_mod usb_storage scsi_mod joydev usblp uhci_hcd hw_random snd_ens1371 snd_rawmidi snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd_page_alloc snd_ac97_codec snd soundcore gameport e100 mii floppy ext3 jbd
Jul 13 10:13:26 shaggy kernel: CPU: 0
Jul 13 10:13:26 shaggy kernel: EIP: 0060:[<0211ce7c>] Not tainted VLI
Jul 13 10:13:26 shaggy kernel: EFLAGS: 00010016 (2.6.9-1.681_FC3)
Jul 13 10:13:26 shaggy kernel: EIP is at remove_wait_queue+0xa/0xee
Jul 13 10:13:26 shaggy kernel: eax: 005da358 ebx: 005da358 ecx: 1c7f7050 edx: 1c7f7028
Jul 13 10:13:26 shaggy kernel: esi: 1c7f7028 edi: 00000216 ebp: 00004000 esp: 0bd3eedc
Jul 13 10:13:26 shaggy kernel: ds: 007b es: 007b ss: 0068
Jul 13 10:13:26 shaggy kernel: Process slimserver.pl (pid: 11399, threadinfo=0bd3e000 task=186fa630)
Jul 13 10:13:26 shaggy kernel: Stack: 1c7f7024 1c7f7000 00000000 0217adc3 186f4e60 00000000 0217b2d9 00002800
Jul 13 10:13:26 shaggy kernel: 00000000 000028f0 00000000 00000000 00000000 0000000e 000028f0 06b8fbec
Jul 13 10:13:26 shaggy kernel: 06b8fbe8 06b8fbe4 06b8fbf8 06b8fbf4 06b8fbf0 00000000 0000000e 00000000
Jul 13 10:13:26 shaggy kernel: Call Trace:
Jul 13 10:13:26 shaggy kernel: [<0217adc3>] poll_freewait+0x1a/0x38
Jul 13 10:13:26 shaggy kernel: [<0217b2d9>] do_select+0x3bf/0x3d3
Jul 13 10:13:26 shaggy kernel: [<0217ade1>] __pollwait+0x0/0x94
Jul 13 10:13:26 shaggy kernel: [<0215ee70>] get_user_size+0x30/0x57
Jul 13 10:13:26 shaggy kernel: [<0217b62a>] sys_select+0x32a/0x43e
Jul 13 10:13:26 shaggy kernel: [<02128c29>] update_wall_time+0x9/0x31
Jul 13 10:13:26 shaggy kernel: [<02124abb>] sys_gettimeofday+0x25/0x55
Jul 13 10:13:26 shaggy kernel: Code: <3>Debug: sleeping function called from invalid context at include/linux/rwsem.h:43
Jul 13 10:13:26 shaggy kernel: in_atomic():0[expected: 0], irqs_disabled():1
Jul 13 10:13:26 shaggy kernel: [<0211cbcb>] __might_sleep+0x7d/0x8a
Jul 13 10:13:26 shaggy kernel: [<0215e726>] rw_vm+0x20e/0x47a
Jul 13 10:13:26 shaggy kernel: [<0211ce51>] add_wait_queue_exclusive+0xbc/0xdd
Jul 13 10:13:26 shaggy kernel: [<0211ce51>] add_wait_queue_exclusive+0xbc/0xdd
Jul 13 10:13:26 shaggy kernel: [<0215ee70>] get_user_size+0x30/0x57
Jul 13 10:13:26 shaggy kernel: [<0211ce51>] add_wait_queue_exclusive+0xbc/0xdd
Jul 13 10:13:26 shaggy kernel: [<0210682b>] show_registers+0x109/0x15e
Jul 13 10:13:26 shaggy kernel: [<02106a2f>] die+0x14a/0x241
Jul 13 10:13:26 shaggy kernel: [<0211937e>] do_page_fault+0x0/0x511
Jul 13 10:13:26 shaggy kernel: [<0211937e>] do_page_fault+0x0/0x511
Jul 13 10:13:26 shaggy kernel: [<02119733>] do_page_fault+0x3b5/0x511
Jul 13 10:13:26 shaggy kernel: [<0211ce7c>] remove_wait_queue+0xa/0xee
Jul 13 10:13:26 shaggy kernel: [<022cec5e>] tcp_sendmsg+0xdb4/0xe50
Jul 13 10:13:26 shaggy kernel: [<0211b101>] recalc_task_prio+0x128/0x133
Jul 13 10:13:26 shaggy kernel: [<02306598>] schedule+0x478/0x5a4
Jul 13 10:13:26 shaggy kernel: [<0211937e>] do_page_fault+0x0/0x511
Jul 13 10:13:26 shaggy kernel: [<0211ce7c>] remove_wait_queue+0xa/0xee
Jul 13 10:13:26 shaggy kernel: [<0217adc3>] poll_freewait+0x1a/0x38
Jul 13 10:13:26 shaggy kernel: [<0217b2d9>] do_select+0x3bf/0x3d3
Jul 13 10:13:26 shaggy kernel: [<0217ade1>] __pollwait+0x0/0x94
Jul 13 10:13:26 shaggy kernel: [<0215ee70>] get_user_size+0x30/0x57
Jul 13 10:13:26 shaggy kernel: [<0217b62a>] sys_select+0x32a/0x43e
Jul 13 10:13:26 shaggy kernel: [<02128c29>] update_wall_time+0x9/0x31
Jul 13 10:13:26 shaggy kernel: [<02124abb>] sys_gettimeofday+0x25/0x55
Jul 13 10:13:26 shaggy kernel: Bad EIP value.
[root@shaggy ~]#

stinkingpig
2005-07-16, 18:57
bklaas wrote:
> this morning my linux machine locked up completely and had to be
> rebooted, marking the first time in about 5 years of use that has
> happened.
>
> I'm not sure if this /var/log/messages dump is useful, but slimserver
> is referenced in the log.
>
> I am using SlimServer Version: 6.0.2 - trunk to power two squeezeboxen,
> one wired, one wireless. At the time of the freeze, neither was playing
> and they were configured to sync with each other.
> Server is a PIII/766MHz running the 2.6.9-1.681_FC3 kernel.
>
> [root@shaggy ~]# cat /var/log/messages | grep '10:13:26'
> Jul 13 10:13:26 shaggy kernel: Unable to handle kernel paging request
> at virtual address 005da358

As I'm sure you know after 5 years, perl is a user-space process and
very unlikely to crash any OS. However, it is certainly possible for the
load generated during a database scan to exacerbate a physical
heat-dissapation, memory or disk problem, which is what I'd look for here.

--
Jack at Monkeynoodle dot Org: It's a Scientific Venture...
Riding the Emergency Third Rail Power Trip since 1996!