Kernel-Crash?

Rene Scholz mrz at informatik.uni-jena.de
Mit Mai 30 10:56:01 CEST 2001


Der neue Webserver blieb gestern stehen, was ich erst nachts gemerkt habe.

Folgendes ist aus syslog.0 :



May 28 02:36:01 tantalus /USR/SBIN/CRON[16409]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 02:40:01 tantalus /USR/SBIN/CRON[16412]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 02:44:01 tantalus /USR/SBIN/CRON[16416]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 02:48:01 tantalus /USR/SBIN/CRON[16419]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 02:52:01 tantalus /USR/SBIN/CRON[16422]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 02:56:01 tantalus /USR/SBIN/CRON[16426]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:00:01 tantalus /USR/SBIN/CRON[16429]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:04:01 tantalus /USR/SBIN/CRON[16434]: (root) CMD (find / -name core -type f > /root/find.core.log)
May 28 03:04:01 tantalus /USR/SBIN/CRON[16436]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:04:05 tantalus kernel: Unable to handle kernel paging request at virtual address 0006b7d1
May 28 03:04:05 tantalus kernel: current->tss.cr3 = 025b9000, %%cr3 = 025b9000
May 28 03:04:05 tantalus kernel: *pde = 00000000
May 28 03:04:05 tantalus kernel: Oops: 0000
May 28 03:04:05 tantalus kernel: CPU:    0
May 28 03:04:05 tantalus kernel: EIP:    0010:[vsprintf+861/1240]
May 28 03:04:05 tantalus kernel: EFLAGS: 00010097
May 28 03:04:05 tantalus kernel: eax: 0006b7d1   ebx: ffffffff   ecx: 0006b7d1   edx: fffffffe
May 28 03:04:05 tantalus kernel: esi: c02fdd2d   edi: c6b37e64   ebp: 00000000   esp: c6b37e00
May 28 03:04:05 tantalus kernel: ds: 0018   es: 0018   ss: 0018
May 28 03:04:05 tantalus kernel: Process find (pid: 16435, process nr: 60, stackpage=c6b37000)
May 28 03:04:05 tantalus kernel: Stack: cc81a260 00000001 ffffffff 00000000 c6b37e28 0000004e c724a370 c0115635
May 28 03:04:05 tantalus kernel:        c02fdd03 c0268b85 c6b37e64 c96edd38 c937aca7 cc81a260 00000001 cb0e8798
May 28 03:04:05 tantalus kernel:        c724a350 00000296 00000286 c96edd38 c01333bc c0268b64 c937aca7 c96edd90
May 28 03:04:06 tantalus kernel: Call Trace: [printk+33/360] [cprt+281/17985] [dput+308/324] [cprt+248/17985] [prune_dcache+234/272] [try_to_free_inodes+188/252] [__brelse+19/84]
May 28 03:04:06 tantalus kernel:        [grow_inodes+32/408] [get_new_inode+185/292] [iget4+113/124] [iget+17/24] [ext2_lookup+90/140] [real_lookup+77/160] [lookup_dentry+268/428] [__namei+38/88]
May 28 03:04:06 tantalus kernel:        [sys_newlstat+13/96] [system_call+52/56] [startup_32+43/285]
May 28 03:04:06 tantalus kernel: Code: 80 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 89 44 24 10 f7 c5
May 28 03:08:01 tantalus /USR/SBIN/CRON[16439]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:12:01 tantalus /USR/SBIN/CRON[16442]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:16:01 tantalus /USR/SBIN/CRON[16446]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:20:01 tantalus /USR/SBIN/CRON[16449]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:24:01 tantalus /USR/SBIN/CRON[16453]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:28:01 tantalus /USR/SBIN/CRON[16456]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:32:01 tantalus /USR/SBIN/CRON[16459]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:36:01 tantalus /USR/SBIN/CRON[16463]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:40:01 tantalus /USR/SBIN/CRON[16466]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:44:01 tantalus /USR/SBIN/CRON[16470]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:48:01 tantalus /USR/SBIN/CRON[16473]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:52:01 tantalus /USR/SBIN/CRON[16476]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 03:56:01 tantalus /USR/SBIN/CRON[16480]: (www) CMD (/usr/lib/cgi-bin/guestbook.pl -b -v >$HOME/guestbook.log)
May 28 04:00:02 tantalus /USR/SBIN/CRON[16484]: (postgres) CMD ([ -x /usr/lib/postgresql/bin/do.maintenance ] && /usr/lib/postgresql/bin/do.maintenance -a)
May 28 04:12:29 tantalus -- MARK --
May 28 04:32:29 tantalus -- MARK --
May 28 04:52:29 tantalus -- MARK --
May 28 05:12:29 tantalus -- MARK --
May 28 05:32:29 tantalus -- MARK --
May 28 05:52:29 tantalus -- MARK --
May 28 06:12:29 tantalus -- MARK --
May 28 06:32:29 tantalus -- MARK --
May 28 06:52:29 tantalus -- MARK --
May 28 07:12:29 tantalus -- MARK --



Um 3:04 laeuft dieser cronjob:

4       3       *       *       *       find / -name core -type f > /root/find.core.log



War das nun ein Crash oder nicht?

Und wieso lief der Kernel dann weiter?

Irgendwann kam dann das im syslog.0:


May 29 08:52:29 tantalus -- MARK --
May 29 09:12:29 tantalus -- MARK --
May 29 09:32:29 tantalus -- MARK --
May 29 09:52:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 09:56:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:00:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:04:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:08:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:12:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:16:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:20:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)
May 29 10:24:01 tantalus /usr/sbin/cron[17104]: (CRON) error (can't fork)



Der kernel war ein
Linux tantalus 2.2.18pre21 #1 Sat Nov 18 18:47:15 EST 2000 i686 unknown

hier dmesg:


tantalus:/var/log# dmesg
Linux version 2.2.18pre21 (herbert at arnor) (gcc version 2.7.2.3) #1 Sat Nov 18 18:47:15 EST 2000
Detected 350801 kHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 699.59 BogoMIPS
Memory: 256864k/262080k available (1728k kernel code, 412k reserved, 2928k data, 148k init)
Dentry hash table entries: 32768 (order 6, 256k)
Buffer cache hash table entries: 262144 (order 8, 1024k)
Page cache hash table entries: 65536 (order 6, 256k)
VFS: Diskquotas version dquot_6.4.0 initialized
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
512K L2 cache (4 way)
CPU: L2 Cache: 512K
CPU: Intel Pentium II (Deschutes) stepping 02
Checking 386/387 coupling... OK, FPU using exception 16 error reporting.
Checking 'hlt' instruction... OK.
Checking for popad bug... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.35a (19990819) Richard Gooch (rgooch at atnf.csiro.au)
PCI: PCI BIOS revision 2.10 entry at 0xfb1d0
PCI: Using configuration type 1
PCI: Probing PCI hardware
Linux NET4.0 for Linux 2.2
Based upon Swansea University Computer Society NET3.039
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
TCP: Hash tables configured (ehash 262144 bhash 65536)
Starting kswapd v 1.5
Detected PS/2 Mouse Port.
pty: 256 Unix98 ptys configured
Real Time Clock Driver v1.09
RAM disk driver initialized:  16 RAM disks of 4096K size
loop: registered device at major 7
PIIX4: IDE controller on PCI bus 00 dev 39
PIIX4: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:DMA, hdb:DMA
    ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:DMA, hdd:DMA
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
md driver 0.36.6 MAX_MD_DEV=4, MAX_REAL=8
(scsi0) <Adaptec AHA-294X Ultra2 SCSI host adapter> found at PCI 0/16/0
(scsi0) Wide Channel, SCSI ID=7, 32/255 SCBs
(scsi0) Downloading sequencer code... 392 instructions downloaded
scsi: <fdomain> Detection failed (no card)
NCR53c406a: no available ports found
sym53c416.c: Version 1.0.0
Failed initialization of WD-7000 SCSI card!
IBM MCA SCSI: Version 3.2
IBM MCA SCSI: No Microchannel-bus present --> Aborting.
              This machine does not have any IBM MCA-bus
              or the MCA-Kernel-support is not enabled!
megaraid: v1.11 (Aug 23, 2000)
aec671x_detect:
3w-xxxx: tw_findcards(): No cards found.
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.31/3.2.4
       <Adaptec AHA-294X Ultra2 SCSI host adapter>
scsi : 1 host.
(scsi0:0:0:0) Synchronous at 80.0 Mbyte/sec, offset 15.
  Vendor: IBM       Model: DDRS-34560D       Rev: DC1B
  Type:   Direct-Access                      ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
(scsi0:0:4:0) Synchronous at 20.0 Mbyte/sec, offset 16.
  Vendor: NEC       Model: CD-ROM DRIVE:465  Rev: 1.03
  Type:   CD-ROM                             ANSI SCSI revision: 02
Detected scsi CD-ROM sr0 at scsi0, channel 0, id 4, lun 0
scsi : detected 1 SCSI cdrom 1 SCSI disk total.
sr0: scsi3-mmc drive: 14x/32x cd/rw xa/form2 cdda tray
Uniform CD-ROM driver Revision: 3.11
SCSI device sda: hdwr sector= 512 bytes. Sectors= 8925000 [4357 MB] [4.4 GB]
Partition check:
 sda: sda1 sda2 sda3 < sda5 sda6 sda7 >
apm: BIOS version 1.2 Flags 0x07 (Driver version 1.13)
apm: disabled on user request.
VFS: Mounted root (ext2 filesystem) readonly.
Freeing unused kernel memory: 148k freed
NET4: Unix domain sockets 1.0 for Linux NET4.0.
Adding Swap: 216836k swap-space (priority -1)
tulip.c:v0.91g-ppc 7/16/99 becker at cesdis.gsfc.nasa.gov
eth0: Digital DC21041 Tulip rev 33 at 0xe800, 00:80:C8:57:B1:FB, IRQ 5.
eth0: 21041 Media table, default media 0800 (Autosense).
eth0:  21041 media #0, 10baseT.
eth0:  21041 media #4, 10baseT-FD.
eth0:  21041 media #1, 10base2.
Serial driver version 4.27 with no serial options enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
eth0: 21143 10baseT link beat good.
tantalus:/var/log#




Muss ich mir jetzt Sorgen machen?
Das soll der neue Webserver von www.thur.de werden, daher waere Stabilitaet
sehr wichtig.

Ist Kernel 2.4.5 stabil genug?

Ist das Reiserfs (mit Quota-Patches) auch stabil genug fuer so was?
(ich muss 40GB partitionieren fuer /home und /var ...)


rene
-- 
"Wer die Freiheit aufgibt, um Sicherheit zu gewinnen,
 der wird am Ende beides verlieren"             (Benjamin Franklin)
==> Voland @IRC <==  2048/0xF11D6871  2A8D 3F92 4EB8 E55C  3605 D571 38C8 E2B8
mrz at informatik.uni-jena.de                         http://www.thur.de/~Voland/


-- 
tlug Mailingliste
liste at tlug.de
http://schwarz.thueday.de/mailman/listinfo/tlug_allgemein