on 07-16-2010 7:49 AM
Hi All,
I have setup of ECC production in ditribution enviornment as follows.
Total no of servers are 5
Two servers (DB and CI) are configured in os clustring (redhat) with drac (fence device).
All the servers are RHEL4 (AS 64Bit) with kernel 4 update with oracle 10G.
Past few months we are facing the following error message and our cluster stop their serivces and try to relocate the resources to another node. So please help me to find out the reason of cluster failure.
Error logs are as follows.
Jul 3 01:43:06 sid kernel: fs.sh[28164]: segfault at 0000000000000008 rip 0000000000432098 rsp 0000007fbfffda50 error 4
Jul 3 01:43:06 sid clurgmgrd[10579]: <notice> Stopping service DB-ecc
Jul 3 01:43:07 sid clurgmgrd: [10579]: <info> Executing /home/keenable/dbfailover stop
Jul 3 01:43:07 sid su(pam_unix)[28190]: session opened for user sidadm by (uid=0)
Jul 3 01:43:19 sid rsh(pam_unix)[11875]: session closed for user sidadm
Jul 3 01:50:01 sid crond(pam_unix)[31589]: session opened for user root by (uid=0)
Jul 3 01:50:01 sid crond(pam_unix)[31589]: session closed for user root
Jul 3 01:55:27 sid su(pam_unix)[28190]: session closed for user sidadm
Jul 3 01:55:27 sid su(pam_unix)[2136]: session opened for user orasid by (uid=0)
Jul 3 01:55:27 sid su(pam_unix)[2136]: session closed for user orasid
Jul 3 01:55:27 sid clurgmgrd: [10579]: <info> Removing IPv4 address 172.xx.0.xx from eth0
Jul 3 01:55:37 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/102_64
Jul 3 01:55:38 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/sapbackup
Jul 3 01:55:38 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/sapcheck
Jul 3 01:55:38 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/sapdata1
Jul 3 01:55:38 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/sapdata2
Jul 3 01:55:38 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/sapdata3
Jul 3 01:55:38 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/sapdata4
Jul 3 01:55:39 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/mirrlogA
Jul 3 01:55:39 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/mirrlogB
Jul 3 01:55:39 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/origlogA
Jul 3 01:55:39 sid clurgmgrd: [10579]: <info> unmounting /oracle/LED/origlogB
Thanks,
Kamal Kishore
> Jul 3 01:43:06 sid kernel: fs.sh[28164]: segfault at 0000000000000008 rip 0000000000432098 rsp 0000007fbfffda50 error
Whatever that "fs.sh" is - it's not working properly.
What cluster software do you use?
Markus
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.