on 03-26-2008 6:31 PM
We installed the an NTW04s ABAP+JAVA in MSCA.
So on the first node is instalelled (locally) the CI, while on the second node is installed one Dialog Instance.
On both the nodes we installed the ASCS + SCS services, and for all of them, all both the nodes, we installed
the relative Enqueue Replicator Servers.
After the installation we noticed problems doing the move group for the SAP group with the Cluster Manager.
The ASCS and the SCS resources go in "Failed" status on the other node, and did not come up.
Also to move back the SAP group do not solve the problem, as the ASCS and the SCS resources continue to staty in
failed status.
What we noticed is that as the move group action is started there is a message coming from the operating system that sound something like:
" an error occured on the X: drive..trying to write (or save) the
sapstart.env (or sapstartsrv.exe) file...data has been lost..."
After that the only solution to make the ASCS and the SCS services able to start again is to replace the sapstartsrv.exe on the global sapmnt with a backup copy.
The sizes of the these files is not changed due the move group action, by the way replacing them with a backup
copy allows the services to start again.
So it seems these executable are corrupted during the failover.
We are using the 133 pl kernel.
We need to clarify if this behaviour can be caused from disk errors on the MSCS nodes or instead from a missing Sap patch level.
Someone experimented this strange error ?
Marco,
installing SAP systems under MSCS is clearly described in the install guides. You must follow these instructions especially regarding the server setup. Please go through this again and check whether there are any discontinuities.
Regards,
Benny
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Tnks for the feedback.
have to fix my first description of the scenario.
The ASCS e SCS are installed on shared disks, not locally on both nodes.
On both nodes we run two times the installation of the ENqueue replication
service, the first time indicating the ASCS, the second time the SCS instance number.
In short we did the installation according the Sap manual.
On both nodes the Enqueue Replication services have the same system numbers.
The Primary and the Secondary application servers (Abap+Java) are installed locally on different nodes and uses different sistem numbers.
Any idea why the sapstartsrv.exe should be corrupted by a failover of the SAP cluster group ?
applied patches attached to note 1043592
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Hi Marco,
There is nothing to do with patch level, it seem's there is some issue with services only. If you don't move the services in the consistant mode( closing the instance and stopping the services) this kind of problems will arise.
Regards,
Hari.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
90 | |
10 | |
10 | |
10 | |
7 | |
7 | |
6 | |
5 | |
4 | |
3 |
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.