Hello all,
I met a reccurent issue with PureDisk architecture version 6.6.3a : no scheduled backup job are running
détails of differents logs :
slapd.log :
Jan 29 11:17:35 nbupd-som slapd[6118]: connection_read(18): no connection!
Jan 29 11:25:30 nbupd-som slapd[6118]: connection_read(22): no connection!
Jan 29 11:30:16 nbupd-som slapd[6118]: connection_read(18): no connection!
pdweb-error.log :
[Tue Jan 29 11:15:31 2013] [error] [client ::1] client denied by server configuration: /opt/pdweb/htdocs/ [Tue Jan 29 11:15:31 2013] [error] [client ::1] client denied by server configuration: /opt/pdweb/error
Controller.log :
Tue Jan 29 2013 11:15:05.800240 INFO (1074796864): Connection from 10.25.84.10 activated : pdagent 46 Tue Jan 29 2013 11:15:11.224889 INFO (1077176640): Timeout reached for connection from 10.25.60.10 Tue Jan 29 2013 11:15:11.241392 INFO (1077176640): Timeout reached for connection from 10.25.75.10 Tue Jan 29 2013 11:15:11.271076 INFO (1077176640): Timeout reached for connection from 10.25.54.10 Tue Jan 29 2013 11:15:17.147818 INFO (1075853632): Connection from 10.25.60.10 activated : pdagent 15 Tue Jan 29 2013 11:15:55.717928 INFO (1076382016): Connection from 10.25.54.10 activated : pdagent 8 Tue Jan 29 2013 11:16:00.980828 INFO (1074268480): Connection from 10.25.75.10 activated : pdagent 5 Tue Jan 29 2013 11:16:11.496577 INFO (1077176640): Timeout reached for connection from 10.25.70.10 Tue Jan 29 2013 11:16:11.512855 INFO (1077176640): Timeout reached for connection from 10.25.75.19 Tue Jan 29 2013 11:16:11.581968 INFO (1077176640): Timeout reached for connection from 10.25.12.10
agent.log :
Tue Jan 29 2013 11:15:30.299310 ERROR (1081665856): The webservice returned an error : Cannot retrieve next job step for Agent nbupd-som.groupe.sa.colas.com (25000000): Network error while retrieving the response.
Tue Jan 29 2013 11:15:30.299459 ERROR (1081665856): WebService call failed Tue Jan 29 2013 11:15:50.370577 INFO (1086687552): Execute scheduled task: next jobstep Tue Jan 29 2013 11:17:30.733441 ERROR (1081665856): Network error during webservice call: Operation timed out after 120018 milliseconds with 0 bytes received
=> the only workaround we found is to reboot to PureDisk server ; after that, the jobs are launched correctly
Is it a kown issue ? if yes is there any fix for that ?
Thanks to all by advance for your help.
Regards,
Florent.