Downtime Tuesday July 9

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4309
Credit: 250041949
RAC: 34400
Topic 231271

Next Tuesday, July 9, we will stop the project for migrating our DBs to new server hardware. The downtime will start at 8 AM UTC and shouldn't last longer than a few hours, certainly not extend the business day (4 PM UTC).

BM

GWGeorge007
GWGeorge007
Joined: 8 Jan 18
Posts: 3043
Credit: 4948324354
RAC: 1204969

 Bernd,Thanks for the

 

Bernd,

Thanks for the heads up!  We'll be prepared for your down time.

 

George

Proud member of the Old Farts Association

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4309
Credit: 250041949
RAC: 34400

We're back!

We're back!

BM

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4959
Credit: 18638928756
RAC: 5355359

Yes, the project is back but

Yes, the project is back but is NOT working correctly now.

All hosts are receiving an error message about missing scheduler URL's in the master_einstein.phys.uwm.edu.xml file.

Computer: Numbskull

8147    Einstein@Home    Jul 09, 2024, 02:57:03 PM    [error] No scheduler URLs found in master file

 

Computer: Numbskull

8149    Einstein@Home    Jul 09, 2024, 02:57:03 PM    [sched_op] Reason: 727 consecutive failures fetching scheduler list
 

 

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 982
Credit: 25170813
RAC: 0

Thanks. Looking into it...

Thanks. Looking into it...

Einstein@Home Project

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4959
Credit: 18638928756
RAC: 5355359

Thanks the current master.xml

Thanks the current master.xml file being sent out is still the maintenance outage notice.

Site off-line

Einstein@Home is currently under maintenance. We should be back shortly. Thank you for your patience.

Copyright © 2024 Einstein@Home. All rights reserved.

 

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 982
Credit: 25170813
RAC: 0

Yep, I noticed that but I

Yep, I noticed that but I can't figure out why. The page isn't configured anymore and even now got moved away entirely, followed by multiple web server restarts. This doesn't make any sense. Also I can't reproduce this outside of BOINC...

Einstein@Home Project

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3930
Credit: 46034142642
RAC: 64218472

Maybe local issues or issues

Maybe local issues or issues with site communication through certain nodes? 
 

All of my hosts are working normally. 

_________________________________________________________________________

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 982
Credit: 25170813
RAC: 0

Darn, I found it. It's the

Darn, I found it. It's the hideous on-disk cache used by our web server. A simple server restart doesn't purge it, so this needed manual treatment. I tested it locally and a simple "Update" in the BOINC Manager should now do the trick.

Sorry for the hickup!

Oliver

Einstein@Home Project

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4959
Credit: 18638928756
RAC: 5355359

Thanks for the successful

Thanks for the successful bug-hunt, Oliver.  All my hosts are communicating again.

 

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3930
Credit: 46034142642
RAC: 64218472

another artifact from the

another artifact from the project migration seems to be that stats exporting is no longer happening.

_________________________________________________________________________

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.