An unscheduled power outage took all systems in the main LBNL datacenter down shortly after midnight on April 29.  Cascading impacts also impacted some services outside the datacenter.  Production business systems and most other services were restored by IT personnel during the night, though some services were not restored until early working hours on April 29.   HPC services will gradually be brought online throughout the day.  The outage caused at least one major hardware failure, which will delay some storage services for HPC users until April 30 when a replacement unit can be delivered (affected customers are being separately notified).

 

The root cause of the power disruption has not yet been identified.

 

 

 

  • No labels