Status with Crex (UPDATE: Rackham and Snowy /proj and /sw/data available again) closed
The project storage system Crex (/proj and /sw/data) for Rackham and Snowy is currently unavailable as we are investigating an issue with metadata. The issue is related to the work...
Updated:
Problem with incoming support mail closed
Mail sent to support@uppmax.uu.se are bouncing with an error message. We are investigating. Update 2022-05-31 15:20 This has now been resolved. The error was due to an upgrade in the...
Updated:
June maintenance window (/proj and Slurm NOT yet available on Rackham and Snowy!) closed
The June maintenance window will start at 09:00 CEST on June 1. Queues on Rackham and Snowy will be stopped as we work on the project storage system (Crex). Queues...
Updated:
Issues with Crex (file system for Rackham) closed
Slow access to projects directories and files on Rackham can be experienced. Some users have reported that commands such as “ls” or “ll” take a long time to complete, and...
Updated:
Slower allocation of GPU-nodes in Bianca closed
We are currently investigating an issue with slower allocation of GPU-nodes in Bianca. We believe this is a result from NVIDIA rotating their CUDA-keys, you can read more about this...
Updated:
Issue running node jobs in Bianca closed
We are currently investigating an issue in Bianca when running node jobs. This is most likely due to the recent Slurm upgrade. If you attempt to schedule a node job...
Updated:
May maintenance window closed
The May maintenance window will start at 09:00 CEST on May 4. Queues on Rackham and Snowy will be stopped as we perform work on migrating metadata on Crex to...
Updated:
Slow Slurm queue and issues creating login nodes in Bianca closed
There is currently an issue with the management plane in Bianca. New login nodes will take longer than expected to be created, and the Slurm queue might move slower than...
Updated:
April maintenance window closed
The April maintenance window will start at 09:00 CET on April 6. Queues on Rackham and Snowy will be stopped as we perform work on expanding the project storage system...
Updated:
Slow Slurm queue and issues creating login nodes in Bianca closed
There is currently an issue with the management plane in Bianca. New login nodes will take longer than expected to be created, and the Slurm queue might move slower than...
Updated:
UPPMAX Cloud / EAST-1 is down closed
We are troubleshooting an issue with the UPPMAX cloud. You will receive strange error if you try to login to https://east-1.cloud.snic.se. The issue is related to an network update from...
Updated:
Crex is slow closed
Crex is not behaving well att the moment and file system access is very slow which means that it is in practice not working. We upgraded Crex yesterday to the...
Updated:
March maintenance window closed
The March maintenance window will start at 09:00 CET on March 2. Queues on Rackham and Snowy will be stopped as we upgrade storage project system All systems will receive...
Updated:
Crex is slow closed
Saturday update Crex is back on track again. The bugs leading to issues on Thursdag will be addressed in file system upgrades on next maintenance day, March 2. 17:00 Update...
Updated:
February maintenance window closed
The February maintenance window will start at 06:00 CET on February 2. All systems will be SHUTDOWN due to central cooling system maintenance. All systems will receive system updates and...
Updated:
Crex is down closed
Crex is now back up and seems to be working, but we’re investigating what happened with the vendor. Crex ran into new issues yesterday evening around 19:30 and was down...
Updated:
January maintenance window closed
The January maintenance window will start at 09:00 CEST on January 12. No Slurm queues will be stopped. All systems will receive system updates and security fixes. Any disturbances on...
Updated:
Delay in Bianca jobs and login nodes being scheduled closed
There are currently issues for jobs scheduled on Bianca projects that do not already have access to a node of the right type. The movement of nodes between projects is...
Updated:
Crex problems closed
The main project file system Crex for the clusters Rackham and Snowy has been showing some problems the last few days. We have been in contact with the vendor to...
Updated:
Shutdown of all systems on 2 february at 07:00 CET closed
The UPPMAX compute hall will be partially shutdown during 2 February between 07:00 - 11:00 CET as Akademiska Hus performs work on the cooling circuit. The shutdown has been planned...
Updated: