UPPMAX Cloud network issues open

There is currently an issue related to the backend network infrastructure at the Uppmax Cloud. Users my experience problems connecting to new VMs. Currently there is no workaround except to...

Issues with Crex (file system for Rackham) closed

We are experiencing problems with Snowy’s and Rackham’s storage system “Crex”. Many disks in the same enclosure is reported as failed. The disks are now rebuilding and the manufacturer is...


Updated:

Issues with lupus (file system for irma) closed

We were noticed of lupus (the file system for irma) being slow on some nodes and traced it to an issue with a specific disk target. While resolving that we...

UPPMAX Cloud login issues closed

There is currently an issue logging into UPPMAX Cloud using both the Web dashboard and APIs. This problem is central to SSC and unfortunately affect the other regions too. We...

February maintenance window closed

The service on Wednesday 6th of February begins at 09:00 and affects all systems in various degrees. All systems and services will get bug and security updates. The queues on...


Updated:

Problems accessing wharf 2019-01-16 closed

It seems a network hiccup earlier today (2019-01-16) caused issues for connections to the wharf for bianca. This has been resolved as far as we know and new connections should...

Security update and reboot closed

Due to a recent security issue in the Linux operating system, UPPMAX has decided to apply the newly released security fixes immediately. After applying the fix, we need to reboot...


Updated:

January maintenance window closed

The service on Wednesday 9th of January begins at 09:00 and affects all systems. All systems will get bug and security updates. We will also perform a minor reorganization of...


Updated:

Cooling issues in the UPPMAX compute room closed

We have an ongoing issue with cooling in the server hall. All systems have been closed down to prevent hardware damage. Final ticket report All systems are up and running...


Updated:

UPPMAX Account Request may take longer to process closed

UPPMAX is working on updating infrastructure services that affects the creation of new accounts. We will temporarily disable creation of new accounts at various times during the update. We expect...


Updated:

Quota issues for some sllstore projects on crex (rackham/snowy) closed

Some sllstore project ended up with incorrect quotas becasue of how data is handled when communicated to/from SUPR. Fixed data (and resulting quota) is being rolled out and this issue...


Updated:

Uppmax cloud has experienced an error causing the system to be temporally unavailable closed

Openstack gathers metrics via the internal ceilometer service and stores the logs. It seems like the container where the logs are stored became full, not allowing ceilometer to write any...

Slow slurm on rackham closed

Slurm (the workload manager we use to schedule jobs) does not always like it then it has too many jobs to keep track of. This has happened quite a few...


Updated:

Temporary problem with SSH keys closed

Due to a configuration error our loginservers temporary hade incorrect SSH host keys. If you tried connect during this time you may have got a warning like: $ ssh rackham.uppmax.uu.se...

Issues with running singularity containers with restricted permissions from /home closed

It seems singularity currently does not allow running containers from /home that have restricted access permissions, meaning the complete path to access the container file must have execute permissions for...


Updated:

"No space left on device" on Rackham and Snowy closed

There is currently a problem writing data to the project storage system on Rackham and Snowy. The error message is “No space left on device”. The problem has affected jobs...

Issues with singularity since the latest maintenance window closed

Final ticket report We believed this was fixed by the latest build installed. Update 2018-11-14 15:26 A build that should contain fixes for the issue container creation failed: mount error:...


Updated:

Problems logging into Bianca closed

We have seen a problem with Bianca that prevented users from logging into login node. We believe the problem is fixed. If you still have problem logging in you are...


Updated:

Permission errors on bianca/castor closed

Final ticket report Issues seem solved by work-around. Depending on one’s history on UPPMAX and previous memberships of project, it’s possible one may see permission issues after the recent maintenance...


Updated:

Castor issues on compute nodes closed

Final ticket report This was fixed by building new base images for the virtual machines. We’re still working on figuring out what happened. For some reason, compute nodes already started...


Updated: