Issues with running singularity containers with restricted permissions from /home open

It seems singularity currently does not allow running containers from /home that have restricted access permissions, meaning the complete path to access the container file must have execute permissions for...

Files may still appear hidden on Bianca open

We have received reports and seen a few cases from Bianca and Castor that files sadly may still appear as hidden. We are investigating this issue again, and advice users...


Updated:

Quota issues for some sllstore projects on crex (rackham/snowy) closed

Some sllstore project ended up with incorrect quotas becasue of how data is handled when communicated to/from SUPR. Fixed data (and resulting quota) is being rolled out and this issue...


Updated:

Uppmax cloud has experienced an error causing the system to be temporally unavailable closed

Openstack gathers metrics via the internal ceilometer service and stores the logs. It seems like the container where the logs are stored became full, not allowing ceilometer to write any...

Slow slurm on rackham closed

Slurm (the workload manager we use to schedule jobs) does not always like it then it has too many jobs to keep track of. This has happened quite a few...


Updated:

Temporary problem with SSH keys closed

Due to a configuration error our loginservers temporary hade incorrect SSH host keys. If you tried connect during this time you may have got a warning like: $ ssh rackham.uppmax.uu.se...

"No space left on device" on Rackham and Snowy closed

There is currently a problem writing data to the project storage system on Rackham and Snowy. The error message is “No space left on device”. The problem has affected jobs...

Issues with singularity since the latest maintenance window closed

Final ticket report We believed this was fixed by the latest build installed. Update 2018-11-14 15:26 A build that should contain fixes for the issue container creation failed: mount error:...


Updated:

Problems logging into Bianca closed

We have seen a problem with Bianca that prevented users from logging into login node. We believe the problem is fixed. If you still have problem logging in you are...


Updated:

Permission errors on bianca/castor closed

Final ticket report Issues seem solved by work-around. Depending on one’s history on UPPMAX and previous memberships of project, it’s possible one may see permission issues after the recent maintenance...


Updated:

Castor issues on compute nodes closed

Final ticket report This was fixed by building new base images for the virtual machines. We’re still working on figuring out what happened. For some reason, compute nodes already started...


Updated:

November maintenance window closed

The service on Wednesday 7th of November begins at 09:00. The service window affects all systems in various degrees. For Irma and Bianca the queues will be stopped. For all...


Updated:

Issue with castor (filesystem for bianca) closed

We’re currently seeing an issue on one of the nodes providing castor (the file system for bianca). This may lead to failed read/writes and possibly crashed jobs. We’re working to...


Updated:

/sw/data temporarily invisible on crex. closed

/sw/data was temporarily unavailable under that name on crex (the storage system for rackham) for a while today. Fixed at 19:00.

My VASP jobs trigger SIGSEGV closed

We have several reports of VASP jobs killed by SIGSEGV. The problem is believed to be caused by a recently added security fix to limit a certain memory space of...


Updated:

There is currently a problem logging into Bianca closed

There is currently a problem logging into Bianca. The login nodes that gets started during your login by some reason fails to get fully operational. This results in either broken...


Updated:

Quota issues for some projects on crex (rackham) closed

A few projects on crex were subjected to the wrong quota recently. We have worked around the issue and are fixing the underlying data in SUPR.

September maintenance window closed

The service on Wednesday 5th of September starts at 08:50 (10 min earlier than normal). The service window affects all systems in various degrees. For Irma and Bianca the queues...


Updated:

Cooling issues in the UPPMAX compute room closed

The storm that flew passed Uppsala around 15:00 managed to turn off the cooling pumps and we were forced to emergency shutoff the Rackham, Irma, Dis and Bianca As soon...


Updated:

Memory issues on some rackham nodes closed

We’re seeing issues related to memory on some rackham nodes. This relates to the kernel at times being unable to get the memory it needs. This can show up as...


Updated: