lower bootstrax resources and use xenon_sectets
Created by: jorana
Two minor additions are made to bootstrax:
- lower the resources available to process a run to be more lenient on CPU/RAM when a run has failed 10 times or more. Was raised in https://github.com/XENONnT/straxen/issues/138. I could have implemented a roll-of where each fail gradually lowers the amount of resources. I choose not to do this as this might waste too much time on a problem that actually needs a proper fix or is simply not related to the CPU/RAM usage. Please do note that sub-optimal performance of the eventbuilders may stay undetected if this happens regularly! For example a few days I restarted the eventbuilders as their swap and RAM were clogged up with all kind of stuff. One could imagine that if this happens very often one would not notice after this PR. However, especially comming year, I will be paying close attention to the rundb and often query the runsdb with the following syntax to check that this doesn't happen too often:
{"bootstrax.n_failures":{"$gt":1}, "number":{"$gt":8000}}
-
use xenon_sectrets.py as a resource for rundb passwords. This is a technical issue of not being able to restart bootstrax as the environment variables are not available over
ssh -t 'bootstrax'
. Using this syntax allow us to store the passwords in the xenon_sectrets.py without having to mingle with the sshd_config of the eventbuilders. - Finally I've also upgraded the default of the number of cores and max messages as these values were ridiculously low.