-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
eScience Course: 31/10-11/11 2022 (plus 28-30 November for remote presentations of final results). #42
Comments
Great to know, thanks!
I discussed with CESNET and after 21st Oct we need to scale down the CLIVAR JupyterHub, removing first the worker nodes with the
Tests are still ongoing between @guillaumeeb, Miguel and myself. I would say that manual scaling is still preferred.
According to what I see in grafana, we should be covered after removing 15 VMs of the Additionally, please remember to submit an application to https://c-scale.eu/call-for-use-cases/ to gain access to additional resources to what you have in EGI-ACE so we could host multiple JupyterHub/DaskHub instances at the same time. Actually, I would be greateful if you could spread the link to the C-SCALE call through your networks (i.e. the European Pangeo community) so others can also benefit from these resources. |
I would suggest to deploy a new
Sorry, I won't be available for that. |
I started to look into building a new infrastructure, similar to the pangeo-clivar but using only the That works, but I found the deployment process really much slower than with the @sebastian-luna-valero: I would have liked to try also the @tinaok @annefou: the possible |
Thanks for starting the creation of the new jupyterhub for the eScience course. On my side, I have created (duplicated from clivar workshop) https://github.com/pangeo-data/escience-2022 I just ask the course organisers for the storage. I see that for the clivar bootcamp, they had 1TB and it is not full. I guess 2TB is OK. They may also be able to create minIO for reading some data from their own infrastructure (let's see if it works out). Thanks. |
Sorry, I won't be available for that neither.
Is it possible to separate the object storage with the usage of CLIVAR? For computing resource, may be I can make a meeting with each workgroup to understand when they will work intensively so that we can adjust the needs manually in advance? (and thank you jean for creating separating instance!) |
You are right, we need to make the storage separate. For the eScience course, they will work until the end of November. I guess we also want to onboard them more "permanently" e.g. we would like to provide a more long-term solution but it is probably only viable once the elastic part is in place. |
We could request the 30 vCPU option to be added but the
For me the keyword here is
Object storage is detached from JupyterHub deployments.
Remember that until we get automatic elasticiy in place, manually scaling up and down the cluster is possible. |
Sorry I was not clear enough. I was hoping that we do not give access to the e-science course students for vopangeo.eu in aai.egi.eu for accessing s3 storage just now (untie #17 is resolved), then they can write into 'ANY OF' vo.pangeo.eu-swift disk space. |
I see, thanks! If other object storage is not available, and until #17 is solved, we could also look into deploying our own MinIO. However, there is the extra effort required to deploy and maintain this operational, and I am not sure whether I will have the time. What about others? |
@sebastian-luna-valero @tinaok @annefou I did a quick test with Downloads were a lot slower however, but that could be related to the network?! If there are no other dramatic losses in performances I guess that it should be fine for the eScience course? This eosc infrastructure now has 16 WNs and the same values as clivar except for the amount of memory which is increased |
great!
Indeed, I would say so.
Looking at OpenStack I see:
@j34ni please replace |
As this storage is only use for home directory, I don't think we need so much. Are you planning to put huge scientific datasets there?
I won't have the time either. Would the solution to create yet another Openstack project for the escience workshop to host data containers here would be feasible? We cloud reduce object store quotas on both projects if needed.
@j34ni, I see that the jupyterhub is available at https://pangeo-eosc.vm.fedcloud.eu/jupyterhub/, so I guess you did not use the latest configuration provided here: https://github.com/pangeo-data/pangeo-eosc/blob/main/EGI.md? This is not crutial, but the setup is a bit simplified in this documentation. |
@sebastian-luna-valero @tinaok @annefou Should we fiddle with pangeo-clivar now, as it is being used, or should we wait before starting to remove the |
@guillaumeeb |
The pangeo-clivar cluster went down from 49 VMs to 31 VMs already, have you noticed any disruption? We agreed with CESNET the Again, I am here to help if needed.
Currently the quotas are 10TB for each project. Please confirm the new quota value and I will double check with CESNET. |
I am not sure these 49 - 31 = 18 VMs were configured/used at all in the infrastructure, they never showed up in the list of VMs in the IM Dashboard anyway, so I deleted them in openstack manually If I start to remove VMs from the instrastructure while they are in use the affected users will not be very happy @tinaok When will be a good time to do that? |
The quota was just a proposition to see if we could create another Openstack project to have an object storage space that has different access policy. Imagine a pangeo-escience Openstack project, maybe we would need to create another user group too on check-in? |
According to grafana, the cluster is quite now, and it's Friday afternoon, I would say it's a good time to reconfigure the cluster.
Sure, we can create a new group in check-in dedicated for the new object store. Please note that this would imply that every time a new user requests to enroll in the VO for the eScience Course, VO managers would have to manually add them to the new group. If that's not a problem for you, we can do it. Anyway, we would need to update |
OK, I'll give it a go |
Now I see a lot more VMs for pangeo-clivar on the IM Dashboard than when I refreshed a few minutes ago and these 18 "ghost" are suddenly back, what happened? |
I am sorry, but I can't check since don't have access to the pangeo-{clivar,eosc} clusters on my profile of IM Dashboard. As a last resort, I can send you my details so you can add me as owner to check further. |
Please do send me your credentials |
Will do. By the way:
|
they are both on the dev |
CESNET is happy to create another OpenStack project. There are two options
Please let me know your thoughts. |
I think option 1 is best. |
Thanks, please review #44 |
Closing as obsolete. |
Clivar is ending very soon and we have another course coming. Very similar needs e.g. 17 students + 10 mentors (see https://www.aces.su.se/research/projects/escience-tools-in-climate-science-linking-observations-with-modelling/). I will create a repo (same as for Clivar); the organisers would also be happy if someone can deliver some of the trainings (mostly focusing on Dask + kerchunk because the rest is covered in-house).
The text was updated successfully, but these errors were encountered: