-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Misc documentation updates I'd like to make #2041
Comments
Stuff resolved or tracked elsewhere1 - resolved# documented like this
python3 deployer generate-cluster <cluster-name> aws
# in practice done like this
pip install -e .
deployer generate-aws-cluster --cluster-name=ubc-eoas --hub-type=basehub --cluster-region=ca-central-1 2 - resolvedWhen having created a .jsonnet file, the zones I got didn't match available availability zones. 1a, 1b, 1c was generated, but only 1a, 1b, 1d existed. 4 - resolved by #2056Document experiences in upgrading beyond 1.22. I wrote about this in https://2i2c.slack.com/archives/CKJS000F4/p1671374097438499. infrastructure/eksctl/template.jsonnet Lines 39 to 42 in 065989f
5 - resolvedjsonnet is a tool to install but I wasn't asked to install it at any point in time as a pre-requisite to deploying a new hub. 6 - resolved
We .gitignore them, so one cant. 7 - resolvedFollowing a terraform step i got myself a We shouldn't version control the Action: add to .gitignore 8 - resolvedI've seen 10 - resolvedThere is a link that is outdated and points to the wrong lines of relevance:
In practice, we seem to need to add things to the output list of the 11 - resolved
"deploy chart" should be "support chart" 12 - resolved
The subsequent indentation is off. 13 - resolvedTypo in:
15 - resolved by #2045Its unclear to me in CILogon authentication if I should use CILogon directly or involve auth0 somehow. 16 - resolvedIts unclear to me how to configure From discussion and trial, it seems that shown_idp should reference the EntityId as listed via https://cilogon.org/idplist/ and exactly that. 17 - tracked in 2i2c-org/default-hub-homepage#19This page seems outdated, referencing https://2i2c.org/pilot and getting redirected to https://docs.2i2c.org/ 18 - tracked in 2i2c-org/default-hub-homepage#18logout button issue in default login page template 19 - previously resolvedThe central grafana URL is not clear where it is at this point. 20 - resolved
Followed by incorrect indentation Decomissioning21 - resolvedThis section didn't provide a link to the list of github applications and had incorrect formatting. 22 - resolvedMention that deleting data can take a long time in https://infrastructure.2i2c.org/en/latest/hub-deployment-guide/hubs/other-hub-ops/delete-hub.html#delete-data. 24 - resolved
"After" merged 26 - resolvedTypo at https://infrastructure.2i2c.org/en/latest/howto/grafana-github-auth.html (For example, ghttps://grafana.pilot.2i2c.cloud) 27 - resolvedI had to create a directory for my new cluster in 28 - resolvedI saw this, but CLUSTER_NAME wasn't prefixed with
29 - resolvedWhen configuring domain names via namecheap, it would be good to link directly to where this is done: https://ap.www.namecheap.com/Domains/DomainControlPanel/2i2c.cloud/advancedns instead of namecheap.com. 30 - resolvedThis command should be without
31 - resolvedUpdate AWS account creation docs to suggest use of email sub-addressing, like Related to https://github.com/2i2c-org/meta/issues/535 |
3 - resolved by #2082Verification that this generated .jsonnet logic is correct and makes sense. This looks incorrect but I'm not sure. Or maybe this generates correctly even if basehub != daskhub, giving daskNodes a null value? |
Most of the things, if not all, are resolved or tracked somewhere else. |
The top post is updated as I've removed things from it as I've resolved them, I can have this closed or manage this privately as it doesn't really merit attention from others in a way though. I'll open it for now. |
What remains to be resolved
9
In this aws step about granting eksctl access to other users.
I wonder if we should even do that if we can use the deployer script to get credentials though, hmm. Also, I have now not created additional users for that account.
UPDATE: Yes, because we need such credentials when we do operations like adding/removing node pools as well. Why one may ask, but the answer is because for example
kubectl drain
is used which is a k8s api interaction. The action point is to make it clear why we add this permission.14
Its unclear to me in the Enable authentication section if we are supposed to add 2i2c members to be able to authenticate, and if so, with what identity provider (GitHub team? Google email accounts from 2i2c.org?).
23
I saw no mention of cleaning up scratch buckets, but I think we should consider that as well in the decomissioning process.
24
This section didn't link to how to create an incident response issue. I asked myself, where? In 2i2c-org/infrastructure?
https://team-compass.2i2c.org/en/latest/projects/managed-hubs/incidents.html#key-terms
25
I'm not sure if
/pd trigger
works, or in what channel, or similar. I never managed to see a popup like described in https://team-compass.2i2c.org/en/latest/projects/managed-hubs/incidents.html#incident-response-process.32
In this comment, in step 1, I ask the community reps to help authorize the github oauth application to receive organizational membership info from users instead of asking to become an owner and do it for them. With it, I provided a screenshot example.
#2323 (comment)
33
Setting up a new GCP Project with the existing billing account should make it clear that only new billing accounts as compared to new gcp projects already linked to the 2i2c billing account need to configure cost exports.
https://infrastructure.2i2c.org/hub-deployment-guide/cloud-accounts/new-gcp-project/#create-a-new-gcp-project
34
There should be a final step linking to setting up quotas in https://infrastructure.2i2c.org/hub-deployment-guide/cloud-accounts/new-gcp-project
35
The docs for creating a new gcp project doesn't mention the ability to generate that and cluster config etc via
deployer generate-gcp-cluster
https://infrastructure.2i2c.org/hub-deployment-guide/cloud-accounts/new-gcp-project
36
When generating a new GCP cluster:
37
The gcp cluster variable prefix is used to generate resources, and if its more than 20 letters, the resource names generated, such as
<prefix>-cluster-sa
become longer than accepted. We could have validation about this to avoid it.Is it okay to make
catalystproject-latam
becomelatam
for example?I think so, because the following resources seemed to include it in its name
38
Wrong directory mentioned in leading comment at https://infrastructure.2i2c.org/hub-deployment-guide/new-cluster/new-cluster/#exporting-and-encrypting-the-cluster-access-credentials. Its really about making sure that the deployer gets credentials to the cluster in a file enc-deployer-credentials.secret.json put in config/clusters/cluster_name.
39
From https://infrastructure.2i2c.org/hub-deployment-guide/hubs/new-hub/
Should be a "Helm chart configuration"
40
Actually user servers are "containers" running in isolation from each others, but possibly on the same physical machine.
https://docs.2i2c.org/user/topics/data/filesystem/
41
Mention https://cloud.google.com/logging/docs/view/query-library is a good reference for queries in GCP
42
In GPU setup, mention a check for GPU availability in zones
The text was updated successfully, but these errors were encountered: