Page not found
+The page you requested cannot be found (perhaps it was moved or renamed).
+You may want to try searching to find the page's new location, or use +the table of contents to find the page you are looking for.
++
The page you requested cannot be found (perhaps it was moved or renamed).
+You may want to try searching to find the page's new location, or use +the table of contents to find the page you are looking for.
+*If you haven’t yet read the getting started Wiki pages; start there
+Every chapter needs to start out with this chunk of code:
+*Every chapter also needs Learning objectives that will look like this:
+This chapter will cover:
+For this chapter, we’ll need the following packages attached:
+*Remember to add any additional packages you need to your course’s own docker image.
+ +Here’s a subheading and some text in this subsection!
+You can demonstrate code like this:
+output_dir <- file.path("resources", "code_output")
+if (!dir.exists(output_dir)) {
+ dir.create(output_dir)
+}
And make plots too:
+ + +You can also save these plots to file:
+ +## $breaks
+## [1] 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
+##
+## $counts
+## [1] 5 27 27 30 31 18 6 6
+##
+## $density
+## [1] 0.06666667 0.36000000 0.36000000 0.40000000 0.41333333 0.24000000 0.08000000
+## [8] 0.08000000
+##
+## $mids
+## [1] 4.25 4.75 5.25 5.75 6.25 6.75 7.25 7.75
+##
+## $xname
+## [1] "iris$Sepal.Length"
+##
+## $equidist
+## [1] TRUE
+##
+## attr(,"class")
+## [1] "histogram"
+
+## png
+## 2
+How to include a Google slide. It’s simplest to use the ottrpal
package:
But if you have the slide or some other image locally downloaded you can also use html like this:
+ +To show videos in your course, you can use markdown syntax like this:
+ +Alternatively, you can use knitr::include_url()
like this:
+Note that we are using echo=FALSE
in the code chunk because we don’t want the code part of this to show up.
+If you are unfamiliar with how R Markdown code chunks work, read this.
OR this works:
+ +Examples of including a website link.
+This works:
+ +OR this:
+OR this:
+ +We can put citations at the end of a sentence like this (Allaire et al. 2021). +Or multiple citations Xie, Allaire, and Grolemund (2018).
+but they need a ; separator (Allaire et al. 2021; Xie, Allaire, and Grolemund 2018).
+In text, we can put citations like this Allaire et al. (2021).
+There are some preset callout boxes available, which can be used like so:
+notice
:
Here’s something interesting.
+warning
:
Look out!
+dictionary
:
Here’s a definition.
+reflection
:
Consider this!
+github
:
Here’s how you use GitHub.
+wip
:
This section is a Work in Progress.
+You should print out session info when you have code for reproducibility purposes.
+ +## ─ Session info ───────────────────────────────────────────────────────────────
+## setting value
+## version R version 4.3.2 (2023-10-31)
+## os Ubuntu 22.04.4 LTS
+## system x86_64, linux-gnu
+## ui X11
+## language (EN)
+## collate en_US.UTF-8
+## ctype en_US.UTF-8
+## tz Etc/UTC
+## date 2024-08-07
+## pandoc 3.1.1 @ /usr/local/bin/ (via rmarkdown)
+##
+## ─ Packages ───────────────────────────────────────────────────────────────────
+## package * version date (UTC) lib source
+## askpass 1.2.0 2023-09-03 [1] RSPM (R 4.3.0)
+## bookdown 0.39.1 2024-06-11 [1] Github (rstudio/bookdown@f244cf1)
+## bslib 0.6.1 2023-11-28 [1] RSPM (R 4.3.0)
+## cachem 1.0.8 2023-05-01 [1] RSPM (R 4.3.0)
+## cli 3.6.2 2023-12-11 [1] RSPM (R 4.3.0)
+## curl 5.2.0 2023-12-08 [1] RSPM (R 4.3.0)
+## devtools 2.4.5 2022-10-11 [1] RSPM (R 4.3.0)
+## digest 0.6.34 2024-01-11 [1] RSPM (R 4.3.0)
+## ellipsis 0.3.2 2021-04-29 [1] RSPM (R 4.3.0)
+## evaluate 0.23 2023-11-01 [1] RSPM (R 4.3.0)
+## fansi 1.0.6 2023-12-08 [1] RSPM (R 4.3.0)
+## fastmap 1.1.1 2023-02-24 [1] RSPM (R 4.3.0)
+## fs 1.6.3 2023-07-20 [1] RSPM (R 4.3.0)
+## glue 1.7.0 2024-01-09 [1] RSPM (R 4.3.0)
+## highr 0.10 2022-12-22 [1] RSPM (R 4.3.0)
+## hms 1.1.3 2023-03-21 [1] RSPM (R 4.3.0)
+## htmltools 0.5.7 2023-11-03 [1] RSPM (R 4.3.0)
+## htmlwidgets 1.6.4 2023-12-06 [1] RSPM (R 4.3.0)
+## httpuv 1.6.14 2024-01-26 [1] RSPM (R 4.3.0)
+## httr 1.4.7 2023-08-15 [1] RSPM (R 4.3.0)
+## jquerylib 0.1.4 2021-04-26 [1] RSPM (R 4.3.0)
+## jsonlite 1.8.8 2023-12-04 [1] RSPM (R 4.3.0)
+## knitr 1.47.3 2024-06-11 [1] Github (yihui/knitr@e1edd34)
+## later 1.3.2 2023-12-06 [1] RSPM (R 4.3.0)
+## lifecycle 1.0.4 2023-11-07 [1] RSPM (R 4.3.0)
+## magrittr * 2.0.3 2022-03-30 [1] RSPM (R 4.3.0)
+## memoise 2.0.1 2021-11-26 [1] RSPM (R 4.3.0)
+## mime 0.12 2021-09-28 [1] RSPM (R 4.3.0)
+## miniUI 0.1.1.1 2018-05-18 [1] RSPM (R 4.3.0)
+## openssl 2.1.1 2023-09-25 [1] RSPM (R 4.3.0)
+## ottrpal 1.2.1 2024-06-11 [1] Github (jhudsl/ottrpal@828539f)
+## pillar 1.9.0 2023-03-22 [1] RSPM (R 4.3.0)
+## pkgbuild 1.4.3 2023-12-10 [1] RSPM (R 4.3.0)
+## pkgconfig 2.0.3 2019-09-22 [1] RSPM (R 4.3.0)
+## pkgload 1.3.4 2024-01-16 [1] RSPM (R 4.3.0)
+## png 0.1-8 2022-11-29 [1] CRAN (R 4.3.2)
+## profvis 0.3.8 2023-05-02 [1] RSPM (R 4.3.0)
+## promises 1.2.1 2023-08-10 [1] RSPM (R 4.3.0)
+## purrr 1.0.2 2023-08-10 [1] RSPM (R 4.3.0)
+## R6 2.5.1 2021-08-19 [1] RSPM (R 4.3.0)
+## Rcpp 1.0.12 2024-01-09 [1] RSPM (R 4.3.0)
+## readr 2.1.5 2024-01-10 [1] RSPM (R 4.3.0)
+## remotes 2.4.2.1 2023-07-18 [1] RSPM (R 4.3.0)
+## rlang 1.1.4 2024-06-04 [1] CRAN (R 4.3.2)
+## rmarkdown 2.27.1 2024-06-11 [1] Github (rstudio/rmarkdown@e1c93a9)
+## sass 0.4.8 2023-12-06 [1] RSPM (R 4.3.0)
+## sessioninfo 1.2.2 2021-12-06 [1] RSPM (R 4.3.0)
+## shiny 1.8.0 2023-11-17 [1] RSPM (R 4.3.0)
+## stringi 1.8.3 2023-12-11 [1] RSPM (R 4.3.0)
+## stringr 1.5.1 2023-11-14 [1] RSPM (R 4.3.0)
+## tibble 3.2.1 2023-03-20 [1] CRAN (R 4.3.2)
+## tzdb 0.4.0 2023-05-12 [1] RSPM (R 4.3.0)
+## urlchecker 1.0.1 2021-11-30 [1] RSPM (R 4.3.0)
+## usethis 2.2.3 2024-02-19 [1] RSPM (R 4.3.0)
+## utf8 1.2.4 2023-10-22 [1] RSPM (R 4.3.0)
+## vctrs 0.6.5 2023-12-01 [1] RSPM (R 4.3.0)
+## xfun 0.44.4 2024-06-11 [1] Github (yihui/xfun@9da62cc)
+## xml2 1.3.6 2023-12-04 [1] RSPM (R 4.3.0)
+## xtable 1.8-4 2019-04-21 [1] RSPM (R 4.3.0)
+## yaml 2.3.8 2023-12-11 [1] RSPM (R 4.3.0)
+##
+## [1] /usr/local/lib/R/site-library
+## [2] /usr/local/lib/R/library
+##
+## ──────────────────────────────────────────────────────────────────────────────
+
+There are several pieces of content that are useful across multiple books, such as “How to Create a Workspace”. To reduce maintenance burden, the AnVIL_Template
repository has a collection of _child
R Markdown documents that can be included in your books.
You can see all the available AnVIL modules in the AnVIL_Template book
+cow::borrow_chapter()
To add a module to your book, include a code chunk with the cow::borrow_chapter()
function, specifying the file for the module you want to borrow and the repository you want to borrow it from (the AnVIL_Template
repository).
See the OTTR_Template wiki page for the most recent instructions and additional details.
+Here is an example of including the _child_workspace_create.Rmd
with cow::borrow_chapter()
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the plus icon near the top of left of the page.
+Name your Workspace and select the appropriate Billing Project. All activity in the Workspace will be charged to this Billing Project (regardless of who conducted it).
+If you are working with protected data, you can set the Authorization Domain to limit who can be added to your Workspace. Note that the Authorization Domain cannot be changed after the Workspace is created (i.e. there is no way to make this Workspace shareable with a larger audience in the future). Workspaces by default are only visible to people you specifically share them with. Authorization domains add an extra layer of enforcement over privacy, but by nature make sharing more complicated. We recommend using Authorization Domains in cases where it is extremely important and/or legally required that the data be kept private (e.g. protected patient data, industry data). For data you would merely prefer not be shared with the world, we recommend relying on standard Workspace sharing permissions rather than Authorization Domains, as Authorization Domains can make future collaborations, publications, or other sharing complicated.
+Click “CREATE WORKSPACE”. The new Workspace should now show up under your Workspaces.
+You can use the borrowed_chunk
custom div block to highlight borrowed content or any other content you choose.
Basic format:
+:::: {.borrowed_chunk}
+Code/Content here
+::::
+Rendered with highlighting:
+If you do not already have a Google account that you would like to use for accessing Terra, create one now.
+If you would like to create a Google account that is associated with your non-Gmail, institutional email address, follow these instructions.
+Rendered without highlighting:
+If you do not already have a Google account that you would like to use for accessing Terra, create one now.
+If you would like to create a Google account that is associated with your non-Gmail, institutional email address, follow these instructions.
+ +Modules about billing and Billing Projects on Google Cloud Platform and Terra.
+Log in to the Google Cloud Platform console using your Google ID. Make sure to use the same Google account ID you use to log into Terra.
If you are a first time user, don’t forget to claim your free credits! If you haven’t been to the console before, once you accept the Terms of Service you will be greeted with an invitation to “Try for Free.”
+Follow the instructions to sign up for a Billing Account and get your credits.
Choose “Individual Account”. This “billing account” is just for managing billing, so you don’t need to be able to add your team members. You will need to give either a credit card or bank account for security. Don’t worry! You won’t be billed until you explicitly turn on automatic billing.
+You can view and edit your new Billing Account, by selecting “Billing” from the left-hand menu, or going directly to the billing console console.cloud.google.com/billing
+Clicking on the Billing Account name will allow you to manage the account, including accessing reports, setting alerts, and managing payments and billing.
+At any point, you can create additional Billing Accounts using the Create Account button. We generally recommend creating a new Billing Account for each funding source.
+This gives Terra permission to create projects and send charges to the Google Billing Account, and must be done by an administrator of the Google Billing Account.
+Terra needs to be added as a “Billing Account User”:
+Log in to the Google Cloud Platform console using your Google ID.
Navigate to Billing
+You may be automatically directed to view a specific Billing Account. If you see information about a single account rather than a list of your Billing Accounts, you can get back to the list by clicking “Manage Billing Accounts” from the drop-down menu.
+Check the box next to the Billing Account you wish to add Terra to, click “ADD MEMBER”.
+Enter terra-billing@terra.bio
in the text box. In the drop-down menu, mouse over Billing, then choose “Billing Account User”.
Click “SAVE”.
+Anyone you wish to add to the Billing Account will need their own Google ID.
+To add a member to a Billing Project:
+Log in to the Google Cloud Platform console using your Google ID.
Navigate to Billing
+You may be automatically directed to view a specific Billing Account. If you see information about a single account rather than a list of your Billing Accounts, you can get back to the list by clicking “Manage Billing Accounts” from the drop-down menu.
+Check the box next to the Billing Account you wish to add a member to, click “ADD MEMBER”.
+Enter their Google ID in the text box. In the drop-down menu, mouse over Billing, then choose the appropriate role.
+Click “SAVE”.
+Log in to the Google Cloud Platform console using the Google ID associated with your Google Cloud projects.
Open the dropdown menu on the top left and click on Billing.
+You may be automatically directed to view a specific Billing Account. If you see information about a single account (and it’s not the one you’re interested in), you can get back to the list of all your Billing Accounts by clicking “Manage Billing Accounts” from the drop-down menu.
+Click on the name of the Billing Account you want to set alerts for.
+In the left-hand menu, click “Budgets & alerts”.
+Click the “Create Budget” tab.
+Enter a name for your budget, and then choose which projects you want to monitor. Then click “Next”.
+For Budget Type, select “Specified amount”. Enter the total budget amount for the month (you will set alerts at different thresholds in the next step). Click “Next” (do not click “Finish”).
+Enter the threshold amounts where you want to receive an alert. We recommend starting with 50% and 90%. You can set other alerts if you prefer.
+Check the box for “Email alerts to billing admins and users”, then click “Finish”. Now you (as the owner and admin), along with anyone you added with admin or user privileges (e.g. lab managers) will receive alerts when your lab members reach the specified spending thresholds. These emails will be sent to the Gmail accounts associated with the Billing Account.
+You can edit your budgets at any time by going to Billing > Budgets & alerts, and clicking on the name of the budget you want to edit.
+You can always check your current spend through the Google Billing console, but remember
+The Google Billing console displays information by Billing Account. To view spending:
+Log in to the Google Cloud Platform console using the Google ID associated with your Google Cloud projects.
Open the dropdown menu on the top left and click on Billing.
+You may be automatically directed to view a specific Billing Account. If you see information about a single account (and it’s not the one you’re interested in), you can get back to the list of all your Billing Accounts by clicking “Manage Billing Accounts” from the drop-down menu.
+Click on the name of the Billing Account for the project you want to view.
+Look at the top of the Overview tab to see your month-to-date spending.
+Scroll further down the Overview tab to show your top projects.
+Click on the Reports tab to see more detailed information about each of your projects. This is probably the most useful tab for exploring costs of individual projects over time.
+Click on the Cost table tab to obtain a convenient table of spending per project.
+Launch Terra and sign in with your Google account. If this is your first time logging in to Terra, you will need to accept the Terms of Service.
In the drop-down menu on the left, navigate to “Billing”. Click the triple bar in the top left corner to access the menu. Click the arrow next to your name to expand the menu, then click “Billing”. You can also navigate there directly with this link: https://anvil.terra.bio/#billing
+On the Billing page, click the “+ CREATE” button to create a new Billing Project. If prompted, select the Google account to use. If prompted, give Terra permission to manage Google Cloud Platform billing accounts.
+Enter a unique name for your Terra Billing Project and select the appropriate Google Billing Account. The name of the Terra Billing Project must:
+Select the Google Billing Account to use. All activities conducted under your new Terra Billing Project will charge to this Google Billing Account. If prompted, give Terra permission to manage Google Cloud Platform billing accounts.
+Click “CREATE BILLING PROJECT”.
+Your new Billing Project should now show up in the list of Billing Projects Owned by You. You can add additional members or can modify or deactivate the Billing Project at any time by clicking on its name in this list.
+The page doesn’t always update as soon as the Billing Project is created. If it’s been a couple of minutes and you don’t see a change, try refreshing the page.
+Launch Terra and sign in with your Google account.
In the drop-down menu on the left, navigate to “Billing”. Click the triple bar in the top left corner to access the menu. Click the arrow next to your name to expand the menu, then click “Billing”. You can also navigate there directly with this link: https://anvil.terra.bio/#billing
+Click “Owned by You” and find the Billing Project. If you do not see the Billing Project in this list, then you are not an Owner and do not have permission to add members.
+Click on the name of the Billing Project.
+Click on the “Members” tab to view and manage members. Then click the “Add User” button.
+Enter the email address of the user or group you’d like to add the the Billing Project.
+If this user or group will need to add and remove other users of the Billing Project, check the Owner box. Otherwise leave it unchecked.
+Click “ADD USER”.
+You should now see the user or group listed in the Billing Project members, along with the appropriate role. They should now be able to use the Billing Project to fund work on AnVIL.
+If you need to remove members or modify their roles, you can do so at any time by clicking the teardrop button next to their name.
+ +By default this module includes a warning to make sure people understand they will lose access to their Workspace buckets. You can remove the warning from this module by setting AnVIL_module_settings$warning
to FALSE
before running cow::borrow_chapter
:
AnVIL_module_settings <- list(
+ warning = FALSE
+)
+cow::borrow_chapter(
+ doc_path = "child/_child_terra_billing_project_disable.Rmd",
+ repo_name = "jhudsl/AnVIL_Template"
+)
+Disabling a Billing Project makes Workspace contents inaccessible!
+Disabling a Billing Project disables funding to all Workspaces funded by the Billing Project. You will be unable to compute in these Workspaces, and you will lose access to any data stored in the Workspace buckets. It is sometimes possible to restore access by reactivating billing, but Google makes no promises about whether or how long the data will be recoverable.
+Make sure everyone with Workspaces funded by the Billing Project has saved anything they want to keep in another location before disabling the Billing Project.
+To disable a Terra Billing Project (i.e. remove the Google Billing Account that funds the Terra Billing Project):
+Launch Terra and sign in with your Google account.
In the drop-down menu on the left, navigate to “Billing”. Click the triple bar in the top left corner to access the menu. Click the arrow next to your name to expand the menu, then click “Billing”. You can also navigate there directly with this link: https://anvil.terra.bio/#billing
+Click “Owned by You” and find the Billing Project. If you do not see the Billing Project in this list, then you are not an Owner and do not have permission to add members.
+Click on the name of the Billing Project.
+If you don’t see information about the Billing Account, click on “View billing account” to expand the Billing Account information. You may be prompted to enter your login information again.
+You should see the name of the Google Billing Account that is funding this Terra Billing Project. Click on the teardrop icon next to the name of the Billing Account.
+Click “Remove Billing Account”.
+Click OK to confirm that you want to disable funding for this Billing Project.
+The page should now indicate that there is no linked billing account.
+If necessary, you can restore funding to the Billing Project and associated Workspaces by clicking the teardrop icon and selecting “Change Billing Account”. However, Google makes no promises about how long the Workspace contents will remain available after you disable funding, so it is best not to rely on them.
+ +August 07, 2024
+This book is part of a series of books for the Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) of the National Human Genome Research Institute (NHGRI). Learn more about AnVIL by visiting https://anvilproject.org or reading the article in Cell Genomics.
+ +Please check out our full collection of AnVIL and related resources: https://hutchdatascience.org/AnVIL_Collection/
+Modules aimed at instructors.
+You may be wondering if AnVIL is a good choice for your class. We feel the answer is an unequivocal YES!
+AnVIL provides all the advantages of a cloud computing environment:
+AnVIL is purpose-built for genomic data science:
+Here is a checklist to help you keep track of the steps for setting up and teaching on AnVIL. Details about each of these steps can be found in the AnVIL Instructor Guide. Note that this checklist is written for workshop instructors as well as classroom instructors, so some of the “Event Setup” steps may not be applicable to you, depending on your teaching context.
+1+ month before:
+~1 week before:
+Day of (or shortly before):
+After:
+If you are new to AnVIL, we strongly recommend seeking funding through NHGRI / STRIDES for teaching your course. In order to prevent abuse, Google sets certain limits on cloud resources available to new users that can complicate teaching (e.g. project quotas). As you or your institution builds a payment history with Google, the limits on your accounts will be relaxed. In the meantime, STRIDES funding can help ensure your initial classes run smoothly.
+What is STRIDES? NHGRI’s strategic vision highlights the importance of training the next generation of genomic scientists. In collaboration with the NIH Office of Data Science Strategy, NHGRI makes funding available to AnVIL users through the STRIDES program, which aims to promote biological research in cloud environments.
+To inquire about funding your class on AnVIL, please contact help@lists.anvilproject.org.
+Joining a team on AnVIL.
+If you do not already have a Google account that you would like to use for accessing Terra, create one now.
+If you would like to create a Google account that is associated with your non-Gmail, institutional email address, follow these instructions.
+ +Modules aimed at students in a course or workshop.
+In order to run your analyses, you will use the AnVIL cloud computing platform, so that you do not need to install everything on your own computer. The AnVIL (Analysis Visualization and Informatics Lab-space) platform is specially designed for analyzing biological data, and is used by scientists doing all sorts of biological research.
+AnVIL in a nutshell
+First, you will need to set up a (free) Google account.
+If you do not already have a Google account that you would like to use for accessing AnVIL, create one now.
+Next, make sure you can log in to Terra – you will use Terra to perform computations on AnVIL.
+You can access Terra by going to anvil.terra.bio
, or by clicking the link on the AnVIL home page.
Open Terra, and you should be prompted to sign in with your Google account.
+These instructions can be customized to a specific workspace by setting certain variables before running cow::borrow_chapter()
. If these variables have not been set, reasonable defaults are provided (e.g. “ask your instructor”).
This will not work until your instructor has given you permission to spend money to “rent” the computers that will power your analyses (by adding you to a “Billing Project”).
+On AnVIL, you access files and computers through Workspaces. Each Workspace functions almost like a mini code laboratory - it is a place where data can be examined, stored, and analyzed. The first thing we want to do is to copy or “clone” a Workspace to create a space for you to experiment. This will give you access to
+Tip +At this point, it might make things easier to open up a new window in your browser and split your screen. That way, you can follow along with this guide on one side and execute the steps on the other.
+To clone an AnVIL Workspace:
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+You are automatically directed to the “MY WORKSPACES” tab. Here you can see any Workspaces that have been shared with you, along with your permission level. Depending on how your instructor has set things up, you may or may not see any Workspaces in this tab.
+Locate the Workspace specified by your instructor. (The images below show the SARS-CoV-2-Genome Workspace as an example, but you should look for the Workspace specified by your instructor.)
+Clone the workspace by clicking the teardrop button (). Select “Clone”. Or, if you have opened the Workspace, you can find the teardrop button on the top right of the Workspace.
++
You will see a popup box appear, asking you to configure your Workspace
+The new Workspace should now show up under “MY WORKSPACES”. You now have your own copy of the Workspace to work in.
This will not work until your instructor has given you permission to spend money to “rent” the computers that will power your analyses (by adding you to a “Billing Project”).
+On AnVIL, you access files and computers through Workspaces. Each Workspace functions almost like a mini code laboratory - it is a place where data can be examined, stored, and analyzed. The first thing we want to do is to copy or “clone” a Workspace to create a space for you to experiment. This will give you access to
+Tip +At this point, it might make things easier to open up a new window in your browser and split your screen. That way, you can follow along with this guide on one side and execute the steps on the other.
+To clone an AnVIL Workspace:
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+You are automatically directed to the “MY WORKSPACES” tab. Here you can see any Workspaces that have been shared with you, along with your permission level. Depending on how your instructor has set things up, you may or may not see any Workspaces in this tab.
+Locate the Workspace Example_Workspace. (The images below show the SARS-CoV-2-Genome Workspace as an example, but you should look for the Workspace Example_Workspace.)
+Clone the workspace by clicking the teardrop button (). Select “Clone”. Or, if you have opened the Workspace, you can find the teardrop button on the top right of the Workspace.
++
You will see a popup box appear, asking you to configure your Workspace
+The new Workspace should now show up under “MY WORKSPACES”. You now have your own copy of the Workspace to work in.
The module below is specially customized for students, allowing you to give more specific instructions on the settings for their Jupyter environment. There are several other general purpose modules that may also be useful for students (e.g. Pausing Jupyter, Deleting Jupyter) that can be found in other chapters of this book.
+The following instructions can be customized by setting certain variables before running cow::borrow_chapter()
. Developers should create these variables as a list AnVIL_module_settings
. The following variables can be provided:
audience
= Defaults to general
, telling them to use the default Jupyter settings. If audience
is set to student
, it gives more specific instructions.docker_image
= Optional, it will tell them how to set the image.startup_script
= Optional, it will tell them how to set the script.AnVIL is very versatile and can scale up to use very powerful cloud computers. It’s very important that you select the cloud computing environment described here to avoid runaway costs.
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the name of your Workspace. You should be routed to a link that looks like: https://anvil.terra.bio/#workspaces/<billing-project>/<workspace-name>
.
Click on the cloud icon on the far right to access your Cloud Environment options.
+In the dialogue box, click the “Settings” button under Jupyter.
+You will see some configuration options for the Jupyter cloud environment, and a list of costs because it costs a small amount of money to use cloud computing.
+Leave everything else as-is. To create your Jupyter Cloud Environment, scroll down and click the “CREATE” button.
+The dialogue box will close and you will be returned to your Workspace. You can see the status of your cloud environment by hovering over the Jupyter icon. It will take a few minutes for Terra to request computers and install software.
+When your environment is ready, its status will change to “Running”. Click on the “ANALYSES” tab to create or open a Jupyter Notebook.
+From the ANALYSES tab, you can click on the name of an existing Jupyter Notebook to view and launch it, or click the “START” button to create a new Notebook.
+AnVIL is very versatile and can scale up to use very powerful cloud computers. It’s very important that you select the cloud computing environment described here to avoid runaway costs.
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the name of your Workspace. You should be routed to a link that looks like: https://anvil.terra.bio/#workspaces/<billing-project>/<workspace-name>
.
Click on the cloud icon on the far right to access your Cloud Environment options.
+In the dialogue box, click the “Settings” button under Jupyter.
+You will see some configuration options for the Jupyter cloud environment, and a list of costs because it costs a small amount of money to use cloud computing.
+Under “Application configuration” you will see a dropdown menu. Choose “Custom Environment”. Then copy the following link into “Container image” textbox:
+example docker
Under “Startup script” you will see textbox. Copy the following link into the box:
+example startup script
Leave everything else as-is. To create your Jupyter Cloud Environment, scroll down and click the “CREATE” button.
+The dialogue box will close and you will be returned to your Workspace. You can see the status of your cloud environment by hovering over the Jupyter icon. It will take a few minutes for Terra to request computers and install software.
+When your environment is ready, its status will change to “Running”. Click on the “ANALYSES” tab to create or open a Jupyter Notebook.
+From the ANALYSES tab, you can click on the name of an existing Jupyter Notebook to view and launch it, or click the “START” button to create a new Notebook.
+The module below is specially customized for students, allowing you to give more specific instructions on the settings for their RStudio environment. There are several other general purpose modules that may also be useful for students (e.g. Pausing RStudio, Deleting RStudio) that can be found in other chapters of this book.
+The following instructions can be customized by setting certain variables before running cow::borrow_chapter()
. Developers should create these variables as a list AnVIL_module_settings
. The following variables can be provided:
audience
= Defaults to general
, telling them to use the default RStudio settings. If audience
is set to student
, it gives more specific instructions.docker_image
= Optional, it will tell them to open the customization dialogue and direct them on how to set the image.startup_script
= Optional, it will tell them to open the customization dialogue and direct them on how to set the script.AnVIL is very versatile and can scale up to use very powerful cloud computers. It’s very important that you select the cloud computing environment described here to avoid runaway costs.
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the name of your Workspace. You should be routed to a link that looks like: https://anvil.terra.bio/#workspaces/<billing-project>/<workspace-name>
.
Click on the cloud icon on the far right to access your Cloud Environment options.
+In the dialogue box, click the “Settings” button under RStudio.
+You will see some details about the default RStudio cloud environment, and a list of costs because it costs a small amount of money to use cloud computing.
+Click the “CREATE” button.
+The dialogue box will close and you will be returned to your Workspace. You can see the status of your cloud environment by hovering over the RStudio logo. It will take a few minutes for Terra to request computers and install software.
+When your environment is ready, its status will change to “Running”. Click on the RStudio logo to open a new dialogue box that will let you launch RStudio.
+Click the launch icon to open RStudio. This is also where you can pause, modify, or delete your environment when needed.
+You should now see the RStudio interface with information about the version printed to the console.
+AnVIL is very versatile and can scale up to use very powerful cloud computers. It’s very important that you select the cloud computing environment described here to avoid runaway costs.
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the name of your Workspace. You should be routed to a link that looks like: https://anvil.terra.bio/#workspaces/<billing-project>/<workspace-name>
.
Click on the cloud icon on the far right to access your Cloud Environment options.
+In the dialogue box, click the “Settings” button under RStudio.
+You will see some details about the default RStudio cloud environment, and a list of costs because it costs a small amount of money to use cloud computing.
+Click “CUSTOMIZE” to adjust the settings for your environment.
+Under “Application configuration” you will see a dropdown menu. You can also enter text here. Copy the following link into the box:
+example docker
Under “Startup script” you will see textbox. Copy the following link into the box:
+example startup script
Leave everything else as-is. To create your RStudio Cloud Environment, click on the “CREATE” button.
+The dialogue box will close and you will be returned to your Workspace. You can see the status of your cloud environment by hovering over the RStudio logo. It will take a few minutes for Terra to request computers and install software.
+When your environment is ready, its status will change to “Running”. Click on the RStudio logo to open a new dialogue box that will let you launch RStudio.
+Click the launch icon to open RStudio. This is also where you can pause, modify, or delete your environment when needed.
+You should now see the RStudio interface with information about the version printed to the console.
+Modules about Groups and user management
+Launch Terra and sign in with your Google account.
In the drop-down menu on the left, navigate to “Groups”. Click the triple bar in the top left corner to access the menu. Click the arrow next to your name to expand the menu, then click “Groups”. You can also navigate there directly with this link: https://anvil.terra.bio/#groups
+Click “+ Create a New Group”
+Enter a name for your group. Names must be unique, so it’s often helpful to include your team’s name, the purpose of the group, and optionally the timeframe, if you will have similar groups in the future.
+Click “CREATE GROUP”
+Your new Group should now show up on the Group Management screen. Take note of the email address associated with your group. You will use this email to grant the group access to Billing Projects and Workspaces.
+Launch Terra and sign in with your Google account.
In the drop-down menu on the left, navigate to “Groups”. Click the triple bar in the top left corner to access the menu. Click the arrow next to your name to expand the menu, then click “Groups”. You can also navigate there directly with this link: https://anvil.terra.bio/#groups
+Find the name of the Group you want to add someone to, and confirm that you have Admin privileges for the Group (you can only add and remove members to a Group if you are an Admin). Click on the name of the Group to view and manage members.
+Click on “+ Add User”. You will be prompted to enter the user’s AnVIL ID.
+Type in the user’s email address. Make sure this is the account that they will be using to access AnVIL.
+If this member will need to add and remove other members of the Group, check the box for “Can manage members (admin)”. This will add them as an “Admin” for the Group. Otherwise leave it unchecked, and they will be added as a “Member”.
+Click ADD USER. This will take you back to the Group administration page.
+The new Group member will now be shown in the list of group members, along with their role. They should now have access to anything that the Group has been given access to.
+If you need to remove members or modify their roles, you can do so at any time by clicking the teardrop button next to their name.
+ +Launch Terra and sign in with your Google account.
In the drop-down menu on the left, navigate to “Groups”. Click the triple bar in the top left corner to access the menu. Click the arrow next to your name to expand the menu, then click “Groups”. You can also navigate there directly with this link: https://anvil.terra.bio/#groups
+Find the name of the Group, then look in the “Group Email” column to find the email address associated with the Group. Copy this email address and paste it in as the username when adding people to Billing Projects and Workspaces. This will grant everyone in the Group access to the Billing Project or Workspace.
+Modules about opening, touring, and closing AnVIL platforms
+Here is a video tutorial that describes the basics of using Jupyter Notebook on AnVIL.
+ +The slides for this tutorial are are located here.
+AnVIL is very versatile and can scale up to use very powerful cloud computers. It’s very important that you select a cloud computing environment appropriate to your needs to avoid runaway costs. If you are uncertain, start with the default settings; it is fairly easy to increase your compute resources later, if needed, but harder to scale down.
+Note that, in order to use Jupyter, you must have access to a Terra Workspace with permission to compute (i.e. you must be a “Writer” or “Owner” of the Workspace).
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the name of your Workspace. You should be routed to a link that looks like: https://anvil.terra.bio/#workspaces/<billing-project>/<workspace-name>
.
Click on the cloud icon on the far right to access your Cloud Environment options.
+In the dialogue box, click the “Settings” button under Jupyter.
+You will see some configuration options for the Jupyter cloud environment, and a list of costs because it costs a small amount of money to use cloud computing.
+Configure any settings you need for your cloud environment. If you are uncertain about what you need, the default configuration is a reasonable, cost-conservative choice. It is fairly easy to increase your compute resources later, if needed, but harder to scale down. Scroll down and click the “CREATE” button when you are satisfied with your setup.
+The dialogue box will close and you will be returned to your Workspace. You can see the status of your cloud environment by hovering over the Jupyter icon. It will take a few minutes for Terra to request computers and install software.
+When your environment is ready, its status will change to “Running”. Click on the “ANALYSES” tab to create or open a Jupyter Notebook.
+From the ANALYSES tab, you can click on the name of an existing Jupyter Notebook to view and launch it, or click the “START” button to create a new Notebook.
+Clicking on a Notebook name will open a static preview of the Notebook. To edit and run the Notebook, click the “OPEN” button.
+Here is a video tutorial that describes the basics of using Galaxy on AnVIL.
+ +The slides for this tutorial are are located here.
+Note that, in order to use Galaxy, you must have access to a Terra Workspace with permission to compute (i.e. you must be a “Writer” or “Owner” of the Workspace).
+Open your Workspace, and click on the “NOTEBOOKS” tab. Next, click on “Create a Cloud Environment for Galaxy”. You should see a popup window on the right side of the screen. Click on “NEXT” and “CREATE” to keep all settings as-is. This will take 8-10 minutes. When it is done, click “LAUNCH GALAXY”.
+ +Once you are done with your activity, you’ll need to shut down your Galaxy cloud environment. This frees up the cloud resources for others and minimizes computing cost. The following steps will delete your work, so make sure you are completely finished at this point. Otherwise, you will have to repeat your work from the previous steps.
+Return to AnVIL, and find the Galaxy logo that shows your cloud environment is running. Click on this logo:
+ +Next, click on “DELETE ENVIRONMENT OPTIONS”:
+ +Finally, select “Delete everything, including persistent disk”. Make sure you are done with the activity and then click “DELETE”.
+ +Here is a video tutorial that describes the basics of using RStudio on AnVIL.
+ +The slides for this tutorial are are located here.
+AnVIL is very versatile and can scale up to use very powerful cloud computers. It’s very important that you select a cloud computing environment appropriate to your needs to avoid runaway costs. If you are uncertain, start with the default settings; it is fairly easy to increase your compute resources later, if needed, but harder to scale down.
+Note that, in order to use RStudio, you must have access to a Terra Workspace with permission to compute (i.e. you must be a “Writer” or “Owner” of the Workspace).
+Open Terra - use a web browser to go to anvil.terra.bio
In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the name of your Workspace. You should be routed to a link that looks like: https://anvil.terra.bio/#workspaces/<billing-project>/<workspace-name>
.
Click on the cloud icon on the far right to access your Cloud Environment options.
+In the dialogue box, click the “Settings” button under RStudio.
+You will see some details about the default RStudio cloud environment, and a list of costs because it costs a small amount of money to use cloud computing.
+If you are uncertain about what you need, the default configuration is a reasonable, cost-conservative choice. It is fairly easy to increase your compute resources later, if needed, but harder to scale down. Click the “Create” button.
+Otherwise, click “CUSTOMIZE” to modify the environment for your needs.
+The dialogue box will close and you will be returned to your Workspace. You can see the status of your cloud environment by hovering over the RStudio logo. It will take a few minutes for Terra to request computers and install software.
+When your environment is ready, its status will change to “Running”. Click on the RStudio logo to open a new dialogue box that will let you launch RStudio.
+Click the launch icon to open RStudio. This is also where you can pause, modify, or delete your environment when needed.
+You should now see the RStudio interface with information about the version printed to the console.
+Next, we will be using RStudio and the package Glimma
to create interactive plots. See this vignette for more information.
The Bioconductor team has created a very useful package to programmatically interact with Terra and Google Cloud. Install the AnVIL
package. It will make some steps easier as we go along.
You can now quickly install precompiled binaries using the AnVIL package’s install()
function. We will use it to install the Glimma
package and the airway
package. The airway
package contains a SummarizedExperiment
data class. This data describes an RNA-Seq experiment on four human airway smooth muscle cell lines treated with dexamethasone.
{Note: for some of the packages, you will have to install packaged from the CRAN repository, using the install.packages() function. The examples will show you which install method to use.}
+<img src="07-using_platforms_modules_files/figure-html//1BLTCaogA04bbeSD1tR1Wt-mVceQA6FHXa8FmFzIARrg_g11f12bc99af_0_56.png" alt="Screenshot of the RStudio environment interface. Code has been typed in the console and is highlighted." width="100%" />
+Load the example data.
+The multidimensional scaling (MDS) plot is frequently used to explore differences in samples. When this data is MDS transformed, the first two dimensions explain the greatest variance between samples, and the amount of variance decreases monotonically with increasing dimension. The following code will launch a new window where you can interact with the MDS plot.
+Change the colour_by
setting to “groups” so you can easily distinguish between groups. In this data, the “group” is the treatment.
You can download the interactive html file by clicking on “Save As”.
+You can also download plots and other files created directly in RStudio. To download the following plot, click on “Export” and save in your preferred format to the default directory. This saves the file in your cloud environment.
+You should see the plot in the “Files” pane.
+Select this file and click “More” > “Export”
+Select “Download” to save the file to your local machine.
+The right-hand side icon reminds you that you are accruing cloud computing costs. If you don’t see this icon, you may need to scroll to the right.
+You should minimize charges when you are not performing an analysis. You can do this by clicking on the RStudio icon and selecting “Pause”. This will release the CPU and memory resources for other people to use. Note that your work will be saved in the environment and continue to accrue a very small cost. This work will be lost if the cloud environment gets deleted. If there is anything you would like to save permanently, it’s a good idea to copy it from your compute environment to another location, such as the Workspace bucket, GitHub, or your local machine, depending on your needs.
+You can also pause your cloud environment(s) at https://anvil.terra.bio/#clusters.
+Pausing your cloud environment only temporarily stops your work. When you are ready to delete the cloud environment, click on the RStudio icon on the right-hand side and select “Settings”. If you don’t see this icon, you may need to scroll to the right.
+Click on “Delete Environment”.
+If you are certain that you do not need the data and configuration on your disk, you should select “Delete everything, including persistent disk”. If there is anything you would like to save, open the compute environment and copy the file(s) from your compute environment to another location, such as the Workspace bucket, GitHub, or your local machine, depending on your needs.
+Select “DELETE”.
+You can also delete your cloud environment(s) and disk storage at https://anvil.terra.bio/#clusters.
+Modules describing why AnVIL is useful for different personas.
+You may be wondering if AnVIL is a good choice for your class. We feel the answer is an unequivocal YES!
+AnVIL provides all the advantages of a cloud computing environment:
+AnVIL is purpose-built for genomic data science:
+Modules about Terra Workspaces
+Locate the Workspace you want to clone. If a Workspace has been shared with you ahead of time, it will appear in “MY WORKSPACES”. You can clone a Workspace that was shared with you to perform your own analyses. In the screenshot below, no Workspaces have been shared.
+If a Workspace hasn’t been shared with you, navigate to the “FEATURED” or “PUBLIC” Workspace tabs.
+Use the search box to find the Workspace you want to clone.
+Click the teardrop button on the far right next to the Workspace you want to clone. Click “Clone”. You can also clone the Workspace from the Workspace Dashboard instead of the search results.
++
You will see a popup box appear. Name your Workspace and select the appropriate Terra Billing Project. All activity in the Workspace will be charged to this Billing Project (regardless of who conducted it). Remember that each Workspace should have its own Billing Project.
+If you are working with protected data, you can set the Authorization Domain to limit who can be added to your Workspace. Note that the Authorization Domain cannot be changed after the Workspace is created (i.e. there is no way to make this Workspace shareable with a larger audience in the future). Workspaces by default are only visible to people you specifically share them with. Authorization domains add an extra layer of enforcement over privacy, but by nature make sharing more complicated. We recommend using Authorization Domains in cases where it is extremely important and/or legally required that the data be kept private (e.g. protected patient data, industry data). For data you would merely prefer not be shared with the world, we recommend relying on standard Workspace sharing permissions rather than Authorization Domains, as Authorization Domains can make future collaborations, publications, or other sharing complicated.
+Click “CLONE WORKSPACE”. The new Workspace should now show up under your Workspaces.
+In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+Click on the plus icon near the top of left of the page.
+Name your Workspace and select the appropriate Billing Project. All activity in the Workspace will be charged to this Billing Project (regardless of who conducted it).
+If you are working with protected data, you can set the Authorization Domain to limit who can be added to your Workspace. Note that the Authorization Domain cannot be changed after the Workspace is created (i.e. there is no way to make this Workspace shareable with a larger audience in the future). Workspaces by default are only visible to people you specifically share them with. Authorization domains add an extra layer of enforcement over privacy, but by nature make sharing more complicated. We recommend using Authorization Domains in cases where it is extremely important and/or legally required that the data be kept private (e.g. protected patient data, industry data). For data you would merely prefer not be shared with the world, we recommend relying on standard Workspace sharing permissions rather than Authorization Domains, as Authorization Domains can make future collaborations, publications, or other sharing complicated.
+Click “CREATE WORKSPACE”. The new Workspace should now show up under your Workspaces.
+In the drop-down menu on the left, navigate to “Workspaces”. Click the triple bar in the top left corner to access the menu. Click “Workspaces”.
+You are automatically directed to the “My Workspaces” tab. Here you can see any Workspaces that have been shared with you, along with your permission level.
+ +Click on the name of a Workspace to open it. Opening and viewing a Workspace does not cost anything.
+When you open a Workspace, you are directed to the Workspace Dashboard. This generally has a description of the Workspace contents, as well as some useful details about the Workspace itself.
+From here you can navigate through the different tabs of the Workspace, and if you have sufficient permission, you can start running analyses. If you are only a Reader, you may need to “clone” (make your own copy) of the Workspace before you can start working.
+