You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

An early version of JupyterHub is now available within the SAIL Databank for a limited number of users. As our collective experience of working with JupyterHub evolves and our technical offering, this guide will get updated in line with it.  

Getting started

  1. Note: Access to JupyterHub is only available for users on projects that have paid for additional processing power. 
  2. If you have requested JupyterHub and this has been confirmed you will need to log on to the SAIL Gateway as normal. Once within the Windows 10 environment, open up your Browser and navigate to - http://jupyterhub.sail.k8s.chi.swan.ac.uk/ 
  3. You may be prompted with an error message warning you that the site is not safe, we wish to assure you that it is safe but whilst in development, this site will appear to be unsafe. Please follow the browsers navigation options as detailed below 



4.  Following successfully navigating these warnings you will be logged into JupyterHub and will see a list of notebooks to choose from. Assuming that you are on a GPU project you will see a minimum of 3 options:

  • The first is a basic (non-GPU) notebook. This has python and R kernels installed, and also allows you to launch VS Code and RStudio from within the notebook if you prefer a more fully-featured IDE.
  • The second is a notebook with the same features as the first that will attach itself to a GPU, but that does not contain any GPU drivers or related python libraries. You will have to install all of your own GPU drivers from within the notebook if you select this. Not recommended.
  • The third is a GPU-attached notebook with CUDA 11.6.2, Tensorflow, and other common python ML libraries preinstalled. It is configured to automatically surface your specific project GPU to Tensorflow within the notebook. This notebook only supports Python. It also includes VS Code and Tensorboard, as well as an extension for monitoring your GPU resource usage.

5. If you are on more than 1 GPU project (e.g. project 1234 with GPU and project 1653 with GPU) you will see separate options in the notebook image list for each project. In this case, the list will look something like this:

  • Standard Jupyter notebook
  • Standard Jupyter notebook with GPU for project 1234
  • GPU-enabled Jupyter notebook with GPU for project 1234
  • Standard Jupyter notebook with GPU for project 1653
  • GPU-enabled Jupyter notebook with GPU for project 1653

All of your notebooks have the same underlying file system, so choosing a different notebook doesn't affect any access to the files that you keep on JupyterHub.

A request on notebook choice

We would like to strongly suggest that users only select GPU-enabled notebooks if they are intending to run workloads that will actually use the GPU. If a user is just doing data cleaning or exploratory analysis then the 'Standard Jupyter Notebook' is more than sufficient for this. It is also very important that users on multiple GPU projects select the correct notebook for the project that they will be working on. We will be monitoring resource usage to help us better understand the usage patterns and usage rates of projects with GPUs.


Inside the notebook

After selecting your notebook, your server will start up (after a very short delay) and the traditional Jupyter Lab interface will load. This should be an interface that is familiar. Our notebooks offer some special features, including:

  • CloudBeaver – this lets you connect to DB2 and run SQL directly in the window. 
  • VS Code - a fully featured IDE with several extensions pre-installed.
  • RStudio - the RStudio IDE.

Installing new packages

The notebooks are configured so that any new conda environment you create (see Installing Library Packages in Anaconda) will automatically create a corresponding kernel launcher on your Jupyter homepage. For this reason, we recommend that users primarily install packages via Anaconda, rather than pip or CRAN, where possible.

Python

  1. Open a Terminal window from the homepage.
  2. Create a new conda environment to install your package in, conda create --name myshinynewenv
  3. Activate your new environment, conda activate myshinynewenv
  4. Install your package, e.g. conda install -c conda-forge recordlinkage
  5. You can also install from pip in the traditional way, ensuring you activate the environment (step 3) that you want to install the package into first.
  6. When you close the Terminal window and return to the homepage you should see a new python kernel with the same name as your new conda environment.

R

  1. Open a Terminal window from the homepage.
  2. If you want to use your new package within R Studio then just install it into the base environment.
  3. If you want to use it within Jupyter R then create a new env as in steps 2 and 3 in the Python section above.
  4. Install your package, e.g. conda install -c r r-terra
  5. You can also install from CRAN in the traditional R way.

Using GIT

The notebooks are set up to be able to pull and push to the SAIL Gitlab. You can set this up how you set it up on your SAIL Desktop (see How to use GITLAB within SeRP, and instructions on the internal SAIL wiki).







  • No labels