As a preliminary step, each of you is going to
Connect to a Data Science project
Create a Data Connection
Deploy a Data Science Pipeline Server
Launch a Workbench
Clone the git repo into your Workbench
The instructions below will guide you through these steps. Follow them carefully.
Click on the available project project. You should land on a similar page:
We have deployed an instance of Minio in the cluster to act as a simple Object Storage for our purposes.
You will need to Add data connection that points to it. Scroll down to the bottom of data science project page and click on “Data Connections”:
You will land to and empty state page. Click on “Add data connection”. Here are the information you need to enter:
pipelines
userX
minio123
https://minio-s3-minio.apps.crazy-train.sandbox1730.opentlc.com
none
userX
IMPORTANT: Once again, the bucket you will use has to match with the user ID you were provided
It is highly recommended to create your pipeline server before creating a workbench. So let’s do that now!
Select the Data Connection created earlier (pipelines) and click the Configure pipeline server button:
Wait for the pipeline server to finish its creation. When your pipeline server is ready, your screen will look like the following:
At this point, your pipeline server is ready and deployed.
IMPORTANT: You need to wait until that screen is ready. If it’s still spinning, wait for it to complete. If you continue and create your workbench before the pipeline server is ready, your workbench will not be able to submit pipelines to it.