Install Red Hat OpenShift Data Science in Red Hat OpenShift Service on AWS

Red Hat OpenShift Data Science is a platform for data scientists and developers of artificial intelligence (AI) applications. It provides a fully supported environment that lets you rapidly develop, train, test, and deploy machine learning models on-premises and/or in the public cloud. 

OpenShift Data Science is provided as a managed cloud service add-on for Red Hat OpenShift or as self-managed software that you can install on-premise or in the public cloud on OpenShift.

In this learning path, you will install OpenShift Data Science on Red Hat OpenShift Service on AWS (ROSA), a fully managed application platform for building and deploying applications. If you can’t remember how to launch OpenShift Data Science, return to the Launch Red Hat OpenShift Data Science learning path.


Access the AWS web console

To follow along with this learning path, you will need an AWS account with ROSA and Red Hat OpenShift Data Science installed.

Log in to your AWS web console (Figure 1).

Enter openshift in the search menu.  In the search results, select Red Hat OpenShift Service on AWS (Figure 2).

Clicking this link takes you to the Red Hat OpenShift Service on AWS home page (Figure 3). 

Next, you will install the Red Hat OpenShift Data Science Operator.

 

To install OpenShift Data Science, click the Enable Red Hat OpenShift button and Download the CLI.

Next, you will need to log in to your Red Hat OpenShift Service on AWS account. Make sure that you log in as Cluster-Admin (see Figure 1).

The Red Hat OpenShift Service on AWS login screen.
Figure 1. The Red Hat OpenShift Service on AWS login screen.

 

Enter your username and password, as shown in Figure 2.

Enter your cluster-admin username and password.
Figure 2. Enter your cluster-admin username and password.

 

If your login was successful, you should see the ROSA overview page, as shown in Figure 3.

Landing page for Red Hat OpenShift Service on AWS.
Figure 3. Landing page for Red Hat OpenShift Service on AWS.

 

You are now ready to install Red Hat OpenShift Data Science. Select the waffle (grid) icon in the upper-right corner of the console (highlighted in Figure 4).

Use the waffle icon to navigate to the Red Hat Hybrid Cloud Console from the ROSA console.
Figure 4. Use the waffle icon to navigate to the Red Hat Hybrid Cloud Console from the ROSA console.

 

Choose Red Hat Hybrid Cloud Console from the options displayed and navigate to the Clusters option in the left navigation panel. You should now see your newly created cluster listed. See Figure 5.

The newly created rosa-wrh84 cluster is listed in the console.
Figure 5. The newly created rosa-wrh84 cluster is listed in the console.

 

To meet the prerequisites for installing OpenShift Data Science on your cluster, we need to create a machine pool with the necessary vCPU and memory requirements. The Install button will be disabled until we do so. 

Click on the Machine pools tab and then the Add machine pool button, as shown in Figure 6.

After clicking the Machine pools tab, you can add a machine pool.
Figure 6. After clicking the Machine pools tab, you can add a machine pool.

 

Give your machine pool a name, select the m5.4xlarge node instance from the dropdown menu, and click Add machine pool. See Figure 7.

Add a machine pool name and select the m5.4xlarge node instance.
Figure 7. Add a machine pool name and select the m5.4xlarge node instance.

 

Click the Add-ons tab. See Figure 8.

View available add-ons in the Add ons tab.
Figure 8. View available add-ons in the Add ons tab.

 

In the Add ons tab, you will see a list (in card format) of the applications and services that are available to install on your cluster. Select Red Hat OpenShift Data Science and click the Install button. See Figure 9.

Select Red Hat OpenShift Data Science and click the Install button.
Figure 9. Select Red Hat OpenShift Data Science and click the Install button.

 

Once Red Hat OpenShift Data Science has finished installing, click the Open in Console button. See Figure 10.

Click on the Open in Console button.
Figure 10. Click on the Open in Console button.

 

A login screen will appear, as shown in Figure 11. Log in with your ROSA credentials.

Log into ROSA.
Figure 11. Log into ROSA.

 

Once you have logged into ROSA, click on the waffle (grid) icon and choose the Red Hat OpenShift Data Science menu option. See Figure 12.

Click on the waffle icon and then choose Red Hat OpenShift Data Science.
Figure 12. Click on the waffle icon and then choose Red Hat OpenShift Data Science.

 

You will be prompted to log in again. Once logged in, you will be in the Red Hat OpenShift Data Science platform! From there, you can choose from a number of data science platform applications and services to work with, such as Jupyter notebooks.  

Conclusion

This ends our learning path. Explore additional resources for ROSA below.

Resources

Video tutorial

Watch this video to learn how to deploy a Red Hat OpenShift cluster to AWS using Red Hat OpenShift Service on AWS.

 

Documentation

Read the documentation for Deploying ROSA without AWS STS, which is a requirement for using OpenShift Data Science.

Learning paths

Looking for more ROSA learning paths? Check out:

Previous resource
Overview: Install Red Hat OpenShift Data Science in Red Hat OpenShift Service on AWS

Info alert: Install Red Hat OpenShift Data Science in Red Hat OpenShift Service on AWS