Model serving in RHODS

In this learning path, you will create a model server and deploy a model in your data science project from the Red Hat OpenShift Data Science dashboard.

We are now ready to create a model server!

In the Models and model servers section, click on Configure server (Figure 5).

Figure 5: Create a model server in the Models and model servers section.
Figure 5: Create a model server in the Models and model servers section.

Choose your configuration options for your model server.

  • Number of model server replicas: This is the number of instances of the model server engine that you want to deploy. You can scale it up as needed, depending on the number of requests you will receive.
  • Model server size: This is the amount of resources, CPU, and RAM that will be allocated to your server. Select the appropriate configuration for size and the complexity of your model.
  • Model route: Check this box if you want the serving endpoint (the model serving API) to be accessible outside of the OpenShift cluster through an external route.
  • Token authorization: Check this box if you want to secure or restrict access to the model by forcing requests to provide an authorization token.

When you have finished your configuration, click on Configure (Figure 6). 

Figure 6: Configure the model server.
Figure 6: Configure the model server.

Your model server is now ready (Figure 7).

Figure 7: The model server is ready.
Figure 7: The model server is ready.

If you want to see details for the model server, click on the Type tab (Figure 8).

Figure 8: This page shows the model server details.
Figure 8: This page shows the model server details.

You can also modify a model server configuration by clicking on the three dots on the right side, and selecting Edit model (Figure 9). This will bring back the same configuration page we used earlier. This menu also allows you to delete the model server.

Figure 9: How to edit the model server.
Figure 9: How to edit the model server.
Previous resource
Overview: Model serving in RHODS
Next resource
Deploy the model