Skip to main content
Redhat Developers  Logo
  • Products

    Featured

    • Red Hat Enterprise Linux
      Red Hat Enterprise Linux Icon
    • Red Hat OpenShift AI
      Red Hat OpenShift AI
    • Red Hat Enterprise Linux AI
      Linux icon inside of a brain
    • Image mode for Red Hat Enterprise Linux
      RHEL image mode
    • Red Hat OpenShift
      Openshift icon
    • Red Hat Ansible Automation Platform
      Ansible icon
    • Red Hat Developer Hub
      Developer Hub
    • View All Red Hat Products
    • Linux

      • Red Hat Enterprise Linux
      • Image mode for Red Hat Enterprise Linux
      • Red Hat Universal Base Images (UBI)
    • Java runtimes & frameworks

      • JBoss Enterprise Application Platform
      • Red Hat build of OpenJDK
    • Kubernetes

      • Red Hat OpenShift
      • Microsoft Azure Red Hat OpenShift
      • Red Hat OpenShift Virtualization
      • Red Hat OpenShift Lightspeed
    • Integration & App Connectivity

      • Red Hat Build of Apache Camel
      • Red Hat Service Interconnect
      • Red Hat Connectivity Link
    • AI/ML

      • Red Hat OpenShift AI
      • Red Hat Enterprise Linux AI
    • Automation

      • Red Hat Ansible Automation Platform
      • Red Hat Ansible Lightspeed
    • Developer tools

      • Red Hat Trusted Software Supply Chain
      • Podman Desktop
      • Red Hat OpenShift Dev Spaces
    • Developer Sandbox

      Developer Sandbox
      Try Red Hat products and technologies without setup or configuration fees for 30 days with this shared Openshift and Kubernetes cluster.
    • Try at no cost
  • Technologies

    Featured

    • AI/ML
      AI/ML Icon
    • Linux
      Linux Icon
    • Kubernetes
      Cloud icon
    • Automation
      Automation Icon showing arrows moving in a circle around a gear
    • View All Technologies
    • Programming Languages & Frameworks

      • Java
      • Python
      • JavaScript
    • System Design & Architecture

      • Red Hat architecture and design patterns
      • Microservices
      • Event-Driven Architecture
      • Databases
    • Developer Productivity

      • Developer productivity
      • Developer Tools
      • GitOps
    • Secure Development & Architectures

      • Security
      • Secure coding
    • Platform Engineering

      • DevOps
      • DevSecOps
      • Ansible automation for applications and services
    • Automated Data Processing

      • AI/ML
      • Data Science
      • Apache Kafka on Kubernetes
      • View All Technologies
    • Start exploring in the Developer Sandbox for free

      sandbox graphic
      Try Red Hat's products and technologies without setup or configuration.
    • Try at no cost
  • Learn

    Featured

    • Kubernetes & Cloud Native
      Openshift icon
    • Linux
      Rhel icon
    • Automation
      Ansible cloud icon
    • Java
      Java icon
    • AI/ML
      AI/ML Icon
    • View All Learning Resources

    E-Books

    • GitOps Cookbook
    • Podman in Action
    • Kubernetes Operators
    • The Path to GitOps
    • View All E-books

    Cheat Sheets

    • Linux Commands
    • Bash Commands
    • Git
    • systemd Commands
    • View All Cheat Sheets

    Documentation

    • API Catalog
    • Product Documentation
    • Legacy Documentation
    • Red Hat Learning

      Learning image
      Boost your technical skills to expert-level with the help of interactive lessons offered by various Red Hat Learning programs.
    • Explore Red Hat Learning
  • Developer Sandbox

    Developer Sandbox

    • Access Red Hat’s products and technologies without setup or configuration, and start developing quicker than ever before with our new, no-cost sandbox environments.
    • Explore Developer Sandbox

    Featured Developer Sandbox activities

    • Get started with your Developer Sandbox
    • OpenShift virtualization and application modernization using the Developer Sandbox
    • Explore all Developer Sandbox activities

    Ready to start developing apps?

    • Try at no cost
  • Blog
  • Events
  • Videos

InstructLab: Advancing generative AI through open source

Make your mark on AI

May 7, 2024
Alina Ryan
Related topics:
Artificial intelligenceLinuxOpen source
Related products:
Red Hat Enterprise LinuxRed Hat Enterprise Linux AI

Share:

    While large language models (LLMs) offer incredible potential, they also come with their share of challenges. Working with LLMs demands high-quality training data, specialized skills and knowledge, and extensive computing resources. The process of forking and retraining a model is also time consuming and expensive.

    The InstructLab project offers an open source approach to generative AI, sourcing community contributions to support regular builds of an enhanced version of an LLM. This approach is designed to lower costs, remove barriers to testing and experimentation, and improve alignment—that is, ensuring the model's answers are accurate, unbiased, and consistent with the values and goals of its users and creators. 

    What is InstructLab?

    Initiated by IBM and Red Hat, the InstructLab project aims to democratize generative AI through the power of an open source community. It simplifies the LLM training phase through community skills and knowledge submissions. 

    InstructLab leverages the LAB (Large-scale Alignment for chatBots) methodology to enable community-driven development and model evolution. To learn more about the science behind LAB approach, see the InstructLab research paper posted by IBM.

    Who is InstructLab for?

    You don’t have to be a rocket scientist to contribute to InstructLab (but it’s great if you are!). With the InstructLab approach, there is minimal technical experience required. Contributions to the model are accepted in the form of knowledge and skills, with topics ranging from Beyoncé facts to professional law. This broad scope of topics makes the process approachable and entertaining. 

    We especially encourage contributions from experts in non-technical fields. Not only will this enhance the model's performance on a topic, but contributions from non-technical industry experts give them a voice in the AI conversation. InstructLab offers a practical way for less technical folks to contribute to a technical space that is poised to have lasting impact on the world.

    What benefits does InstructLab offer?

    InstructLab provides a cost-effective, community-driven solution for improving the alignment of LLMs and makes it easy for those with minimal machine learning experience to contribute.

    Cost-effective

    An open source approach makes InstructLab accessible to individuals and organizations regardless of their financial resources. As long as you have access to a laptop, you can download and use InstructLab tools, as we've designed it to run on laptop hardware. Such accessibility promotes a more inclusive environment for both developers and contributors. 

    Community-driven instruction tuning also drives the cost of model training down. By relying on the community, users cover topic generation by adding tasks of interest via skills and knowledge contributions. The synthetic data generation approach also means a smaller amount of data is needed from those contributions to have an impact on the model during training. This can all be tuned into a small-parameter, open source licensed model that is relatively cheap to both tune and serve for inferencing.

    Community-driven

    Opening up the data generation for the instruction tuning phase of model training to a large pool of contributors helps address innovation challenges that often arise during LLM training. Having a community drives together diverse talent by fostering collaboration among individuals with different backgrounds, expertise, and perspectives. This in turn encourages a wide range of contributions to land into the models. In addition, feedback from the users, contributors, and code reviewers in the community can help inform topic selection instead of solely relying on performance analysis and benchmarking data. 

    Ease of use

    Non-technical people are typically deterred from contributing to software or AI due to perceived complexity and technical barriers. The vast array of models and tooling available and the perceived investment of time and effort required can be overwhelming for anyone, especially those without a technical background. 

    However, InstructLab removes most of these barriers. Thanks to YAML’s structured format and intuitive syntax, it’s easy to contribute knowledge and skill bounties in the form of a question-and-answer template. Contributors also benefit from an entire community with a wealth of resources including forums, docs, and user groups where individuals can seek support from one another. 

    Key features

    • Regularly released models built with community contributions: Stay up to date by creating an account on HuggingFace.co and ‘liking’ the model(s) from the InstructLab repository.
    • Pull request-focused contribution process in the open community: Keep track of new knowledge and skills contributions by watching the InstructLab/taxonomy repository on GitHub.
    • Enhanced CLI tooling for contributing skills and knowledge and the ability to smoke test them in a locally-built model: Stay tuned by following our GitHub repo.

    What’s next?

    • Discover the developer preview of Red Hat Enterprise Linux AI, a foundation model platform to develop, test, and run Granite family large language models for enterprise applications.
    • If you’re interested in building models that you can develop and serve yourself, check out Podman Desktop AI Lab, an open source extension for Podman Desktop to work with LLMs on a local environment.

    Get started

    • Check out the InstructLab community page to get started now. 
    • Explore LLM fine-tuning on Red Hat Developer.
    • Learn about InstructLab on Red Hat Enterprise Linux AI.
    Last updated: January 20, 2025

    Related Posts

    • Introducing Podman AI Lab: Developer tooling for working with LLMs

    • Red Hat OpenShift AI installation and setup

    • Access the OpenAI ChatGPT API in Quarkus

    • How to use LLMs in Java with LangChain4j and Quarkus

    • Integrate your Quarkus application with GPT4All

    • Model training in Red Hat OpenShift AI

    Recent Posts

    • Alternatives to creating bootc images from scratch

    • How to update OpenStack Services on OpenShift

    • How to integrate vLLM inference into your macOS and iOS apps

    • How Insights events enhance system life cycle management

    • Meet the Red Hat Node.js team at PowerUP 2025

    What’s up next?

    Learn how to access a large language model using Node.js and LangChain.js. You’ll also explore LangChain.js APIs that simplify common requirements like retrieval-augmented generation (RAG).

    Start the activity
    Red Hat Developers logo LinkedIn YouTube Twitter Facebook

    Products

    • Red Hat Enterprise Linux
    • Red Hat OpenShift
    • Red Hat Ansible Automation Platform

    Build

    • Developer Sandbox
    • Developer Tools
    • Interactive Tutorials
    • API Catalog

    Quicklinks

    • Learning Resources
    • E-books
    • Cheat Sheets
    • Blog
    • Events
    • Newsletter

    Communicate

    • About us
    • Contact sales
    • Find a partner
    • Report a website issue
    • Site Status Dashboard
    • Report a security problem

    RED HAT DEVELOPER

    Build here. Go anywhere.

    We serve the builders. The problem solvers who create careers with code.

    Join us if you’re a developer, software engineer, web designer, front-end designer, UX designer, computer scientist, architect, tester, product manager, project manager or team lead.

    Sign me up

    Red Hat legal and privacy links

    • About Red Hat
    • Jobs
    • Events
    • Locations
    • Contact Red Hat
    • Red Hat Blog
    • Inclusion at Red Hat
    • Cool Stuff Store
    • Red Hat Summit

    Red Hat legal and privacy links

    • Privacy statement
    • Terms of use
    • All policies and guidelines
    • Digital accessibility

    Report a website issue