Skip to main content
Redhat Developers  Logo
  • AI

    Get started with AI

    • Red Hat AI
      Accelerate the development and deployment of enterprise AI solutions.
    • AI learning hub
      Explore learning materials and tools, organized by task.
    • AI interactive demos
      Click through scenarios with Red Hat AI, including training LLMs and more.
    • AI/ML learning paths
      Expand your OpenShift AI knowledge using these learning resources.
    • AI quickstarts
      Focused AI use cases designed for fast deployment on Red Hat AI platforms.
    • No-cost AI training
      Foundational Red Hat AI training.

    Featured resources

    • OpenShift AI learning
    • Open source AI for developers
    • AI product application development
    • Open source-powered AI/ML for hybrid cloud
    • AI and Node.js cheat sheet

    Red Hat AI Factory with NVIDIA

    • Red Hat AI Factory with NVIDIA is a co-engineered, enterprise-grade AI solution for building, deploying, and managing AI at scale across hybrid cloud environments.
    • Explore the solution
  • Learn

    Self-guided

    • Documentation
      Find answers, get step-by-step guidance, and learn how to use Red Hat products.
    • Learning paths
      Explore curated walkthroughs for common development tasks.
    • Guided learning
      Receive custom learning paths powered by our AI assistant.
    • See all learning

    Hands-on

    • Developer Sandbox
      Spin up Red Hat's products and technologies without setup or configuration.
    • Interactive labs
      Learn by doing in these hands-on, browser-based experiences.
    • Interactive demos
      Click through product features in these guided tours.

    Browse by topic

    • AI/ML
    • Automation
    • Java
    • Kubernetes
    • Linux
    • See all topics

    Training & certifications

    • Courses and exams
    • Certifications
    • Skills assessments
    • Red Hat Academy
    • Learning subscription
    • Explore training
  • Build

    Get started

    • Red Hat build of Podman Desktop
      A downloadable, local development hub to experiment with our products and builds.
    • Developer Sandbox
      Spin up Red Hat's products and technologies without setup or configuration.

    Download products

    • Access product downloads to start building and testing right away.
    • Red Hat Enterprise Linux
    • Red Hat AI
    • Red Hat OpenShift
    • Red Hat Ansible Automation Platform
    • See all products

    Featured

    • Red Hat build of OpenJDK
    • Red Hat JBoss Enterprise Application Platform
    • Red Hat OpenShift Dev Spaces
    • Red Hat Developer Toolset

    References

    • E-books
    • Documentation
    • Cheat sheets
    • Architecture center
  • Community

    Get involved

    • Events
    • Live AI events
    • Red Hat Summit
    • Red Hat Accelerators
    • Community discussions

    Follow along

    • Articles & blogs
    • Developer newsletter
    • Videos
    • Github

    Get help

    • Customer service
    • Customer support
    • Regional contacts
    • Find a partner

    Join the Red Hat Developer program

    • Download Red Hat products and project builds, access support documentation, learning content, and more.
    • Explore the benefits

InstructLab: Advancing generative AI through open source

Make your mark on AI

May 7, 2024
Alina Ryan
Related topics:
Artificial intelligenceLinuxOpen source
Related products:
Red Hat Enterprise LinuxRed Hat Enterprise Linux AI

    While large language models (LLMs) offer incredible potential, they also come with their share of challenges. Working with LLMs demands high-quality training data, specialized skills and knowledge, and extensive computing resources. The process of forking and retraining a model is also time consuming and expensive.

    The InstructLab project offers an open source approach to generative AI, sourcing community contributions to support regular builds of an enhanced version of an LLM. This approach is designed to lower costs, remove barriers to testing and experimentation, and improve alignment—that is, ensuring the model's answers are accurate, unbiased, and consistent with the values and goals of its users and creators. 

    What is InstructLab?

    Initiated by IBM and Red Hat, the InstructLab project aims to democratize generative AI through the power of an open source community. It simplifies the LLM training phase through community skills and knowledge submissions. 

    InstructLab leverages the LAB (Large-scale Alignment for chatBots) methodology to enable community-driven development and model evolution. To learn more about the science behind LAB approach, see the InstructLab research paper posted by IBM.

    Who is InstructLab for?

    You don’t have to be a rocket scientist to contribute to InstructLab (but it’s great if you are!). With the InstructLab approach, there is minimal technical experience required. Contributions to the model are accepted in the form of knowledge and skills, with topics ranging from Beyoncé facts to professional law. This broad scope of topics makes the process approachable and entertaining. 

    We especially encourage contributions from experts in non-technical fields. Not only will this enhance the model's performance on a topic, but contributions from non-technical industry experts give them a voice in the AI conversation. InstructLab offers a practical way for less technical folks to contribute to a technical space that is poised to have lasting impact on the world.

    What benefits does InstructLab offer?

    InstructLab provides a cost-effective, community-driven solution for improving the alignment of LLMs and makes it easy for those with minimal machine learning experience to contribute.

    Cost-effective

    An open source approach makes InstructLab accessible to individuals and organizations regardless of their financial resources. As long as you have access to a laptop, you can download and use InstructLab tools, as we've designed it to run on laptop hardware. Such accessibility promotes a more inclusive environment for both developers and contributors. 

    Community-driven instruction tuning also drives the cost of model training down. By relying on the community, users cover topic generation by adding tasks of interest via skills and knowledge contributions. The synthetic data generation approach also means a smaller amount of data is needed from those contributions to have an impact on the model during training. This can all be tuned into a small-parameter, open source licensed model that is relatively cheap to both tune and serve for inferencing.

    Community-driven

    Opening up the data generation for the instruction tuning phase of model training to a large pool of contributors helps address innovation challenges that often arise during LLM training. Having a community drives together diverse talent by fostering collaboration among individuals with different backgrounds, expertise, and perspectives. This in turn encourages a wide range of contributions to land into the models. In addition, feedback from the users, contributors, and code reviewers in the community can help inform topic selection instead of solely relying on performance analysis and benchmarking data. 

    Ease of use

    Non-technical people are typically deterred from contributing to software or AI due to perceived complexity and technical barriers. The vast array of models and tooling available and the perceived investment of time and effort required can be overwhelming for anyone, especially those without a technical background. 

    However, InstructLab removes most of these barriers. Thanks to YAML’s structured format and intuitive syntax, it’s easy to contribute knowledge and skill bounties in the form of a question-and-answer template. Contributors also benefit from an entire community with a wealth of resources including forums, docs, and user groups where individuals can seek support from one another. 

    Key features

    • Regularly released models built with community contributions: Stay up to date by creating an account on HuggingFace.co and ‘liking’ the model(s) from the InstructLab repository.
    • Pull request-focused contribution process in the open community: Keep track of new knowledge and skills contributions by watching the InstructLab/taxonomy repository on GitHub.
    • Enhanced CLI tooling for contributing skills and knowledge and the ability to smoke test them in a locally-built model: Stay tuned by following our GitHub repo.

    What’s next?

    • Discover the developer preview of Red Hat Enterprise Linux AI, a foundation model platform to develop, test, and run Granite family large language models for enterprise applications.
    • If you’re interested in building models that you can develop and serve yourself, check out Podman Desktop AI Lab, an open source extension for Podman Desktop to work with LLMs on a local environment.

    Get started

    • Check out the InstructLab community page to get started now. 
    • Explore LLM fine-tuning on Red Hat Developer.
    • Learn about InstructLab on Red Hat Enterprise Linux AI.
    Last updated: January 20, 2025

    Related Posts

    • Introducing Podman AI Lab: Developer tooling for working with LLMs

    • Red Hat OpenShift AI installation and setup

    • Access the OpenAI ChatGPT API in Quarkus

    • How to use LLMs in Java with LangChain4j and Quarkus

    • Integrate your Quarkus application with GPT4All

    • Model training in Red Hat OpenShift AI

    Recent Posts

    • SQL Server HA on RHEL: Meet Pacemaker HA Agent v2 (tech preview)

    • Deploy with confidence: Continuous integration and continuous delivery for agentic AI

    • Every layer counts: Defense in depth for AI agents with Red Hat AI

    • Fun in the RUN instruction: Why container builds with distroless images can surprise you

    • Trusted software factory: Building trust in the agentic AI era

    What’s up next?

    Learn how to access a large language model using Node.js and LangChain.js. You’ll also explore LangChain.js APIs that simplify common requirements like retrieval-augmented generation (RAG).

    Start the activity
    Red Hat Developers logo LinkedIn YouTube Twitter Facebook

    Platforms

    • Red Hat AI
    • Red Hat Enterprise Linux
    • Red Hat OpenShift
    • Red Hat Ansible Automation Platform
    • See all products

    Build

    • Developer Sandbox
    • Developer tools
    • Interactive tutorials
    • API catalog

    Quicklinks

    • Learning resources
    • E-books
    • Cheat sheets
    • Blog
    • Events
    • Newsletter

    Communicate

    • About us
    • Contact sales
    • Find a partner
    • Report a website issue
    • Site status dashboard
    • Report a security problem

    RED HAT DEVELOPER

    Build here. Go anywhere.

    We serve the builders. The problem solvers who create careers with code.

    Join us if you’re a developer, software engineer, web designer, front-end designer, UX designer, computer scientist, architect, tester, product manager, project manager or team lead.

    Sign me up

    Red Hat legal and privacy links

    • About Red Hat
    • Jobs
    • Events
    • Locations
    • Contact Red Hat
    • Red Hat Blog
    • Inclusion at Red Hat
    • Cool Stuff Store
    • Red Hat Summit
    © 2026 Red Hat

    Red Hat legal and privacy links

    • Privacy statement
    • Terms of use
    • All policies and guidelines
    • Digital accessibility

    Chat Support

    Please log in with your Red Hat account to access chat support.