Skip to content

Welcome to the 🐶 InstructLab Project

Quick Links: Documentation | FAQ | Hugging Face | Calendar | Discord | Slack | YouTube | X | Reddit

Banner

InstructLab is an approachable open source AI community project. Our community's mission is to enable anyone to shape the future of generative AI via the collaborative improvement of open source-licensed Granite large language models (LLMs) using InstructLab's fine-tuning technology.


Community Announcement (Sept 2, 2025)

Over the past year, we’ve been honored by your creativity, insights, and shared passion for advancing generative AI through InstructLab. Whether you added a new “knowledge” via pull request, offered feedback, joined a community call, or helped translate documentation, you’ve shaped our project in meaningful ways. Thank you.

To better align with evolving technical needs, we’re announcing an evolution for the InstructLab community. We will be refactoring the project by separating the components out to improve its maintainability and usability, primarily as a framework SDK for model tuning.

What's Changing

To enhance the long-term viability and efficiency of the InstructLab project, a strategic decision has been made to relocate its foundational building blocks into separate, dedicated project repositories. This carefully considered shift is anticipated to yield substantial benefits, primarily in the areas of maintainability and independent component maturation. This independent development will foster greater agility, allowing for more focused improvements and faster iteration cycles for individual parts of the project.

Looking Ahead

We’re excited about this next chapter and believe it will lead to more robust, flexible, and powerful tools for the generative AI community. We encourage you to follow the individual component projects in their new homes and continue contributing to their growth.

SDG: https://github.com/Red-Hat-AI-Innovation-Team/sdg_hub

Training: https://github.com/Red-Hat-AI-Innovation-Team/training_hub


Additional information

InstructLab uses Large-Scale Alignment for ChatBots [1] (LAB), a new alignment tuning method for LLMs that leverages synthetic data. To learn more about InstructLab’s origins, visit the About Taxonomy page.

[1] Shivchander Sudalairaj*, Abhishek Bhandwaldar*, Aldo Pareja*, Kai Xu, David D. Cox, Akash Srivastava*. "LAB: Large-Scale Alignment for ChatBots", arXiv preprint arXiv: 2403.01081, 2024. (* denotes equal contributions)

Pinned Loading

  1. instructlab instructlab Public

    InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

    Python 1.3k 441

  2. taxonomy taxonomy Public

    Taxonomy tree that will allow you to create models tuned with your data

    Python 280 1.3k

  3. community community Public

    InstructLab Community wide collaboration space including contributing, security, code of conduct, etc

    Python 91 49

  4. dev-docs dev-docs Public

    Developer documents for the InstructLab organization

    Makefile 10 38

Repositories

Showing 10 of 20 repositories
  • instructlab Public

    InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

    instructlab/instructlab’s past year of commit activity
    Python 1,345 Apache-2.0 441 21 16 Updated Sep 4, 2025
  • community Public

    InstructLab Community wide collaboration space including contributing, security, code of conduct, etc

    instructlab/community’s past year of commit activity
    Python 91 Apache-2.0 49 7 9 Updated Sep 4, 2025
  • website Public
    instructlab/website’s past year of commit activity
    TypeScript 4 CC-BY-4.0 27 8 6 Updated Sep 3, 2025
  • .github Public

    InstructLab GitHub organization community files.

    instructlab/.github’s past year of commit activity
    Makefile 2 Apache-2.0 10 0 3 Updated Sep 3, 2025
  • sdg Public

    Python library for Synthetic Data Generation

    instructlab/sdg’s past year of commit activity
    Python 45 Apache-2.0 56 8 0 Updated Sep 2, 2025
  • training Public

    InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data

    instructlab/training’s past year of commit activity
    Python 42 Apache-2.0 72 55 19 Updated Sep 1, 2025
  • eval Public

    Python library for Evaluation

    instructlab/eval’s past year of commit activity
    Python 15 Apache-2.0 27 15 10 Updated Sep 1, 2025
  • taxonomy Public

    Taxonomy tree that will allow you to create models tuned with your data

    instructlab/taxonomy’s past year of commit activity
    Python 280 Apache-2.0 1,257 5 45 Updated Aug 26, 2025
  • ui Public

    Place to hack on UI for InstructLab

    instructlab/ui’s past year of commit activity
    TypeScript 34 Apache-2.0 56 57 (18 issues need help) 33 Updated Aug 25, 2025
  • examples Public

    Contains examples and notebooks for InstructLab

    instructlab/examples’s past year of commit activity
    Jupyter Notebook 2 Apache-2.0 10 0 2 Updated Aug 21, 2025