Skip to content

Conversation

@crhysc
Copy link
Contributor

@crhysc crhysc commented Dec 16, 2025

This pull request includes functionality for loading gpt-oss and other Unsloth models using AtomGPT as well as formatting text into the Alpaca or Harmony template depending on which model is being used. The core functionality is an abstract factory, found in inverse_models/factories.py, and it separates gpt-oss loading from other model loading. This strategy resolved many overt and silent bugs found when loading gpt-oss, and training and inference with gpt-oss successfully executes. Interestingly, the text in this screenshot looks as if ChatGPT is reasoning about a crystal structure, further validating that gpt-oss is working as intended. Before merging this pull request, I need to double check that the harmony template is formatted correctly, and I need to do the README tests.
Screenshot 2025-12-14 at 5 34 33 AM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant