Feature/loader factory #12

crhysc · 2025-12-16T17:39:37Z

This pull request includes functionality for loading gpt-oss and other Unsloth models using AtomGPT as well as formatting text into the Alpaca or Harmony template depending on which model is being used. The core functionality is an abstract factory, found in inverse_models/factories.py, and it separates gpt-oss loading from other model loading. This strategy resolved many overt and silent bugs found when loading gpt-oss, and training and inference with gpt-oss successfully executes. Interestingly, the text in this screenshot looks as if ChatGPT is reasoning about a crystal structure, further validating that gpt-oss is working as intended. Before merging this pull request, I need to double check that the harmony template is formatted correctly, and I need to do the README tests.

…mplates

crhysc and others added 30 commits October 28, 2025 18:34

initialize diffractgpt script

da41d09

Merge branch 'main' into feature/RamanGPT

ea4bfa5

initialize correct dataset script

bea7d59

initialize code to make train test alpaca jsons

5e6633b

version 1. should make sentences

a6d7842

initialize runner

fbb0c3a

name change

fae7eee

update prompt to include ()

b7a9cde

add cm^-1 to prompt

ff64e83

remove rounding bug

3b987f8

rest of the script

2425df9

freq normal and niggli reduce

9f0b3c2

add freq upper bound

2771adf

initial commit

69c3463

Update loader.py

1d15157

patch if load_in_4bit

8ff5cae

upgrade the required transformers version to 4.57.1

1cb88ef

add _get_dtype(dtype) to line 466

69226ee

add gpt_oss to def patch_peft_model()

f6efda5

patch num_logits_to_keep

f1bab0e

strip num_logits_to_keep in unsloth_fast_generate() for gpt-oss models

7d3d37b

Update gpt_oss.py

c02d6a5

patch pre_patch()

078175d

force progress bar

1be754c

add invalid structures error handling

85d2f82

add error handling for inverse_predict.py

9552f4e

num_proc

8902e70

if gen_mat = None: ...

431e3f9

rm "Here is the output"

3bbd9d2

terminate string literal

e3d22e1

ccamp104 and others added 21 commits December 1, 2025 12:03

let tokenizers be >= 0.22.0

d9c94af

hf hub >= 0.32.0

5b4ef60

hf-xet>=1.1.2

32e831d

print target and predicted structures if PRINT_STRUCTURES=1

006cf16

mv print statements before validation checks

786ddac

let the raw LLM output be printed if PRINT_STRUCTURES=1

76602a6

initialize abs factory for loading. add chat template stubs

33e52c8

add kwargs to format()

d0dd793

get harmony template in factory

1970a36

from typing import Any

11b2ab6

remove relative import

52228ed

import callable

1464ce1

remove import chattemplate

9793ad8

remove imports to non-interface objects for model loading and chat te…

3cfc59a

…mplates

add type checking if statement for the trainingpropconfig import

6e8771a

add unsloth>=2024.10,<2025.3

9a41780

arrange imports to debug

25661fe

from atomgpt.inverse_models.dataset_utils import make_alpaca_json

c8790f8

add imports for make_alpaca_json()

780d334

mv get_input() to dataset_utils

58bac34

rm resume from chkpt=true

73e3096

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/loader factory #12

Feature/loader factory #12

Uh oh!

crhysc commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Feature/loader factory #12

Are you sure you want to change the base?

Feature/loader factory #12

Uh oh!

Conversation

crhysc commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant