[docs] Add gpt oss deployment example #56400

Aydin-ab · 2025-09-09T23:49:32Z

Why are these changes needed?

Adding new examples for gpt-oss, later released as an anyscale templates as well

Note:

Using our latest version ray==2.49.2 + vllm 0.10.2 breaks but this seems already fixed in nightly so we can change that requirement once the next release is out (mid october)

Also, our current ray-llm image is using vllm 0.10.0 so we have to build a new image with vllm>=0.10.1 this makes the anyscale service launch pretty long (~5min)

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

gemini-code-assist

Code Review

This PR adds a new example for deploying gpt-oss models. The changes are mostly good, but there are a few issues. There's a critical copy-paste error in the Anyscale service configuration for gpt-oss-120b in the example notebook and README. There are also some inconsistencies in the documentation links, Docker image versions, and configuration values in the examples. The Dockerfile could also be improved for better practices. I've provided detailed comments and suggestions to address these points.

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/README.md

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/Dockerfile

doc/source/serve/tutorials/deployment-serve-llm/README.md

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/Dockerfile

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/notebook.ipynb

kunling-anyscale

.

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/README.md

kunling-anyscale

add few suggestions and comments

Aydin-ab · 2025-09-22T21:21:51Z

i notice the ray-llm nightly image is using vllm==0.10.2 so we can probably start the engineering review in the next release

Aydin-ab · 2025-10-03T00:02:25Z

i notice the ray-llm nightly image is using vllm==0.10.2 so we can probably start the engineering review in the next release

for context using our latest version ray==2.49.2 + vllm 0.10.2 breaks but this seems already fixed in nightly so we can change that requirement once the next release is out (mid october)

meanwhile we can continue and publish with this version of ray==2.49.0, vllm==0.10.1

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/serve_gpt_oss.py

Signed-off-by: Aydin Abiar <aydin@anyscale.com>

doc/source/serve/tutorials/deployment-serve-llm/ci/nb2py.py

doc/source/serve/tutorials/deployment-serve-llm/README.ipynb

doc/source/serve/tutorials/deployment-serve-llm/README.md

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/notebook.ipynb

Signed-off-by: Aydin Abiar <aydin@anyscale.com>

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/notebook.ipynb

Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Signed-off-by: Aydin Abiar <62435714+Aydin-ab@users.noreply.github.com>

Signed-off-by: Aydin Abiar <aydin@anyscale.com>

Signed-off-by: Josh Kodi <joshkodi@gmail.com>

Signed-off-by: xgui <xgui@anyscale.com>

Signed-off-by: Aydin Abiar <aydin@anyscale.com>

Signed-off-by: Future-Outlier <eric901201@gmail.com>

Aydin-ab requested review from a team as code owners September 9, 2025 23:49

Aydin-ab marked this pull request as draft September 9, 2025 23:49

gemini-code-assist bot reviewed Sep 9, 2025

View reviewed changes

kunling-anyscale reviewed Sep 9, 2025

View reviewed changes

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/notebook.ipynb Show resolved Hide resolved

kunling-anyscale reviewed Sep 9, 2025

View reviewed changes

doc/source/serve/tutorials/deployment-serve-llm/gpt-oss/notebook.ipynb Outdated Show resolved Hide resolved

kunling-anyscale reviewed Sep 10, 2025

View reviewed changes