[CHORE] Compute server E2E tests #85
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR adds E2E tests for the compute server, as well as updates the readme + other files with the required info for running the E2E tests locally.
Note: One of the more open-ended tests (i.e. the security posture test case) is more fickle than the rest, and it seems to be due to a bug with the mcp bridge that we use. It occasionally attempts to run servers/tools that dont actually exist, which causes a failure.
Example error:
2025-12-02 10:24:50.145 | ERROR | ollama_mcp_bridge.proxy_service:proxy_chat_with_tools:72 - Chat proxy failed: Tool oracle-oci-networking-mcp-server-get_subnet not found When I send a request with the prompt "Can you review the security configuration of my compute instances in Ashburn and let me know if there are any recommended improvements or best practices to strengthen their security posture?"It is trying to call
oracle-oci-networking-mcp-server-get_subnetwhen it should be trying to call
oracle-oci-networking-mcp-server.get_subnet...why?
Type of change
Please delete options that are not relevant.
How Has This Been Tested?
Message me for the contents of my .env file if you are looking to replicate my testing, dont want to post it in this public repo
Checklist: