Use test-api-key to prevent login thread leaks #65

delner · 2025-12-19T00:35:38Z

Problem

Many of my new tests were failing locally with WebMock errors.

The issue is related to test duration, but it's more specifically about orphan background threads and stub lifecycle:                                                                                                                                        
                                                                                                                                                                                                                                                              
The sequence:                                                                                                                                                                                                                                                
                                                                                                                                                                                                                                                              
1. Test A creates State with api_key: "test-key" (no org_id) → spawns background login thread                                                                                                                                                                
2. Login fails (stub returns 500, or cassette doesn't have login stub) → thread enters exponential backoff retry                                                                                                                                             
3. Test A completes → WebMock.reset! clears stubs (or VCR cassette is ejected)                                                                                                                                                                               
4. Test B starts → registers new stub with different Authorization header (e.g., Bearer <BRAINTRUST_API_KEY>)                                                                                                                                                
5. Orphan thread wakes up, tries login with Authorization: Bearer test-key                                                                                                                                                                                   
6. Stub doesn't match → WebMock::NetConnectNotAllowedError                                                                                                                                                                                                   
                                                                                                                                                                                                                                                              
Why some CI jobs are more affected:                                                                                                                                                                                                                       
                                                                                                                                                                                                                                                                                              
- More tests = more opportunities for:                                                                                                                                                                                                                       
  - Threads to be spawned                                                                                                                                                                                                                                    
  - Threads to be in backoff when stubs change                                                                                                                                                                                                               
  - New tests to register incompatible stubs                                                                                                                                                                                                                 
- Longer test duration increases window for race condition

Solution

There's a magic test key in the code:                                                                                                                                                                                                                        
                                                                                                                                                                                                                                                               
# In lib/braintrust/api/internal/auth.rb                                                                                                                                                                                                                     
if api_key == "test-api-key"                                                                                                                                                                                                                                 
  Log.debug("Login: using test API key, returning fake auth")                                                                                                                                                                                                
  return AuthResult.new(org_id: "test-org-id", ...)                                                                                                                                                                                                          
end                                                                                                                                                                                                                                                          
                                                                                                                                                                                                                                                              
So api_key: "test-api-key" skips HTTP entirely and returns fake auth.

This also happens to short-circuit the login thread. Since this is the current idiomatic approach, apply it for now. In the future, we should eliminate this "magic" behavior and keep test behavior isolated to the test suite.

clutchski · 2025-12-19T00:41:57Z

test/braintrust/without_openai_test.rb

+    # Note: "test-api-key" triggers fake auth to avoid HTTP requests
    state = Braintrust.init(
-      api_key: "test-key",
+      api_key: "test-api-key",


Fixed: Missing test key for tests (leaking threads)

81a6517

delner self-assigned this Dec 19, 2025

delner requested review from clutchski and realark December 19, 2025 00:35

ibolmo approved these changes Dec 19, 2025

View reviewed changes

delner merged commit aeab52d into main Dec 19, 2025
7 checks passed

delner deleted the fix/login_thread_leak branch December 19, 2025 00:40

clutchski reviewed Dec 19, 2025

View reviewed changes

delner mentioned this pull request Dec 25, 2025

Fix leaking login thread in test_login_in_thread_retries_on_failure #68

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use test-api-key to prevent login thread leaks #65

Use test-api-key to prevent login thread leaks #65

delner commented Dec 19, 2025

Uh oh!

Uh oh!

clutchski Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use test-api-key to prevent login thread leaks #65

Use test-api-key to prevent login thread leaks #65

Conversation

delner commented Dec 19, 2025

Problem

Solution

Uh oh!

Uh oh!

clutchski Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants