feat: add API error detection and auto-recovery for peers #7

howold-lab · 2025-12-19T15:00:26Z

When a peer's CLI process is alive but API calls are failing (400, 429, 500, etc.), the current crash detection doesn't trigger because the pane isn't dead.

This adds:

New module: orchestrator/api_error_recovery.py
Detects API error patterns in pane output during health checks
Auto-restarts peer when errors detected (with debounce and limits)
Notifies user via outbox when restart occurs

Reuses existing restart infrastructure (count_recent_restarts, restart_peer) and follows the existing module pattern (make() factory function).

When a peer's CLI process is alive but API calls are failing (400, 429, 500, etc.), the current crash detection doesn't trigger because the pane isn't dead. This adds: - New module: orchestrator/api_error_recovery.py - Detects API error patterns in pane output during health checks - Auto-restarts peer when errors detected (with debounce and limits) - Notifies user via outbox when restart occurs Reuses existing restart infrastructure (count_recent_restarts, restart_peer) and follows the existing module pattern (make() factory function).

ChesterRa · 2025-12-19T15:30:14Z

Thanks a lot for your PR!
Would you please explain why RESTART helps when API calls are failing?
I'm a little confused...

howold-lab · 2025-12-20T05:18:06Z

hi, author
Because under certain models, such as CC, it's easy to encounter 400 errors, etc. As you know, in some countries, a VPN is required. so

waterbang · 2025-12-22T01:25:02Z

Hi, I feel we should let Foreman make autonomous decisions, what do you think?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add API error detection and auto-recovery for peers #7

feat: add API error detection and auto-recovery for peers #7

Uh oh!

howold-lab commented Dec 19, 2025

Uh oh!

ChesterRa commented Dec 19, 2025

Uh oh!

howold-lab commented Dec 20, 2025

Uh oh!

waterbang commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add API error detection and auto-recovery for peers #7

Are you sure you want to change the base?

feat: add API error detection and auto-recovery for peers #7

Uh oh!

Conversation

howold-lab commented Dec 19, 2025

Uh oh!

ChesterRa commented Dec 19, 2025

Uh oh!

howold-lab commented Dec 20, 2025

Uh oh!

waterbang commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants