Skip to content

Conversation

@kimichenn
Copy link
Contributor

@kimichenn kimichenn commented Nov 22, 2025

closes #277

Update some yolo stuff to support the new structure of the model.

Notable changes:

  • Detection threshold bumped to 0.35 (from 0.05)
  • Threshold is now a setting in config jsons instead of scattered everywhere (the only exception is unit/integration tests, as I assume we want to mess with this number often)
  • Added NMS to deduplicate bboxes
  • Changed default image size from 640x640 to 1024x1024 (I tried both image sizes for training/inference, and the 1024 works better so we'll stick with that)

The model can be found here.

I don't foresee us using the old model, so this can be merged regardless of whether I make future updates to the model (the architecture should be the same), with a big caveat, as detailed below. I think we'll just stick to finetuned YOLO this year.

The caveat: it is important to note that this model does NOT work for any not-stolen images (this might change after we update not-stolen to only use human and tent images). This means that we cannot run the full sitl suite unless you manually add detection yourself in the reports page.

@kimichenn kimichenn marked this pull request as ready for review November 22, 2025 20:30
Copy link
Contributor

@AskewParity AskewParity left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I haven't actually looked to test this model on sitl, but the detection images you sent me look very promising.

@kimichenn kimichenn merged commit 8540778 into main Nov 23, 2025
2 checks passed
@kimichenn kimichenn deleted the feat/yolo-v2 branch November 23, 2025 23:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Implement Owlv2

3 participants