SORCE: Small Object Retrieval in Complex Environments

Chunxu Liu*, Chi Xie*, Xiaxu Chen, Feng Zhu, Rui Zhao, Limin Wang,
Nanjing University, SenseTime Research

Overview

TL; DR. We introduce Small Object Retrieval in Complex Environments (SORCE) task, which is a new subfield of T2IR, focusing on retrieving small objects in complex images.

We introduce a new dataset, SORCE-1K, comprising 1,023 image-text pairs in which each caption describes only a localized object region. This design explicitly avoids providing contextual clues from the broader scene, thereby preventing models from exploiting shortcut cues.

Additionally, we demonstrate that with the use of simple yet effective Regional Prompts (ReP), multimodal large language models (MLLMs) can accurately attend to and embed the corresponding image regions. Our fine-tuned models are available for evaluation here.

Dataset Preparation

Please download SORCE-1K dataset from Hugging Face and place it in the datasets folder.

mkdir datasets
huggingface-cli download --repo-type dataset --resume-download lcxrocks/sorce-1k --local-dir ./datasets/sorce-1k

Environment Setup

Please make sure the transformers version is compatible.

conda create -n sorce python=3.11
pip install -r requirements.txt

Evaluation

To evaluate the model, please run the following command, which will download the 🤗hugginface pretrained model.

bash dist_eval.sh

Citation

If you think this project is helpful in your research or for application, please feel free to leave a star⭐️ and cite our paper:


@misc{liu2025sorcesmallobjectretrieval,
      title={SORCE: Small Object Retrieval in Complex Environments}, 
      author={Chunxu Liu and Chi Xie and Xiaxu Chen and Wei Li and Feng Zhu and Rui Zhao and Limin Wang},
      year={2025},
      eprint={2505.24441},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2505.24441}, 
}

License and Acknowledgement

This project is released under the Apache 2.0 license. The codes are based on E5-V. Please also follow their licenses. Thanks for their awesome work!

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
README.md		README.md
dist_eval.sh		dist_eval.sh
eval_sorce.py		eval_sorce.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SORCE: Small Object Retrieval in Complex Environments

Overview

Dataset Preparation

Environment Setup

Evaluation

Citation

License and Acknowledgement

About

Uh oh!

Releases

Packages

Languages

MCG-NJU/SORCE

Folders and files

Latest commit

History

Repository files navigation

SORCE: Small Object Retrieval in Complex Environments

Overview

Dataset Preparation

Environment Setup

Evaluation

Citation

License and Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages