Skip to content
@evalplus

evalplus

Popular repositories Loading

  1. evalplus evalplus Public

    Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

    Python 1.6k 185

  2. repoqa repoqa Public

    RepoQA: Evaluating Long-Context Code Understanding

    Python 125 7

  3. evalplus.github.io evalplus.github.io Public

    HTML 13 5

  4. humanevalplus_release humanevalplus_release Public

    Release repository for HumanEval+ data

    Python 4 1

  5. mbppplus_release mbppplus_release Public

    Release repository for MBPP+ data

    Python 1

  6. repoqa_release repoqa_release Public

    1 1

Repositories

Showing 8 of 8 repositories
  • evalplus Public

    Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

    evalplus/evalplus’s past year of commit activity
    Python 1,642 Apache-2.0 185 53 (1 issue needs help) 2 Updated Oct 2, 2025
  • evalplus/evalplus.github.io’s past year of commit activity
    HTML 13 Apache-2.0 5 0 0 Updated Dec 26, 2024
  • repoqa Public

    RepoQA: Evaluating Long-Context Code Understanding

    evalplus/repoqa’s past year of commit activity
    Python 125 Apache-2.0 7 2 2 Updated Nov 1, 2024
  • evalplus/repoqa_release’s past year of commit activity
    1 Apache-2.0 1 0 0 Updated Oct 7, 2024
  • evalplus/evalperf_release’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Aug 6, 2024
  • humanevalplus_release Public

    Release repository for HumanEval+ data

    evalplus/humanevalplus_release’s past year of commit activity
    Python 4 Apache-2.0 1 0 0 Updated May 1, 2024
  • mbppplus_release Public

    Release repository for MBPP+ data

    evalplus/mbppplus_release’s past year of commit activity
    Python 1 Apache-2.0 0 0 0 Updated Apr 17, 2024
  • Cirron Public Forked from s7nfo/Cirron

    Cirron measures how many CPU instructions and system calls a piece of Python code executes.

    evalplus/Cirron’s past year of commit activity
    C 0 4 0 0 Updated Feb 18, 2024

Top languages

Python C HTML

Most used topics

Loading…