Skip to content

Ghidra model producing less variables than expected in the wild #11

@mahaloz

Description

@mahaloz

In a discussion with @edmcman here:
mahaloz/DAILA#32 (comment)

We have both independently noticed VarBERT is producing fewer variables than expected on binaries such as /bin/ls. It's unclear now which of the following (combinations?) is the cause:

  1. A bug in how we are passing data to VarBERT
  2. A bad choice of model to use at-scale
  3. An incorrect model

We should investigate this to make the real-world use better. It may also be worth looking at our Ghidra testcases again.

Metadata

Metadata

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions