Add a new delegate to allow API tracing #505

daniel-raffler · 2025-08-14T11:43:46Z

This is a preliminary draft for adding API tracing to JavaSMT with the help of a new delegate. The idea is to record all API calls and generate a new Java program from them. By running this program the exact sequence of calls can then be recreated. The main application here is debugging, where the traces allow us to create easy to reproduce examples for solver errors. This is especially useful when the error occurs as part of a larger program where it can be hard to pin down the exact sequence of JavaSMT calls that are needed to trigger the bug.

We use a new delegate to implement this feature. Setting solver.trace to true will enable tracing, and the output will be stored in a file called trace*.java

TODO

~~Finish the implementation. Currently we only have (parts of) the ArrayFormulaManager, IntegerFormulaManager, BooleanFormulaManager, UFManager and ProverEnvironment~~
~~Write the trace to a file while it's being created. We'll need this to debug segfaults as the trace is otherwise lost~~ done
~~Consider adding an option to skip duplicate calls. (The trace is currently way too long)~~ ~~Fixed, but not committed yet~~
~~Write a simple delta-debugger to shrink the trace down even further3~~ Maybe later..

We're now using ddSmt, see comment #505 (comment)

Things left to do

~~Add support for missing formula managers in the script~~
~~Still missing: floating point, quantifier, strings and separation logic. At least the first two should still be added before merging~~
~~Handle solver options in the script~~
~~Fix undo point in the trace logger~~
~~Done, but we should double check the Rebuilder~~
Merge Add support for indexed functions #507
Add user documentation for debugging with the tracer (see the comment below)
Update the changelog
~~Run some tests in CPAchecker to see if there are still issues in the script~~
~~Add support for quantifiers and interpolation to the Smtlib translation script~~
~~Test with more solvers~~

…vironment (and later TraceInterpolatingProverEnvironment)

daniel-raffler · 2025-08-18T15:53:34Z

Write a simple delta-debugger to shrink the trace down even further

I started working on the delta-debugger today and wrote a python script to reduce the size of the traces. So far it does little more that some dead-code elimination, but that's already enough to bring down the size of the trace by a factor of ten. I believe that another factor of two should be possible with some aggressive optimization.

The issue now is that I don't quite know where to put such a script in JavaSMT. We could handle this as a separate project, or maybe include it in the JavaSMT source tree, similar to the Example projects. However, neither really seems quite ideal.

@baierd, @kfriedberger: What is your opinion?

Here is the file in question:

#!/usr/bin/env python3
import re
import sys
from collections import defaultdict
from pathlib import Path


# Read a trace file
def readTrace(path):
    with open(path) as file:
        return [line.rstrip() for line in file]


# Build a map with line numbers for all variable definitions
def getLinesForDefinitions(trace):
    lineNumber = 1
    lineDefs = dict()
    for line in trace:
        if line.find('=') >= 0:
            leftSide = line[0:(line.find('=') - 1)]
            name = re.match('var (.*)', leftSide)
            lineDefs[name.group(1)] = lineNumber
        lineNumber = lineNumber + 1
    return lineDefs


# Build a dependency graph for the definitions
# Maps from variables to the places where they are used
def buildDependencies(lineDefs, trace):
    lineNumber = 1
    deps = defaultdict(list)
    for line in trace:
        expr = line[(line.find('=') + 2):] if line.find('=') >= 0 else line
        object = expr[0:expr.find('.')]
        if object[0].islower():
            deps[lineDefs[object]].append(lineNumber)
        # FIXME Parse the expression to get the variables
        for m in re.finditer('(config|logger|notifier|var[0-9]+)', expr):
            deps[lineDefs[m.group()]].append(lineNumber)
        lineNumber += 1
    return deps


# Collect all top-level statements
# Top-level statements are:
#  *.addConstraint(*)
#  *.isUnsat()
#  *.getModel()
#  *.asList()
# FIXME Finish this list
def usedTopLevel(lineDefs, trace):
    tl = set()
    for line in trace:
        m = re.fullmatch(
            'var (var[0-9]+) = (var[0-9]+).(isUnsat\\(\\)|getModel\\(\\)|asList\\(\\)|addConstraint\\((var[0-9]+)\\));',
            line)
        if m != None:
            tl.add(lineDefs[m.group(1)])
    return tl


# Calculate the closure of all used definitions, starting with the top-level statements
def usedClosure(tl, deps):
    cl = set()
    st = set(tl)
    while cl.union(st) != cl:
        cl = cl.union(st)
        st = set()
        for (key, val) in deps.items():
            if set(val).intersection(cl) != set():
                st.add(key)
    return cl


# Keep only statements and definitions that are used
def filterUnused(used, trace):
    lineNumber = 1
    reduced = []
    for line in trace:
        if line.find('=') == -1 or lineNumber in used:
            reduced.append(line)
        lineNumber += 1
    return reduced


# Remove all definitions that are not used (recursively)
def removeDeadCode(trace):
    lineDefs = getLinesForDefinitions(trace)
    deps = buildDependencies(lineDefs, trace)
    tl = usedTopLevel(lineDefs, trace)
    cl = usedClosure(tl, deps)
    return filterUnused(cl, trace)


# We'll use multiple passes to reduce the size of the trace:
# 1. Read the trace
# 2. Remove unused code
# 3. Remove unnecessary toplevel commands
# 4. Loop: Remove aliasing (by duplicating the definitions)
# 5.    Loop: Reduce terms
# 6. Remove unused prover environments
if __name__ == '__main__':
    arg = sys.argv
    if not len(sys.argv) == 2:
        print('Expecting a path to a trace file as argument')
        exit(-1)

    path = Path(sys.argv[1])
    if not (path.is_file()):
        print(f'Could not find file "{path}"')
        exit(-1)

    # TODO Implement steps 3-6
    # TODO Check that the reduced trace still crashes

    trace = readTrace(path)
    for line in removeDeadCode(trace):
        print(line)

The idea is to run JavaSMT with solver.trace=true to collect a trace of the crash, and then use the script to reduce the trace. Since we're "crashing" (posssibly with a segfault) there doesn't seem to be a good way to do this in one go

…ntext

…ct that is not tracked

…on builder

…olver Rebuilding the terms makes sure that we don't encounter any unknown subformulas in the visitors

…private

…an printing Smtlib directly

It's possible to reorder instructions to avoid parallel prover instances. However, this risks changing the trace so much that it no longer crashes. We're therefore not doing this transformation automatically. In practice this rarely seems to cause issues as most traces don't use more than one prover at once

We only have a single, global prover and the current options seem to be enough

Rewriting (= a 0) to (=0 a) causes some issues in our tests that expect the formula to have 2 subterms

… tests

Don't upload the traces

daniel-raffler · 2025-12-18T13:36:18Z

I believe this is ready for review now

The approach now uses a two step process: We first run the program with trace=true to generate a JavaSMT trace and then use a script to convert this trace into SMTLIB output. This then allows us to use an existing delta-debugger like ddSmt to reduce the size of the SMTLIB file, as the trace would otherwise grow fairly large

Some examples for how to use the tracing mode can be found in my earlier comments here and here. Notice that traces are now no longer generated automatically for the tests. So the second example requires you to first revert 3068d81 and ce46f96

The last CI run with tracing enabled can be found here. On CVC5, MathSAT and Z3 most of the test are passing just fine. Other solvers still have more issues, mostly due to limitations in the visitor and heavy formula rewriting

Test results with tracing enabled:

The tracing delegate now supports all formula managers in JavaSMT. Trace translation to SMTLIB is still more limited and we don't support enumerations, strings or separation logic at this point. However, these could easily be added. Translation to SMTLIB is also not possible if more than one SolverContext is needed, or if multiple ProverEnvironments are used at the same time (sequential use is fine)

For testing purposes I did run part of svcomp in CPAchecker with tracing enabled. You can find the results here. The largest issue is the lack of support for bvextract while tracing. However, this can be fixed later, once we figured out how to handle indexed functions in JavaSMT. Other than that there were no new exceptions and the generated traces seem to run fine on MathSAT

@baierd, @kfriedberger
Is there anything that is still missing from this PR?

Solves an issue where we would get a 0ary `and` instead of `true` from the solver See SolverVisitorTest.testTransformationInsideQuantifiersWithTrue

daniel-raffler added the enhancement label Aug 14, 2025

daniel-raffler mentioned this pull request Aug 14, 2025

MathSAT5 Returns Null for msat_model_create_iterator() #481

Open

daniel-raffler added 3 commits August 15, 2025 13:58

Trace: Add "resource" annotation for TraceModel.getModel

8ac6c0c

Trace: Add a superclass TraceBasicProverEnvironment for TraceProverEn…

d2958ed

…vironment (and later TraceInterpolatingProverEnvironment)

Trace: Apply error-prone patch

fb252cd

daniel-raffler added 24 commits August 19, 2025 14:22

Trace: Add bitvector support

fef1635

Trace: Add floating point support

cb7a7d2

Trace: Specify the solver backend in the trace when creating a new co…

7c432fc

…ntext

Trace: Add a note

c676391

Trace: Add printing support for ArrayFormulaTypes

299c901

Trace: Catch I/O exceptions directly in the logger

d229219

Trace: Add support for visitors

a187568

Trace: Allow parsing and printing

0590364

Trace: Fix checkstyle issue

b0dd829

Trace: Throw an exception when trying to get the variable for an obje…

2ed82e5

…ct that is not tracked

Trace: Make TraceLogger package-private

3ce7e7b

Trace: Add support for printing more formula types

2d35c3d

Trace: Apply refaster patch

b273c25

Trace: Remove duplicate "." when creating a trace for the configurati…

c7c15bf

…on builder

Trace: Refactor code and rebuild all terms that are returned by the s…

01cb338

…olver Rebuilding the terms makes sure that we don't encounter any unknown subformulas in the visitors

Trace: Add support for array constants

3df3d88

Trace: Avoid catching runtime exceptions while logging

6ad67ab

Trace: Add support for BooleanFormulaManager.xor

ab245e7

Trace: Add support for BooleanFormulaManager.extractVariablesAndUFs

f5f8f0f

Trace: Add support for creating integer constants from Strings

018dbb8

Trace: Add some missing logging for methods in TraceFormulaManager

01871ac

Trace: Make constructor for TraceFloatingPointFormulaManager package-…

bfc4a76

…private

Trace: Add support for creating FloatingPointFormulas from BigDecimals

adb0cbc

Trace: Fix default rounding mode for BigDecimals

73f6797

daniel-raffler added 23 commits December 14, 2025 23:34

traceToSmtlib: Don't write help message to stderr

48972fb

traceToSmtlib: Translate to an intermediate representation, rather th…

bd303b8

…an printing Smtlib directly

traceToSmtlib: Fix makeBitvector

a785545

traceToSmtlib: Fix overflow issues for large bitvector constants

68405c6

traceToSmtlib: Fix error message

2857b03

Trace: Always remove qf.eliminateQuantifiers from the log

82a1748

traceToSmtlib: Add quantifier support

9069b12

traceToSmtlib: Fix fp.makeValue for FloatingPointNumber

d3b2030

Trace: Discard calls to prover.unsatCoreOverAssumptions

4b1b2ff

Trace: Add a case for fp constants in the rebuilder

193f5c8

Trace: Add a case for fp equality in the rebuilder

3de9747

Trace: Allow formula translation between contexts

30b0a4f

traceToSmtlib: Clean up parser

86395a8

traceToSmtlib: Update grammar in the documentation

23c80ab

traceToSmtlib: Add pytest to requirements.txt

fd245cd

traceToSmtlib: Clean up some warnings

e4de5cb

traceToSmtlib: Close a todo

3104afa

We only have a single, global prover and the current options seem to be enough

Trace: Simplify Rebuilder for variables

26194c7

Trace: Simplify model.asList

66296a8

Trace: Fix numeral.sum in SmtInterpol if the sum has less than 2 terms

4cddd84

Trace: Revert changes to how equality to zero is handled in Princess

e276d72

Rewriting (= a 0) to (=0 a) causes some issues in our tests that expect the formula to have 2 subterms

Trace: Revert changes to the CI and disable tracing while running the…

3068d81

… tests

daniel-raffler requested review from baierd and kfriedberger December 18, 2025 09:26

daniel-raffler added 2 commits December 18, 2025 10:40

Trace: Revert changes to the CI

a9de0fd

Don't upload the traces

Trace: Remove testing script

ce46f96

daniel-raffler added 2 commits December 18, 2025 16:59

Trace: Simplify result of quantifier elimination in Z3

0b46e8e

Solves an issue where we would get a 0ary `and` instead of `true` from the solver See SolverVisitorTest.testTransformationInsideQuantifiersWithTrue

Trace: Close some todos

6340d3d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a new delegate to allow API tracing #505

Add a new delegate to allow API tracing #505

Uh oh!

daniel-raffler commented Aug 14, 2025 •

edited

Loading

Uh oh!

daniel-raffler commented Aug 18, 2025

Uh oh!

daniel-raffler commented Dec 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Add a new delegate to allow API tracing #505

Are you sure you want to change the base?

Add a new delegate to allow API tracing #505

Uh oh!

Conversation

daniel-raffler commented Aug 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniel-raffler commented Aug 18, 2025

Uh oh!

daniel-raffler commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

daniel-raffler commented Aug 14, 2025 •

edited

Loading

daniel-raffler commented Dec 18, 2025 •

edited

Loading