Instrumentation for Lanchain4j's OpenAI Chat Model #24

realark · 2025-12-29T03:40:04Z

No description provided.

realark · 2025-12-29T18:54:19Z

src/test/java/dev/braintrust/instrumentation/langchain/BraintrustLangchainTest.java

+    @SneakyThrows
+    void testSyncChatCompletion() {
+        // Mock the OpenAI API response
+        wireMock.stubFor(


TODO: my next quality-of-life change will be to switch out stubs for VCR-like replay (wiremock supports this apparently). Some time in the next week or two

delner

Some questions but nothing blocking!

delner · 2025-12-30T02:17:45Z

src/main/java/dev/braintrust/instrumentation/langchain/BraintrustLangchain.java

+    public record Options(String providerName) {}
+
+    @SuppressWarnings("unchecked")
+    private static <T> T getPrivateField(Object obj, String fieldName)


This question is just for my own education: it looks like we're using reflection to access the private fields in order to instrument them, correct? What are the performance/stability risks associated with reflection? Are there other practical alternatives for instrumentation?

In the Ruby world, we generally would avoid accessing private fields because of the potential for instability (e.g. someone in a patch version changes the API.)

Yes that's right, we're using reflection. There isn't much risk in this case because we'll just fail to apply instrumentation if something goes wrong

Performance is pretty good with reflection, but even if it wasn't this is only done once during client build

There isn't a viable alternative right now, but once we get into auto instrumentation for java we'll have more options

delner · 2025-12-30T02:19:34Z

src/main/java/dev/braintrust/instrumentation/langchain/WrappedHttpClient.java

+        Span span = startNewSpan(getSpanName(providerInfo));
+        try (Scope scope = span.makeCurrent()) {
+            tagSpan(span, request, providerInfo);
+            final long startTime = System.nanoTime();


What is nanoTime? Is it wall clock time or is it something else?

Basically wall clock time. It's an increasing nanosecond counter from an arbitrary starting point

delner · 2025-12-30T02:22:11Z

src/test/java/dev/braintrust/TestHarness.java

+                                        + " after %d attempts",
+                                minSpanCount, spans.size(), attempts));
+            }
+            Thread.sleep(1000);


Why do you need to wait & sleep? To read off the OTel thread? Is there a faster, more directly way to do this synchronously in the test suite?

Usually waiting isn't needed but some of the streaming tests finish their spans after this method is invoked for the first time

I feel like there should be a better way to do this, but the only gotcha is I'm using the built in otel utils to collect spans so I'm not sure what hooks I would have to insert concurrency signaling stuff

I'm making some other changes to the test harness in another branch. I'll add this to that work. At the very least I can dial down the sleep time (10ms should be plenty)

realark added the enhancement New feature or request label Dec 29, 2025

realark force-pushed the ark/langchain4j-instrumentation branch 2 times, most recently from 8db59a5 to 8c94b9c Compare December 29, 2025 18:03

realark marked this pull request as ready for review December 29, 2025 18:34

langchain4j openai instrumentation

f09d73b

realark force-pushed the ark/langchain4j-instrumentation branch from 8c94b9c to f09d73b Compare December 29, 2025 18:51

realark commented Dec 29, 2025

View reviewed changes

realark requested review from clutchski and delner December 29, 2025 19:02

delner approved these changes Dec 30, 2025

View reviewed changes

realark merged commit 1b58fef into main Dec 30, 2025
1 check passed

realark deleted the ark/langchain4j-instrumentation branch December 30, 2025 09:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Instrumentation for Lanchain4j's OpenAI Chat Model #24

Instrumentation for Lanchain4j's OpenAI Chat Model #24

Uh oh!

realark commented Dec 29, 2025

Uh oh!

realark Dec 29, 2025 •

edited

Loading

Uh oh!

delner left a comment

Uh oh!

delner Dec 30, 2025

Uh oh!

realark Dec 30, 2025

Uh oh!

delner Dec 30, 2025

Uh oh!

realark Dec 30, 2025

Uh oh!

delner Dec 30, 2025

Uh oh!

realark Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Instrumentation for Lanchain4j's OpenAI Chat Model #24

Instrumentation for Lanchain4j's OpenAI Chat Model #24

Uh oh!

Conversation

realark commented Dec 29, 2025

Uh oh!

realark Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

delner left a comment

Choose a reason for hiding this comment

Uh oh!

delner Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

realark Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

delner Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

realark Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

delner Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

realark Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

realark Dec 29, 2025 •

edited

Loading