LLM Observability: Part 1 (OpenAI) #1058

langchain4j · 2024-05-07T11:37:16Z

Issue

Change

Added ModelListener, ChatLanguageModelRequest, and ChatLanguageModelResponse that are compatible (have all the required attributes) with OTEL LLM semconv draft.
Added an option to attach multiple ModelListener<ChatLanguageModelRequest, ChatLanguageModelResponse> to OpenAiChatModel and OpenAiStreamingChatModel (pilot module).

`ChatLanguageModelRequest`

public class ChatLanguageModelRequest {

    private final String model;
    private final Double temperature;
    private final Double topP;
    private final Integer maxTokens;
    private final List<ChatMessage> messages;
    private final List<ToolSpecification> toolSpecifications;
}

`ChatLanguageModelResponse`

public class ChatLanguageModelResponse {

    private final String id;
    private final String model;
    private final TokenUsage tokenUsage;
    private final FinishReason finishReason;
    private final AiMessage aiMessage;
}

Example

ModelListener<ChatLanguageModelRequest, ChatLanguageModelResponse> modelListener =
                new ModelListener<ChatLanguageModelRequest, ChatLanguageModelResponse>() {

                    @Override
                    public void onRequest(ChatLanguageModelRequest request) {
                        // handle request
                    }

                    @Override
                    public void onResponse(ChatLanguageModelResponse response, ChatLanguageModelRequest request) {
                        // handle response
                    }

                    @Override
                    public void onError(Throwable error, ChatLanguageModelResponse response, ChatLanguageModelRequest request) {
                        // handle error
                    }
                };

        OpenAiChatModel model = OpenAiChatModel.builder()
                .apiKey(...)
                .listeners(singletonList(modelListener))
                .build();

General checklist

There are no breaking changes
I have added unit and integration tests for my change
I have manually run all the unit and integration tests in the module I have added/changed, and they are all green
I have manually run all the unit and integration tests in the core and main modules, and they are all green
I have added/updated the documentation
I have added an example in the examples repo (only for "big" features)

(cherry picked from commit 83294cb)

langchain4j · 2024-05-07T11:46:11Z

@brunobat could you please take a look? 🙏

geoand · 2024-05-13T10:40:26Z

I'm also very interested in this so we can augment in the already present observability features in the Quarkus implementation.

I also think that having something like this in sooner that rather than later would be very helpful as these new integration points could definitely prove useful for various tasks.

geoand · 2024-05-13T14:47:05Z

I believe that onResponse could also be passed ChatLanguageModelRequest

langchain4j · 2024-05-13T14:50:22Z

@geoand yeah I was just adding it :)

geoand · 2024-05-13T14:52:56Z

💪🏼

langchain4j · 2024-05-13T14:58:49Z

@geoand do you think id can still be useful for something (after adding request to onResponse())?

geoand · 2024-05-13T14:59:52Z

I very much doubt it's useful

… streaming mode, added onError

...j-core/src/main/java/dev/langchain4j/model/chat/observability/ChatLanguageModelListener.java

langchain4j-open-ai/src/main/java/dev/langchain4j/model/openai/OpenAiStreamingChatModel.java

...j-core/src/main/java/dev/langchain4j/model/chat/observability/ChatLanguageModelListener.java

geoand · 2024-05-23T12:03:19Z

I am actually thinking that the listener might need to be changed to:

public interface ModelListener<Request, Response> {

    default OnRequestResult<Request> onRequest(Request request) {
        return null;
    }

    default void onResponse(Response response, OnRequestResult<Request> onRequestResult) {

    }

    default void onError(Throwable error, Response response, OnRequestResult<Request> onRequestResult) {

    }

    /**
     * This name is horrible, and should replaced by something better 
     */    
    interface OnRequestResult<Request> {
        Request request();
    }
}

The reason is that an integration might want to include some custom data when the request is created and that data might need to be accessible when the result comes back (or an error occurs).

WDYT?

langchain4j · 2024-05-23T13:04:56Z

@geoand can you provide an example when this might be required?

geoand · 2024-05-23T13:07:28Z

I am thinking that with OpenTelemetry if I want to open a new span on request and close it on end, I need a way to keep hold of that. Now OTel might have a way to do that already (with ThreadLocal, or some pluggable strategy), but other APIs might not.

brunobat · 2024-05-23T13:51:00Z

We can add to the request object the OTel context

geoand · 2024-05-23T13:53:42Z

Right, but other APIs might not have that kind of capability

geoand · 2024-05-24T06:55:01Z

Here is another example:

With the current API, how would one use Micrometer to add a timed metric of the operation?

P.S. I take complete responsibility for not spotting this issue earlier.

geoand · 2024-05-24T07:10:15Z

We can add to the request object the OTel context

Looking at this more, I don't see how one could close the Scope with the current LangChain4j API.

geoand · 2024-05-24T08:25:54Z

I have a draft proposal of what I would like to do here.

Using that in Quarkus LangChain4j I would utilize it like so:

public class OpenTelemetryChatLanguageModelListener implements ModelListener<ChatLanguageModelRequest, OpenTelemetryChatLanguageModelListener.ScopeHoldingOnRequestResult, ChatLanguageModelResponse> {

    private final Tracer tracer;

    @Inject
    public OpenTelemetryChatLanguageModelListener(Tracer tracer) {
        this.tracer = tracer;
    }

    @Override
    public ScopeHoldingOnRequestResult onRequest(ChatLanguageModelRequest request) {
        String name = "ChatCompletions " + request.model();

        Span span = tracer.spanBuilder(name).startSpan();
        Scope scope = span.makeCurrent();

        // TODO: implement

        return new ScopeHoldingOnRequestResult(request, scope);
    }

    @Override
    public void onResponse(ChatLanguageModelResponse chatLanguageModelResponse, ScopeHoldingOnRequestResult request) {
        // TODO: implement
        request.scope.close();
    }

    @Override
    public void onError(Throwable error, ChatLanguageModelResponse chatLanguageModelResponse,
                        ScopeHoldingOnRequestResult request) {
        // TODO: implement
        request.scope.close();
    }

    public static class ScopeHoldingOnRequestResult implements OnRequestResult<ChatLanguageModelRequest> {

        private final ChatLanguageModelRequest request;
        private final Scope scope;

        private ScopeHoldingOnRequestResult(ChatLanguageModelRequest request, Scope scope) {
            this.request = request;
            this.scope = scope;
        }

        @Override
        public ChatLanguageModelRequest request() {
            return null;
        }
    }
}

brunobat · 2024-05-24T09:16:44Z

Will comment on the PR

langchain4j added 2 commits May 7, 2024 13:29

Draft: LLM observability

6f38f95

(cherry picked from commit 83294cb)

Draft: LLM observability

e9838df

langchain4j added 2 commits May 13, 2024 18:10

Draft: LLM observability: refactored a bit, returning id and model in…

d03330e

… streaming mode, added onError

Draft: LLM observability: refactored a bit, returning id and model in…

9ae3b56

… streaming mode, added onError

langchain4j commented May 13, 2024

View reviewed changes

...j-core/src/main/java/dev/langchain4j/model/chat/observability/ChatLanguageModelListener.java Outdated Show resolved Hide resolved

langchain4j-open-ai/src/main/java/dev/langchain4j/model/openai/OpenAiStreamingChatModel.java Outdated Show resolved Hide resolved

brunobat reviewed May 14, 2024

View reviewed changes

langchain4j mentioned this pull request May 14, 2024

[FEATURE] Langsmith integration #1104

Open

langchain4j and others added 6 commits May 14, 2024 16:04

Draft: LLM Observability

0974ce0

Draft: LLM Observability

78c4fc8

Merge branch 'main' into llm-observability

604ffd0

LLM Observability

5e192ee

LLM Observability

6a3bc95

LLM Observability

d00f528

langchain4j marked this pull request as ready for review May 21, 2024 16:18

langchain4j changed the title ~~Draft: LLM Observability~~ LLM Observability May 21, 2024

langchain4j changed the title ~~LLM Observability~~ LLM Observability Part 1 May 22, 2024

langchain4j changed the title ~~LLM Observability Part 1~~ LLM Observability: Part 1 May 22, 2024

langchain4j merged commit 6818e27 into main May 22, 2024
6 checks passed

langchain4j deleted the llm-observability branch May 22, 2024 11:14

langchain4j changed the title ~~LLM Observability: Part 1~~ LLM Observability: Part 1 (OpenAI) May 23, 2024

geoand mentioned this pull request May 24, 2024

Update ModelListener API to be more integration friendly #1157

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Observability: Part 1 (OpenAI) #1058

LLM Observability: Part 1 (OpenAI) #1058

langchain4j commented May 7, 2024 •

edited

langchain4j commented May 7, 2024 •

edited

geoand commented May 13, 2024 •

edited

geoand commented May 13, 2024

langchain4j commented May 13, 2024

geoand commented May 13, 2024

langchain4j commented May 13, 2024 •

edited

geoand commented May 13, 2024

geoand commented May 23, 2024 •

edited

langchain4j commented May 23, 2024

geoand commented May 23, 2024

brunobat commented May 23, 2024

geoand commented May 23, 2024

geoand commented May 24, 2024 •

edited

geoand commented May 24, 2024

geoand commented May 24, 2024

brunobat commented May 24, 2024

LLM Observability: Part 1 (OpenAI) #1058

LLM Observability: Part 1 (OpenAI) #1058

Conversation

langchain4j commented May 7, 2024 • edited

Issue

Change

ChatLanguageModelRequest

ChatLanguageModelResponse

Example

General checklist

langchain4j commented May 7, 2024 • edited

geoand commented May 13, 2024 • edited

geoand commented May 13, 2024

langchain4j commented May 13, 2024

geoand commented May 13, 2024

langchain4j commented May 13, 2024 • edited

geoand commented May 13, 2024

geoand commented May 23, 2024 • edited

langchain4j commented May 23, 2024

geoand commented May 23, 2024

brunobat commented May 23, 2024

geoand commented May 23, 2024

geoand commented May 24, 2024 • edited

geoand commented May 24, 2024

geoand commented May 24, 2024

brunobat commented May 24, 2024

langchain4j commented May 7, 2024 •

edited

`ChatLanguageModelRequest`

`ChatLanguageModelResponse`

langchain4j commented May 7, 2024 •

edited

geoand commented May 13, 2024 •

edited

langchain4j commented May 13, 2024 •

edited

geoand commented May 23, 2024 •

edited

geoand commented May 24, 2024 •

edited