🐛 Bug Report: LLM token counts not updated in traces sent to dynatrace #2661

sristiraj · 2025-02-17T17:36:17Z

Which component is this bug for?

Langchain Instrumentation

📜 Description

Openllmetry traces does not set completion and total token counts in traces sent to Dynatrace SAAS http via open telemetry for an RAG chain using langchain framework and antropic claude 3.5 foundation model served on bedrock and accessed via Mosaic AI gateway in databricks.

Traces for prompt generation and LLM request response calls are captured correctly but token counts are missing.

👟 Reproduction steps

Create a python project and install traceloop-sdk dependency along with langchain-core, databricks-langchain, streamlit.
Create a sample application module with file name "sample.py" using code sample below.

import streamlit as st
from traceloop.sdk import Traceloop
from langchain_core.prompts import PromptTemplate
from langchain_databricks.chat_models import ChatDatabricks
from langchain_core.output_parsers import StrOutputParser
user_id="abc"
session_id = uuid.uuid4()
Traceloop.set_association_properties({"user_id":user_id, "session_id":session_id})
Traceloop.init(app_name="sr sample", disable_batch=True)
st.title("Marketing Email Generator")
with st.form("my_form"):
    product_name = st.text_input("Product Name")
    product_features = st.text_input("Product Features")
    audience = st.text_input("Target Audience")
    submitted = st.form_submit_button("Submit")
    if submitted:
        prompt_subject = PromptTemplate(
            input_variables=["product_name", "product_features", "audience"],
            template="""You are an email maketing campaign manager.
            Generate a subject for the email for the product: {product_name}
            and having features that includes: {product_features}
            Respond only the subject line.
            """
        )
        prompt_email = PromptTemplate(
            input_variables=["subject", "audience"],
            template="""You are an email marketing manager.
            Generate a email body for the email subject: {subject}
            and for the audience: {audience}
            """
        )
        llm = ChatDatabricks(endpoint="claude-3-5-sonnet-20240620-v1-0",
                     temperature = 0)
        subject_chain = prompt_subject | llm | (lambda subject: (subject, st.write(f"Title: {subject.content}"))[0]) | StrOutputParser()
        email_chain = prompt_email | llm
        final_chain = subject_chain | (lambda subject: {"subject": subject, "audience": audience}) | email_chain
        response = final_chain.invoke({"product_name": product_name,
                                        "product_features": product_features})
        st.write(response.content)

Create a serving endpoint in databricks AI gateway for claude 3.5 model with name "claude-3-5-sonnet-20240620-v1-0".
Create an access token in databricks that can be used for interacting between the code snippet above and claude model served on AI gateway.
Create a access token in dynatrace with permission to ingest open telemetry traces, metrics and logs.
In the python project, set the environment variable, TRACELOOP_HEADERS with the dynatrace token created in step 4. and TRACELOOP_BASE_URL to dynatrace url https://.live.dynatrace.com/api/v2/otlp.
Set the DATABRICKS_HOST environment variable to databricks workspace url.
Set the DATABRICKS_TOKEN environment variable to the databricks access token created in step 3 above.
Run the python module created in step 2 using command "streamlit run sample.py"
In the browser open the application started using streamlit and provide values for product name, product feature and audience as traceloop, tracing, GenAI engineer respectively.
Check in dynatrace by navigating to Distributed Tracing within Dynatrace to check if the trace has token counts updated.

👍 Expected behavior

Completion token and Total token should be updated in llm.usage section in dynatrace.

👎 Actual Behavior with Screenshots

Token counts are not updated. Refer to screenshot attached.

🤖 Python Version

3.10

📃 Provide any additional context for the Bug.

No response

👀 Have you spent some time to check if this bug has been raised before?

I checked and didn't find similar issue

Are you willing to submit PR?

Yes I am willing to submit a PR!

The text was updated successfully, but these errors were encountered:

dosubot · 2025-02-17T17:37:53Z

Hey there, @sristiraj! I'm here to help you with any bugs, questions, or contributions. Let's tackle this together!

The issue you're experiencing with missing token counts in traces sent to Dynatrace might be related to how the Openllmetry library integrates with Langchain and handles token count extraction and reporting. Here are a few points to consider:

Token Count Extraction: The integration between Langchain and Openllmetry captures token usage by setting attributes on spans and recording metrics. Ensure that the code responsible for setting these attributes is correctly implemented and that the necessary attributes (LLM_USAGE_PROMPT_TOKENS, LLM_USAGE_COMPLETION_TOKENS, LLM_USAGE_TOTAL_TOKENS) are being set on the spans ^[1].
Configuration and Environment Variables: Verify that all necessary environment variables and configurations are correctly set. Although specific configuration options for enabling token count reporting weren't found, ensure that your setup aligns with the OpenTelemetry and Dynatrace requirements.
Compatibility with Dynatrace: There might be compatibility issues or specific configurations needed for Dynatrace to correctly interpret and display token counts. Double-check the Dynatrace setup and ensure that it is configured to ingest and display OpenTelemetry trace data correctly.
Error Handling: If there are errors or missing data when extracting token counts, ensure that the error handling in your code is robust enough to capture and log these issues for further investigation.

If these suggestions don't resolve the issue, consider reviewing the specific methods responsible for setting token counts, such as _set_token_usage and _process_response_item, to ensure they are functioning as expected ^[2]^[3].

To continue talking to Dosu, mention @dosu.

^{Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other}

TheMellyBee · 2025-03-17T19:04:11Z

I'm also experiencing this with Dynatrace and llama-index for VertexAI

dosubot bot added the bug Something isn't working label Feb 17, 2025

obs-gh-abhishekrao mentioned this issue Apr 8, 2025

fix(langchain): retain parent ctx and ensure appropriate llm span is created #2805

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Bug Report: LLM token counts not updated in traces sent to dynatrace #2661

🐛 Bug Report: LLM token counts not updated in traces sent to dynatrace #2661

sristiraj commented Feb 17, 2025 •

edited

Loading

dosubot bot commented Feb 17, 2025

TheMellyBee commented Mar 17, 2025

🐛 Bug Report: LLM token counts not updated in traces sent to dynatrace #2661

🐛 Bug Report: LLM token counts not updated in traces sent to dynatrace #2661

Comments

sristiraj commented Feb 17, 2025 • edited Loading

Which component is this bug for?

📜 Description

👟 Reproduction steps

👍 Expected behavior

👎 Actual Behavior with Screenshots

🤖 Python Version

📃 Provide any additional context for the Bug.

👀 Have you spent some time to check if this bug has been raised before?

Are you willing to submit PR?

dosubot bot commented Feb 17, 2025

TheMellyBee commented Mar 17, 2025

sristiraj commented Feb 17, 2025 •

edited

Loading