Untitled

[]()

If you’re working in Generative AI, chances are you’re using a language model in your day-to-day work. It could be Azure OpenAI, Anthropic’s Cluade or even one from AWS Bedrock. Regardless of which LLM it is there is a great deal one can learn from analyzing the actions we take and their responses.

This guide will be focusing on debugging Azure OpenAI calls made through Langchain

Before we being be sure to install the following packages

pip install -U -q langchain langchain_openai langfuse

Let us start by first defining our LLM

from langchain_openai import (AzureChatOpenAI
import os

llm = AzureChatOpenAI(
    azure_endpoint=os.getenv("AZURE_OPENAI_ENDPOINT"),
    deployment_name=os.getenv("DEPLOYMENT_NAME"),
    api_key=getenv("AZURE_OPENAI_API_KEY"),
    api_version=os.getenv("OPENAI_API_VERSION"),
    temperature=0.4, max_tokens=1000,
)

We will be using Langfuse an open-source LLM tracing application to monitor our LLM usage. LangSmith would have been my first choice but they no longer provide a self-hosted option.

./Observe%20your%20LLMs%20with%20Langfuse%20_%20Stackademic_files/1_aCWYtibTuyyI7DILofXQaQ.png

Langfuse

Untitled

Let’s first configure Langfuse locally, you will need Docker for this.

git clone <https://github.com/langfuse/langfuse.git>
cd langfuse
docker compose up

Now you can access your local langfuse at http://localhost:3000/