API Reference

This document details the API for the openai-structured library, which provides a Python interface for working with OpenAI Structured Outputs.

Version Compatibility

Python Support

Python 3.9+: Full support
Python 3.8: Limited support (no TypedDict)
Python 3.7 and below: Not supported

API Versions

OpenAI API: v2024-02-15 or later
JSON Schema: Draft 7
Pydantic: v2.0+

Client

The client module provides functions for working with OpenAI Structured Outputs, featuring streaming support and efficient buffer management.

Functions

openai_structured.client.async_openai_structured_stream(*, messages: List[Dict[str, str]], schema: Dict[str, Any], model: str = 'gpt-4o', temperature: float = 0.0, max_tokens: int | None = None, top_p: float = 1.0, frequency_penalty: float = 0.0, presence_penalty: float = 0.0, timeout: float = 60.0, stream_config: StreamConfig | None = None, validate_schema: bool = True, on_log: Callable[[str, Any], Awaitable[None]] | None = None) → AsyncGenerator[Dict[str, Any], None]

Make a streaming OpenAI API call using OpenAI Structured Outputs.

Parameters:

messages – List of chat messages in OpenAI format
schema – JSON Schema defining the expected response structure
model – Model to use (default: “gpt-4o”)
temperature – Sampling temperature (default: 0.0)
max_tokens – Maximum tokens to generate (default: model-specific)
top_p – Top-p sampling parameter (default: 1.0)
frequency_penalty – Frequency penalty (default: 0.0)
presence_penalty – Presence penalty (default: 0.0)
timeout – API timeout in seconds (default: 60.0)
stream_config – Stream configuration (default: None)
validate_schema – Whether to validate response against schema (default: True)
on_log – Optional callback for structured logging events (default: None) - Receives LogEvent objects with event type and data - Used for custom logging, monitoring, and debugging - Sensitive data is automatically redacted

Returns:

AsyncGenerator yielding structured data chunks

Raises:

Various exceptions (see Error Handling section)

Example:

schema = {
    "type": "object",
    "properties": {
        "summary": {"type": "string"},
        "key_points": {
            "type": "array",
            "items": {"type": "string"}
        }
    }
}

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Analyze this text: " + text}
]

async for chunk in async_openai_structured_stream(
    messages=messages,
    schema=schema,
    model="gpt-4o",
    temperature=0.7,
    stream_config=StreamConfig(
        max_buffer_size=1024 * 1024,  # 1MB
        cleanup_threshold=512 * 1024   # 512KB
    )
):
    print(chunk)

openai_structured.client.supports_structured_output(model_name: str) → bool

Check if a model supports OpenAI Structured Outputs.

This function validates whether a given model name supports OpenAI Structured Outputs, handling both aliases and dated versions. For dated versions, it ensures they meet minimum version requirements.

Parameters:: model_name – The model name to validate. Can be either: - an alias (e.g., “gpt-4o”) - dated version (e.g., “gpt-4o-2024-08-06”) - newer version (e.g., “gpt-4o-2024-09-01”)
Returns:: True if the model supports OpenAI Structured Outputs, False otherwise

Example:

# Check alias
if supports_structured_output("gpt-4o"):
    print("Model supports OpenAI Structured Outputs")

# Check dated version
if supports_structured_output("gpt-4o-2024-08-06"):
    print("Version is supported")

# Check unsupported model
if not supports_structured_output("gpt-3.5-turbo"):
    print("Model does not support OpenAI Structured Outputs")

Notes:

Aliases (e.g., “gpt-4o”) are automatically resolved to the latest compatible version
Dated versions must meet minimum version requirements
For dated versions, both the base model and date are validated
Newer versions are accepted if the base model is supported

openai_structured.client.get_context_window_limit(model_name: str) → int

Get the context window limit (maximum total tokens) for a given model.

Parameters:: model_name – The model name (e.g., “gpt-4o”, “o1”, “o3-mini”)
Returns:: Maximum context window size in tokens

Example:

limit = get_context_window_limit("gpt-4o")  # Returns 128,000
limit = get_context_window_limit("o1")      # Returns 200,000
limit = get_context_window_limit("o3-mini") # Returns 200,000

openai_structured.client.get_default_token_limit(model_name: str) → int

Get the default output token limit for a given model.

Parameters:: model_name – The model name (e.g., “gpt-4o”, “o1”, “o3-mini”)
Returns:: Maximum output tokens allowed

Example:

limit = get_default_token_limit("gpt-4o")  # Returns 16,384
limit = get_default_token_limit("o1")      # Returns 100,000
limit = get_default_token_limit("o3-mini") # Returns 100,000

Note: The actual usable output tokens may be slightly less due to invisible reasoning tokens.

Classes

class openai_structured.client.StreamConfig

Configuration for streaming behavior with OpenAI Structured Outputs.

Parameters:

max_buffer_size – Maximum buffer size in bytes (default: 1MB)
cleanup_threshold – Buffer cleanup threshold in bytes (default: 512KB)
chunk_size – Stream chunk size in bytes (default: 8KB)
max_cleanup_attempts – Maximum number of cleanup attempts (default: 3)
max_parse_errors – Maximum number of parse errors before failing (default: 5)
log_size_threshold – Size change that triggers logging (default: 100KB)

Example:

config = StreamConfig(
    max_buffer_size=1024 * 1024,  # 1MB
    cleanup_threshold=512 * 1024,  # 512KB
    chunk_size=8192               # 8KB
)

class openai_structured.client.StreamBuffer

Internal buffer management for streaming OpenAI Structured Outputs responses.

Parameters:

config – StreamConfig instance controlling buffer behavior
schema – Optional Pydantic model class for validation

Attributes:

total_bytes: Current buffer size in bytes parse_errors: Number of parse errors encountered cleanup_attempts: Number of cleanup attempts performed _cleanup_stats: Dictionary tracking cleanup operations:

strategy: Cleanup strategy used (ijson_parsing or pattern_matching)

cleaned_bytes: Number of bytes cleaned

error_context: Context around errors when they occur

validation_error: Details of validation errors

json_error: Details of JSON parsing errors

Methods:

write(content: str) -> None: Write content to the buffer. Raises BufferOverflowError if size exceeds limit.
process_stream_chunk(content: str, on_log: Optional[Callable]) -> Optional[Any]: Process a stream chunk and return parsed content if complete.
cleanup() -> None: Attempt to clean the buffer by finding and preserving valid JSON.
reset() -> None: Reset the buffer state while preserving configuration.
close() -> Optional[BaseModel]: Close the buffer, attempt to extract a final response, and clean up resources.

Example:

buffer = StreamBuffer(
    config=StreamConfig(),
    schema=MyPydanticModel
)

try:
    result = buffer.process_stream_chunk(chunk)
    if result:
        print(f"Valid data: {result}")
except BufferError as e:
    print(f"Buffer error: {e}")

Errors

The errors module defines custom exceptions used by the library.

Exceptions

exception openai_structured.errors.APIResponseError

Base exception for API response errors. Contains detailed information about the failed response.

Attributes:

response_id (Optional[str]): The OpenAI response ID for tracking and debugging
content (Optional[str]): The raw response content that caused the error

Example:

try:
    result = await async_openai_structured_call(...)
except APIResponseError as e:
    print(f"Error ID: {e.response_id}")
    print(f"Error content: {e.content}")
    print(f"Error message: {str(e)}")

exception openai_structured.errors.InvalidResponseFormatError

Raised when the API response doesn’t match the expected format. Inherits from APIResponseError, providing response_id and content.

Example:

try:
    result = await async_openai_structured_call(...)
except InvalidResponseFormatError as e:
    print(f"Invalid format in response {e.response_id}")
    print(f"Raw content: {e.content}")

exception openai_structured.errors.EmptyResponseError

Raised when the API returns an empty response. Inherits from APIResponseError, providing response_id and content.

Example:

try:
    result = await async_openai_structured_call(...)
except EmptyResponseError as e:
    print(f"Empty response with ID: {e.response_id}")

exception openai_structured.errors.StreamBufferError

Raised when stream buffer limits are exceeded.

Causes:

Buffer size exceeds limit
Cleanup fails
Memory allocation fails

Example:

try:
    async for chunk in async_openai_structured_stream(...):
        process_chunk(chunk)
except StreamBufferError as e:
    print(f"Buffer overflow: {e}")

exception openai_structured.errors.StreamInterruptedError

Raised when the stream is interrupted unexpectedly.

Causes:

Network issues
API errors
Client disconnection
Timeouts

Example:

try:
    async for chunk in async_openai_structured_stream(...):
        process_chunk(chunk)
except StreamInterruptedError as e:
    print(f"Stream interrupted: {e}")

exception openai_structured.errors.StreamParseError

Raised when stream content cannot be parsed.

Causes:

Invalid JSON
Schema mismatch
Encoding issues
Partial response

Example:

try:
    async for chunk in async_openai_structured_stream(...):
        process_chunk(chunk)
except StreamParseError as e:
    print(f"Parse error: {e}")

exception openai_structured.errors.ValidationError

Raised when schema validation fails.

Causes:

Schema violations
Type mismatches
Missing fields
Format errors

Example:

try:
    async for chunk in async_openai_structured_stream(...):
        process_chunk(chunk)
except ValidationError as e:
    print(f"Validation error: {e}")

Note

Token limit validation is performed using the validate_token_limits function, which raises a ValueError if limits are exceeded.

exception openai_structured.errors.TokenLimitError

Raised when token limits are exceeded for a model.

Attributes:

requested_tokens (Optional[int]): The number of tokens requested
model_limit (Optional[int]): The maximum token limit for the model

Example:

try:
    result = await async_openai_structured_call(
        client=client,
        model="gpt-4o",
        max_tokens=20_000,  # Exceeds limit
        output_schema=MySchema,
        user_prompt="..."
    )
except TokenLimitError as e:
    print(f"Token limit exceeded: requested {e.requested_tokens} tokens")
    print(f"Model limit is {e.model_limit} tokens")

Note:: Token limits vary by model: - gpt-4o: 16,384 output tokens - o1: 100,000 output tokens - o3-mini: 100,000 output tokens

exception openai_structured.errors.TokenParameterError

Raised when both max_output_tokens and max_completion_tokens are used. These parameters are mutually exclusive as they control the same functionality. Only one should be used in a request.

Example:

try:
    client.complete(
        "gpt-4o",
        max_output_tokens=100,
        max_completion_tokens=100
    )
except TokenParameterError as e:
    print(f"Token error: {e}")
    # Output:
    # "Cannot specify both 'max_output_tokens' and 'max_completion_tokens' parameters.
    # These parameters are mutually exclusive as they control the same functionality.
    # Choose one:
    # - max_output_tokens (recommended)
    # - max_completion_tokens (legacy)"

exception openai_structured.errors.ModelNotSupportedError

Raised when a model is not supported by the client.

Example:

try:
    registry.get_capabilities("unsupported-model")
except ModelNotSupportedError as e:
    print(f"Model error: {e}")
    # Output:
    # "Model 'unsupported-model' is not supported.
    # Available models:
    # - Dated models: gpt-4o-2024-08-06, o1-2024-12-17
    # - Aliases: gpt-4o, o1
    # Note: For dated models, use format: base-YYYY-MM-DD (e.g. gpt-4o-2024-08-06)"

exception openai_structured.errors.VersionTooOldError

Raised when a model version is older than the minimum supported version.

Example:

try:
    registry.get_capabilities("gpt-4o-2024-07-01")
except VersionTooOldError as e:
    print(f"Version error: {e}")
    # Output:
    # "Model 'gpt-4o-2024-07-01' version 2024-07-01 is too old.
    # Minimum supported version: 2024-08-06
    # Note: Use the alias 'gpt-4o' to always get the latest version"

exception openai_structured.errors.InvalidDateError

Raised when a model version has invalid date components.

Example:

try:
    registry.get_capabilities("gpt-4o-2024-13-01")
except InvalidDateError as e:
    print(f"Date error: {e}")
    # Output:
    # "Invalid date format in model version: Month must be between 1 and 12
    # Use format: YYYY-MM-DD (e.g. 2024-08-06)"

exception openai_structured.errors.OpenAIClientError

Base exception for client-side errors. Used for various validation and parameter errors.

Example:

try:
    capabilities.validate_parameter("reasoning_effort", "invalid")
except OpenAIClientError as e:
    print(f"Parameter error: {e}")
    # Output:
    # "Invalid value 'invalid' for parameter 'reasoning_effort'.
    # Description: Controls the model's reasoning depth.
    # Allowed values: low, medium, high"

Error Handling Examples

Here are comprehensive examples of handling different error scenarios:

Basic Error Recovery

from openai_structured import (
    APIResponseError, StreamBufferError, StreamInterruptedError,
    StreamParseError, ValidationError, ModelNotSupportedError,
    StreamBuffer
)
from openai_structured.errors import TokenLimitError
from openai import APIError, RateLimitError, APITimeoutError

async def process_with_basic_recovery():
    stream_config = StreamConfig(
        max_buffer_size=1024 * 1024,  # 1MB
        cleanup_threshold=512 * 1024   # 512KB
    )
    buffer = StreamBuffer(config=stream_config)

    try:
        async for chunk in async_openai_structured_stream(
            model="gpt-4o",
            output_schema=OutputSchema,
            system_prompt="Analyze this",
            user_prompt="Sample text",
            stream_config=stream_config
        ):
            process_chunk(chunk)

    except ModelNotSupportedError as e:
        # Handle model compatibility issues
        print(f"Model not supported: {e}")
        print("Available models: gpt-4o, gpt-4o-mini, o1")

    except ValidationError as e:
        # Handle schema validation failures
        print(f"Schema validation failed: {e}")
        print("Fields with errors:", e.errors())

    except StreamBufferError as e:
        # Handle buffer-related issues
        print(f"Buffer error: {e}")
        if hasattr(e, '_cleanup_stats'):
            print("Cleanup attempts:", e._cleanup_stats['attempts'])
            print("Last buffer size:", e._cleanup_stats['bytes_before'])

    except StreamParseError as e:
        # Handle JSON parsing issues
        print(f"Parse error after {e.attempts} attempts")
        print(f"Last error: {e.last_error}")

    except APIResponseError as e:
        # Handle API response issues with detailed info
        print(f"API Response Error (ID: {e.response_id})")
        print(f"Response content: {e.content}")

Advanced Error Recovery

from typing import Optional, Dict, Any
import asyncio
from tenacity import retry, stop_after_attempt, wait_exponential

class ErrorHandler:
    def __init__(self, max_retries: int = 3):
        self.max_retries = max_retries
        self.current_attempt = 0
        self.last_error: Optional[Exception] = None
        self.cleanup_stats: Dict[str, Any] = {}

    async def process_with_retry(self):
        while self.current_attempt < self.max_retries:
            try:
                async for chunk in async_openai_structured_stream(
                    client=client,
                    model="gpt-4o",
                    output_schema=OutputSchema,
                    system_prompt="Analyze this",
                    user_prompt="Sample text",
                    timeout=30.0
                ):
                    await self.process_chunk(chunk)
                break  # Success, exit loop

            except (StreamBufferError, ValidationError) as e:
                # Don't retry these errors
                self.log_error("Permanent error, not retrying", e)
                raise

            except StreamInterruptedError as e:
                # Retry with exponential backoff
                await self.handle_interrupted_stream(e)

            except APITimeoutError:
                # Retry with increased timeout
                await self.handle_timeout()

            except RateLimitError:
                # Retry with increased wait time
                await self.handle_rate_limit()

            except APIResponseError as e:
                # Log detailed response info and retry
                await self.handle_api_response_error(e)

            except Exception as e:
                # Unexpected error
                self.log_error("Unexpected error", e)
                raise

            self.current_attempt += 1

        if self.last_error:
            raise self.last_error

    async def handle_interrupted_stream(self, error: StreamInterruptedError):
        self.last_error = error
        wait_time = min(2 ** self.current_attempt, 30)  # Max 30 seconds
        self.log_error(f"Stream interrupted, retrying in {wait_time}s", error)
        await asyncio.sleep(wait_time)

    async def handle_timeout(self):
        new_timeout = 30 * (self.current_attempt + 1)  # Increase timeout
        self.log_error(f"Timeout, retrying with {new_timeout}s timeout")
        # Update client timeout for next attempt

    async def handle_rate_limit(self):
        wait_time = 30 * (self.current_attempt + 1)  # Increase wait time
        self.log_error(f"Rate limited, waiting {wait_time}s")
        await asyncio.sleep(wait_time)

    async def handle_api_response_error(self, error: APIResponseError):
        self.last_error = error
        self.log_error(
            f"API error (ID: {error.response_id})",
            f"Content: {error.content}"
        )
        await asyncio.sleep(5)  # Brief wait before retry

    def log_error(self, message: str, error: Optional[Exception] = None):
        print(f"Attempt {self.current_attempt + 1}/{self.max_retries}: {message}")
        if error:
            print(f"Error details: {error}")

# Usage
handler = ErrorHandler(max_retries=3)
await handler.process_with_retry()

These examples demonstrate:

Different error handling strategies: - Simple error catching and reporting - Sophisticated retry logic with exponential backoff - Error-specific handling and recovery
Proper resource cleanup using finally
Detailed error information extraction: - Response IDs from APIResponseError - Cleanup statistics from StreamBufferError - Parse attempt counts from StreamParseError
Advanced retry mechanisms: - Rate limit handling with increasing delays - Timeout handling with increasing timeouts - Stream interruption recovery
Structured error logging and monitoring

Example Usage

Basic Streaming

from openai_structured import async_openai_structured_stream, StreamConfig
from openai_structured.errors import StreamBufferError, ValidationError

async def process_stream():
    try:
        async for chunk in async_openai_structured_stream(
            client=client,
            model="gpt-4o-2024-08-06",
            output_schema=OutputSchema,
            system_prompt="Analyze this text",
            user_prompt="Sample text to analyze",
            stream_config=StreamConfig(
                max_buffer_size=1024 * 1024
            )
        ):
            print(chunk)
    except ValueError as e:
        if "token limit" in str(e).lower():
            print(f"Token limit exceeded: {e}")
        else:
            raise
    except StreamBufferError as e:
        print(f"Buffer error: {e}")
    except ValidationError as e:
        print(f"Validation error: {e}")

Error Recovery

from openai_structured.errors import StreamInterruptedError
import asyncio

async def process_with_retry(max_retries=3):
    last_error = None
    for attempt in range(max_retries):
        try:
            async for chunk in async_openai_structured_stream(...):
                process_chunk(chunk)
            break
        except StreamInterruptedError as e:
            last_error = e
            if attempt == max_retries - 1:
                raise
            await asyncio.sleep(1)
        except (StreamBufferError, ValidationError) as e:
            # Don't retry these errors
            raise

    if last_error:
        raise last_error

Resource Management

async def process_with_timeout():
    try:
        async for chunk in async_openai_structured_stream(
            messages=[...],
            schema={...},
            timeout=30.0
        ):
            process_chunk(chunk)
    except asyncio.TimeoutError:
        print("Operation timed out")
    finally:
        cleanup_resources()

Schema Validation

from openai_structured.errors import ValidationError

schema = {
    "type": "object",
    "properties": {
        "name": {
            "type": "string",
            "pattern": "^[A-Za-z]+$"
        },
        "age": {
            "type": "integer",
            "minimum": 0,
            "maximum": 150
        }
    },
    "required": ["name", "age"]
}

try:
    async for chunk in async_openai_structured_stream(
        messages=[...],
        schema=schema,
        validate_schema=True
    ):
        process_chunk(chunk)
except ValidationError as e:
    print(f"Validation failed: {e}")

Schema Validation

from openai_structured.errors import ValidationError

schema = {
    "type": "object",
    "properties": {
        "name": {
            "type": "string",
            "pattern": "^[A-Za-z]+$"
        },
        "age": {
            "type": "integer",
            "minimum": 0,
            "maximum": 150
        }
    },
    "required": ["name", "age"]
}

try:
    async for chunk in async_openai_structured_stream(
        messages=[...],
        schema=schema,
        validate_schema=True
    ):
        process_chunk(chunk)
except ValidationError as e:
    print(f"Validation failed: {e}")

Error Handling

The library raises the following exceptions:

StreamBufferError: Raised when the buffer size exceeds the configured maximum.
StreamInterruptedError: Raised when the stream is interrupted before completion.
StreamParseError: Raised when the stream content cannot be parsed as valid JSON.
ValidationError: Raised when the response does not match the provided schema.
APIError: Raised when the OpenAI API returns an error.
ValueError: Raised in several cases:
- When token limits are exceeded (input too long or output limit exceeded)
- When an invalid model version is provided
- When schema validation fails

Example error handling:

try:
    async for chunk in async_openai_structured_stream(
        client=client,
        model="gpt-4o-2024-08-06",
        output_schema=OutputSchema,
        system_prompt="Analyze this text",
        user_prompt="Sample text to analyze",
    ):
        process_chunk(chunk)
except ValueError as e:
    if "token limit" in str(e).lower():
        print(f"Token limit exceeded: {e}")
        print("Consider reducing input size or using a model with larger context")
    else:
        raise
except StreamBufferError as e:
    print(f"Buffer overflow: {e}")
except StreamInterruptedError as e:
    print(f"Stream interrupted: {e}")
except ValidationError as e:
    print(f"Validation error: {e}")
except APIError as e:
    print(f"API error: {e}")
finally:
    await client.close()

Logging Events

The library provides structured logging through the on_log callback:

class openai_structured.errors.LogEvent

Structured logging event.

Parameters:

type – Event type (e.g., “buffer.size”, “stream.start”, “error”)
data – Event data (sensitive information automatically redacted)

Security:: The library automatically redacts sensitive information in logs: - API keys and tokens - Authentication headers - Other security-sensitive fields This protection applies to all logging events, including errors and API responses.

Common event types:

buffer.size: Buffer size changes
stream.start: Stream creation
stream.end: Stream completion
stream.chunk: Chunk received
cleanup.stats: Buffer cleanup statistics
error: Error details (sensitive data redacted)
parse.attempt: Parse attempt details
validation: Schema validation results

Example logging implementation:

import logging
logger = logging.getLogger(__name__)

async def log_callback(event: LogEvent, level: str):
    # All events are automatically redacted for security
    if event.type == "error":
        logger.error("Error: %s", event.data, exc_info=True)  # API keys and auth data redacted
    elif event.type == "buffer.size":
        logger.info("Buffer size: %d bytes", event.data["size"])
    elif event.type == "cleanup.stats":
        logger.debug("Cleanup stats: %s", event.data)
    else:
        logger.debug("Event %s: %s", event.type, event.data)

async for chunk in async_openai_structured_stream(
    model="gpt-4o-2024-08-06",
    output_schema=OutputSchema,
    system_prompt="Analyze this text",
    user_prompt="Sample text to analyze",
    on_log=log_callback
):
    process_chunk(chunk)

Data Processing Features

The template engine includes powerful data processing capabilities for analyzing and transforming structured data.

Data Transformation

# Sort items by a key
{{ items|sort_by('timestamp') }}

# Group items by category
{% set grouped = items|group_by('category') %}
{% for category, items in grouped.items() %}
    {{ category }}: {{ items|length }} items
{% endfor %}

# Filter items
{{ items|filter_by('status', 'active') }}

# Extract values
{{ items|pluck('name') }}

# Get unique values
{{ items|unique }}

# Count frequencies
{{ items|frequency }}

Aggregation Functions

# Basic aggregation
{% set stats = data|aggregate('value') %}
Count: {{ stats.count }}
Sum: {{ stats.sum }}
Average: {{ stats.avg }}
Min: {{ stats.min }}
Max: {{ stats.max }}

# Aggregate nested data
{% set user_stats = users|aggregate('age') %}
Average age: {{ user_stats.avg }}

Data Analysis

# Generate data summary
{% set summary = summarize(data) %}
Total records: {{ summary.total_records }}
{% for field, stats in summary.fields.items() %}
    {{ field }}:
    - Type: {{ stats.type }}
    - Unique values: {{ stats.unique_values }}
    - Null count: {{ stats.null_count }}
{% endfor %}

# Create pivot tables
{% set pivot = pivot_table(data, index='category', values='amount', aggfunc='sum') %}
{{ pivot|dict_to_table }}

Table Formatting

# Basic table
{{ table(['Name', 'Age'], [['Alice', 25], ['Bob', 30]]) }}

# Aligned table
{{ align_table(['Name', 'Age'], [['Alice', 25], ['Bob', 30]], ['left', 'right']) }}

# Convert dict to table
{{ stats|dict_to_table }}

# Convert list to table
{{ users|list_to_table(headers=['Name', 'Age']) }}

# Auto-format any data structure
{{ data|auto_table }}

Examples

Here are some practical examples combining multiple features:

# Analyze user activity by category
{% set user_activity = data|group_by('category') %}
{% for category, items in user_activity.items() %}
    Category: {{ category }}
    {{ items|aggregate('duration')|dict_to_table }}
{% endfor %}

# Generate summary report
{% set stats = data|aggregate('value') %}
{% set distribution = data|pluck('category')|frequency %}

Summary Statistics:
{{ stats|dict_to_table }}

Category Distribution:
{{ distribution|dict_to_table }}

# Create detailed pivot analysis
{% set pivot_data = pivot_table(data,
                              index='category',
                              values='amount',
                              aggfunc='mean') %}
Average Amount by Category:
{{ pivot_data|dict_to_table }}

Testing

The library provides utilities for testing applications that use openai-structured.

Stream Testing

When testing streaming functionality, you should test both the iteration mechanism and content processing:

# Test streaming functionality
results = list(openai_structured_stream(
    client=client,  # Use a real client with test credentials
    model="gpt-4o",
    output_schema=SimpleMessage,
    user_prompt="test"
))

# Verify results
assert len(results) > 0
for result in results:
    assert isinstance(result, SimpleMessage)

Error Handling

Test error scenarios by configuring your client with invalid credentials or using network conditions that would trigger errors:

from openai_structured.errors import StreamInterruptedError

# Test with invalid API key
client = OpenAI(api_key="invalid-key")

with pytest.raises(StreamInterruptedError):
    list(openai_structured_stream(
        client=client,
        model="gpt-4o",
        output_schema=SimpleMessage,
        user_prompt="test"
    ))

Async Testing

For async code, use pytest-asyncio and test both successful and error cases:

@pytest.mark.asyncio
async def test_async_stream():
    client = AsyncOpenAI()  # Configure with test credentials

    results = []
    async for result in async_openai_structured_stream(
        client=client,
        model="gpt-4o",
        output_schema=SimpleMessage,
        user_prompt="test"
    ):
        results.append(result)

    assert len(results) > 0
    for result in results:
        assert isinstance(result, SimpleMessage)

Note

o1 and o3 models only support a limited set of parameters:

max_completion_tokens
reasoning_effort

Attempting to use other parameters (temperature, top_p, etc.) will raise an OpenAIClientError.

Model Support

The library supports the following OpenAI models with structured output:

Production Models

gpt-4o-2024-08-06
- Full JSON schema support
- 128K context window
- 16K output tokens
- Supports streaming
gpt-4o-mini-2024-07-18
- 128K context window
- 16K output tokens
- Supports streaming
o1-2024-12-17
- 200K context window
- 100K output tokens
- Limited parameter support
- Does not support streaming
o3-mini-2025-01-31
- 200K context window
- 100K output tokens
- Limited parameter support
- Supports streaming

Limited Parameter Support

o1 and o3 models only support the following parameters:

max_completion_tokens
reasoning_effort

Attempting to use other parameters (temperature, top_p, etc.) will raise an OpenAIClientError.

Streaming Support

Different models have varying streaming support:

# o1-2024-12-17 does not support streaming
try:
    async for response in async_openai_structured_stream(
        model="o1-2024-12-17",  # Will raise 400 error
        stream=True
    ):
        process_response(response)
except OpenAIClientError as e:
    # Error: "Unsupported value: 'stream' does not support true with this model"
    handle_error(e)

# o3 main model does not support streaming
try:
    async for response in async_openai_structured_stream(
        model="o3",  # Will raise 400 error
        stream=True
    ):
        process_response(response)
except OpenAIClientError as e:
    # Error: "The main o3 model does not support streaming"
    handle_error(e)

# o3-mini and o3-mini-high support streaming
async for response in async_openai_structured_stream(
    model="o3-mini",  # Works correctly
    stream=True
):
    process_response(response)

Exceptions

exception openai_structured.errors.OpenAIClientError

Base exception for client-side errors. Raised in several cases:

When attempting to use unsupported parameters with o1/o3 models
When model version is not supported
When validation fails