Introduction
The Arize Python SDK v8 is a comprehensive client library for building, evaluating, and monitoring AI applications. Whether you’re developing LLM-powered applications or traditional ML models, this SDK provides the tools you need for complete observability and continuous improvement. Arize Platform:- Arize AX — Enterprise AI engineering platform with embedded AI Copilot
- Phoenix — Open-source tracing and evaluation framework
- OpenInference — Instrumentation for LLM applications
Key Features
- Tracing - Trace your LLM application’s runtime using OpenTelemetry-based instrumentation
- Evaluation - Leverage LLMs to benchmark your application’s performance
- Datasets - Create versioned datasets for experimentation, evaluation, and fine-tuning
- Experiments - Track and evaluate changes to prompts, models, and retrieval
- Playground (Coming Soon) - Optimize prompts, compare models, and replay traced LLM calls
- Prompt Management (Coming Soon) - Manage prompt changes with version control and experimentation
Installation
Install the base package:Optional Dependencies
The following optional extras provide specialized functionality:Note: The
otel extra installs the arize-otel package, which is also available as a standalone package. If you only need auto-instrumentation without the full SDK, install arize-otel directly.| Extra | Install Command | What It Provides |
|---|---|---|
| otel | pip install arize[otel] | OpenTelemetry auto-instrumentation package (arize-otel) for automatic tracing |
| embeddings | pip install arize[embeddings] | Automatic embedding generation for NLP, CV, and structured data (Pillow, datasets, tokenizers, torch, transformers) |
| mimic | pip install arize[mimic] | MIMIC explainer for model interpretability |
Getting Started
TheArizeClient is the recommended entry point for all SDK operations.
client.datasets- Dataset managementclient.experiments- Experiment tracking and evaluationclient.models- Traditional ML model loggingclient.projects- Project managementclient.spans- LLM tracing and spans operations- etc.
Configuration Options
Configure the client with constructor parameters or environment variables. Each configuration parameter follows this resolution order:- Constructor parameter (highest priority)
- Environment variable
- Built-in default (lowest priority)
Basic Configuration
Authentication
Authenticate using API keys obtained from the Arize Platform. The API key is required for all SDK operations and can be provided via constructor parameter or environment variable. If not provided, the SDK will raise aMissingAPIKeyError.
Defaults:
api_key- required (no default)
Region
Specify the Arize region you are trying to interact with (e.g., US_CENTRAL, EU_WEST). When a region is specified, it overrides individual host settings for all endpoints (API, OTLP, and Flight). This provides a convenient way to configure all endpoints at once for a specific region. This option is mutually exclusive withsingle_host/single_port and base_domain.
Defaults:
region-Region.UNSET(no region-based override)
| Region Identifier | Cloud Provider | API Host | OTLP Host | Flight Host |
|---|---|---|---|---|
CA_CENTRAL_1A | GCP | api.ca-central-1a.arize.com | otlp.ca-central-1a.arize.com | flight.ca-central-1a.arize.com |
EU_WEST_1A | GCP | api.eu-west-1a.arize.com | otlp.eu-west-1a.arize.com | flight.eu-west-1a.arize.com |
US_CENTRAL_1A | GCP | api.us-central-1a.arize.com | otlp.us-central-1a.arize.com | flight.us-central-1a.arize.com |
US_EAST_1B | AWS | api.us-east-1b.arize.com | otlp.us-east-1b.arize.com | flight.us-east-1b.arize.com |
Logging
Control the SDK’s internal logging behavior. Configure the logging level to adjust verbosity, enable structured JSON logs for machine parsing, or disable logging entirely. SDK logs provide visibility into operations like API calls, caching, and error conditions. Defaults:ARIZE_LOG_ENABLE-trueARIZE_LOG_LEVEL-INFOARIZE_LOG_STRUCTURED-false
Caching
The SDK caches large datasets locally to speed up experiment iteration. When enabled, datasets are stored in Parquet format in the cache directory, reducing download time for repeated access. Thearize_directory parameter specifies where the SDK stores cache files, logs, and other persistent data. Cache files are stored in {arize_directory}/cache/.
Defaults:
enable_caching-Truearize_directory-~/.arize
Advanced Configuration
Configure advanced SDK settings for custom deployments, performance tuning, and specific networking requirements.Endpoint Override Mutual Exclusivity: The SDK provides three mutually exclusive ways to override endpoint locations:
region, single_host/single_port, and base_domain. Specifying more than one will raise a MultipleEndpointOverridesError. If none are specified, individual per-endpoint host/port settings are used.Custom Endpoints
Override default endpoint locations for custom deployments, on-premise installations, or non-standard environments. The SDK uses three types of endpoints: API (REST operations), OTLP (OpenTelemetry tracing), and Flight (bulk data transfers via gRPC). Defaults:api_host-api.arize.comapi_scheme-httpsotlp_host-otlp.arize.comotlp_scheme-httpsflight_host-flight.arize.comflight_port-443flight_scheme-grpc+tls
Single Endpoint Override
Use a single host and port for all SDK endpoints (API, OTLP, and Flight). This is a convenience option for environments where all services are behind a single load balancer or proxy. This option is mutually exclusive withregion and base_domain.
Defaults:
single_host- “ (not set)single_port-0(not set)
Private Connect Override
Use a base domain to automatically generate endpoint hosts for Private Connect setups. When specified, the SDK generates hosts asapi.<base_domain>, otlp.<base_domain>, and flight.<base_domain>. This is the recommended approach for Private Connect deployments where all services share a common base domain.
This option is mutually exclusive with region and single_host/single_port.
Defaults:
base_domain- “ (not set)
TLS Verification
Control TLS certificate verification for HTTP requests. Disable verification only in trusted development environments with self-signed certificates or when behind corporate proxies with certificate inspection. Always keep verification enabled in production. Defaults:request_verify-True
Payload Limits
Configure maximum payload sizes for HTTP requests and Arrow data processing. Increase these limits if working with very large datasets or reduce them to catch oversized requests earlier. Defaults:max_http_payload_size_mb-100pyarrow_max_chunksize-10000
Streaming
Configure concurrent processing for streaming operations like ML model logging. Adjust worker threads and queue size to optimize throughput for your workload. Defaults:stream_max_workers-8stream_max_queue_bound-5000
Transport Options
The SDK intelligently selects the best transport method based on payload size:- HTTP/REST: Default for smaller payloads, compatible with all environments
- gRPC + Arrow Flight: Automatically used for large datasets, experiments, and bulk operations
- 10-100x faster for large datasets
- Efficient binary serialization
- Minimal memory overhead
Response Objects
All SDK API responses are structured Pydantic models that provide type safety, validation, and IDE autocomplete support. Response objects offer convenient methods for data access, conversion, and exploration.Response Types
The SDK returns two main types of responses: List Responses - Return collections with pagination metadata:Field Introspection
Explore available fields on any response object usingmodel_fields:
Data Conversion
Convert response objects to different formats for further processing: Dictionary Format - Access as Python dict:Pagination
List responses include pagination metadata for fetching additional pages:Pre-Release API Warnings
Pre-release APIs (ALPHA and BETA) are actively evolving based on user feedback. While BETA endpoints are mostly stable with rare breaking changes, ALPHA endpoints are experimental and breaking changes are expected.For detailed information about API version stages, stability guarantees, and recommendations,
see API Version Stages in the REST API reference.