Experiments API Overview

Overview

The Experiments API allows you to design, execute, and analyze experiments to test different prompts, models, or configurations. This enables data-driven decision making and systematic improvement of your AI applications.

Key Features

Experiment Design: Create structured experiments with multiple variants
A/B Testing: Compare different prompts, models, or configurations
Statistical Analysis: Get statistically significant results
Performance Tracking: Monitor key metrics and outcomes
Result Analysis: Detailed insights and recommendations

Quick Start

from keywordsai import KeywordsAI

client = KeywordsAI(api_key="your-api-key")

# Create a new experiment
experiment = client.experiments.create(
    name="Greeting Prompt A/B Test",
    description="Testing formal vs casual greeting styles",
    variants=[
        {"name": "formal", "prompt_id": "prompt_123"},
        {"name": "casual", "prompt_id": "prompt_456"}
    ]
)

# Start the experiment
client.experiments.start(experiment_id=experiment['id'])

# Get experiment results
results = client.experiments.get_results(experiment_id=experiment['id'])

Available Methods

Synchronous Methods

create() - Create a new experiment
list() - List experiments with filtering
get() - Retrieve a specific experiment
update() - Update experiment configuration
delete() - Delete an experiment
start() - Start running an experiment
stop() - Stop a running experiment
get_results() - Get experiment results and analysis

Asynchronous Methods

All methods are also available in asynchronous versions using AsyncKeywordsAI.

Experiment Structure

An experiment typically contains:

id: Unique identifier
name: Human-readable name
description: Experiment description
variants: List of experiment variants to test
metrics: Key metrics to track
status: Current status (draft, running, completed, stopped)
traffic_split: How traffic is distributed between variants
start_date: When the experiment started
end_date: When the experiment ended
results: Statistical results and analysis

Experiment Lifecycle

Design: Create experiment with variants and metrics
Configure: Set traffic split and success criteria
Start: Begin collecting data
Monitor: Track progress and early results
Analyze: Review statistical significance
Conclude: Stop experiment and implement winner

Common Use Cases

Prompt Optimization: Test different prompt variations
Model Comparison: Compare different AI models
Feature Testing: Test new features or configurations
Performance Optimization: Optimize for specific metrics
User Experience: Test different interaction patterns

Best Practices

Define clear success metrics before starting
Ensure sufficient sample size for statistical significance
Run experiments for appropriate duration
Avoid multiple simultaneous experiments on same traffic
Document experiment hypotheses and learnings

Error Handling

try:
    experiment = client.experiments.create(
        name="Test Experiment",
        variants=[{"name": "control", "prompt_id": "prompt_123"}]
    )
except Exception as e:
    print(f"Error creating experiment: {e}")

Next Steps

Create an Experiment - Learn how to design experiments
Run Experiments - Begin running your experiments
Analyze Results - Understand experiment outcomes
Manage Experiments - Browse and organize

Python Tracing SDK

TypeScript Tracing SDK

Keywords AI SDK

Experiments API Overview

Overview

Key Features

Quick Start

Available Methods

Synchronous Methods

Asynchronous Methods

Experiment Structure

Experiment Lifecycle

Common Use Cases

Best Practices

Error Handling

Next Steps

Python Tracing SDK

TypeScript Tracing SDK

Keywords AI SDK

Documentation Index

​Overview

​Key Features

​Quick Start

​Available Methods

​Synchronous Methods

​Asynchronous Methods

​Experiment Structure

​Experiment Lifecycle

​Common Use Cases

​Best Practices

​Error Handling

​Next Steps

Overview

Key Features

Quick Start

Available Methods

Synchronous Methods

Asynchronous Methods

Experiment Structure

Experiment Lifecycle

Common Use Cases

Best Practices

Error Handling

Next Steps