Portkey Benchmark Tool

A comprehensive benchmarking tool for API performance testing with two modes:

Comparison Mode: Compare latency between direct OpenAI API calls and Portkey proxy calls
Load Test Mode: Dedicated load testing for Portkey endpoints

This tool helps you understand Portkey performance characteristics and optimize your API usage patterns.

🎯 Purpose

This benchmark tool:

Compares response times between OpenAI direct API and Portkey proxy (comparison mode)
Performs dedicated load testing for Portkey endpoints (load test mode)
Measures network latency overhead and processing times
Tests performance under concurrent load with configurable parameters
Provides detailed statistical analysis and reports
Identifies OpenAI processing time vs. network latency breakdown

📋 Prerequisites

Node.js (version 14 or higher)
npm or yarn package manager
Valid OpenAI API key
Valid Portkey API key

🚀 Quick Start

Install dependencies:
```
npm install
```

Configure your API keys: Edit config.json and replace the placeholder values:

{
  "portkeyApiKey": "your-actual-portkey-api-key",
  "portkeyProviderSlug": "your-provider-slug-from-model-catalog",
  "openaiApiKey": "sk-your-actual-openai-api-key",
}

If you have a default config attached with the Portkey API key, which has a virtual key inside, you can skip the Portkey provider slug.
Provider slug should be prefixed be `@`.

Run the benchmark:
```
npm start
```
or
```
node benchmark.js
```

⚙️ Configuration

The config.json file contains all benchmarking parameters:

Mode Configuration

mode: Test mode to run (default: "comparison")
- "comparison": Compare OpenAI vs Portkey latency (requires both API keys)
- "loadtest": Load test Portkey only (requires Portkey API key)

Required Settings (Mode-Dependent)

openaiApiKey: Your OpenAI API key (required for comparison mode)
portkeyApiKey: Your Portkey API key (required for both modes)

API Configuration

portkeyBaseURL: Portkey API endpoint (default: "https://api.portkey.ai/v1")
portkeyConfigId: Optional Portkey configuration ID
model: OpenAI model to test (default: "gpt-3.5-turbo")

Test Parameters

concurrency: Number of concurrent requests (default: 5)
maxRequests: Maximum number of requests to send (default: 100)
testDuration: Maximum test duration in seconds (default: 60)
maxTokens: Maximum tokens per response (default: 150)
temperature: Model temperature setting (default: 0.7)

⚠️ Important for Accurate Results: We strongly recommend running at least 500 requests to get statistically meaningful results and avoid skewed measurements. Small sample sizes can lead to unreliable latency comparisons due to network variability and cold start effects.

Prompt Configuration

The prompt can be either:

String format: "Your prompt text here"

Message array format:

[
  {"role": "system", "content": "You are a helpful assistant."},
  {"role": "user", "content": "Your question here"}
]

Mode Configuration Examples

Your existing config.json already has all the necessary fields. Just change the mode field to switch between test types:

For Portkey Load Testing

{
  "mode": "loadtest",
  // ... rest of your existing config
  // Only portkeyApiKey is required for this mode
}

For Comparison Testing (Default)

{
  "mode": "comparison",
  // ... rest of your existing config
  // Both portkeyApiKey and openaiApiKey are required
}

💡 Tip: You can also omit the mode field entirely to default to comparison mode.

🏃‍♂️ Usage

Basic Usage

node benchmark.js

Switching Between Modes

Simply edit your config.json file and change the mode field:

Run Portkey Load Test

Edit config.json and set "mode": "loadtest"
Ensure portkeyApiKey is configured
Run: node benchmark.js

Run Comparison Test

Edit config.json and set "mode": "comparison" (or omit mode for default)
Ensure both portkeyApiKey and openaiApiKey are configured
Run: node benchmark.js

Custom Configuration File

node benchmark.js custom-config.json

📊 Understanding the Output

Console Output

The benchmark provides real-time progress updates:

Preflight test results
Worker progress with request timings
Success/failure rates for each provider
Periodic progress summaries

Final Report

📊 PORTKEY LATENCY BENCHMARK REPORT
==================================================

🔍 TEST SUMMARY:
Total Request Pairs: 500
Test Duration: 2023-12-01T10:30:00.000Z

📈 PERFORMANCE COMPARISON:
┌─────────────────────┬──────────────┬──────────────┬─────────────┐
│ Metric              │ OpenAI       │ Portkey      │ Difference  │
├─────────────────────┼──────────────┼──────────────┼─────────────┤
│ Avg Total Time      │ 1247.20ms    │ 1267.30ms    │ +20.10ms    │
│ Avg Network Latency │ 182.50ms     │ 202.40ms     │ +19.90ms    │
│ Success Rate        │ 100%         │ 100%%        │ -           │
│ Median Time         │ 1198.00ms    │ 1219.00ms    │ +21.00ms    │
│ P95 Time            │ 1785.00ms    │ 1809.00ms    │ +24.00ms    │
│ P99 Time            │ 2087.00ms    │ 2115.00ms    │ +28.00ms    │
└─────────────────────┴──────────────┴──────────────┴─────────────┘

🎯 KEY INSIGHTS:
• Portkey adds an average of 20.10ms latency (1.6% increase)
• Network latency difference: 19.90ms
• Success rate difference: -0.2%

Saved Reports

Detailed JSON reports are saved to the results/ directory with:

Complete configuration used
Raw timing data for all requests
Statistical summaries
Success/failure details

📈 Metrics Explained

Total Time: End-to-end request time including network and processing
OpenAI Processing Time: Time spent by OpenAI servers processing the request
Network Latency: Time spent in network transmission (Total - Processing)
Success Rate: Percentage of requests that completed successfully
P95/P99: 95th and 99th percentile response times (worst-case scenarios)

🛠️ Troubleshooting

Common Issues

"Both OpenAI and Portkey preflight tests failed"

Check your API keys are valid and properly formatted
Verify network connectivity
Ensure API quotas are not exceeded

"Processing time extraction failed"

Some OpenAI responses may not include processing time headers
The benchmark will continue but network latency calculations will be limited

"Config file not found"

Ensure config.json exists in the same directory
Use the correct path if specifying a custom config file

High failure rates

Check API rate limits and quotas
Reduce concurrency if hitting rate limits
Verify the model specified is available

Debug Mode

For detailed header information and debugging:

Check console output during preflight testing
Review the raw results in the generated JSON reports
Headers and processing times are logged during preflight tests

📁 Project Structure

benchmark/
├── benchmark.js          # Main benchmark script
├── config.json          # Configuration file (supports all modes)
├── package.json         # Node.js dependencies
├── package-lock.json    # Dependency lock file
├── results/             # Generated benchmark reports
└── README.md           # This file

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Test thoroughly with different configurations
Submit a pull request

📄 License

MIT License - see package.json for details.

⚠️ Important Notes

API Costs: This tool makes real API calls that will consume your OpenAI/Portkey credits
Rate Limits: Be mindful of API rate limits when setting high concurrency
Fair Testing: Use identical prompts and settings for accurate comparisons
Processing Time: Accuracy depends on API providers returning processing time headers

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
benchmark.js		benchmark.js
config.json		config.json
package-lock.json		package-lock.json
package.json		package.json

Portkey-AI/benchmark

Folders and files

Latest commit

History

Repository files navigation