How to Monitor OpenAI API Downtime with Portkey AI

How to Monitor OpenAI API Downtime with Portkey AI
- Introduction & Architecture
- Prerequisites & Setup
Complete installation commands
- Core Implementation: Step-by-Step
  - Step 1: Initialize the Monitoring Script
Configuration variables

📺 Watch: Neural Networks Explained

Video by 3Blue1Brown

Introduction & Architecture

In recent years, large language models (LLMs) have become a cornerstone of artificial intelligence research and commercial applications. Organizations like OpenAI, which developed the GPT [4] family of LLMs, have seen significant growth in their user base and model usage. However, as reliance on these services grows, so does the need for reliable monitoring tools to ensure service availability and performance.

Portkey AI's Downtime Monitor is a free tool designed specifically for tracking API uptime and latencies for various OpenAI models and other LLM providers. This tutorial will guide you through setting up and using Portkey AI's Downtime Monitor to monitor the reliability of OpenAI APIs, which are crucial for applications relying on real-time language processing capabilities.

The architecture behind this monitoring solution involves periodically querying the OpenAI API endpoints and recording response times and availability status. By leverag [3]ing Python scripts and libraries like requests, we can efficiently gather data and visualize it using tools such as Grafana or custom dashboards. This setup ensures that any disruptions in service are quickly identified, allowing for timely interventions to maintain application stability.

As of April 10, 2026, OpenAI's GPT-3 and GPT-4 models have seen significant adoption with millions of downloads on platforms like HuggingFace [7] (5,801,451 downloads for gpt-oss-20b and 3,572,271 downloads for gpt-oss-120b). This widespread use underscores the importance of robust monitoring solutions to ensure these models remain accessible and performant.

Prerequisites & Setup

To set up Portkey AI's Downtime Monitor, you will need a Python environment with specific dependencies. The following steps outline how to install necessary packages and configure your development environment:

Python Environment: Ensure you have Python 3.8 or higher installed on your system.
Dependencies:
- requests: For making HTTP requests to the OpenAI API.
- pandas: To handle data in a tabular format for analysis.
- matplotlib and seaborn: For creating visualizations of downtime statistics.

The choice of these dependencies is driven by their widespread use, reliability, and ease of integration with Python scripts. Additionally, they provide robust support for handling large datasets and generating insightful visual representations.

# Complete installation commands
pip install requests pandas matplotlib seaborn

Core Implementation: Step-by-Step

Step 1: Initialize the Monitoring Script

First, create a Python script to initialize the monitoring process. This involves setting up basic configurations such as API keys, endpoint URLs, and logging mechanisms.

import os
from datetime import datetime
import requests
import pandas as pd

# Configuration variables
OPENAI_API_KEY = 'your_openai_api_key'
ENDPOINT_URL = 'https://api.openai.com/v1/engines/gpt-3.5-turbo/completions'

def initialize_monitoring():
    """
    Initialize the monitoring process by setting up configurations and logging.
    """
    # Ensure API key is set
    if not OPENAI_API_KEY:
        raise ValueError("OpenAI API Key must be provided.")

    # Log initialization time
    print(f"Monitoring initialized at {datetime.now()}")

Step 2: Query the OpenAI API

Next, implement a function to query the OpenAI API and record response times. This involves making HTTP requests using requests and measuring the duration of each request.

def query_openai_api():
    """
    Query the OpenAI API endpoint and measure response time.

    Returns:
        dict: Response data from the API call.
    """
    headers = {
        'Authorization': f'Bearer {OPENAI_API_KEY}',
        'Content-Type': 'application/json'
    }

    start_time = datetime.now()
    try:
        response = requests.post(ENDPOINT_URL, json={'prompt': 'Hello'}, headers=headers)
        response.raise_for_status()  # Raise an error for bad status codes
        end_time = datetime.now()

        return {
            'status_code': response.status_code,
            'response_time_ms': (end_time - start_time).total_seconds() * 1000,
            'data': response.json()
        }
    except requests.RequestException as e:
        print(f"Request failed with error: {e}")
        return {'error': str(e)}

Step 3: Log and Analyze Response Data

After querying the API, log the results to a CSV file for later analysis. This involves appending each response's details to a DataFrame and saving it as a CSV.

def log_response_data(response):
    """
    Log the response data from the OpenAI API call.

    Args:
        response (dict): Response data dictionary.
    """
    df = pd.DataFrame([response])
    df.to_csv('api_downtime_log.csv', mode='a', header=False, index=False)

Step 4: Schedule Periodic Monitoring

To ensure continuous monitoring, use a scheduling library like schedule to run the monitoring script at regular intervals.

import schedule
import time

def job():
    """
    Job function that runs periodically to monitor API availability.
    """
    response = query_openai_api()
    log_response_data(response)

# Schedule the job every 5 minutes
schedule.every(5).minutes.do(job)

Configuration & Production Optimization

To take this monitoring script from a local development environment to production, several configurations and optimizations are necessary:

Environment Variables: Store sensitive information like API keys in environment variables rather than hardcoding them.
Batch Processing: Optimize the script for batch processing by querying multiple endpoints simultaneously or using asynchronous requests with libraries like aiohttp.
Error Handling & Retries: Implement robust error handling and retry mechanisms to handle transient failures gracefully.

import os

# Load API key from environment variable
OPENAI_API_KEY = os.getenv('OPENAI_API_KEY', 'your_openai_api_key')

Advanced Tips & Edge Cases (Deep Dive)

Error Handling

Implement comprehensive error handling to manage various failure scenarios, such as network timeouts or rate limit exceedances.

def query_openai_api():
    try:
        # Existing code..
    except requests.Timeout:
        print("Request timed out.")
    except requests.TooManyRedirects:
        print("Too many redirects occurred.")

Security Risks

Be cautious of security risks such as prompt injection attacks, where malicious inputs could be used to manipulate the API's behavior. Validate and sanitize all input data before sending it to the OpenAI API.

Results & Next Steps

By following this tutorial, you have set up a robust monitoring solution for tracking the availability and performance of OpenAI APIs. This setup provides real-time insights into potential disruptions, ensuring that your applications relying on these services remain stable and responsive.

For further enhancements:

Integrate with alerting systems like Slack or PagerDuty to notify teams immediately when downtime is detected.
Expand monitoring to cover additional LLM providers beyond OpenAI for a comprehensive overview of service reliability in the AI industry.

References

1. Wikipedia - GPT. Wikipedia. [Source]

2. Wikipedia - OpenAI. Wikipedia. [Source]

3. Wikipedia - Rag. Wikipedia. [Source]

4. GitHub - Significant-Gravitas/AutoGPT. Github. [Source]

5. GitHub - openai/openai-python. Github. [Source]

6. GitHub - Shubhamsaboo/awesome-llm-apps. Github. [Source]

7. GitHub - huggingface/transformers. Github. [Source]

8. OpenAI Pricing. Pricing. [Source]

How to Monitor OpenAI API Downtime with Portkey AI

How to Monitor OpenAI API Downtime with Portkey AI

Table of Contents

📺 Watch: Neural Networks Explained

Introduction & Architecture

Prerequisites & Setup

Core Implementation: Step-by-Step

Step 1: Initialize the Monitoring Script

Step 2: Query the OpenAI API

Step 3: Log and Analyze Response Data

Step 4: Schedule Periodic Monitoring

Configuration & Production Optimization

Advanced Tips & Edge Cases (Deep Dive)

Error Handling

Security Risks

Results & Next Steps

References

Was this article helpful?

Related Articles

How to Build a Multimodal App with Gemini 2.0 Vision API

How to Build a SOC Assistant with TensorFlow and PyTorch 2026

How to Implement Advanced AI Models with TensorFlow vs PyTorch: A Deep Dive into 2026 Trends