gpt-3.5-turbo-16k

Common Name: GPT-3.5 Turbo 16k

OpenAI

Released on Nov 6, 2023 12:00 AMSupportedTool Invocation

Compare Try in Chat

GPT-3.5 Turbo variant with extended 16K token context window for longer conversations and documents.

Specifications

Context

16.4K

Maximum Output

4.1K

Inputtext

Outputtext

Performance (7-day Average)

Collecting…

Pricing

Input$3.30/MTokens

Output$4.40/MTokens

Batch Input$1.65/MTokens

Batch Output$2.20/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

GPT Realtime

$4.40/$17.60/M

ctx32Kmax4Kavail—tps—

InOut

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

GPT Realtime

$4.40/$17.60/M

ctx32Kmax4Kavail—tps—

InOut

This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.

GPT-3.5 Turbo Instruct

$1.65/$2.20/M

ctx16Kmax4Kavail—tps—

InOut

GPT-3.5 model optimized for single-turn instruction following via completion API endpoint.

GPT-3.5 Turbo Instruct (0914)

$1.65/$2.20/M

ctx16Kmax4Kavail—tps—

InOutCap

September 2023 snapshot of GPT-3.5 Turbo Instruct for legacy completion API use cases.