gpt-3.5-turbo-16k
Common Name: GPT-3.5 Turbo 16k
GPT-3.5 Turbo variant with extended 16K token context window for longer conversations and documents.
Specifications
Context
16.4K
Maximum Output
4.1K
Inputtext
Outputtext
Performance (7-day Average)
Collecting…
Collecting…
Collecting…
Pricing
Input$3.30/MTokens
Output$4.40/MTokens
Batch Input$1.65/MTokens
Batch Output$2.20/MTokens
Availability Trend (24h)
Performance Metrics (24h)
Similar Models
$4.40/$17.60/M
ctx32Kmax4Kavail—tps—
InOut
This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.
$4.40/$17.60/M
ctx32Kmax4Kavail—tps—
InOut
This is our first general-availability realtime model, capable of responding to audio and text inputs in realtime over WebRTC, WebSocket, or SIP connections.
$1.65/$2.20/M
ctx16Kmax4Kavail—tps—
InOut
GPT-3.5 model optimized for single-turn instruction following via completion API endpoint.
$1.65/$2.20/M
ctx16Kmax4Kavail—tps—
InOutCap
September 2023 snapshot of GPT-3.5 Turbo Instruct for legacy completion API use cases.