Flex Mode Half-Price Billing Feature Launched

OhMyGPT platform now offers Flex mode half-price billing feature, providing you with a more flexible cost optimization solution.

Feature Introduction

Flex mode is a flexible billing mode that offers lower usage costs by trading off some response priority. When calling the Chat.Completions API, simply set the service_tier parameter in the request body to flex to enjoy 50% off the regular price.

Supported Models

Models currently supporting Flex mode include:

GPT-5 Series

  • gpt-5
  • gpt-5-2025-08-07
  • gpt-5-mini
  • gpt-5-mini-2025-08-07
  • gpt-5-nano
  • gpt-5-nano-2025-08-07

O Series Models

  • o3
  • o3-2025-04-16
  • o4-mini
  • o4-mini-2025-04-16

How to Use

When calling the Chat.Completions API, add the service_tier parameter to the request body:

Important Notes

  • In Flex mode, requests may experience slight response delays during peak hours
  • Billing is charged at 50% of actual usage
  • Suitable for batch processing tasks with lower real-time requirements

Technical Support

If you have any questions, please contact us through:

  • Email: [email protected]
  • Online Feedback: Feedback button in the top right corner of the platform
OhMyGPT