Flex Mode Half-Price Billing Feature Launched
OhMyGPT platform now offers Flex mode half-price billing feature, providing you with a more flexible cost optimization solution.
Feature Introduction
Flex mode is a flexible billing mode that offers lower usage costs by trading off some response priority. When calling the Chat.Completions API, simply set the service_tier parameter in the request body to flex to enjoy 50% off the regular price.
Supported Models
Models currently supporting Flex mode include:
GPT-5 Series
- gpt-5
- gpt-5-2025-08-07
- gpt-5-mini
- gpt-5-mini-2025-08-07
- gpt-5-nano
- gpt-5-nano-2025-08-07
O Series Models
- o3
- o3-2025-04-16
- o4-mini
- o4-mini-2025-04-16
How to Use
When calling the Chat.Completions API, add the service_tier parameter to the request body:
Important Notes
- In Flex mode, requests may experience slight response delays during peak hours
- Billing is charged at 50% of actual usage
- Suitable for batch processing tasks with lower real-time requirements
Technical Support
If you have any questions, please contact us through:
- Email: [email protected]
- Online Feedback: Feedback button in the top right corner of the platform