All of a sudden o3 in the “Legacy models” section is thinking for a VERY long time for me for every prompt?
By Holidays in Europe / October 22, 2025 / No Comments / Uncategorized
Sudden Increase in Processing Time for GPT-3 in the “Legacy Models” Section: An Unusual Behavior
Recently, developers and AI enthusiasts have observed an unexpected behavior involving GPT-3 models, specifically within the “Legacy Models” section of their prompts. Users are reporting that GPT-3 is taking an unusually long time—over 10 minutes—to respond to each prompt, a significant deviation from its typical processing duration.
Background Context
Under normal circumstances, GPT-3 models, including those categorized as “legacy,” respond within reasonable timeframes, especially when the chat context remains below 10,000 tokens. Historically, the advent of GPT-5 has shifted the performance landscape, with GPT-3 models often reasoning faster or at least maintaining consistent response times. However, recent experiences suggest a reversal: GPT-3 is now reasoning much more slowly than before, even when it previously performed efficiently.
What Has Changed?
-
Model Usage Post-GPT-5 Launch: After GPT-5’s release, many users shifted primarily to newer models for better performance. Consequently, GPT-3 models, including those in legacy, are used less frequently.
-
Unexpected Behavior: Instead of maintaining their usual responsiveness, GPT-3 models are now exhibiting prolonged reasoning times across multiple prompts—some exceeding 10 minutes per task.
-
Scope of Prompts: Users have tested with varied prompts, but the delay persists regardless of the prompt complexity, suggesting a systemic issue rather than input-related.
Potential Causes
While specific causes are yet to be confirmed, some hypotheses include:
- Backend Infrastructure Changes: Updates or maintenance in OpenAI’s backend could impact legacy model performance.
- Model Processing Prioritization: Shifts in resource allocation might deprioritize older models, leading to longer response times.
- Rate Limiting or Quota Management: Increased demand or quota restrictions could cause delays.
- Technical Bugs: Glitches or bugs introduced during platform updates can sometimes lead to performance degradation.
Implications for Users
This slowdown significantly impacts productivity and user experience, especially for those relying on GPT-3 for quick, reliable outputs. It’s essential for users to monitor these behaviors and adjust their workflows accordingly.
Next Steps
If you are experiencing similar issues:
- Check for Official Announcements: Review OpenAI’s status page or community forums for any ongoing incidents or updates.
- Report the Issue: Sharing detailed feedback with OpenAI can help diagnose and resolve