I used kimi code yesterday all day. . i wake up and start again after 10 hours and after 4 messages i get a error LLM provider error: Error code: 429 - {‘error’: {‘message’: “We’re receiving too many requests at the moment. Please wait a moment and try again.”, ‘type’: ‘rate_limit_reached_error’}}
My weekly limit is at 7%. My daily limit is at 3%.
I just cant understand why i am hitting a limit today and not yesterday.
From my understanding, the 429 error you’re’re seeing despite having plenty of quota remaining (7% weekly, 3% daily) is likely due to hitting a concurrency limit rather than exhausting your token budget.
With the recent shift from request-based to token-based billing in Kimi Code’s Membership Benefits, it appears the system now enforces concurrent request throttling that operates independently of your daily/weekly usage percentages shown in the UI.
My guess is that yesterday your usage pattern was more staggered, whereas today’s first few messages might have been sent in rapid succession, potentially triggering the concurrent request threshold even though your actual token budget is still healthy.
I think the confusion comes from the generic error message ("We're receiving too many requests...") not clearly distinguishing between:
Quota exhaustion (running out of tokens)
Concurrency limiting (too many simultaneous or rapid-fire requests)
The Kimi Code team is working on updating the documentation at Kimi Code Membership Benefits | Kimi Code Docs to clarify these distinct limit types. For now, if you encounter this again, you might try adding brief pauses between requests, which could help avoid tripping the concurrency guardrails while your token budget is clearly still well within limits.