Handling High Traffic


After the first day, I already experienced an unexpected influx of users. Upon analyzing the server, I realized that I'm facing significantly high response times (up to 10 seconds). This is because the LLM I'm using can only handle a maximum of 5 concurrent calls.

I'm considering implementing usage limits for free users to ensure paying customers receive unlimited and responsive service, along with the option to insert your API key for full access for free

For the time being, I apologize for any inconvenience. If the bot doesn't respond, please try again later.


Leave a comment

Log in with itch.io to leave a comment.