# Rate Limiting and Sleep Mode

### Rate limiting

By default stack `create chat session` & `chat`  API has the following rate limit.

* An application can handle 40 requests per minute
* A user can make 4 requests per minute

These limits are added to make sure the application works smoothly for all users.

### Sleep Mode

To use the resources efficiently GenAI Stack apps automatically go into sleep mode if it's not used for the last 60 mins.

If the app is in sleep mode you will see the following UI:

<figure><img src="/files/TQEsUPKJqk2n4rgfmlMR" alt=""><figcaption><p>sleep mode</p></figcaption></figure>

Click on the W**ake Up** button to activate the app.

**Note**: If you want to use the GenAI Stack API contact us at `tech@aiplanet.com` to increase the rate limit & to remove the sleep mode.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.aiplanet.com/genai-stack-1/rest-apis/rate-limiting-and-sleep-mode.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
