UltimumAI

How to reduce AI chat costs in UltimumAI

2 minute read

The price of one AI response is:
amount that AI had to read+size of generated response

Every time you send a message, the AI reads the whole chat from the beginning - not just the last message. That costs!

The longer the chat, the more expensive new AI responses become!

Here are some money saving tips

Start new chats often ✂️

Start a new chat instead of extending the old one. If you must, use the tactics described below.

Enable auto caching for Claude models 💾

  • If the time gap between your messages is smaller then 5 minutes, this will reduce reading costs by 90%
  • If the gap is not small, it will increase costs!  
  • The first message you send will always be slightly more expensive (because it creates the cache).

Enable reduced memory 🧠

Often it's enough for the model to look at only the last few messages instead of the entire conversation. This allows you to have infinite conversations without increasing the cost.

Edit message instead of sending a new one ✏️

When you're not satisfied with the received response, edit your message and regenerate it. This will prevent the conversation from growing.

Use cheaper AI model 👶

Consider whether you really need the strongest model for a particular task. In UltimumAI, you can change the model mid-conversation.

Additional Tips  

Maximize use of official apps 💻

Some models can be used for free in official applications. Make the most of them, and jump into UltimumAI when you encounter limitations.

Use english and latin alphabet 💬

English uses the least tokens and money. For example, Arabic is 3x more expensive (try it here). Tell the model to always respond in English, and try your best to do the same.

Set system instructions ⚙️

For example, you can write 'respond briefly and directly' or 'respond with code only'.

Ask multiple questions in the same message 🔢

Don't send multiple messages in a row with one question.

Create a conversation summary 📝

Tell a cheap/free model to create a summary of the conversation. Then copy that into a new conversation and continue there.

Turn off unnecessary tools 🛠️

Even if the model does not use a certain tool, the message price will be higher just because it is enabled.

Isprobaj besplatno
Images provided by Freepik