Recommend
POST /v1/recommend — Minima recalls similar past task → model → outcome records, aggregates empirical success rates, and returns the cheapest model expected to clear your quality bar. Zero added latency to your actual LLM call.
Recommend
POST /v1/recommend — Minima recalls similar past task → model → outcome records, aggregates empirical success rates, and returns the cheapest model expected to clear your quality bar. Zero added latency to your actual LLM call.
Run it yourself
Minima hands back a recommended_model and a recommendation_id. You run the model in your own stack — Minima never proxies, executes, or rewrites.
Feed back
POST /v1/feedback — report the outcome, quality score, and realized tokens. Minima writes the outcome to memory, reinforces the exact neighbors that drove the pick, and promotes durable lessons.
Improve automatically
Every feedback call makes the next recommendation sharper. The cost basis climbs from flat estimates → observed median → rescaled per-request as history accumulates.
task → model → outcome history in your Mubit instance, with namespace / user_id sub-scoping inside it. POST /v1/recommend ──▶ you run the model ──▶ POST /v1/feedback (recall + rank) (your stack) (write outcome, reinforce) ▲ │ └────────────── picks get sharper ──────────────────┘