Question 1

What is freelm?

Accepted Answer

freelm is an open-source, always-up LLM client and API gateway for Python and Node.js. It pools free-tier LLM providers like OpenRouter, Gemini, NIM, Groq, Cerebras, and Mistral behind a single OpenAI-compatible interface, offering automatic failover and dynamic model discovery.

Question 2

How does the automatic failover work?

Accepted Answer

When a provider hits a rate limit (429) or fails (5xx), freelm intercepts the error before throwing an exception. It trips a circuit breaker for that API key and instantly retries the request using the next available key or provider in your pool.

Question 3

Which routing strategies does freelm support?

Accepted Answer

freelm supports four routing strategies: 'priority' (strict ordering), 'round_robin' (even load distribution), 'quota_aware' (prioritizing keys with the most remaining daily/RPM limits), and 'latency' (routing to the fastest provider).

Question 4

Is freelm actually free to use?

Accepted Answer

Yes. The freelm package is free (MIT-licensed). It routes your requests to the free tiers of supported providers so you don't need a credit card. Your total throughput scales with the combined free quotas of the API keys you provide.

Question 5

Does it support the OpenAI SDK format?

Accepted Answer

Yes, both the Python and Node.js versions provide a drop-in OpenAI shim. You can import OpenAI from the freelm compatibility module and use your existing chat completion and streaming code without changes.

Question 6

Do I need API keys for all supported providers?

Accepted Answer

No. You only need to supply the keys you have. freelm automatically detects which keys are available in your environment variables and builds its routing pool dynamically.

One API. Six Free Providers. Zero Downtime.

Enterprise reliability. Free tier pricing.

Multi-Provider Pooling

Zero-Downtime Failover

Drop-in OpenAI Shim

Native Streaming

Live Model Discovery

Smart Circuit Breakers

Frequently Asked Questions