Trendshift - Ask AI

base on A list of free LLM inference resources accessible via API.  # Free LLM API resources This lists various services that provide free access or credits towards API-based LLM usage. > [!NOTE] > Please don't abuse these services, else we might lose them. > [!WARNING] > This list explicitly excludes any services that are not legitimate (eg reverse engineers an existing chatbot) - [Free Providers](#free-providers) - [OpenRouter](#openrouter) - [Google AI Studio](#google-ai-studio) - [NVIDIA NIM](#nvidia-nim) - [Mistral (La Plateforme)](#mistral-la-plateforme) - [Mistral (Codestral)](#mistral-codestral) - [HuggingFace Inference Providers](#huggingface-inference-providers) - [Vercel AI Gateway](#vercel-ai-gateway) - [Cerebras](#cerebras) - [Groq](#groq) - [Together (Free)](#together-free) - [Cohere](#cohere) - [GitHub Models](#github-models) - [Cloudflare Workers AI](#cloudflare-workers-ai) - [Google Cloud Vertex AI](#google-cloud-vertex-ai) - [Providers with trial credits](#providers-with-trial-credits) - [Together](#together) - [Fireworks](#fireworks) - [Baseten](#baseten) - [Nebius](#nebius) - [Novita](#novita) - [AI21](#ai21) - [Upstage](#upstage) - [NLP Cloud](#nlp-cloud) - [Alibaba Cloud (International) Model Studio](#alibaba-cloud-international-model-studio) - [Modal](#modal) - [Inference.net](#inferencenet) - [nCompass](#ncompass) - [Hyperbolic](#hyperbolic) - [SambaNova Cloud](#sambanova-cloud) - [Scaleway Generative APIs](#scaleway-generative-apis) ## Free Providers ### [OpenRouter](https://openrouter.ai) **Limits:** [20 requests/minute<br>50 requests/day<br>1000 requests/day with $10 lifetime topup](https://openrouter.ai/docs/api-reference/limits) Models share a common quota. - [DeepCoder 14B Preview](https://openrouter.ai/agentica-org/deepcoder-14b-preview:free) - [DeepHermes 3 Llama 3 8B Preview](https://openrouter.ai/nousresearch/deephermes-3-llama-3-8b-preview:free) - [DeepSeek R1](https://openrouter.ai/deepseek/deepseek-r1:free) - [DeepSeek R1 Distill Llama 70B](https://openrouter.ai/deepseek/deepseek-r1-distill-llama-70b:free) - [DeepSeek R1 Distill Qwen 14B](https://openrouter.ai/deepseek/deepseek-r1-distill-qwen-14b:free) - [DeepSeek V3 0324](https://openrouter.ai/deepseek/deepseek-chat-v3-0324:free) - [Dolphin 3.0 Mistral 24B](https://openrouter.ai/cognitivecomputations/dolphin3.0-mistral-24b:free) - [Dolphin 3.0 R1 Mistral 24B](https://openrouter.ai/cognitivecomputations/dolphin3.0-r1-mistral-24b:free) - [Gemma 2 9B Instruct](https://openrouter.ai/google/gemma-2-9b-it:free) - [Gemma 3 12B Instruct](https://openrouter.ai/google/gemma-3-12b-it:free) - [Gemma 3 27B Instruct](https://openrouter.ai/google/gemma-3-27b-it:free) - [Gemma 3 4B Instruct](https://openrouter.ai/google/gemma-3-4b-it:free) - [Kimi VL A3B Thinking](https://openrouter.ai/moonshotai/kimi-vl-a3b-thinking:free) - [Llama 3.1 405B Instruct](https://openrouter.ai/meta-llama/llama-3.1-405b-instruct:free) - [Llama 3.1 Nemotron Ultra 253B v1](https://openrouter.ai/nvidia/llama-3.1-nemotron-ultra-253b-v1:free) - [Llama 3.2 3B Instruct](https://openrouter.ai/meta-llama/llama-3.2-3b-instruct:free) - [Llama 3.3 70B Instruct](https://openrouter.ai/meta-llama/llama-3.3-70b-instruct:free) - [Llama 4 Maverick](https://openrouter.ai/meta-llama/llama-4-maverick:free) - [Llama 4 Scout](https://openrouter.ai/meta-llama/llama-4-scout:free) - [Mistral 7B Instruct](https://openrouter.ai/mistralai/mistral-7b-instruct:free) - [Mistral Nemo](https://openrouter.ai/mistralai/mistral-nemo:free) - [Mistral Small 24B Instruct 2501](https://openrouter.ai/mistralai/mistral-small-24b-instruct-2501:free) - [Mistral Small 3.1 24B Instruct](https://openrouter.ai/mistralai/mistral-small-3.1-24b-instruct:free) - [QwQ 32B ArliAI RpR v1](https://openrouter.ai/arliai/qwq-32b-arliai-rpr-v1:free) - [Qwen 2.5 72B Instruct](https://openrouter.ai/qwen/qwen-2.5-72b-instruct:free) - [Qwen 2.5 VL 32B Instruct](https://openrouter.ai/qwen/qwen2.5-vl-32b-instruct:free) - [Qwen QwQ 32B](https://openrouter.ai/qwen/qwq-32b:free) - [Qwen2.5 Coder 32B Instruct](https://openrouter.ai/qwen/qwen-2.5-coder-32b-instruct:free) - [Qwen2.5 VL 72B Instruct](https://openrouter.ai/qwen/qwen2.5-vl-72b-instruct:free) - [Reka Flash 3](https://openrouter.ai/rekaai/reka-flash-3:free) - [Shisa V2 Llama 3.3 70B](https://openrouter.ai/shisa-ai/shisa-v2-llama3.3-70b:free) - [cognitivecomputations/dolphin-mistral-24b-venice-edition:free](https://openrouter.ai/cognitivecomputations/dolphin-mistral-24b-venice-edition:free) - [deepseek/deepseek-chat-v3.1:free](https://openrouter.ai/deepseek/deepseek-chat-v3.1:free) - [deepseek/deepseek-r1-0528-qwen3-8b:free](https://openrouter.ai/deepseek/deepseek-r1-0528-qwen3-8b:free) - [deepseek/deepseek-r1-0528:free](https://openrouter.ai/deepseek/deepseek-r1-0528:free) - [google/gemma-3n-e2b-it:free](https://openrouter.ai/google/gemma-3n-e2b-it:free) - [google/gemma-3n-e4b-it:free](https://openrouter.ai/google/gemma-3n-e4b-it:free) - [meta-llama/llama-3.3-8b-instruct:free](https://openrouter.ai/meta-llama/llama-3.3-8b-instruct:free) - [microsoft/mai-ds-r1:free](https://openrouter.ai/microsoft/mai-ds-r1:free) - [mistralai/devstral-small-2505:free](https://openrouter.ai/mistralai/devstral-small-2505:free) - [mistralai/mistral-small-3.2-24b-instruct:free](https://openrouter.ai/mistralai/mistral-small-3.2-24b-instruct:free) - [moonshotai/kimi-dev-72b:free](https://openrouter.ai/moonshotai/kimi-dev-72b:free) - [moonshotai/kimi-k2:free](https://openrouter.ai/moonshotai/kimi-k2:free) - [nvidia/nemotron-nano-9b-v2:free](https://openrouter.ai/nvidia/nemotron-nano-9b-v2:free) - [openai/gpt-oss-120b:free](https://openrouter.ai/openai/gpt-oss-120b:free) - [openai/gpt-oss-20b:free](https://openrouter.ai/openai/gpt-oss-20b:free) - [qwen/qwen3-14b:free](https://openrouter.ai/qwen/qwen3-14b:free) - [qwen/qwen3-235b-a22b:free](https://openrouter.ai/qwen/qwen3-235b-a22b:free) - [qwen/qwen3-30b-a3b:free](https://openrouter.ai/qwen/qwen3-30b-a3b:free) - [qwen/qwen3-4b:free](https://openrouter.ai/qwen/qwen3-4b:free) - [qwen/qwen3-8b:free](https://openrouter.ai/qwen/qwen3-8b:free) - [qwen/qwen3-coder:free](https://openrouter.ai/qwen/qwen3-coder:free) - [tencent/hunyuan-a13b-instruct:free](https://openrouter.ai/tencent/hunyuan-a13b-instruct:free) - [tngtech/deepseek-r1t-chimera:free](https://openrouter.ai/tngtech/deepseek-r1t-chimera:free) - [tngtech/deepseek-r1t2-chimera:free](https://openrouter.ai/tngtech/deepseek-r1t2-chimera:free) - [z-ai/glm-4.5-air:free](https://openrouter.ai/z-ai/glm-4.5-air:free) ### [Google AI Studio](https://aistudio.google.com) Data is used for training when used outside of the UK/CH/EEA/EU. <table><thead><tr><th>Model Name</th><th>Model Limits</th></tr></thead><tbody> <tr><td>Gemini 2.5 Pro</td><td>3,000,000 tokens/day<br>125,000 tokens/minute<br>50 requests/day<br>2 requests/minute</td></tr> <tr><td>Gemini 2.5 Flash</td><td>250,000 tokens/minute<br>250 requests/day<br>10 requests/minute</td></tr> <tr><td>Gemini 2.5 Flash-Lite</td><td>250,000 tokens/minute<br>1,000 requests/day<br>15 requests/minute</td></tr> <tr><td>Gemini 2.5 Flash Image Preview (Nano Banana)</td><td></td></tr> <tr><td>Gemini 2.0 Flash</td><td>1,000,000 tokens/minute<br>200 requests/day<br>15 requests/minute</td></tr> <tr><td>Gemini 2.0 Flash-Lite</td><td>1,000,000 tokens/minute<br>200 requests/day<br>30 requests/minute</td></tr> <tr><td>Gemini 2.0 Flash (Experimental)</td><td>250,000 tokens/minute<br>50 requests/day<br>10 requests/minute</td></tr> <tr><td>Gemini 1.5 Flash</td><td>250,000 tokens/minute<br>50 requests/day<br>15 requests/minute</td></tr> <tr><td>Gemini 1.5 Flash-8B</td><td>250,000 tokens/minute<br>50 requests/day<br>15 requests/minute</td></tr> <tr><td>LearnLM 2.0 Flash (Experimental)</td><td>1,500 requests/day<br>15 requests/minute</td></tr> <tr><td>Gemma 3 27B Instruct</td><td>15,000 tokens/minute<br>14,400 requests/day<br>30 requests/minute</td></tr> <tr><td>Gemma 3 12B Instruct</td><td>15,000 tokens/minute<br>14,400 requests/day<br>30 requests/minute</td></tr> <tr><td>Gemma 3 4B Instruct</td><td>15,000 tokens/minute<br>14,400 requests/day<br>30 requests/minute</td></tr> <tr><td>Gemma 3 1B Instruct</td><td>15,000 tokens/minute<br>14,400 requests/day<br>30 requests/minute</td></tr> </tbody></table> ### [NVIDIA NIM](https://build.nvidia.com/explore/discover) Phone number verification required. Models tend to be context window limited. **Limits:** 40 requests/minute - [Various open models](https://build.nvidia.com/models) ### [Mistral (La Plateforme)](https://console.mistral.ai/) * Free tier (Experiment plan) requires opting into data training * Requires phone number verification. **Limits (per-model):** 1 request/second, 500,000 tokens/minute, 1,000,000,000 tokens/month - [Open and Proprietary Mistral models](https://docs.mistral.ai/getting-started/models/models_overview/) ### [Mistral (Codestral)](https://codestral.mistral.ai/) * Currently free to use * Monthly subscription based * Requires phone number verification **Limits:** 30 requests/minute, 2,000 requests/day - Codestral ### [HuggingFace Inference Providers](https://huggingface.co/docs/inference-providers/en/index) HuggingFace Serverless Inference limited to models smaller than 10GB. Some popular models are supported even if they exceed 10GB. **Limits:** [$0.10/month in credits](https://huggingface.co/docs/inference-providers/en/pricing) - Various open models across supported providers ### [Vercel AI Gateway](https://vercel.com/docs/ai-gateway) Routes to various supported providers. **Limits:** [$5/month](https://vercel.com/docs/ai-gateway/pricing) ### [Cerebras](https://cloud.cerebras.ai/) <table><thead><tr><th>Model Name</th><th>Model Limits</th></tr></thead><tbody> <tr><td>gpt-oss-120b</td><td>30 requests/minute<br>60,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Qwen 3 235B A22B Instruct</td><td>30 requests/minute<br>60,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Qwen 3 235B A22B Thinking</td><td>30 requests/minute<br>60,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Qwen 3 Coder 480B</td><td>10 requests/minute<br>150,000 tokens/minute<br>100 requests/hour<br>1,000,000 tokens/hour<br>100 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Llama 3.3 70B</td><td>30 requests/minute<br>64,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Qwen 3 32B</td><td>30 requests/minute<br>64,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Llama 3.1 8B</td><td>30 requests/minute<br>60,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Llama 4 Scout</td><td>30 requests/minute<br>60,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> <tr><td>Llama 4 Maverick</td><td>30 requests/minute<br>60,000 tokens/minute<br>900 requests/hour<br>1,000,000 tokens/hour<br>14,400 requests/day<br>1,000,000 tokens/day</td></tr> </tbody></table> ### [Groq](https://console.groq.com) <table><thead><tr><th>Model Name</th><th>Model Limits</th></tr></thead><tbody> <tr><td>Allam 2 7B</td><td>7,000 requests/day<br>6,000 tokens/minute</td></tr> <tr><td>DeepSeek R1 Distill Llama 70B</td><td>1,000 requests/day<br>6,000 tokens/minute</td></tr> <tr><td>Gemma 2 9B Instruct</td><td>14,400 requests/day<br>15,000 tokens/minute</td></tr> <tr><td>Llama 3.1 8B</td><td>14,400 requests/day<br>6,000 tokens/minute</td></tr> <tr><td>Llama 3.3 70B</td><td>1,000 requests/day<br>12,000 tokens/minute</td></tr> <tr><td>Llama 4 Maverick 17B 128E Instruct</td><td>1,000 requests/day<br>6,000 tokens/minute</td></tr> <tr><td>Llama 4 Scout Instruct</td><td>1,000 requests/day<br>30,000 tokens/minute</td></tr> <tr><td>Whisper Large v3</td><td>7,200 audio-seconds/minute<br>2,000 requests/day</td></tr> <tr><td>Whisper Large v3 Turbo</td><td>7,200 audio-seconds/minute<br>2,000 requests/day</td></tr> <tr><td>groq/compound</td><td>250 requests/day<br>70,000 tokens/minute</td></tr> <tr><td>groq/compound-mini</td><td>250 requests/day<br>70,000 tokens/minute</td></tr> <tr><td>meta-llama/llama-guard-4-12b</td><td>14,400 requests/day<br>15,000 tokens/minute</td></tr> <tr><td>meta-llama/llama-prompt-guard-2-22m</td><td></td></tr> <tr><td>meta-llama/llama-prompt-guard-2-86m</td><td></td></tr> <tr><td>moonshotai/kimi-k2-instruct</td><td>1,000 requests/day<br>10,000 tokens/minute</td></tr> <tr><td>moonshotai/kimi-k2-instruct-0905</td><td>1,000 requests/day<br>10,000 tokens/minute</td></tr> <tr><td>openai/gpt-oss-120b</td><td>1,000 requests/day<br>8,000 tokens/minute</td></tr> <tr><td>openai/gpt-oss-20b</td><td>1,000 requests/day<br>8,000 tokens/minute</td></tr> <tr><td>qwen/qwen3-32b</td><td>1,000 requests/day<br>6,000 tokens/minute</td></tr> </tbody></table> ### [Together (Free)](https://together.ai) **Limits:** Up to 60 requests/minute - [Llama 3.3 70B Instruct](https://together.ai/models/llama-3-3-70b-free) - [DeepSeek R1 Distil Llama 70B](https://together.ai/models/deepseek-r1-distilled-llama-70b-free) ### [Cohere](https://cohere.com) **Limits:** [20 requests/minute<br>1,000 requests/month](https://docs.cohere.com/docs/rate-limits) Models share a common quota. - Command-A - Command-R7B - Command-R+ - Command-R - Aya Expanse 8B - Aya Expanse 32B - Aya Vision 8B - Aya Vision 32B ### [GitHub Models](https://github.com/marketplace/models) Extremely restrictive input/output token limits. **Limits:** [Dependent on Copilot subscription tier (Free/Pro/Pro+/Business/Enterprise)](https://docs.github.com/en/github-models/prototyping-with-ai-models#rate-limits) - AI21 Jamba 1.5 Large - AI21 Jamba 1.5 Mini - Codestral 25.01 - Cohere Command A - Cohere Command R 08-2024 - Cohere Command R+ 08-2024 - Cohere Embed v3 English - Cohere Embed v3 Multilingual - DeepSeek-R1 - DeepSeek-R1-0528 - DeepSeek-V3-0324 - Grok 3 - Grok 3 Mini - JAIS 30b Chat - Llama 4 Maverick 17B 128E Instruct FP8 - Llama 4 Scout 17B 16E Instruct - Llama-3.2-11B-Vision-Instruct - Llama-3.2-90B-Vision-Instruct - Llama-3.3-70B-Instruct - MAI-DS-R1 - Meta-Llama-3.1-405B-Instruct - Meta-Llama-3.1-8B-Instruct - Ministral 3B - Mistral Large 24.11 - Mistral Medium 3 (25.05) - Mistral Nemo - Mistral Small 3.1 - OpenAI GPT-4.1 - OpenAI GPT-4.1-mini - OpenAI GPT-4.1-nano - OpenAI GPT-4o - OpenAI GPT-4o mini - OpenAI Text Embedding 3 (large) - OpenAI Text Embedding 3 (small) - OpenAI gpt-5 - OpenAI gpt-5-chat (preview) - OpenAI gpt-5-mini - OpenAI gpt-5-nano - OpenAI o1 - OpenAI o1-mini - OpenAI o1-preview - OpenAI o3 - OpenAI o3-mini - OpenAI o4-mini - Phi-4 - Phi-4-mini-instruct - Phi-4-mini-reasoning - Phi-4-multimodal-instruct - Phi-4-reasoning ### [Cloudflare Workers AI](https://developers.cloudflare.com/workers-ai) **Limits:** [10,000 neurons/day](https://developers.cloudflare.com/workers-ai/platform/pricing/#free-allocation) - @cf/openai/gpt-oss-120b - @cf/openai/gpt-oss-20b - DeepSeek R1 Distill Qwen 32B - Deepseek Coder 6.7B Base (AWQ) - Deepseek Coder 6.7B Instruct (AWQ) - Deepseek Math 7B Instruct - Discolm German 7B v1 (AWQ) - Falcom 7B Instruct - Gemma 2B Instruct (LoRA) - Gemma 3 12B Instruct - Gemma 7B Instruct - Gemma 7B Instruct (LoRA) - Hermes 2 Pro Mistral 7B - Llama 2 13B Chat (AWQ) - Llama 2 7B Chat (FP16) - Llama 2 7B Chat (INT8) - Llama 2 7B Chat (LoRA) - Llama 3 8B Instruct - Llama 3 8B Instruct - Llama 3 8B Instruct (AWQ) - Llama 3.1 8B Instruct (AWQ) - Llama 3.1 8B Instruct (FP8) - Llama 3.2 11B Vision Instruct - Llama 3.2 1B Instruct - Llama 3.2 3B Instruct - Llama 3.3 70B Instruct (FP8) - Llama 4 Scout Instruct - Llama Guard 3 8B - LlamaGuard 7B (AWQ) - Mistral 7B Instruct v0.1 - Mistral 7B Instruct v0.1 (AWQ) - Mistral 7B Instruct v0.2 - Mistral 7B Instruct v0.2 (LoRA) - Mistral Small 3.1 24B Instruct - Neural Chat 7B v3.1 (AWQ) - OpenChat 3.5 0106 - OpenHermes 2.5 Mistral 7B (AWQ) - Phi-2 - Qwen 1.5 0.5B Chat - Qwen 1.5 1.8B Chat - Qwen 1.5 14B Chat (AWQ) - Qwen 1.5 7B Chat (AWQ) - Qwen 2.5 Coder 32B Instruct - Qwen QwQ 32B - SQLCoder 7B 2 - Starling LM 7B Beta - TinyLlama 1.1B Chat v1.0 - Una Cybertron 7B v2 (BF16) - Zephyr 7B Beta (AWQ) ### [Google Cloud Vertex AI](https://console.cloud.google.com/vertex-ai/model-garden) Very stringent payment verification for Google Cloud. <table><thead><tr><th>Model Name</th><th>Model Limits</th></tr></thead><tbody> <tr><td><a href="https://console.cloud.google.com/vertex-ai/publishers/meta/model-garden/llama-3-2-90b-vision-instruct-maas" target="_blank">Llama 3.2 90B Vision Instruct</a></td><td>30 requests/minute<br>Free during preview</td></tr> <tr><td><a href="https://console.cloud.google.com/vertex-ai/publishers/meta/model-garden/llama-3-1-405b-instruct-maas" target="_blank">Llama 3.1 70B Instruct</a></td><td>60 requests/minute<br>Free during preview</td></tr> <tr><td><a href="https://console.cloud.google.com/vertex-ai/publishers/meta/model-garden/llama-3-1-405b-instruct-maas" target="_blank">Llama 3.1 8B Instruct</a></td><td>60 requests/minute<br>Free during preview</td></tr> </tbody></table> ## Providers with trial credits ### [Together](https://together.ai) **Credits:** $1 when you add a payment method **Models:** [Various open models](https://together.ai/models) ### [Fireworks](https://fireworks.ai/) **Credits:** $1 **Models:** [Various open models](https://fireworks.ai/models) ### [Baseten](https://app.baseten.co/) **Credits:** $30 **Models:** [Any supported model - pay by compute time](https://www.baseten.co/library/) ### [Nebius](https://studio.nebius.com/) **Credits:** $1 **Models:** [Various open models](https://studio.nebius.ai/models) ### [Novita](https://novita.ai/?ref=ytblmjc&utm_source=affiliate) **Credits:** $0.5 for 1 year **Models:** [Various open models](https://novita.ai/models) ### [AI21](https://studio.ai21.com/) **Credits:** $10 for 3 months **Models:** Jamba family of models ### [Upstage](https://console.upstage.ai/) **Credits:** $10 for 3 months **Models:** Solar Pro/Mini ### [NLP Cloud](https://nlpcloud.com/home) **Credits:** $15 **Requirements:** Phone number verification **Models:** Various open models ### [Alibaba Cloud (International) Model Studio](https://bailian.console.alibabacloud.com/) **Credits:** 1 million tokens/model **Models:** [Various open and proprietary Qwen models](https://www.alibabacloud.com/en/product/modelstudio) ### [Modal](https://modal.com) **Credits:** $5/month upon sign up, $30/month with payment method added **Models:** Any supported model - pay by compute time ### [Inference.net](https://inference.net) **Credits:** $1, $25 on responding to email survey **Models:** Various open models ### [nCompass](https://ncompass.tech) **Credits:** $1 **Models:** Various open models ### [Hyperbolic](https://app.hyperbolic.xyz/) **Credits:** $1 **Models:** - DeepSeek V3 - DeepSeek V3 0324 - Hermes 3 Llama 3.1 70B - Llama 3 70B Instruct - Llama 3.1 405B Base - Llama 3.1 405B Base (FP8) - Llama 3.1 405B Instruct - Llama 3.1 70B Instruct - Llama 3.1 8B Instruct - Llama 3.2 3B Instruct - Llama 3.3 70B Instruct - Pixtral 12B (2409) - Qwen QwQ 32B - Qwen QwQ 32B Preview - Qwen2.5 72B Instruct - Qwen2.5 Coder 32B Instruct - Qwen2.5 VL 72B Instruct - Qwen2.5 VL 7B Instruct - openai/gpt-oss-120b - openai/gpt-oss-120b-turbo - openai/gpt-oss-20b - qwen/qwen3-235b-a22b-instruct-2507-fp8 - qwen/qwen3-coder-480b-a35b-instruct-fp8 - qwen/qwen3-next-80b-a3b-instruct - qwen/qwen3-next-80b-a3b-thinking ### [SambaNova Cloud](https://cloud.sambanova.ai/) **Credits:** $5 for 3 months **Models:** - E5-Mistral-7B-Instruct - Llama 3.1 405B - Llama 3.1 8B - Llama 3.2 1B - Llama 3.2 3B - Llama 3.3 70B - Llama-4-Maverick-17B-128E-Instruct - Llama-4-Scout-17B-16E-Instruct - Llama-Guard-3-8B - Qwen/QwQ-32B - Qwen/Qwen2-Audio-7B-Instruct - Qwen/Qwen3-32B - deepseek-ai/DeepSeek-R1 - deepseek-ai/DeepSeek-R1-0528 - deepseek-ai/DeepSeek-R1-Distill-Llama-70B - deepseek-ai/DeepSeek-V3-0324 - deepseek-ai/DeepSeek-V3.1 - test-model ### [Scaleway Generative APIs](https://console.scaleway.com/generative-api/models) **Credits:** 1,000,000 free tokens **Models:** - BGE-Multilingual-Gemma2 - DeepSeek R1 Distill Llama 70B - Gemma 3 27B Instruct - Llama 3.1 8B Instruct - Llama 3.3 70B Instruct - Mistral Nemo 2407 - Mistral Small 3.1 24B Instruct 2503 - Pixtral 12B (2409) - Qwen2.5 Coder 32B Instruct - devstral-small-2505 - gpt-oss-120b - mistral-small-3.2-24b-instruct-2506 - qwen3-235b-a22b-instruct-2507 - qwen3-coder-30b-a3b-instruct - voxtral-small-24b-2507 ", Assign "at most 3 tags" to the expected json: {"id":"14565","tags":[]} "only from the tags list I provide: [{"id":77,"name":"3d"},{"id":89,"name":"agent"},{"id":17,"name":"ai"},{"id":54,"name":"algorithm"},{"id":24,"name":"api"},{"id":44,"name":"authentication"},{"id":3,"name":"aws"},{"id":27,"name":"backend"},{"id":60,"name":"benchmark"},{"id":72,"name":"best-practices"},{"id":39,"name":"bitcoin"},{"id":37,"name":"blockchain"},{"id":1,"name":"blog"},{"id":45,"name":"bundler"},{"id":58,"name":"cache"},{"id":21,"name":"chat"},{"id":49,"name":"cicd"},{"id":4,"name":"cli"},{"id":64,"name":"cloud-native"},{"id":48,"name":"cms"},{"id":61,"name":"compiler"},{"id":68,"name":"containerization"},{"id":92,"name":"crm"},{"id":34,"name":"data"},{"id":47,"name":"database"},{"id":8,"name":"declarative-gui "},{"id":9,"name":"deploy-tool"},{"id":53,"name":"desktop-app"},{"id":6,"name":"dev-exp-lib"},{"id":59,"name":"dev-tool"},{"id":13,"name":"ecommerce"},{"id":26,"name":"editor"},{"id":66,"name":"emulator"},{"id":62,"name":"filesystem"},{"id":80,"name":"finance"},{"id":15,"name":"firmware"},{"id":73,"name":"for-fun"},{"id":2,"name":"framework"},{"id":11,"name":"frontend"},{"id":22,"name":"game"},{"id":81,"name":"game-engine "},{"id":23,"name":"graphql"},{"id":84,"name":"gui"},{"id":91,"name":"http"},{"id":5,"name":"http-client"},{"id":51,"name":"iac"},{"id":30,"name":"ide"},{"id":78,"name":"iot"},{"id":40,"name":"json"},{"id":83,"name":"julian"},{"id":38,"name":"k8s"},{"id":31,"name":"language"},{"id":10,"name":"learning-resource"},{"id":33,"name":"lib"},{"id":41,"name":"linter"},{"id":28,"name":"lms"},{"id":16,"name":"logging"},{"id":76,"name":"low-code"},{"id":90,"name":"message-queue"},{"id":42,"name":"mobile-app"},{"id":18,"name":"monitoring"},{"id":36,"name":"networking"},{"id":7,"name":"node-version"},{"id":55,"name":"nosql"},{"id":57,"name":"observability"},{"id":46,"name":"orm"},{"id":52,"name":"os"},{"id":14,"name":"parser"},{"id":74,"name":"react"},{"id":82,"name":"real-time"},{"id":56,"name":"robot"},{"id":65,"name":"runtime"},{"id":32,"name":"sdk"},{"id":71,"name":"search"},{"id":63,"name":"secrets"},{"id":25,"name":"security"},{"id":85,"name":"server"},{"id":86,"name":"serverless"},{"id":70,"name":"storage"},{"id":75,"name":"system-design"},{"id":79,"name":"terminal"},{"id":29,"name":"testing"},{"id":12,"name":"ui"},{"id":50,"name":"ux"},{"id":88,"name":"video"},{"id":20,"name":"web-app"},{"id":35,"name":"web-server"},{"id":43,"name":"webassembly"},{"id":69,"name":"workflow"},{"id":87,"name":"yaml"}]" returns me the "expected json"

AI prompts