One API, multiple capabilities. Chat, embeddings, image generation, and speech. All running on GPUs in Groningen.

Models

GET /v1/models

Lists all available models. Returns model IDs and their capabilities.

Chat models

Text generation and conversation. Use with /v1/chat/completions.

Qwen 3.6 35B (MoE, 3B active). Fast interactive model for conversations, analysis, and text generation.

Recommended for most use cases.

Gemma 4 31B (Dense). Deep analysis model with strong reasoning. Best for complex tasks.

Convert text to vectors for search, similarity, and RAG. Use with /v1/embeddings.

BGE-M3 multilingual embeddings. State-of-the-art for retrieval, supports 100+ languages including Dutch.

Generate images from text. Use with /v1/images/generations.

FLUX Schnell. Fast generation (~2s per image) for rapid iteration and prototyping.

FLUX Dev. Higher quality output with more detail and better prompt adherence.

Transcription with speaker diarization. Use with /v1/audio/diarize.

WhisperX with speaker diarization. Transcribes audio and identifies who said what. Supports Dutch and 90+ other languages.

Use short aliases or full model names interchangeably.

Need a different model? We can deploy additional models on request. Contact us at support@appelon.ai