GPT-Image-2 API ist jetzt verfügbar
Vorgestellte KI-Modelle
Aktuell beliebt
GPT-Image-2
OpenAI
GPT-Image-2 is OpenAI's next-generation image model for stronger photorealism, cleaner image editing, and sharper in-image text rendering.
Seedance 2.0
bytedance
Seedance 2.0 is ByteDance's multimodal video model supporting text-to-video, first-and-last-frames, and omni-reference modes. APIXO exclusive: unlimited concurrency, real-person portrait support, and hidden capabilities.
Seedance 2.0 Fast
bytedance
Seedance 2.0 Fast is the speed-optimized variant of ByteDance's multimodal video model. It supports text-to-video, first-and-last-frames, and omni-reference modes with lower per-second pricing and the same APIXO-exclusive capabilities.
Nano Banana
Gemini 2.5 Flash Image Preview (aka Nano Banana) is an advanced AI model excelling in natural language-driven image generation and editing. It produces hyper-realistic, physics-aware visuals with seamless style transformations.
Nano Banana Pro
Nano Banana Pro (Gemini 3 Pro Image) is Google's AGI-level image generation model with reasoning capabilities, native 4K output, Search Grounding for real-time data integration, near-perfect text rendering, and superior spatial awareness.
Wan 2.5
Alibaba
Wan 2.5 is Alibaba's advanced video generation model featuring one-pass audio/video synchronization, multilingual support, and cost-effective production. Creates fully synchronized videos with voiceover and lip-sync from a single prompt.
Large Language Models befinden sich jetzt im Hauptkatalog
Die Preise für Claude, OpenAI und Gemini befinden sich jetzt im selben Modell-Hub, sodass Benutzer zwischen kreativen Modellen, Token-Preisen und dem Preiszentrum wechseln können, ohne den Produktbereich zu verlassen.
Neuerscheinungen
Eine stabile, release-orientierte Ansicht der neuesten Ergänzungen, die man zuerst prüfen sollte.
GPT-Image-2
OpenAI
GPT-Image-2 is OpenAI's next-generation image model for stronger photorealism, cleaner image editing, and sharper in-image text rendering.
Hailuo 2.3
hailuo
Hailuo 2.3 is Miniax's async video model with standard and pro modes for text-to-video and image-to-video generation. Standard mode supports 6s/10s at 768p, while pro mode returns fixed 5s at 1080p.
Hailuo 2.3 Fast
hailuo
Hailuo 2.3 Fast is Miniax's speed-optimized image-to-video model with standard and pro modes. Standard supports 6s/10s at 768p, while pro returns fixed 6s output at 1080p.
Veo 3.1 Extend
Veo 3.1 Extend continues existing Veo 3.1 tasks with new prompts and mode selection (fast or quality), enabling iterative video continuation workflows from internal task IDs.
Grok Image
xai
Grok Image is xAI's image generation model for text-to-image and image-to-image workflows with simple aspect-ratio control and async task delivery.
Wan 2.2 Animate
Alibaba
Wan 2.2 Animate API is Alibaba's character animation model that combines one source image and one motion video to generate stylized animated outputs with animate/replace behavior.
Empfehlungen der Redaktion
Eine kuratierte Auswahl an Bild-, Video-, Audio- und multimodalen Textmodellen, die neuen Nutzern einen direkten Einstieg bietet.
GPT-Image-2
OpenAI
GPT-Image-2 is OpenAI's next-generation image model for stronger photorealism, cleaner image editing, and sharper in-image text rendering.
Nano Banana Pro
Nano Banana Pro (Gemini 3 Pro Image) is Google's AGI-level image generation model with reasoning capabilities, native 4K output, Search Grounding for real-time data integration, near-perfect text rendering, and superior spatial awareness.
Veo 3.1
Google DeepMind’s upgraded AI video model for realistic motion generation, extended clip duration, multi-image reference control, and synchronized audio output in native 1080p.
GPT-Image-1
OpenAI
GPT-Image-1 is OpenAI's advanced multimodal model for high-quality image generation with natural language understanding.
Sora 2 Pro
OpenAI
Sora 2 Pro is OpenAI’s premium video generation model with higher quality output, supporting text-to-video and image-to-video at 720p and 1080p resolutions with flexible durations of 10 or 15 seconds.
Suno V5
suno
Latest Suno text-to-music model that returns two polished songs per call with faster queues and richer vocals.
Gemini 3 Pro
Gemini 3 Pro is Google's flagship multimodal reasoning model built for long-context chat, tool calling, and structured outputs. It accepts text and media inputs and returns high-quality text responses for production assistants and analytics workflows.