AI 模型市场

探索并集成强大的图像、视频、音频和文本生成 AI 模型。

22 模型

Nano Banana

热门

Google

Gemini 2.5 Flash Image Preview (aka Nano Banana) is an advanced AI model excelling in natural language-driven image generation and editing. It produces hyper-realistic, physics-aware visuals with seamless style transformations.

图像
$0.04-25%$0.03

Nano Banana Pro

热门

Google

Nano Banana Pro (Gemini 3 Pro Image) is Google's AGI-level image generation model with reasoning capabilities, native 4K output, Search Grounding for real-time data integration, near-perfect text rendering, and superior spatial awareness.

图像
$0.15-47%$0.08

Midjourney

热门

midjourney

Midjourney is an advanced AI image generation model known for its artistic and high-quality outputs. It excels at creating stylized, creative images with exceptional detail and aesthetic appeal. Supports both text-to-image and image-to-image generation.

图像
$0.3-50%$0.15

Flux Kontext

热门

Black Forest Labs

Professional-grade image generation with enhanced prompt understanding and superior quality output.

图像
$0.04-25%$0.03

GPT-Image-1

热门

OpenAI

GPT-Image-1 is OpenAI's advanced multimodal model for high-quality image generation with natural language understanding.

图像
$0.04-12%$0.035

Wan 2.5

热门

Alibaba

Wan 2.5 is Alibaba's advanced video generation model featuring one-pass audio/video synchronization, multilingual support, and cost-effective production. Creates fully synchronized videos with voiceover and lip-sync from a single prompt.

视频
$0.5-20%$0.4

Seedance 2.0

新建
热门
即将推出

bytedance

Seedance 2.0 API is ByteDance's upcoming next-generation video model expected to advance audio-visual generation, motion consistency, and camera control for text-to-video and image-to-video workflows.

视频
$0.02

Flux 2

新建
热门

Black Forest Labs

BFL’s latest Pro & Flex pipelines for text-to-image and image-to-image with unified 1K/2K pricing and ~30s generation.

图像
$0.045-22%$0.035

Nano Banana 2

新建

Google

Nano Banana 2 is Google’s high-resolution image generation model with 1K/2K/4K output control, 20,000-character prompts, optional Google Search context, and support for up to 14 reference images.

图像
$0.08-38%$0.05

Gemini 3 Pro

新建

Google

Gemini 3 Pro is Google's flagship multimodal reasoning model built for long-context chat, tool calling, and structured outputs. It accepts text and media inputs and returns high-quality text responses for production assistants and analytics workflows.

文本
输入$1.60·输出$8.40/1Mt

Veo 3.1

Google

Google DeepMind’s upgraded AI video model for realistic motion generation, extended clip duration, multi-image reference control, and synchronized audio output in native 1080p.

视频
$1.20-83%$0.2

Sora 2

OpenAI

Sora 2 is OpenAI’s latest AI video generation model, supporting both text-to-video and image-to-video. It delivers realistic motion, physics consistency, with improved control over style, scene, and aspect ratio—ideal for creative apps and social media content.

视频
$1.00-80%$0.2

Sora 2 Pro

新建

OpenAI

Sora 2 Pro is OpenAI’s premium video generation model with higher quality output, supporting text-to-video and image-to-video at 720p and 1080p resolutions with flexible durations of 10 or 15 seconds.

视频
$3.00-60%$1.20

Seedream 5.0

新建

seedream

Seedream 5.0 is ByteDance's next-generation AI image model with real-time web search, controllable editing, and logical reasoning. It supports text-to-image and image-to-image with 2K/3K resolution, multiple aspect ratios, and up to 14 reference images.

图像
$0.035-9%$0.032

Seedream 4.5

seedream

Seedream 4.5 is a powerful text-to-image and image-to-image AI model delivering high-quality image generation with support for 2K and 4K resolutions.

图像
$0.04

Seedance 1.5 Pro

新建

bytedance

Seedance 1.5 Pro is ByteDance's per-second video model for fast text-to-video and image-to-video generation with 480p/720p output, optional sound, aspect ratio control, and fixed-lens camera stability.

视频
$0.012-17%$0.01

Vidu Q3

新建

vidu

Vidu Q3 is a per-second video generation model that combines standard and Turbo text-to-video plus image-to-video workflows in one API. It supports single-image animation, first-and-last-frame transitions, optional sound and BGM, and output up to 1080p.

视频
$0.04-10%$0.036

Kling 2.1

新建

kling

Kling 2.1 is Kuaishou's multi-tier video model with Standard, Pro, and Master modes for image-to-video and text-to-video creation. It supports 5-10 second clips, optional tail images for Pro, and aspect ratio control for Master text-to-video.

视频
$0.25-20%$0.2

Kling 2.5 Turbo Pro

新建

kling

Kling 2.5 Turbo Pro is Kuaishou's high-speed video model for text-to-video and image-to-video creation. It supports 5-10 second clips, optional tail-frame images, aspect ratio control for text-to-video, plus negative prompts and CFG scale guidance.

视频
$0.35-14%$0.3

Kling 2.6

新建

kling

Kling 2.6 is Kuaishou's native audio-visual video model that generates video, speech, sound effects, and ambience in one pass. It supports text-to-audio-visual and image-to-audio-visual creation with Chinese and English voice generation and up to 10-second clips.

视频
$0.35-14%$0.3

LTX-2 19B

新建

Lightricks

LTX-2 19B is Lightricks' open-source 19B diffusion transformer for cinematic video generation. It supports text-to-video and image-to-video workflows, LoRA conditioning, and high-fidelity outputs up to 1080p in the API.

视频
$0.012

Suno V5

新建

suno

Latest Suno text-to-music model that returns two polished songs per call with faster queues and richer vocals.

音频
专属$0.12

更多模型即将推出

我们正在持续扩展 API 服务,每周都会上线新模型。

support@apixo.ai申请新模型