Skip to main content
Sail is a max-efficiency inference provider. We serve open source models at massive scale, and prioritize throughput over latency. Use Sail to build agents that tackle big tasks, with minimal human involvement.

Get started

Quickstart

Make your first API request!

Explore the API

API reference

Browse the full Sail Responses API reference.