Together AI
The AI Native Cloud for building and running generative AI models.
Models Available
100+
Base Inference Rate
$0.90 / 1M tokens
About Together AI
Together AI is a comprehensive cloud platform designed for generative AI. It offers a full-stack solution that includes the Together API for running inference on a wide array of open-source and custom models, with a focus on high performance and low cost. The platform also provides services for fine-tuning models with your own data, allowing for customization and improved accuracy for specific tasks. For larger-scale needs, Together AI offers Bare Metal and GPU Clusters, providing dedicated compute resources for training and hosting private models. Their ecosystem is built around a commitment to open-source AI, aiming to provide the essential infrastructure for developers and enterprises to build the next generation of AI applications.
Core Platform Services
Together Api
Serverless API for inference, fine-tuning, and embeddings for over 100 open-source models.
Together Gpu Clusters
Rent scalable, on-demand GPU clusters for training and inference workloads.
Together Bare Metal
Lease dedicated servers with high-performance GPUs like NVIDIA H100 for maximum control.
Supported Model Types
Language Models
Access to models like Llama, Mixtral, Gemma, and more for chat, instruction-following, and completion.
Code Models
Utilize specialized models for code generation and completion.
Image Models
Support for text-to-image models like Stable Diffusion.
Embeddings
Create vector embeddings for text, a foundational component for retrieval-augmented generation (RAG).