How does Turbopuffer's query billing work?

Turbopuffer bills queries based on the total logical size of the namespace being searched, not just the data your query touches. This means a top-10 query against a 100 GB namespace is billed at the full 100 GB rate. Volume discounts kick in at the 32 GB and 128 GB thresholds, with marginal discounts of 80% and 96% respectively. Namespace pinning, introduced in April 2026, offers an alternative GB-hours billing model for frequently queried namespaces.

What is the minimum cost to use Turbopuffer?

The minimum monthly spend is $64 on the Launch plan. There is no free tier. This platform fee covers access to all database features including vector search, full-text search, hybrid search, and multi-tenancy. Usage charges for storage, queries, and writes are billed on top of this minimum.

How does Turbopuffer storage pricing compare to Pinecone?

Turbopuffer stores vectors on object storage (S3) at approximately $0.02/GB. Pinecone Serverless uses SSD-backed storage at $0.33/GB -- roughly 16x more expensive. This gap is most significant for large datasets with cold access patterns. Cursor reported a 95% cost reduction after migrating from Pinecone to Turbopuffer for their code search embeddings.

Does Turbopuffer offer HIPAA compliance?

HIPAA-ready BAA is available on the Scale plan ($256/month) and the Enterprise plan. The Launch plan at $64/month does not include HIPAA readiness. If your application handles protected health information, you will need to start at the Scale tier at minimum.

What are the performance trade-offs of Turbopuffer's pricing model?

Turbopuffer achieves low storage costs by keeping data on object storage (S3) and caching hot data on NVMe SSDs and RAM. Warm queries hit cache at sub-10ms p50 latency. Cold queries against uncached namespaces can take 300-500ms or up to 4 seconds at p99. You effectively trade latency consistency for cost savings -- ideal for bursty workloads but problematic for always-on, latency-sensitive applications.

Turbopuffer Pricing (2026): Serverless Tier Guide

Pricing information was last verified on April 25, 2026. Pricing may have changed. Visit Turbopuffer for current pricing.

Pricing Overview

Turbopuffer is a serverless vector and full-text search database built on object storage, and its pricing reflects that architecture. Every plan combines a fixed monthly platform fee with usage-based charges for storage, queries, and writes. The Launch plan starts at $64/month, Scale at $256/month, and Enterprise requires a custom quote. What makes Turbopuffer's model distinctive is that storage costs run around $0.02/GB thanks to S3-backed object storage -- roughly 16x cheaper than SSD-first competitors like Pinecone at $0.33/GB. In February 2026, Turbopuffer slashed query pricing by up to 94%, dropping the base queried-data rate from $5/PB to $1/PB with aggressive volume discounts. For teams with large, infrequently accessed vector datasets, this architecture delivers genuine cost savings that compound at scale. The catch: you pay a minimum monthly commitment on every tier, and query costs depend on namespace size rather than the data your query actually touches.

Plan Comparison

Turbopuffer offers three platform tiers. All tiers include the full database feature set (vector search, full-text search, hybrid search, metadata filtering, and multi-tenancy). The tiers differ primarily in compliance, support, and deployment options.

Feature	Launch ($64/mo)	Scale ($256/mo)	Enterprise (Custom)
All Database Features	Yes	Yes	Yes
Multi-Tenancy	Yes	Yes	Yes
Single-Tenancy / BYOC	No	No	Yes
SOC2 Report & GDPR-ready DPA	Yes	Yes	Yes
HIPAA-ready BAA	No	Yes	Yes
Single Sign-On (SSO)	No	Yes	Yes
CMEK (Per Namespace)	No	No	Yes
Private Networking	No	No	Yes
Support	Community Slack & Email	Private Slack, 8-5 Hours	Private Slack, 24/7, SLA
Uptime SLA	No	No	99.95%

On top of the platform fee, usage charges apply across all tiers: storage is billed per logical GB, queries are billed per GB of namespace data queried (with volume discounts at the 32 GB and 128 GB thresholds), and writes are billed per GB written with batch discounts up to 50%. Namespace pinning, introduced in April 2026, offers an alternative billing model where pinned namespaces are charged in GB-hours instead of per-query pricing.

Hidden Costs and Considerations

The biggest pricing trap with Turbopuffer is how query billing works. You are billed based on the total size of the namespace searched, not the data your query actually reads. For multi-tenant deployments with uneven tenant sizes, one large namespace can inflate query costs dramatically. One real-world benchmark showed estimates of $220/month ballooning to $1,000+/month due to this mechanic. Cold-start queries against namespaces not in cache can take up to 4 seconds at p99, which carries an implicit cost in user experience. There is no free tier -- the $64/month minimum applies from day one.

Cost Estimates by Team Size

Because Turbopuffer bills on usage (storage volume, query volume, and write throughput) rather than per-seat, team size is less relevant than workload size. Still, we can provide rough estimates based on typical vector search workloads.

Workload Profile	Storage	Query Load	Estimated Monthly Cost
Solo / Prototype (1M vectors, 768-dim, ~3 GB)	~3 GB	Low (<1 QPS)	$64 - $90
Small Team (10M vectors, 768-dim, ~30 GB)	~30 GB	Moderate (~5 QPS)	$90 - $250
Mid-Size (100M vectors, 1536-dim, ~600 GB)	~600 GB	Moderate-High (~40 QPS)	$500 - $2,000

These estimates assume warm namespaces and fairly uniform tenant sizes. Multi-tenant deployments with skewed data distribution will push costs higher, particularly on the query side. At the prototype level, the $64/month platform minimum is the floor -- usage charges add relatively little. At mid-size scale, query costs dominate the bill.

How Turbopuffer Pricing Compares

Turbopuffer's core advantage is storage cost. At $0.02/GB on object storage, it undercuts every SSD-first competitor by an order of magnitude. Pinecone charges $0.33/GB for serverless storage. Qdrant offers a free tier for small workloads but scales to dedicated cluster pricing. ChromaDB provides a free open-source option with cloud plans starting at $5/month.

The tradeoff is clear: Turbopuffer wins on storage-heavy, read-light workloads where most data sits cold. Competitors win when you need a free entry point, guaranteed low latency on every query, or predictable per-unit pricing without namespace-size billing surprises.

Tool	Pricing Model	Starting Price	Best For
Turbopuffer	Platform fee + usage-based	$64/month	Large-scale vector storage with cold/warm access patterns
Pinecone	Usage-based (serverless)	$0 (free tier)	Teams needing consistent sub-10ms latency and a free starting point
Qdrant	Freemium + managed clusters	$0 (free tier)	Open-source-first teams wanting self-hosted or managed flexibility
ChromaDB	Usage-based	$0 (open-source)	Prototyping and lightweight AI applications

Cursor, Notion, and Linear all chose Turbopuffer specifically because their workloads feature massive vector datasets with bursty access patterns -- exactly where the object-storage architecture delivers 10x savings. If your workload is smaller or demands always-hot performance, we recommend starting with Pinecone or Qdrant's free tiers to validate your access patterns before committing to Turbopuffer's $64/month minimum.

Pricing Overview

Plan Comparison

Feature	Launch ($64/mo)	Scale ($256/mo)	Enterprise (Custom)
All Database Features	Yes	Yes	Yes
Multi-Tenancy	Yes	Yes	Yes
Single-Tenancy / BYOC	No	No	Yes
SOC2 Report & GDPR-ready DPA	Yes	Yes	Yes
HIPAA-ready BAA	No	Yes	Yes
Single Sign-On (SSO)	No	Yes	Yes
CMEK (Per Namespace)	No	No	Yes
Private Networking	No	No	Yes
Support	Community Slack & Email	Private Slack, 8-5 Hours	Private Slack, 24/7, SLA
Uptime SLA	No	No	99.95%

Hidden Costs and Considerations

Cost Estimates by Team Size

Workload Profile	Storage	Query Load	Estimated Monthly Cost
Solo / Prototype (1M vectors, 768-dim, ~3 GB)	~3 GB	Low (<1 QPS)	$64 - $90
Small Team (10M vectors, 768-dim, ~30 GB)	~30 GB	Moderate (~5 QPS)	$90 - $250
Mid-Size (100M vectors, 1536-dim, ~600 GB)	~600 GB	Moderate-High (~40 QPS)	$500 - $2,000

How Turbopuffer Pricing Compares

Tool	Pricing Model	Starting Price	Best For
Turbopuffer	Platform fee + usage-based	$64/month	Large-scale vector storage with cold/warm access patterns
Pinecone	Usage-based (serverless)	$0 (free tier)	Teams needing consistent sub-10ms latency and a free starting point
Qdrant	Freemium + managed clusters	$0 (free tier)	Open-source-first teams wanting self-hosted or managed flexibility
ChromaDB	Usage-based	$0 (open-source)	Prototyping and lightweight AI applications

Turbopuffer Pricing in 2026

Launch

Scale

Enterprise

Pricing Overview

Plan Comparison

Hidden Costs and Considerations

Cost Estimates by Team Size

How Turbopuffer Pricing Compares

Turbopuffer Pricing FAQ

Explore More

Comparisons

Related Pricing Guides

Turbopuffer Pricing in 2026

Launch

Scale

Enterprise

Pricing Overview

Plan Comparison

Hidden Costs and Considerations

Cost Estimates by Team Size

How Turbopuffer Pricing Compares

Turbopuffer Pricing FAQ

Explore More

Comparisons

Related Pricing Guides