Amazon Bedrock is a fully managed service providing access to high-performing foundation models via a single API, offering security, privacy, and responsible AI capabilities.

Pricing Structure

Charges apply for both model inference and customization.

Inference Pricing Plans

  1. On-Demand and Batch: A pay-as-you-go model with no time-based commitments.
  2. Provisioned Throughput: Allows provisioning of sufficient throughput to meet application performance requirements in exchange for a time-based commitment.

On-Demand Mode

  • Charges are based on actual usage.
  • No time-based commitments.
  • Supports cross-region inference for some models without additional charges.
  • Pricing is determined by the source region where the request is made.

Provisioned Throughput Mode

  • Requires purchasing model units for a specific base or custom model.
  • Best suited for large, consistent inference workloads requiring guaranteed throughput.

Please refer to the website Amazon Bedrock to read more about this.