Karthik Sakthivel
Posted on November 6, 2024
What's new at AWS š¢
š° Amazon Bedrock now supports customers to allocate and track on-demand foundation model usage.
š° With this, customers can categorize their GenAI inference costs by department, team, or application using AWS cost allocation tags.
ā ļø What is Amazon Bedrock:
āļø It is a fully managed service that offers a choice of high-performing foundation models from leading AI companies via a single API.
āļø It also provides a broad set of capabilities such as security, privacy, and responsible AI capabilities built in.
š° These capabilities help customer to build tailored applications for multiple use cases across different industries.
š° Importantly it is helping organizations by ensuring customer trust and data governance.
š° You can leverage this feature by creating an application inference profile and tagging it.
ā ļø What is Inference profiles:
āļø These profiles are a resource in Amazon Bedrock that define a model and one or more Regions
āļø Inference profile can route model invocation requests.
ā ļø Types of inference profiles:
1ļøā£ Cross region inference profiles
2ļøā£ Application inference profiles
ā ļø When to use inference profiles:
āļø Track usage metrics
āļø Use tags to monitor costs
āļø Cross-region inference
š Explore more about cross-region inference profiles:
https://aws.amazon.com/blogs/machine-learning/getting-started-with-cross-region-inference-in-amazon-bedrock/
Posted on November 6, 2024
Join Our Newsletter. No Spam, Only the good stuff.
Sign up to receive the latest update from our blog.
Related
November 6, 2024