A company has trained an ML model in Amazon SageMaker.

Amazon Web Services MLA-C01 Full Course Access

Amazon Web Services MLA-C01 View All Questions

Amazon Web Services MLA-C01 Question Answer

A company has trained an ML model in Amazon SageMaker. The company needs to host the model to provide inferences in a production environment.

The model must be highly available and must respond with minimum latency. The size of each request will be between 1 KB and 3 MB. The model will receive unpredictable bursts of requests during the day. The inferences must adapt proportionally to the changes in demand.

How should the company deploy the model into production to meet these requirements?

Create a SageMaker real-time inference endpoint. Configure auto scaling. Configure the endpoint to present the existing model.

Deploy the model on an Amazon Elastic Container Service (Amazon ECS) cluster. Use ECS scheduled scaling that is based on the CPU of the ECS cluster.

Install SageMaker Operator on an Amazon Elastic Kubernetes Service (Amazon EKS) cluster. Deploy the model in Amazon EKS. Set horizontal pod auto scaling to scale replicas based on the memory metric.

Use Spot Instances with a Spot Fleet behind an Application Load Balancer (ALB) for inferences. Use the ALBRequestCountPerTarget metric as the metric for auto scaling.

MLA-C01 PDF/Engine

Printable Format
Value of Money
100% Pass Assurance
Verified Answers
Researched by Industry Experts
Based on Real Exams Scenarios
100% Real Questions

Get 65% Discount on All Products, Use Coupon: "ac4s65"

A healthcare company wants to detect irregularities in patient vital signs that could indicate early...

An ML engineer has a custom container that performs k-fold cross-validation and logs an average...

Summer Sale Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: ac4s65

A company has trained an ML model in Amazon SageMaker.

The Answer Is:

Explanation:

Quick Links