Amazon SageMaker AI Async Inference now supports inline request payloads

基本信息

来源: blogs_podcasts
原始来源: https://aws.amazon.com/blogs/machine-learning/amazon-sagemaker-ai-async-inference-now-supports-inline-request-payloads

来源摘要/节选

公开展示已截断至最多 800 个字符；请访问原始来源查看完整上下文。

Today, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload input data to Amazon Simple Storage Service (Amazon S3) before each invocation.
For payloads up to 128,000 bytes, this removes an entire network round-trip, simplifies client-side code, and reduces the operational surface area of asynchronous inference workloads.
In this post, we explain the motivation behind this feature, walk through the customer experience before and after, and show you how to start using inline payloads today.
You can use Amazon SageMaker AI Async Inference to queue inference requests and process them asynchronously.…

来源说明

当前只保存了公开页面节选，不代表原文全文。请以原始来源为准。

本页只呈现已做哈希绑定的来源证据，不包含基于旧正文或缺失原文的扩展推断。

Amazon SageMaker AI Async Inference now supports inline request payloads | Amazon Web Services

基本信息

来源摘要/节选

来源说明

应用场景

云原生/容器

从首次观测到传播链