AWS Machine Learning Blog

Artificial Intelligence Official Machine Learning Blog of Amazon Web Services

  • Connect Amazon Bedrock agents to cross-account knowledge bases
    by Kunal Ghosh on November 7, 2025 at 11:14 pm

    Organizations need seamless access to their structured data repositories to power intelligent AI agents. However, when these resources span multiple AWS accounts integration challenges can arise. This post explores a practical solution for connecting Amazon Bedrock agents to knowledge bases in Amazon Redshift clusters residing in different AWS accounts.

  • Democratizing AI: How Thomson Reuters Open Arena supports no-code AI for every professional with Amazon Bedrock
    by Laura Skylaki, Vaibhav Goswami, Ramdev Wudali, Sahar El Khoury on November 7, 2025 at 9:51 pm

    In this blog post, we explore how TR addressed key business use cases with Open Arena, a highly scalable and flexible no-code AI solution powered by Amazon Bedrock and other AWS services such as Amazon OpenSearch Service, Amazon Simple Storage Service (Amazon S3), Amazon DynamoDB, and AWS Lambda. We’ll explain how TR used AWS services to build this solution, including how the architecture was designed, the use cases it solves, and the business profiles that use it.

  • Introducing structured output for Custom Model Import in Amazon Bedrock
    by Manoj Selvakumar on November 7, 2025 at 6:53 pm

    Today, we are excited to announce the addition of structured output to Custom Model Import. Structured output constrains a model’s generation process in real time so that every token it produces conforms to a schema you define. Rather than relying on prompt-engineering tricks or brittle post-processing scripts, you can now generate structured outputs directly at inference time.

  • Transform your MCP architecture: Unite MCP servers through AgentCore Gateway
    by Frank Dallezotte on November 6, 2025 at 5:43 pm

    Earlier this year, we introduced Amazon Bedrock AgentCore Gateway, a fully managed service that serves as a centralized MCP tool server, providing a unified interface where agents can discover, access, and invoke tools. Today, we’re extending support for existing MCP servers as a new target type in AgentCore Gateway. With this capability, you can group multiple task-specific MCP servers aligned to agent goals behind a single, manageable MCP gateway interface. This reduces the operational complexity of maintaining separate gateways, while providing the same centralized tool and authentication management that existed for REST APIs and AWS Lambda functions.

  • How Amazon Search increased ML training twofold using AWS Batch for Amazon SageMaker Training jobs
    by Mona Mona on November 5, 2025 at 5:15 pm

    In this post, we show you how Amazon Search optimized GPU instance utilization by leveraging AWS Batch for SageMaker Training jobs. This managed solution enabled us to orchestrate machine learning (ML) training workloads on GPU-accelerated instance families like P5, P4, and others. We will also provide a step-by-step walkthrough of the use case implementation.

  • Iterate faster with Amazon Bedrock AgentCore Runtime direct code deployment
    by Chaitra Mathur on November 4, 2025 at 6:30 pm

    Amazon Bedrock AgentCore is an agentic platform for building, deploying, and operating effective agents securely at scale. Amazon Bedrock AgentCore Runtime is a fully managed service of Bedrock AgentCore, which provides low latency serverless environments to deploy agents and tools. It provides session isolation, supports multiple agent frameworks including popular open-source frameworks, and handles multimodal

  • How Switchboard, MD automates real-time call transcription in clinical contact centers with Amazon Nova Sonic
    by Tanner Jones on November 3, 2025 at 5:25 pm

    In this post, we examine the specific challenges Switchboard, MD faced with scaling transcription accuracy and cost-effectiveness in clinical environments, their evaluation process for selecting the right transcription solution, and the technical architecture they implemented using Amazon Connect and Amazon Kinesis Video Streams. This post details the impressive results achieved and demonstrates how they were able to use this foundation to automate EMR matching and give healthcare staff more time to focus on patient care.

  • Build reliable AI systems with Automated Reasoning on Amazon Bedrock – Part 1
    by Adewale Akinfaderin on October 31, 2025 at 9:44 pm

    Enterprises in regulated industries often need mathematical certainty that every AI response complies with established policies and domain knowledge. Regulated industries can’t use traditional quality assurance methods that test only a statistical sample of AI outputs and make probabilistic assertions about compliance. When we launched Automated Reasoning checks in Amazon Bedrock Guardrails in preview at

  • Custom Intelligence: Building AI that matches your business DNA
    by Hannah Marlowe on October 31, 2025 at 4:07 pm

    In 2024, we launched the Custom Model Program within the AWS Generative AI Innovation Center to provide comprehensive support throughout every stage of model customization and optimization. Over the past two years, this program has delivered exceptional results by partnering with global enterprises and startups across diverse industries—including legal, financial services, healthcare and life sciences,

  • Clario streamlines clinical trial software configurations using Amazon Bedrock
    by Kim Nguyen, Shyam Banuprakash, on October 31, 2025 at 3:49 pm

    This post builds upon our previous post discussing how Clario developed an AI solution powered by Amazon Bedrock to accelerate clinical trials. Since then, Clario has further enhanced their AI capabilities, focusing on innovative solutions that streamline the generation of software configurations and artifacts for clinical trials while delivering high-quality clinical evidence.

  • Introducing Amazon Bedrock cross-Region inference for Claude Sonnet 4.5 and Haiku 4.5 in Japan and Australia
    by Derrick Choo on October 31, 2025 at 2:45 pm

    こんにちは, G’day. The recent launch of Anthropic’s Claude Sonnet 4.5 and Claude Haiku 4.5, now available on Amazon Bedrock, marks a significant leap forward in generative AI models. These state-of-the-art models excel at complex agentic tasks, coding, and enterprise workloads, offering enhanced capabilities to developers. Along with the new models, we are thrilled to announce that

  • Reduce CAPTCHAs for AI agents browsing the web with Web Bot Auth (Preview) in Amazon Bedrock AgentCore Browser
    by Veda Raman on October 30, 2025 at 9:55 pm

    AI agents need to browse the web on your behalf. When your agent visits a website to gather information, complete a form, or verify data, it encounters the same defenses designed to stop unwanted bots: CAPTCHAs, rate limits, and outright blocks. Today, we are excited to share that AWS has a solution. Amazon Bedrock AgentCore

  • Hosting NVIDIA speech NIM models on Amazon SageMaker AI: Parakeet ASR
    by Curt Lockhart, Francesco Ciannella on October 28, 2025 at 6:09 pm

    In this post, we explore how to deploy NVIDIA’s Parakeet ASR model on Amazon SageMaker AI using asynchronous inference endpoints to create a scalable, cost-effective pipeline for processing large volumes of audio data. The solution combines state-of-the-art speech recognition capabilities with AWS managed services like Lambda, S3, and Bedrock to automatically transcribe audio files and generate intelligent summaries, enabling organizations to unlock valuable insights from customer calls, meeting recordings, and other audio content at scale .

  • Responsible AI design in healthcare and life sciences
    by Tonny Ouma on October 24, 2025 at 5:25 pm

    In this post, we explore the critical design considerations for building responsible AI systems in healthcare and life sciences, focusing on establishing governance mechanisms, transparency artifacts, and security measures that ensure safe and effective generative AI applications. The discussion covers essential policies for mitigating risks like confabulation and bias while promoting trust, accountability, and patient safety throughout the AI development lifecycle.

  • Beyond pilots: A proven framework for scaling AI to production
    by Sri Elaprolu, Sabine Khan, Diego Socolinsky, Andrea Jimenez Fernandez, and Randi Larson on October 24, 2025 at 2:42 pm

    In this post, we explore the Five V’s Framework—a field-tested methodology that has helped 65% of AWS Generative AI Innovation Center customer projects successfully transition from concept to production, with some launching in just 45 days. The framework provides a structured approach through Value, Visualize, Validate, Verify, and Venture phases, shifting focus from “What can AI do?” to “What do we need AI to do?” while ensuring solutions deliver measurable business outcomes and sustainable operational excellence.

  • Generate Gremlin queries using Amazon Bedrock models
    by Rachel Hanspal on October 23, 2025 at 8:57 pm

    In this post, we explore an innovative approach that converts natural language to Gremlin queries using Amazon Bedrock models such as Amazon Nova Pro, helping business analysts and data scientists access graph databases without requiring deep technical expertise. The methodology involves three key steps: extracting graph knowledge, structuring the graph similar to text-to-SQL processing, and generating executable Gremlin queries through an iterative refinement process that achieved 74.17% overall accuracy in testing.

  • Incorporating responsible AI into generative AI project prioritization
    by Randy DeFauw on October 23, 2025 at 8:51 pm

    In this post, we explore how companies can systematically incorporate responsible AI practices into their generative AI project prioritization methodology to better evaluate business value against costs while addressing novel risks like hallucination and regulatory compliance. The post demonstrates through a practical example how conducting upfront responsible AI risk assessments can significantly change project rankings by revealing substantial mitigation work that affects overall project complexity and timeline.

  • Build scalable creative solutions for product teams with Amazon Bedrock
    by Kenneth Walsh on October 22, 2025 at 11:02 pm

    In this post, we explore how product teams can leverage Amazon Bedrock and AWS services to transform their creative workflows through generative AI, enabling rapid content iteration across multiple formats while maintaining brand consistency and compliance. The solution demonstrates how teams can deploy a scalable generative AI application that accelerates everything from product descriptions and marketing copy to visual concepts and video content, significantly reducing time to market while enhancing creative quality.

  • Build a proactive AI cost management system for Amazon Bedrock – Part 2
    by Jason Salcido on October 22, 2025 at 6:58 pm

    In this post, we explore advanced cost monitoring strategies for Amazon Bedrock deployments, introducing granular custom tagging approaches for precise cost allocation and comprehensive reporting mechanisms that build upon the proactive cost management foundation established in Part 1. The solution demonstrates how to implement invocation-level tagging, application inference profiles, and integration with AWS Cost Explorer to create a complete 360-degree view of generative AI usage and expenses.

  • Build a proactive AI cost management system for Amazon Bedrock – Part 1
    by Jason Salcido on October 22, 2025 at 6:58 pm

    In this post, we introduce a comprehensive solution for proactively managing Amazon Bedrock inference costs through a cost sentry mechanism designed to establish and enforce token usage limits, providing organizations with a robust framework for controlling generative AI expenses. The solution uses serverless workflows and native Amazon Bedrock integration to deliver a predictable, cost-effective approach that aligns with organizational financial constraints while preventing runaway costs through leading indicators and real-time budget enforcement.

Share Websitecyber
We are an ethical website cyber security team and we perform security assessments to protect our clients.