Artificial Intelligence Official Machine Learning Blog of Amazon Web Services
- Streamline employee training with an intelligent chatbot powered by Amazon Q Businessby Neha Bhupatiraju on August 19, 2025 at 2:02 pm
In this post, we explore how to design and implement custom plugins for Amazon Q Business to create an intelligent chatbot that streamlines employee training by retrieving answers from training materials. The solution implements secure API access using Amazon Cognito for user authentication and authorization, processes multiple document formats, and includes features like RAG-enhanced responses and email escalation capabilities through custom plugins.
- Create a travel planning agentic workflow with Amazon Novaby Isaac Privitera on August 18, 2025 at 8:30 pm
In this post, we explore how to build a travel planning solution using AI agents. The agent uses Amazon Nova, which offers an optimal balance of performance and cost compared to other commercial LLMs. By combining accurate but cost-efficient Amazon Nova models with LangGraph orchestration capabilities, we create a practical travel assistant that can handle complex planning tasks while keeping operational costs manageable for production deployments.
- Introducing Amazon Bedrock AgentCore Gateway: Transforming enterprise AI agent tool developmentby Dhawalkumar Patel on August 15, 2025 at 6:04 pm
In this post, we discuss Amazon Bedrock AgentCore Gateway, a fully managed service that revolutionizes how enterprises connect AI agents with tools and services by providing a centralized tool server with unified interface for agent-tool communication. The service offers key capabilities including Security Guard, Translation, Composition, Target extensibility, Infrastructure Manager, and Semantic Tool Selection, while implementing sophisticated dual-sided security architecture for both inbound and outbound connections.
- Build a scalable containerized web application on AWS using the MERN stack with Amazon Q Developer – Part 1by Bill Chan on August 15, 2025 at 4:45 pm
In a traditional SDLC, a lot of time is spent in the different phases researching approaches that can deliver on requirements: iterating over design changes, writing, testing and reviewing code, and configuring infrastructure. In this post, you learned about the experience and saw productivity gains you can realize by using Amazon Q Developer as a coding assistant to build a scalable MERN stack web application on AWS.
- Optimizing Salesforce’s model endpoints with Amazon SageMaker AI inference componentsby Rishu Aggarwal on August 15, 2025 at 4:41 pm
In this post, we share how the Salesforce AI Platform team optimized GPU utilization, improved resource efficiency and achieved cost savings using Amazon SageMaker AI, specifically inference components.
- Building a RAG chat-based assistant on Amazon EKS Auto Mode and NVIDIA NIMsby Riccardo Freschi on August 15, 2025 at 3:52 pm
In this post, we demonstrate the implementation of a practical RAG chat-based assistant using a comprehensive stack of modern technologies. The solution uses NVIDIA NIMs for both LLM inference and text embedding services, with the NIM Operator handling their deployment and management. The architecture incorporates Amazon OpenSearch Serverless to store and query high-dimensional vector embeddings for similarity search.
- Introducing Amazon Bedrock AgentCore Identity: Securing agentic AI at scaleby Rahul Sharma on August 15, 2025 at 3:28 pm
In this post, we explore Amazon Bedrock AgentCore Identity, a comprehensive identity and access management service purpose-built for AI agents that enables secure access to AWS resources and third-party tools. The service provides robust identity management features including agent identity directory, agent authorizer, resource credential provider, and resource token vault to help organizations deploy AI agents securely at scale.
- Scalable intelligent document processing using Amazon Bedrock Data Automationby Abdul Navaz on August 14, 2025 at 5:53 pm
In the blog post Scalable intelligent document processing using Amazon Bedrock, we demonstrated how to build a scalable IDP pipeline using Anthropic foundation models on Amazon Bedrock. Although that approach delivered robust performance, the introduction of Amazon Bedrock Data Automation brings a new level of efficiency and flexibility to IDP solutions. This post explores how Amazon Bedrock Data Automation enhances document processing capabilities and streamlines the automation journey.
- Whiteboard to cloud in minutes using Amazon Q, Amazon Bedrock Data Automation, and Model Context Protocolby Wrick Talukdar on August 14, 2025 at 5:31 pm
We’re excited to share the Amazon Bedrock Data Automation Model Context Protocol (MCP) server, for seamless integration between Amazon Q and your enterprise data. In this post, you will learn how to use the Amazon Bedrock Data Automation MCP server to securely integrate with AWS Services, use Bedrock Data Automation operations as callable MCP tools, and build a conversational development experience with Amazon Q.
- Bringing agentic Retrieval Augmented Generation to Amazon Q Businessby Sanjit Misra on August 14, 2025 at 5:14 pm
In this blog post, we explore how Amazon Q Business is transforming enterprise data interaction through Agentic Retrieval Augmented Generation (RAG).
- Empowering students with disabilities: University Startups’ generative AI solution for personalized student pathwaysby Sadia Ahmed on August 14, 2025 at 4:10 pm
University Startups, headquartered in Bethesda, MD, was founded in 2020 to empower high school students to expand their education beyond a traditional curriculum. University Startups is focused on special education and related services in school districts throughout the US. In this post, we explain how University Startups uses generative AI technology on AWS to enable students to design a specific plan for their future either in education or the work force.
- Citations with Amazon Nova understanding modelsby Sunita Koppar on August 14, 2025 at 3:56 pm
In this post, we demonstrate how to prompt Amazon Nova understanding models to cite sources in responses. Further, we will also walk through how we can evaluate the responses (and citations) for accuracy.
- Securely launch and scale your agents and tools on Amazon Bedrock AgentCore Runtimeby Shreyas Subramanian on August 13, 2025 at 9:59 pm
In this post, we explore how Amazon Bedrock AgentCore Runtime simplifies the deployment and management of AI agents.
- PwC and AWS Build Responsible AI with Automated Reasoning on Amazon Bedrockby Nafi Diallo on August 13, 2025 at 7:32 pm
This post presents how AWS and PwC are developing new reasoning checks that combine deep industry expertise with Automated Reasoning checks in Amazon Bedrock Guardrails to support innovation.
- How Amazon scaled Rufus by building multi-node inference using AWS Trainium chips and vLLMby James Park on August 13, 2025 at 5:01 pm
In this post, Amazon shares how they developed a multi-node inference solution for Rufus, their generative AI shopping assistant, using Amazon Trainium chips and vLLM to serve large language models at scale. The solution combines a leader/follower orchestration model, hybrid parallelism strategies, and a multi-node inference unit abstraction layer built on Amazon ECS to deploy models across multiple nodes while maintaining high performance and reliability.
- Build an intelligent financial analysis agent with LangGraph and Strands Agentsby Evan Grenda on August 13, 2025 at 4:32 pm
This post describes an approach of combining three powerful technologies to illustrate an architecture that you can adapt and build upon for your specific financial analysis needs: LangGraph for workflow orchestration, Strands Agents for structured reasoning, and Model Context Protocol (MCP) for tool integration.
- Amazon Bedrock AgentCore Memory: Building context-aware agentsby Akarsha Sehwag on August 13, 2025 at 4:30 pm
In this post, we explore Amazon Bedrock AgentCore Memory, a fully managed service that enables AI agents to maintain both immediate and long-term knowledge, transforming one-off conversations into continuous, evolving relationships between users and AI agents. The service eliminates complex memory infrastructure management while providing full control over what AI agents remember, offering powerful capabilities for maintaining both short-term working memory and long-term intelligent memory across sessions.
- Build a conversational natural language interface for Amazon Athena queries using Amazon Novaby Ravi Kumar on August 13, 2025 at 4:18 pm
In this post, we explore an innovative solution that uses Amazon Bedrock Agents, powered by Amazon Nova Lite, to create a conversational interface for Athena queries. We use AWS Cost and Usage Reports (AWS CUR) as an example, but this solution can be adapted for other databases you query using Athena. This approach democratizes data access while preserving the powerful analytical capabilities of Athena, so you can interact with your data using natural language.
- Train and deploy AI models at trillion-parameter scale with Amazon SageMaker HyperPod support for P6e-GB200 UltraServersby Nathan Arnold on August 12, 2025 at 7:57 pm
In this post, we review the technical specifications of P6e-GB200 UltraServers, discuss their performance benefits, and highlight key use cases. We then walk though how to purchase UltraServer capacity through flexible training plans and get started using UltraServers with SageMaker HyperPod.
- How Indegene’s AI-powered social intelligence for life sciences turns social media conversations into insightsby Rudra Kannemadugu, Shravan K S on August 12, 2025 at 6:41 pm
This post explores how Indegene’s Social Intelligence Solution uses advanced AI to help life sciences companies extract valuable insights from digital healthcare conversations. Built on AWS technology, the solution addresses the growing preference of HCPs for digital channels while overcoming the challenges of analyzing complex medical discussions on a scale.