News

—

News Jul 2026

Proving application resilience on Azure with Chaos Studio

Production outages are inevitable. Network latency spikes, database connections fail, entire availability zones go down—and when they do, your application either handles it gracefully or it doesn’t. Most teams don’t know which until it’s too late. Azure Chaos Studio addresses this by letting you deliberately break things in a controlled way. It’s essentially a testing framework that simulates infrastructure failures before they happen for real, giving you confidence that your application can actually recover from the disasters you’ve designed it to handle.

News Jul 2026

Claude in Microsoft Foundry is now generally available

Microsoft has made Claude, Anthropic’s AI model, generally available through Azure’s AI Foundry platform. This means teams can now access Claude at production scale without building custom infrastructure, with the models running on NVIDIA’s latest Blackwell Ultra GPUs. If you’ve been experimenting with Claude through Anthropic’s API or other providers, this announcement matters because it gives you another deployment path—one that’s tightly integrated with Azure’s broader AI stack and governance tools.

News Jul 2026

How GitHub used secret scanning to reach inbox zero

GitHub recently shared how they tackled a sprawling security problem: 20,000+ secret scanning alerts across 15,000 repositories. The challenge wasn’t just the volume—it was figuring out which alerts actually mattered versus false positives that waste engineering time. Their solution offers a practical playbook for any organization drowning in security noise, and the approach reveals something important about how modern development security actually works at scale.

Secret scanning automatically detects credentials, API keys, and tokens that accidentally get committed to repositories. GitHub’s scanner runs on every push, looking for patterns that match known secret formats—AWS access keys, GitHub tokens, Slack webhooks, and hundreds of other credential types. When the system flags something, it generates an alert. This is powerful: you catch leaks before they leave your codebase. But here’s the problem GitHub faced: when you have thousands of repositories and years of accumulated commits, you also get thousands of alerts, many from old repositories, inactive projects, or credentials that were already rotated months ago. Engineers start ignoring alerts because reviewing them feels like busy work. That’s when your security tool stops being useful.

News Jul 2026

Upgrade Amazon EKS clusters with confidence using Kubernetes version rollbacks

Kubernetes upgrades have traditionally been a nail-biting experience. You schedule maintenance, cross your fingers, and hope nothing breaks in production. AWS just made that process significantly less stressful by introducing Kubernetes version rollbacks for Amazon EKS—a feature that lets you undo a cluster upgrade within seven days if things go wrong. This transforms upgrades from a one-way door into a reversible operation, which is a meaningful shift in how teams approach cluster maintenance.

News Jul 2026

Ship infrastructure faster with CloudFormation and CDK pre-deployment validation on every stack operation

Infrastructure as Code (IaC) has become standard practice, but the feedback loop between writing a template and discovering errors can still be painfully slow. AWS CloudFormation lets you define cloud resources as code using JSON or YAML, while CDK takes it further by letting you write infrastructure in Python, TypeScript, or other languages. The problem? A single syntax error, missing property, or invalid parameter type can derail your deployment—whether you’re deploying directly, using change sets for previews, or running automated deployments through CI/CD pipelines or AI agents. AWS has now extended CloudFormation’s validation capabilities to catch these issues earlier in the development process, before they consume deployment time and developer attention.

News Jul 2026

Accelerate your infrastructure deployments by up to 4x with AWS CloudFormation Express mode

AWS CloudFormation just got faster. The new Express mode can cut deployment times down to seconds instead of minutes, which might not sound like much until you’re iterating on infrastructure changes dozens of times a day. Whether you’re building AI applications that need rapid experimentation or managing DevOps workflows that demand quick feedback loops, this feature addresses a real pain point: waiting for CloudFormation stacks to create or update before you can validate your changes.

News Jun 2026

Previewing GPT-5.6 Sol: a next-generation model

OpenAI has announced GPT-5.6 Sol, a new large language model that represents a meaningful step forward in AI capabilities, particularly for technical domains. If you’re working with AI in production environments, this preview gives us a window into what’s coming and what you should be thinking about now.

So what makes Sol different? The model shows substantial improvements in three areas that directly impact cloud and automation work: coding, scientific reasoning, and cybersecurity analysis. When OpenAI says “stronger capabilities in coding,” they’re not just talking about generating boilerplate—Sol appears to handle complex multi-step problems, debugging logic, and architectural decisions with better accuracy than its predecessors. For those of us writing infrastructure-as-code, Lambda functions, or automation scripts, this means better code generation assistance and fewer hallucinations when asking the model to help with tricky logic. The science and cybersecurity improvements matter too, especially if you’re working with security scanning, threat analysis, or using AI to help parse technical documentation and research papers.

News Jun 2026

How to Generate an SBOM for Container Workflows

If you’re deploying containers at any scale, you’ve probably encountered security audits asking for a complete list of what’s inside your images. That’s where SBOMs come in. A Software Bill of Materials (SBOM) is essentially an inventory of all dependencies, libraries, and packages bundled into your container image. Think of it like a nutrition label for your software—it tells you exactly what ingredients are present. As container security becomes a core requirement for compliance frameworks like SLSA and regulations such as the Executive Order on Cybersecurity, understanding how to generate and integrate SBOMs into your CI/CD pipeline is becoming a table-stakes skill for DevOps and platform engineers.

News Jun 2026

Evaluating performance and efficiency of the GitHub Copilot agentic harness across models and tasks

GitHub recently published findings on their Copilot agentic harness—a framework designed to run AI agents across different models while measuring performance and efficiency. If you’re building AI-assisted workflows or considering which models to use for your development tasks, this is worth understanding. The research essentially answers a practical question many teams face: which combination of model and task setup gives you the best results without burning through your token budget?

News Jun 2026

Spotlight on WG Device Management

Kubernetes has become the go-to orchestration platform for cloud workloads, but it was originally designed for stateless applications that only needed CPU and memory. Today, that’s changing rapidly. As AI models, edge computing, and telecommunications services move to Kubernetes, operators face a new challenge: how do you allocate and manage specialized hardware like GPUs, TPUs, and network interface cards (NICs)? This is where the Kubernetes Device Management working group steps in, developing standards for hardware resource allocation that go far beyond traditional CPU and memory constraints.

News Jun 2026

From insight to action: The next phase of agentic cloud operations

Cloud operations have traditionally worked in cycles: you monitor your environment, get alerts, analyze what’s wrong, and then manually decide what to do next. But what if that entire decision-making loop could happen automatically? Microsoft’s vision of agentic cloud operations moves beyond dashboards and alerts to create systems that don’t just tell you what’s broken—they fix it themselves. This represents a meaningful shift in how we approach cloud management, turning cloud platforms from reactive tools into proactive decision-makers.

News Jun 2026

I automated my job (and it made me a better leader)

There’s a persistent myth in tech leadership that automation is something you delegate to junior engineers. But what if the best person to automate your own workflow is you? A senior leader at GitHub recently shared how they implemented 40 automations across their daily tasks—and the outcome wasn’t burnout prevention, though that was a nice side effect. Instead, it fundamentally changed how they lead their team.

The technical foundation here is straightforward but powerful. These automations likely combine GitHub Actions workflows, API integrations, and cloud-based task runners to eliminate repetitive manual work: automatically summarizing pull requests, routing code reviews to the right people, aggregating metrics from multiple dashboards, and flagging blockers before they become problems. The magic isn’t in any single tool—it’s in the workflow orchestration. When a pull request lands, a workflow can trigger status checks, post summaries to Slack, update tracking systems, and notify stakeholders without human intervention. For Python-savvy engineers, this might mean writing Lambda functions on AWS that trigger on CloudWatch events, or using boto3 to manage resources at scale. The infrastructure is already there; the missing piece is usually just connecting the dots.

News Jun 2026

Run isolated sandboxes with full lifecycle control: AWS Lambda introduces MicroVMs

AWS just announced Lambda MicroVMs, a new compute primitive that shifts how you think about serverless isolation and state management. Instead of sharing kernel resources across functions, each MicroVM gives you a dedicated, lightweight virtual machine with full isolation. This sits somewhere between traditional Lambda containers and full EC2 instances—you get the isolation guarantees of a VM without managing infrastructure or waiting for boot times.

Here’s what makes this technically different: traditional Lambda functions run in a shared sandbox environment within a container, which means the kernel and some system resources are technically shared across invocations (though AWS handles security isolation at the application level). MicroVMs flip this model. Each function gets its own isolated kernel and resource namespace, similar to how you’d think about separate machines. They launch in milliseconds and can maintain state for up to 8 hours, meaning you can pause, resume, and reconnect without losing your session. There’s no need to rebuild state from scratch on every invocation. Practically, this matters because you have explicit control over the VM lifecycle—you decide when to pause, resume, or terminate, rather than relying on Lambda’s default timeout and cleanup model.

News Jun 2026

Accelerate Incident Resolution with PagerDuty and AWS DevOps Agent

Every ops engineer knows the scenario: your phone buzzes at 2 a.m. with a critical alert. Your heart sinks. The notification tells you that something is broken, but not why. You’re now scrambling through CloudWatch logs, SSH-ing into instances, and running diagnostics while your application hemorrhages traffic and your customers watch their requests timeout. This context gap—between detection and understanding—is where SRE teams waste the most time during incidents. AWS and PagerDuty have partnered to close that gap with the AWS DevOps Agent, a tool designed to automatically gather diagnostic data and surface it directly in PagerDuty incidents, cutting mean-time-to-resolution (MTTR) significantly.

News Jun 2026

Feature Flag Orchestration with AWS DevOps Agent and LaunchDarkly

When an outage hits at 2 AM, your team’s response speed determines whether customers experience a five-minute blip or a cascading disaster. Yet many organizations still manage feature flags and incident response separately—meaning engineers waste precious minutes hunting through dashboards, deciding which flags matter, and coordinating manual changes across teams. AWS DevOps Agent paired with LaunchDarkly bridges this gap by automating the connection between your incident response workflows and feature flag management, letting engineers respond to emergencies with a single action instead of a dozen.

News Jun 2026

Supercharge your cloud operations with the Kiro power for AWS DevOps Agent

The 2 AM alert is a rite of passage in cloud engineering. Your phone buzzes. Your service is down or degrading. You stumble out of bed and start the familiar ritual: SSH into the bastion host, grep through CloudWatch logs, check the deployment history, trace through your code to understand what changed. Meanwhile, crucial context is scattered across a dozen browser tabs—your monitoring dashboard, X-Ray traces, infrastructure diagrams, configuration files. By the time you’ve assembled the full picture, you’ve already lost 20 minutes you didn’t have to spare.

News Jun 2026

Announcing Amazon EC2 G7 instances accelerated by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs

AWS has released Amazon EC2 G7 instances into general availability, marking a significant upgrade for workloads that demand serious GPU power. These instances pack NVIDIA’s RTX PRO 4500 Blackwell Server Edition GPUs—a processor designed specifically for data centers rather than gaming or consumer applications. If you’ve been running inference models, rendering graphics pipelines, or processing large datasets, G7 instances represent a meaningful leap forward in performance-per-dollar compared to their predecessors.

News Jun 2026

Amazon ECS introduces new high-resolution metrics for faster service auto scaling

Container applications need to respond quickly to traffic spikes. If your ECS service waits minutes to scale up during a sudden surge, users experience slowdowns and timeouts. Amazon’s new high-resolution metrics feature addresses this timing challenge by allowing ECS auto scaling policies to react based on metrics collected at one-second intervals instead of the default one-minute intervals. This tighter feedback loop means your containers can scale up faster when demand increases and scale down more efficiently during quiet periods.

News Jun 2026

How we built an internal data analytics agent

GitHub recently shared how they built Qubot, an internal analytics agent that lets employees ask questions about company data using plain language instead of writing SQL queries. It’s a practical example of how AI can reduce friction in data workflows—something that applies far beyond GitHub’s walls.

At its core, Qubot solves a common problem: data exists in databases, but accessing it requires SQL expertise. Not everyone on a team has that skill, and even those who do spend time writing boilerplate queries. GitHub’s approach uses Claude (via Bedrock or similar) to translate natural language questions into SQL queries that run against their internal data warehouse. An employee can ask “How many pull requests were merged last quarter?” and get results without touching a database client. The agent handles schema understanding, query generation, and result formatting—essentially acting as a smart intermediary between human questions and structured data.

News Jun 2026

Production-Ready Autonomous Incident Resolution with AWS DevOps Agent (now GA) and Datadog MCP Server

The partnership between AWS and Datadog has matured into something genuinely useful: a system that can detect, diagnose, and fix infrastructure problems with minimal human intervention. AWS DevOps Agent, now generally available, works alongside Datadog’s Model Context Protocol (MCP) Server to turn monitoring alerts into actionable resolutions. Instead of waiting for on-call engineers to wake up, correlate logs, check configurations, and apply fixes, this integration handles the routine work automatically—and does it in minutes instead of hours.

News Jun 2026

Getting more from each token: How Copilot improves context handling and model routing

If you’ve been using GitHub Copilot, you’ve probably noticed that your credit usage can add up quickly. Every code suggestion, chat conversation, and model inference consumes tokens—the small units that AI models use to process and generate text. GitHub’s latest improvements focus on making those tokens work harder for you, reducing waste and ensuring your credits stretch further while actually improving code quality. This matters because in a world where AI assistance is becoming essential to development workflows, efficiency directly impacts both your wallet and your team’s productivity.

News Jun 2026

Announcing Web Search on Amazon Bedrock AgentCore: Ground your AI agents in current, accurate web knowledge

One of the biggest challenges when deploying AI agents in production is keeping them accurate. Language models have knowledge cutoffs—they don’t know about events after their training data ends. If your customer service agent is answering questions about your latest product launch or an agent needs real-time pricing information, it’s working with stale information. AWS is addressing this with Web Search on Amazon Bedrock AgentCore, a managed capability that lets your agents pull current information directly from the web without you having to build and maintain the infrastructure yourself.

News Jun 2026

Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications

Enterprise AI teams face a familiar pain point: building retrieval-augmented generation (RAG) systems is complex. You need to connect to multiple data sources, parse different file formats, orchestrate embeddings, manage vector databases, and chain everything together—all while keeping your application accurate and performant. AWS’s new Fully Managed Knowledge Bases for Amazon Bedrock aims to eliminate much of this infrastructure work, letting your team focus on what actually matters: delivering business value.

News Jun 2026

Amazon S3 annotations: attach rich, queryable context directly to your objects

Amazon S3 just added a feature called annotations that fundamentally changes how you can work with object metadata at scale. Instead of managing metadata in separate databases or systems, you can now attach up to 1 GB of rich, queryable context directly to S3 objects. For teams building AI agents and automation workflows, this is a practical shift that simplifies data discovery and context management in ways that single-key/value tag systems simply can’t match.

News Jun 2026

GitHub Copilot CLI for Beginners: Overview of common slash commands

GitHub Copilot has expanded beyond code editors into the terminal itself. Copilot CLI brings AI-assisted command suggestions directly to your shell, making it easier to construct complex commands without memorizing syntax or hunting through documentation. For developers working with AWS, building automation scripts, or managing cloud infrastructure from the command line, this tool can significantly reduce friction when dealing with unfamiliar tools or verbose command syntax.

Copilot CLI works by accepting natural language descriptions of what you want to accomplish, then translating them into actual shell commands. When you type a slash command like ?? or git!, you’re signaling to Copilot that you need help. The tool analyzes your input, considers the context of your current shell environment and recent commands, and suggests appropriate commands to run. It understands common cloud CLI tools like the AWS CLI, kubectl, Terraform, and others—meaning you can ask for help with complex flags and arguments without needing to reference man pages. The suggestions appear inline or in a prompt, letting you review before executing.

News Jun 2026

Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors

AWS has released new EC2 instance types—M9g and M9gd—built on the latest AWS Graviton5 processor. If you’ve been following the evolution of AWS-custom silicon, this is a meaningful step forward. Graviton5 delivers up to 25% better compute performance than its Graviton4 predecessor while maintaining AWS’s focus on energy efficiency. For teams running containerized workloads, microservices, or general-purpose applications, this means more capability per dollar and per watt consumed.

The technical improvement comes from a redesigned processor architecture. Graviton5 increases clock speeds, improves instruction throughput, and enhances memory bandwidth compared to Graviton4. The M-series instances are general-purpose machines—they balance compute, memory, and network resources—making them suitable for a wide range of workloads. The key distinction: M9g instances come with local NVMe SSD storage (the “d” in M9gd), which helps if your application needs fast, temporary storage without making extra API calls to EBS. For Python-based batch jobs, Node.js APIs, or containerized applications using Docker and Kubernetes, these instances fit naturally into existing architectures.

News Jun 2026

Claude Fable 5 available today in Microsoft Foundry: Powering the next era of autonomous agents

Anthropic has released Claude Fable 5, their latest frontier AI model, through Microsoft Foundry—marking a significant milestone in making advanced AI capabilities available to enterprise developers. This release represents a shift toward practical, production-grade autonomous agents that can handle complex workflows without constant human intervention. If you’ve been experimenting with Claude’s API or watching the AI space evolve, this is worth understanding because it affects the tools you’ll be building with over the next year.

News Jun 2026

How we made GitHub Copilot CLI more selective about delegation

GitHub recently shared insights into improving how GitHub Copilot CLI decides when to hand off tasks to other AI agents or tools. The core problem they were solving is surprisingly common: when an AI system can delegate work, it often does so too eagerly, creating unnecessary handoffs that slow things down and introduce failure points. By making Copilot CLI smarter about when to delegate, they reduced overhead while keeping the benefits of task specialization.

News Jun 2026

Making secret scanning more trustworthy: Reducing false positives at scale

Secret scanning is one of those security tools that sounds simple in theory but gets complicated fast. The idea is straightforward: scan your codebase for accidentally committed credentials like API keys, database passwords, or AWS access tokens before they reach production. But here’s the problem that GitHub tackled—these scanners generate tons of false positives. A developer commits a test string that looks like a secret, or includes a placeholder in documentation, and suddenly your security team is flooded with alerts that waste time and erode trust in the tool itself. When people ignore 90% of alerts because they’re noise, they’ll miss the real threats hiding in the remaining 10%.

News Jun 2026

Diagnose EKS Node Issues Faster with AWS DevOps Agent and Custom MCP

When your Kubernetes cluster starts throwing CrashLoopBackOff errors at 3 AM, you don’t want to manually SSH into nodes, grep through logs, and cross-reference timestamps with CloudWatch metrics. AWS DevOps Agent automates exactly this kind of troubleshooting by investigating production incidents autonomously. It can diagnose pod failures, trace configuration changes through AWS CloudTrail audit logs, and correlate metrics with cluster events—all without waking up your on-call engineer. But here’s the catch: it only works well when your troubleshooting data lives in AWS services it knows about. When critical diagnostics are scattered across custom monitoring tools, proprietary observability platforms, or internal systems, DevOps Agent hits a wall.

News Jun 2026

Give GitHub Copilot CLI real code intelligence with language servers

GitHub Copilot CLI has been a useful tool for developers working in terminal environments, offering AI-powered suggestions for commands and code snippets. However, its effectiveness has been limited by how it understands your codebase. Traditionally, Copilot CLI relied on grep searches and basic text parsing to gather context about your project—essentially pattern matching without true code comprehension. GitHub has now addressed this limitation by integrating Language Server Protocol (LSP) support, enabling Copilot CLI to tap into the same sophisticated code analysis that powers modern IDEs like VS Code.

News Jun 2026

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

GitHub Copilot CLI has evolved beyond answering random terminal questions. The latest addition—custom agents—lets you teach Copilot about your specific tech stack, infrastructure patterns, and team processes. Instead of explaining your deployment pipeline every time you ask for help, you can set it up once and have Copilot understand your context automatically. This shift from one-off prompts to repeatable workflows is particularly valuable for teams managing complex cloud environments or standardized deployment procedures.

News Jun 2026

Anthropic Claude Fable 5 on AWS: Mythos-class capabilities with built-in safeguards now available

AWS has quietly expanded what’s possible for enterprises building AI applications by making Claude Fable 5 available through Amazon Bedrock and the Claude Platform on AWS. This release democratizes what Anthropic calls “Mythos-class” AI capabilities—essentially the high-performance reasoning and generation you’d expect from their most advanced model—while maintaining the safety-focused architecture that’s become Anthropic’s calling card. If you’ve been hesitant about deploying sophisticated AI models in regulated environments, this development deserves your attention.

News Jun 2026

Microsoft Build 2026: Building agentic apps with Microsoft Fabric and Microsoft Databases

Microsoft’s latest announcements at Build 2026 center on making it easier to develop AI agents that can autonomously handle business tasks. The company is positioning Microsoft Fabric and Microsoft Databases as a unified foundation for these applications—essentially creating an integrated platform where your data infrastructure, AI models, and application logic live together rather than scattered across separate services. This matters because building effective AI agents requires tight coupling between data access, model intelligence, and real-time decision-making. When these components are fragmented, you’re fighting latency issues, consistency problems, and operational complexity.

News Jun 2026

Announcing Microsoft Discovery general availability and Microsoft Discovery app preview

Microsoft has released Microsoft Discovery as a generally available platform, marking a significant shift in how organizations can build and manage agentic AI workflows. If you’ve been following the AI space, you know that autonomous agents—AI systems that can plan, execute, and adapt without constant human intervention—are becoming increasingly central to enterprise automation. Microsoft Discovery is designed to solve one of the biggest challenges teams face: how to actually build these agents responsibly and at scale, not just experiment with them in isolated prototypes.

News Jun 2026

Try the new console experience in Amazon Bedrock, optimized for Anthropic- and OpenAI-compatible APIs

Amazon Bedrock just rolled out a redesigned console that makes it easier to explore, test, and deploy foundation models without leaving the AWS interface. If you’ve found yourself juggling multiple browser tabs to compare models, copy-paste API documentation, or remember which code snippets work with which services, this update addresses those friction points directly. The new experience is specifically optimized for Anthropic and OpenAI-compatible APIs, meaning whether you’re using Claude, GPT models, or others in the compatible ecosystem, you’ll find a more cohesive workflow.

News Jun 2026

Debug deployment failures faster with the Deployments tab in AWS Elastic Beanstalk

Deployment failures are frustrating. One moment your application is ready to ship, the next you’re hunting through logs trying to figure out what went wrong. Traditionally, when an Elastic Beanstalk deployment fails, you’d wait for it to finish, request a log bundle, download it locally, and then manually search through files like eb-engine.log, cfn-init.log, and platform.log hoping to spot the error. If you’re new to Beanstalk’s logging structure, this process can feel like finding a needle in a haystack. AWS has streamlined this workflow with the Deployments tab in the Elastic Beanstalk console, which surfaces error messages directly without requiring you to dig through bundled logs.

News Jun 2026

Claude Opus 4.8 is now available in Microsoft Foundry

Microsoft has made Claude Opus 4.8, Anthropic’s most advanced reasoning model, available through Azure AI Foundry. This marks another important step in multi-model AI accessibility, giving teams working in the Microsoft ecosystem direct access to a frontier-class LLM without leaving their familiar Azure environment. For organizations already invested in Azure infrastructure, this eliminates friction in model selection and deployment.

From a technical perspective, Azure AI Foundry handles the integration through its managed API endpoints. Rather than managing separate connections to Anthropic’s systems, you authenticate through Azure’s identity layer and make standard REST API calls—the same pattern you’d use for other Azure AI services. This means your existing error handling, rate limiting logic, and monitoring dashboards work seamlessly. The model supports both synchronous requests for real-time applications and batch processing APIs for large-scale workloads, giving you flexibility in how you structure applications.

News Jun 2026

GitHub Copilot app: The agent-native desktop experience

At Microsoft Build 2026, GitHub announced a significant shift in how AI agents integrate into developer workflows. The new GitHub Copilot app represents a move away from browser-based AI assistants toward native desktop experiences designed specifically for autonomous agents. Rather than forcing agents into existing chat interfaces, this approach builds tools that let agents interact with your development environment the way they naturally need to—running commands, accessing files, and integrating with your existing tools without friction.

News Jun 2026

Get started with OpenAI GPT-5.5, GPT-5.4 models, and Codex on Amazon Bedrock

Amazon Bedrock just made OpenAI’s latest frontier models available to everyone. GPT-5.5 and GPT-5.4 are now generally available alongside Codex, OpenAI’s specialized coding agent. If you’ve been waiting to integrate cutting-edge language models into your applications without managing infrastructure yourself, this is worth your attention. Bedrock handles the heavy lifting—you focus on building.

Here’s what’s actually happening under the hood. Bedrock is a managed service that abstracts away model infrastructure. Instead of running your own API calls to OpenAI’s servers, you’re making requests through AWS’s infrastructure with their “high performance inference engine.” This matters because it means lower latency, tighter integration with your AWS environment, and unified billing. You pay per token consumed, not monthly subscriptions. If you’re building a document analysis pipeline in Python, for example, you can now invoke GPT-5.5 via a simple boto3 call, get the response back, and immediately pass it to other AWS services like S3 or Lambda for post-processing—all within a single VPC and with CloudTrail logging every request.

News

Proving application resilience on Azure with Chaos Studio

Claude in Microsoft Foundry is now generally available

How GitHub used secret scanning to reach inbox zero

Upgrade Amazon EKS clusters with confidence using Kubernetes version rollbacks

Ship infrastructure faster with CloudFormation and CDK pre-deployment validation on every stack operation

Accelerate your infrastructure deployments by up to 4x with AWS CloudFormation Express mode

Previewing GPT-5.6 Sol: a next-generation model

How to Generate an SBOM for Container Workflows

Evaluating performance and efficiency of the GitHub Copilot agentic harness across models and tasks

Spotlight on WG Device Management

From insight to action: The next phase of agentic cloud operations

I automated my job (and it made me a better leader)

Run isolated sandboxes with full lifecycle control: AWS Lambda introduces MicroVMs

Accelerate Incident Resolution with PagerDuty and AWS DevOps Agent

Feature Flag Orchestration with AWS DevOps Agent and LaunchDarkly

Supercharge your cloud operations with the Kiro power for AWS DevOps Agent

Announcing Amazon EC2 G7 instances accelerated by NVIDIA RTX PRO 4500 Blackwell Server Edition GPUs

Amazon ECS introduces new high-resolution metrics for faster service auto scaling

How we built an internal data analytics agent

Production-Ready Autonomous Incident Resolution with AWS DevOps Agent (now GA) and Datadog MCP Server

Getting more from each token: How Copilot improves context handling and model routing

Announcing Web Search on Amazon Bedrock AgentCore: Ground your AI agents in current, accurate web knowledge

Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications

Amazon S3 annotations: attach rich, queryable context directly to your objects

GitHub Copilot CLI for Beginners: Overview of common slash commands

Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors

Claude Fable 5 available today in Microsoft Foundry: Powering the next era of autonomous agents

How we made GitHub Copilot CLI more selective about delegation

Making secret scanning more trustworthy: Reducing false positives at scale

Diagnose EKS Node Issues Faster with AWS DevOps Agent and Custom MCP

Give GitHub Copilot CLI real code intelligence with language servers

From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI

Anthropic Claude Fable 5 on AWS: Mythos-class capabilities with built-in safeguards now available

Microsoft Build 2026: Building agentic apps with Microsoft Fabric and Microsoft Databases

Announcing Microsoft Discovery general availability and Microsoft Discovery app preview

Try the new console experience in Amazon Bedrock, optimized for Anthropic- and OpenAI-compatible APIs

Debug deployment failures faster with the Deployments tab in AWS Elastic Beanstalk

Claude Opus 4.8 is now available in Microsoft Foundry

GitHub Copilot app: The agent-native desktop experience

Get started with OpenAI GPT-5.5, GPT-5.4 models, and Codex on Amazon Bedrock

Automate root cause analysis across Datadog and Elasticsearch with AWS DevOps Agent

Introducing the next generation of AWS Resilience Hub for generative AI-based SRE resilience journey

Introducing the next generation of Amazon OpenSearch Serverless for building your agentic AI applications

How AWS DevOps Agent uses multi-agent reasoning to find root causes

Powering multi-cluster workloads with seamless cross-cluster networking for Azure Kubernetes Fleet Manager

GitHub recognized as a Leader in the Gartner® Magic Quadrant™ for Enterprise AI Coding Agents for the third year in a row

AWS Weekly Roundup: AWS Transform at 1 year, Claude Platform on AWS, EC2 M3 Ultra Mac instances, and more (May 18, 2026)

Meet Gordon: Docker's AI Agent For Your Entire Container Workflow

Modernizing Excel VBA to Python at Scale with AWS Transform Custom

Announcing AWS CDK Mixins: Composable Abstractions for AWS Resources

Building Self-Extending CLI Tools with Strands Agent

Kubernetes v1.36: Mixed Version Proxy Graduates to Beta

Simplify cross-account and cross-Region stack output references with AWS CloudFormation and CDK's new Fn::GetStackOutput

Custom MCP Catalogs and Profiles: Advancing Enterprise MCP Adoption

Amazon Bedrock introduces new advanced prompt optimization and migration tool

Kubernetes v1.36: Advancing Workload-Aware Scheduling

GitHub Copilot individual plans: Introducing flex allotments in Pro and Pro+, and a new Max plan

Agentic application modernization at scale with Strands and Amazon Transform custom

Amazon Redshift introduces AWS Graviton-based RG instances with an integrated data lake query engine

Kubernetes v1.36: Moving Volume Group Snapshots to GA

AWS Weekly Roundup: Amazon Bedrock AgentCore payments, Agent Toolkit for AWS, and more (May 11, 2026)

Kubernetes v1.36: More Drivers, New Features, and the Next Era of DRA

Building an end-to-end agentic SRE using AWS DevOps Agent

Improving token efficiency in GitHub Agentic Workflows

Kubernetes v1.36: Server-Side Sharded List and Watch

Validating agentic behavior when correct isn't deterministic

The AWS MCP Server is now generally available

Kubernetes v1.36: Declarative Validation Graduates to GA

Modernize your workflows: Amazon WorkSpaces now gives AI agents their own desktop (preview)

Enforcing trust and transparency: Open-sourcing the Azure Integrated HSM

AWS Transform custom: Enterprise Code Modernization with the Learn-Scale-Improve Flywheel

Kubernetes v1.36: Pod-Level Resource Managers (Alpha)

A Virtual Agent team at Docker: How the Coding Agent Sandboxes team uses a fleet of agents to ship faster

OpenAI's GPT-5.5 in Microsoft Foundry: Frontier intelligence on an enterprise ready platform

Kubernetes v1.36: In-Place Vertical Scaling for Pod-Level Resources Graduates to Beta

GitHub Copilot CLI for Beginners: Interactive v. non-interactive mode

Kubernetes v1.36: Tiered Memory Protection with Memory QoS

Top announcements of the What's Next with AWS, 2026

Kubernetes v1.36: Staleness Mitigation and Observability for Controllers