Together AI Showcases Open Agentic Systems at GTC 2026: FlashAttention-4, ThunderAgent, Voice AI, and Production-Grade Inference — Research and Product Updates Highlight Open Source LLMs and AI Factory Capabilities
**Together AI**, as a diamond sponsor of **NVIDIA GTC 2026**, is showcasing its latest research and product innovations at Booth #1213 in San Jose from March 16 to 19. Today’s updates focus on open-source LLMs, voice AI capabilities, production-grade inference, and AI factory infrastructure. Key announcements include **FlashAttention-4** (up to 1.3× faster than cuDNN on NVIDIA Blackwell), the open-source **ThunderAgent** for agentic workloads (delivering a 3.6× throughput improvement), the **ATLAS-2** adaptive learning speculator, and a full-featured voice AI stack supporting real-time speech-to-text and text-to-speech. Together AI demonstrates how enterprises can transition from AI experiments to production deployment in minutes using its GPU clusters and inference platform.

NVIDIA Launches NemoClaw AI Stack at GTC 2026: Single-Command OpenClaw Security Layer, OpenShell Sandbox, Multi-Agent Orchestration — Enterprise-Ready AI Agents with Privacy Guardrails
NVIDIA officially unveiled NemoClaw at GTC 2026 on March 16, an enhanced open-source AI agent stack designed to bring enterprise-grade security, privacy, and multi-agent orchestration to the OpenClaw ecosystem. NemoClaw installs in a single command and combines the OpenClaw agent platform with NVIDIA's Agent Toolkit components, including OpenShell for isolated sandbox execution and AI-Q for building reasoning agents. The stack addresses the critical security concerns that have held back enterprise adoption of autonomous AI agents, offering policy-based guardrails, data privacy controls, and support for NVIDIA's Nemotron open models across deployment environments from RTX PCs to DGX Spark and DGX Station.





