Together AI Showcases Open Agentic Systems at GTC 2026: FlashAttention-4, ThunderAgent, Voice AI, and Production-Grade Inference — Research and Product Updates Highlight Open Source LLMs and AI Factory Capabilities

**Together AI**, as a diamond sponsor of **NVIDIA GTC 2026**, is showcasing its latest research and product innovations at Booth #1213 in San Jose from March 16 to 19. Today’s updates focus on open-source LLMs, voice AI capabilities, production-grade inference, and AI factory infrastructure. Key announcements include **FlashAttention-4** (up to 1.3× faster than cuDNN on NVIDIA Blackwell), the open-source **ThunderAgent** for agentic workloads (delivering a 3.6× throughput improvement), the **ATLAS-2** adaptive learning speculator, and a full-featured voice AI stack supporting real-time speech-to-text and text-to-speech. Together AI demonstrates how enterprises can transition from AI experiments to production deployment in minutes using its GPU clusters and inference platform.

NVIDIA GTC 2026 Opens in San Jose: Jensen Huang Declares AI Is Now Essential Infrastructure — Vera Rubin, NemoClaw, Groq Integration, Physical AI, and the Five-Layer Stack Define the Next Industrial Revolution

NVIDIA GTC 2026 — the world's most closely watched AI conference — has officially opened in San Jose, California, with founder and CEO Jensen Huang delivering a landmark keynote from the SAP Center to 30,000 attendees from 190 countries. Huang declared that AI is no longer an application or a model: "It is essential infrastructure. Every company will use it. Every nation will build it." The conference formally unveils the Vera Rubin platform now in full production, the anticipated NemoClaw open-source enterprise AI agent platform, the $20 billion Groq LPU inference integration, Nemotron 3 Super for agentic reasoning, physical AI leadership across robotics and autonomous systems, and a five-layer AI technology stack that Huang says is powering "one of the largest infrastructure expansions in history."

Jensen Huang GTC 2026 Keynote: NVIDIA Unveils "Physical AI" Architecture — Bridging the Gap Between Digital Intelligence and the Real World

At the NVIDIA GTC 2026 keynote in San Jose, CEO Jensen Huang unveiled the company's comprehensive "Physical AI" architecture, marking a paradigm shift from generative AI to AI that understands and interacts with the physical world. The announcement includes the new Vera Rubin platform, the Cosmos world foundation model family, the Alpamayo reasoning model for autonomous vehicles, and the Newton physics engine — together forming NVIDIA's vision for AI that comprehends gravity, friction, and inertia to power the next generation of robotics and autonomous systems

Telegram
Telegram
WhatsApp
WhatsApp