Nvidia said its NeMo Microservices are generally available as it aims to enable developers to leverage a "data flywheel" that enables enterprises to scale AI agents.

The general availability of Nvidia NeMo Microservices is a follow-up to GTC 2025 announcements. The idea of the NeMo Microservices portfolio is to automate and scale what Nvidia refers to as AI teammates.

GTC 2025: Nvidia GTC 2025: Six lingering questions | Nvidia launches Blackwell Ultra, Dynamo. outlines roadmap through 2027 | Nvidia launches DGX Spark, DGX Station personal AI supercomputers | Nvidia's model parade: Llama Nemotron, Cosmos additions, Isaac GROOT N1

The components of NeMo Microservices, which will integrate with Nvidia's Triton Inference Server, include:

  • NeMo Curator for data processing.
  • NeMo Customizer for model customization.
  • NeMo Evaluator to pick and evaluate models.
  • NeMo Guardrails for protection.
  • NeMo Retriever for information retrieval.
  • Llama Nemotron, a reasoning large language model.

These components each have a role in taking enterprise data and creating the flywheel to leverage agents. Nvidia's NeMo Microservices are going live with a host of integrators and key enterprise vendors such as SAP and ServiceNow backing them.

NeMo Microservices will be available in Nvidia AI Enterprise, include hybrid cloud deployments and work with multiple models and AI software frameworks. Each microservice operates in its own container and Nvidia said there will be a lot more on deck focused on orchestration and adding components.

In a briefing, Nvidia outlined the following use cases:

  • Amdocs is using telecom genAI agents for a 64% improvement in average handling time with half of calls being resolved on the first call. Amdocs deployments used NeMo Microservices to create agents for billing, sales and network to analyze logs.
  • ServiceNow AI agents across HR, IT and customer support are freeing up 3 million hours of human capacity.
  • Yum is using the Nvidia microservices stack to expand voice AI order taking to 500 restaurants.
  • Accenture, AT&T and Cisco all cited use cases in research, call center agents and software development.