Enterprises and startups can create custom generative AI models with NVIDIA's AI Foundation Models, NeMo, and DGX Cloud.
Image credit: NVIDIA
NVIDIA introduced an AI foundry service for startups and enterprises using Microsoft Azure. This service includes NVIDIA AI Foundation Models, NeMo framework and tools, and DGX Cloud AI supercomputing allowing companies to build and deploy custom AI models, including intelligent search, summarization, and content generation, on Microsoft's cloud platform.
Here is an overview of what NVIDIA is offering:
NVIDIA AI Foundation Models is a curated collection of pre-trained models that "gives developers a running start for bringing custom generative AI to their enterprise applications." The models include Llama 2, Stable Diffusion XL, and Mistral.
NVIDIA NeMo is a cloud-native framework to build, customize, and deploy generative AI models. It includes training and inferencing frameworks, guardrailing toolkits, data curation tools, and pre-trained models.
NVIDIA DGX Cloud is an AI-training-as-a-service platform, offering a serverless experience that’s optimized for generative AI.
“Enterprises need custom models to perform specialized skills trained on the proprietary DNA of their company — their data,” said Jensen Huang, CEO of NVIDIA. “NVIDIA’s AI foundry service combines our generative AI model technologies, LLM training expertise and giant-scale AI factory. We built this in Microsoft Azure so enterprises worldwide can connect their custom model with Microsoft’s world-leading cloud services.”
Using NVIDIA's AI foundry service, enterprises can customize models for generative AI-powered apps and then use retrieval-augmented generation (RAG) to connect the models with their data. RAG is a technique "for enhancing the accuracy and reliability of generative AI models with facts fetched from external sources."
The first to use the service are SAP SE, the developer of business software, Amdocs, a company specializing in software and services for communications, and the media supplier Getty Images.
They, along with other NVIDIA foundry service users, can choose from several AI Foundation Models, including NVIDIA Nemotron-3 8B, a new tool with 8 billion parameters offering versions tuned for different use cases.
In addition, NVIDIA DGX Cloud AI is now available on Azure Marketplace. "It features instances customers can rent, scaling to thousands of NVIDIA Tensor Core GPUs, and comes with NVIDIA AI Enterprise software, including NeMo, to speed LLM customization."
You may find these articles interesting