Hugging Face Introduces Inference-as-a-Service with NVIDIA NIM for AI Developers

July 30, 2024

in Blockchain

Reading Time: 2 mins read

Tether to Invest in Quantoz for MiCAR-Compliant Stablecoin Launch

November 19, 2024

MARA Holdings Announces 0 Million Offering of Convertible Senior Notes Due 2030

MARA Holdings Announces $850 Million Offering of Convertible Senior Notes Due 2030

November 19, 2024

Timothy Morano
Jul 30, 2024 06:37

Hugging Face and NVIDIA collaborate to offer Inference-as-a-Service, enhancing AI model efficiency and accessibility for developers.

Hugging Face, a leading AI community platform, is now offering developers Inference-as-a-Service powered by NVIDIA’s NIM microservices, according to NVIDIA Blog. The service aims to boost token efficiency by up to five times with popular AI models and provide immediate access to NVIDIA DGX Cloud.

Enhanced AI Model Efficiency

This new service, announced at the SIGGRAPH conference, allows developers to rapidly deploy leading large language models, including the Llama 3 family and Mistral AI models. These models are optimized using NVIDIA NIM microservices running on NVIDIA DGX Cloud.

Developers can prototype with open-source AI models hosted on the Hugging Face Hub and deploy them in production seamlessly. Enterprise Hub users can leverage serverless inference for increased flexibility, minimal infrastructure overhead, and optimized performance.

Streamlined AI Development

The Inference-as-a-Service complements the existing Train on DGX Cloud service, which is already available on Hugging Face. This integration provides developers with a centralized hub to compare various open-source models, experiment, test, and deploy cutting-edge models on NVIDIA-accelerated infrastructure.

The tools are easily accessible through the “Train” and “Deploy” drop-down menus on Hugging Face model cards, enabling users to get started with just a few clicks.

NVIDIA NIM Microservices

NVIDIA NIM is a collection of AI microservices, including NVIDIA AI foundation models and open-source community models, optimized for inference using industry-standard APIs. NIM offers higher efficiency in processing tokens, improving the efficiency of the underlying NVIDIA DGX Cloud infrastructure and increasing the speed of critical AI applications.

For example, the 70-billion-parameter version of Llama 3 delivers up to 5x higher throughput when accessed as a NIM compared to off-the-shelf deployment on NVIDIA H100 Tensor Core GPU-powered systems.

Accessible AI Acceleration

The NVIDIA DGX Cloud platform is purpose-built for generative AI, offering developers easy access to reliable accelerated computing infrastructure. This platform supports every step of AI development, from prototype to production, without requiring long-term AI infrastructure commitments.

Hugging Face’s Inference-as-a-Service on NVIDIA DGX Cloud, powered by NIM microservices, offers easy access to compute resources optimized for AI deployment. This enables users to experiment with the latest AI models in an enterprise-grade environment.

More Announcements at SIGGRAPH

At the SIGGRAPH conference, NVIDIA also introduced generative AI models and NIM microservices for the OpenUSD framework. This aims to accelerate developers’ abilities to build highly accurate virtual worlds for the next evolution of AI.

For more information, visit the official NVIDIA Blog.

Image source: Shutterstock

Credit: Source link

Hugging Face Introduces Inference-as-a-Service with NVIDIA NIM for AI Developers

Tether to Invest in Quantoz for MiCAR-Compliant Stablecoin Launch

MARA Holdings Announces $850 Million Offering of Convertible Senior Notes Due 2030

Related Posts

Tether to Invest in Quantoz for MiCAR-Compliant Stablecoin Launch

MARA Holdings Announces $850 Million Offering of Convertible Senior Notes Due 2030

Google and NVIDIA Collaborate to Enhance Quantum Processing Unit Development

NVIDIA Expands Quantum Computing Horizons with AI Supercomputing

NVIDIA Unveils Omniverse Blueprints for Real-Time Physics Digital Twins

Ripple Chief Says Company Can Operate Without XRP, Calls SEC’s Crypto Lawsuit ‘Ironic’

Coinbase Approved for Public Listing Amid Reported $100,000,000,000 Valuation

Pro-Bitcoin Trade Group Signals Fresh Push for Mainstream Crypto

Crypto market’s slump liquidates $1.4 billion off the trading board

Ripple Chief Says Company Can Operate Without XRP, Calls SEC’s Crypto Lawsuit ‘Ironic’

Coinbase Approved for Public Listing Amid Reported $100,000,000,000 Valuation

Pro-Bitcoin Trade Group Signals Fresh Push for Mainstream Crypto

Crypto market’s slump liquidates $1.4 billion off the trading board

Paxos to Acquire Membrane Finance in Strategic Move to Make USD-Backed Stablecoins MiCA Compliant

IBIT options trading volume surges to $446M in opening hours, $1.6B by mid-day

Betting on Armageddon? Polymarket Users Wager on Nuclear Detonation in 2024

Missed Dogecoin’s Recent High? Flockerz ICO Hits $2.3M, Analyst Predicts it Will 10x

Topics to Cover!

What’s New Here!

Newsletter

Hugging Face Introduces Inference-as-a-Service with NVIDIA NIM for AI Developers

Related articles

Enhanced AI Model Efficiency

Streamlined AI Development

NVIDIA NIM Microservices

Accessible AI Acceleration

More Announcements at SIGGRAPH

Related Posts

Topics to Cover!

What’s New Here!

Newsletter