Microsoft Bing Visual Search Enhanced by NVIDIA's Accelerated Libraries

Microsoft Bing Visual Search Enhanced by NVIDIA’s Accelerated Libraries

Hong Kong Academy of Finance Opens Applications for 2025 Financial Leaders Programme

November 5, 2024

Binance Adds New USDⓈ-M Perpetual Contracts to Futures Copy Trading

November 5, 2024

Tony Kim
Oct 08, 2024 06:23

Microsoft Bing Visual Search achieves a 5.13x speedup using NVIDIA’s TensorRT, CV-CUDA, and nvImageCodec, enhancing efficiency and reducing costs.

Microsoft Bing Visual Search, a tool enabling users worldwide to search using photographs, has been significantly optimized through a collaboration with NVIDIA, resulting in a remarkable performance boost. According to NVIDIA Technical Blog, the integration of NVIDIA’s TensorRT, CV-CUDA, and nvImageCodec into Bing’s TuringMM visual embedding model has led to a 5.13x increase in throughput for offline indexing pipelines, reducing both energy consumption and costs.

Multimodal AI and Visual Search

Multimodal AI technologies, like Microsoft’s TuringMM, are essential for applications that require seamless interaction between different data types such as text and images. A popular model for joint image-text understanding is CLIP, which uses a dual encoder architecture to process hundreds of millions of image-caption pairs. These advanced models are critical for tasks such as text-based visual search, zero-shot image classification, and image captioning.

Optimization Efforts

The optimization of Bing’s visual embedding pipeline was achieved by leveraging NVIDIA’s GPU acceleration technologies. The effort focused on enhancing the performance of the TuringMM pipeline by using NVIDIA’s TensorRT for model execution, which improved the efficiency of computationally expensive layers in transformer architectures. Additionally, the use of nvImageCodec and CV-CUDA accelerated the image decoding and preprocessing stages, leading to a significant reduction in latency for image processing tasks.

Implementation and Results

Prior to optimization, Bing’s visual embedding model operated on a GPU server cluster that handled inference tasks for various deep learning services across Microsoft. The original implementation, using ONNXRuntime with CUDA Execution Provider, faced bottlenecks due to image decoding processes handled by OpenCV. By integrating NVIDIA’s libraries, the pipeline’s throughput increased from 88 queries per second (QPS) to 452 QPS, showcasing a 5.14x speedup.

These enhancements not only improved processing speed but also reduced the computational load on CPUs by offloading tasks to GPUs, thus maximizing power efficiency. The NVIDIA TensorRT contributed most to the performance gains, while the nvImageCodec and CV-CUDA libraries added an additional 27% improvement.

Conclusion

The successful optimization of Microsoft Bing Visual Search highlights the potential of NVIDIA’s accelerated libraries in enhancing AI-driven applications. The collaboration demonstrates how GPU resources can be effectively utilized to accelerate deep learning and image processing workloads, even when baseline systems already employ GPU acceleration. These advancements pave the way for more efficient and responsive visual search capabilities, benefiting both users and service providers.

For more detailed insights into the optimization process, visit the original NVIDIA Technical Blog.

Image source: Shutterstock

Credit: Source link

Microsoft Bing Visual Search Enhanced by NVIDIA’s Accelerated Libraries

Hong Kong Academy of Finance Opens Applications for 2025 Financial Leaders Programme

Binance Adds New USDⓈ-M Perpetual Contracts to Futures Copy Trading

Related Posts

Hong Kong Academy of Finance Opens Applications for 2025 Financial Leaders Programme

Binance Adds New USDⓈ-M Perpetual Contracts to Futures Copy Trading

a16z Crypto Advocates for Comprehensive U.S. Crypto Policy Framework

Understanding DApps: A Key Component of the Web3 Ecosystem

Global Dollar Network Launched to Boost Stablecoin Adoption

Ripple Chief Says Company Can Operate Without XRP, Calls SEC’s Crypto Lawsuit ‘Ironic’

Coinbase Approved for Public Listing Amid Reported $100,000,000,000 Valuation

Pro-Bitcoin Trade Group Signals Fresh Push for Mainstream Crypto

Crypto market’s slump liquidates $1.4 billion off the trading board

Ripple Chief Says Company Can Operate Without XRP, Calls SEC’s Crypto Lawsuit ‘Ironic’

Coinbase Approved for Public Listing Amid Reported $100,000,000,000 Valuation

Pro-Bitcoin Trade Group Signals Fresh Push for Mainstream Crypto

Crypto market’s slump liquidates $1.4 billion off the trading board

Kraken Completes 2024 Proof of Reserves, Verifying Over $21.5 Billion in Client Assets

Risk-To-Reward on Ethereum Looking ‘Too Good To Pass Up’ According to Crypto Analyst – Here’s Why

Colorado Resident Loses $6,000 in Bitcoin to Phone Scammer

Hong Kong Academy of Finance Opens Applications for 2025 Financial Leaders Programme

Topics to Cover!

What’s New Here!

Newsletter

Microsoft Bing Visual Search Enhanced by NVIDIA’s Accelerated Libraries

Related articles

Multimodal AI and Visual Search

Optimization Efforts

Implementation and Results

Conclusion

Related Posts

Topics to Cover!

What’s New Here!

Newsletter