NVIDIA Technical Blog | News and tutorials for developers, data scientists, and IT admins

Generative AI

Deploy Multilingual LLMs with NVIDIA NIM
Cybersecurity

Advancing Security for Large Language Models with NVIDIA GPUs and Edgeless Systems
Top Stories

AI Brain Implant Restores Bilingual Communication for Stroke Survivor
Generative AI

Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4 340B
Generative AI

End-to-End Driving at Scale with Hydra-MDP

Recent

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

Jul 12, 2024

Event: WeAreDevelopers World Congress 2024

Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.

1 MIN READ

Jul 12, 2024

Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU

Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...

4 MIN READ

An illustration showing a securit alert.

Jul 11, 2024

Defending AI Model Files from Unauthorized Access with Canaries

As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....

6 MIN READ

Jul 11, 2024

Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...

7 MIN READ

Jul 11, 2024

Next Generation of FlashAttention

NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...

1 MIN READ

Jul 11, 2024

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...

10 MIN READ

Jul 11, 2024

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...

9 MIN READ

Jul 10, 2024

Customizing NVIDIA NIMs for Domain-Specific Needs with NVIDIA NeMo

Large language models (LLMs) adopted for specific enterprise applications most often benefit from model customization. Enterprises need to tailor ‌LLMs for...

11 MIN READ

A GIF showing the creation of a building image with diffusion models.

Jul 10, 2024

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...

13 MIN READ

Decorative image of a computer screen with characters and symbols streaming through it.

Jul 10, 2024

Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator

Data curation plays a crucial role in the development of effective and fair large language models (LLMs). High-quality, diverse training data directly...

12 MIN READ

Jul 10, 2024

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...

14 MIN READ

Generative AI

See all

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

Jul 12, 2024

Event: WeAreDevelopers World Congress 2024

Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.

1 MIN READ

Jul 11, 2024

Defending AI Model Files from Unauthorized Access with Canaries

As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....

6 MIN READ

Jul 11, 2024

Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...

7 MIN READ

Jul 11, 2024

Next Generation of FlashAttention

NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...

1 MIN READ

Jul 10, 2024

Customizing NVIDIA NIMs for Domain-Specific Needs with NVIDIA NeMo

Large language models (LLMs) adopted for specific enterprise applications most often benefit from model customization. Enterprises need to tailor ‌LLMs for...

11 MIN READ

Jul 10, 2024

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...

13 MIN READ

Jul 10, 2024

Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator

Data curation plays a crucial role in the development of effective and fair large language models (LLMs). High-quality, diverse training data directly...

12 MIN READ

Jul 08, 2024

Deploy Multilingual LLMs with NVIDIA NIM

Multilingual large language models (LLMs) are increasingly important for enterprises operating in today's globalized business landscape. As businesses expand...

9 MIN READ

Jul 03, 2024

Power Advanced Coding Capabilities with Deepseek Code LLM

Deepseek Coder v2, available as an NVIDIA NIM microservice, enhances project-level coding and infilling tasks.

1 MIN READ

Jul 02, 2024

Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model

NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...

4 MIN READ

Jul 02, 2024

Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and TensorRT-LLM

As large language models (LLMs) continue to grow in size and complexity, the performance requirements for serving them quickly and cost-effectively continue to...

9 MIN READ

AI Foundation Models

See all

Jun 28, 2024

Transforming Financial Analysis with NVIDIA NIM

In financial services, portfolio managers and research analysts diligently sift through vast amounts of data to gain a competitive edge in investments. Making...

13 MIN READ

Jun 24, 2024

Addressing Medical Imaging Limitations with Synthetic Data Generation

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...

9 MIN READ

Jun 10, 2024

Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog

Create high-resolution images with remarkable efficiency with the Advanced text-to-image generation model, SDXL-Lightning, available and optimized now on the...

1 MIN READ

Jun 10, 2024

SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks

Enhance efficiency and performance in instruction-based NLP tasks with SOLAR-10.7B, especially in following instructions, reasoning, and mathematical tasks.

1 MIN READ

Jun 03, 2024

Breeze-7B: LLM Specialized for Traditional Chinese

The model demonstrates strong performance for tasks such as Q&A, multi-round chat, and summarization in both traditional Chinese and English.

1 MIN READ

Jun 03, 2024

BGE-M3: Advanced Multilingual Text Retrieval Model

Experience the versatile embedding model designed for multilingual, multi-functional, and multi-granularity text retrieval tasks, excelling in dense,...

1 MIN READ

May 30, 2024

Convert Natural Language to Code with CodeGemma

Experience the advanced LLM API for code generation, completion, mathematical reasoning, and instruction following with free cloud credits.

1 MIN READ

May 14, 2024

Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model

With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.

1 MIN READ

May 13, 2024

Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia

At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...

3 MIN READ

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ

Apr 26, 2024

New LLM: Snowflake Arctic Model for SQL and Code Generation

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...

3 MIN READ

Apr 22, 2024

Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API

This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...

4 MIN READ

Simulation / Modeling / Design

See all

Jul 12, 2024

Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU

Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...

4 MIN READ

Jul 11, 2024

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...

10 MIN READ

Jul 11, 2024

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...

9 MIN READ

Jul 10, 2024

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...

13 MIN READ

Jul 10, 2024

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...

14 MIN READ

Jul 03, 2024

Just Released: cuDSS 0.3.0

cuDSS (Preview) is an accelerated direct sparse solver. It now supports multi-GPU multi-node platforms, and introduces a hybrid memory mode.

1 MIN READ

Jul 02, 2024

Checkpointing CUDA Applications with CRIU

Checkpoint and restore functionality for CUDA is exposed through a command-line utility called cuda-checkpoint. This utility can be used to transparently...

7 MIN READ

Abstract image with three different illustrations representing HPC applications.

Jun 28, 2024

Explainer: What Is High-Performance Computing?

High-performance computing (HPC) is the art and science of using groups of cutting-edge computer systems to perform complex simulations, computations, and data...

1 MIN READ

Image of a factory floor with loading equipment and a person with a clipboard.

Jun 26, 2024

Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD

SyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...

7 MIN READ

Jun 24, 2024

Addressing Medical Imaging Limitations with Synthetic Data Generation

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...

9 MIN READ

Decorative image of avatars working in different office locations.

Jun 24, 2024

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...

12 MIN READ

Decorative image of light fields in green, purple, and blue.

Jun 18, 2024

Runtime Fatbin Creation Using the NVIDIA CUDA Toolkit 12.4 Compiler

CUDA Toolkit 12.4 introduced a new nvFatbin library for creating fatbins at runtime. Fatbins, otherwise known as NVIDIA device code fat binaries, are containers...

11 MIN READ

Robotics

See all

Jul 11, 2024

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Most objects in home and industrial settings consist of multiple parts that must be assembled. While human workers typically perform assembly, in certain...

10 MIN READ

Jul 11, 2024

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

The world’s energy system is increasingly complex and distributed due to increasing renewable energy generation, decentralization of energy resources, and...

9 MIN READ

Jul 10, 2024

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...

14 MIN READ

Jun 25, 2024

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...

5 MIN READ

Jun 24, 2024

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...

12 MIN READ

Jun 17, 2024

Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab

Developing effective locomotion policies for quadrupeds poses significant challenges in robotics due to the complex dynamics involved. Training quadrupeds to...

12 MIN READ

Jun 17, 2024

Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab

The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...

11 MIN READ

Jun 14, 2024

Level Up Your Skills with Five New NVIDIA Technical Courses

With AI introducing an unprecedented pace of technological innovation, staying ahead means keeping your skills up to date. The NVIDIA Developer Program gives...

4 MIN READ

Image of a robotic arm lifting a package.

Jun 13, 2024

Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release

NVIDIA Omniverse is a platform that enables you to build applications for complex 3D and industrial digitalization workflows based on Universal Scene...

5 MIN READ

Jun 05, 2024

Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK

NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...

6 MIN READ

Jun 04, 2024

Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA

NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...

12 MIN READ

Decorative image of workflows in a line.

Jun 02, 2024

Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow

Large areas like warehouses, factories, stadiums, and airports are typically monitored by hundreds of cameras to improve safety and optimize operations....

11 MIN READ

Computer Vision / Video Analytics

See all

Jul 10, 2024

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Large-scale, use–case-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That’s because digital twins...

14 MIN READ

Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...

6 MIN READ

Jun 26, 2024

Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC

NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...

7 MIN READ

Jun 26, 2024

Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD

SyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...

7 MIN READ

Jun 25, 2024

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...

5 MIN READ

Jun 24, 2024

Addressing Medical Imaging Limitations with Synthetic Data Generation

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...

9 MIN READ

Jun 24, 2024

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...

12 MIN READ

Jun 18, 2024

Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0

Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...

11 MIN READ

Jun 17, 2024

Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab

The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...

11 MIN READ

Jun 06, 2024

MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development

MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...

1 MIN READ

Jun 05, 2024

Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK

NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...

6 MIN READ

Jun 04, 2024

Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA

NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...

12 MIN READ

Data Science

See all

Jul 12, 2024

Event: WeAreDevelopers World Congress 2024

Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.

1 MIN READ

Jul 12, 2024

Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU

Mathematical optimization is a powerful tool that enables businesses and people to make smarter decisions and reach any number of goals—from improving...

4 MIN READ

Jul 11, 2024

Defending AI Model Files from Unauthorized Access with Canaries

As AI models grow in capability and cost of creation, and hold more sensitive or proprietary data, securing them at rest is increasingly important....

6 MIN READ

Jul 11, 2024

Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

The rapidly evolving field of generative AI is focused on building neural networks that can create realistic content such as text, images, audio, and synthetic...

7 MIN READ

Jul 09, 2024

Just Released: nvmath-python

nvmath-python is an open-source Python library that provides high performance access to the core mathematical operations in the NVIDIA Math Libraries. Available...

1 MIN READ

Two b&w images of a woman in a hat, one image in a higher resolution.

Jul 05, 2024

Explainer: What Is K-Means?

K-means is a clustering algorithm—one of the simplest and most popular unsupervised machine learning (ML) algorithms for data scientists.

1 MIN READ

Jul 03, 2024

Maximize GPU performance with Near-Real-Time Usage Stats on NVDashboard v0.10

At NVIDIA GTC 2024, the RAPIDS team demonstrated new features on NVDashboard v0.10 a dashboard that runs on JupyterLab, for monitoring GPU usage to help...

6 MIN READ

Jul 02, 2024

Checkpointing CUDA Applications with CRIU

Checkpoint and restore functionality for CUDA is exposed through a command-line utility called cuda-checkpoint. This utility can be used to transparently...

7 MIN READ

Jul 01, 2024

How Cutting-Edge Computer Chips are Speeding Up the AI Revolution

Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI.

1 MIN READ

Jun 28, 2024

Federated XGBoost Made Practical and Productive with NVIDIA FLARE

XGBoost is a highly effective and scalable machine learning algorithm widely employed for regression, classification, and ranking tasks. Building on the...

6 MIN READ

Jun 27, 2024

Secure LLM Tokenizers to Maintain Application Integrity

This post is part of the NVIDIA AI Red Team’s continuing vulnerability and technique research. Use the concepts presented to responsibly assess and increase...

6 MIN READ

Jun 26, 2024

Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD

SyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...

7 MIN READ

Content Creation / Rendering

See all

Jul 10, 2024

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Generative AI, the ability of algorithms to process various types of inputs—such as text, images, audio, video, and code—and generate new content, is...

13 MIN READ

Jun 26, 2024

Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC

NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...

7 MIN READ

Jun 13, 2024

Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release

NVIDIA Omniverse is a platform that enables you to build applications for complex 3D and industrial digitalization workflows based on Universal Scene...

5 MIN READ

Jun 10, 2024

Reallusion Brings Digital Characters to Life with NVIDIA AI

In today's digital age, creating realistic animated characters is crucial for filmmakers, game developers, and content creators looking to bring their visions...

6 MIN READ

Comparison of 1080p and 4K RTX VSR and HDR.

Jun 06, 2024

Enhancing Low-Resolution SDR Video with the NVIDIA RTX Video SDK

NVIDIA RTX Video is a collection of AI video enhancements that improve the visual quality of lower-quality video. RTX Video Super Resolution was announced...

2 MIN READ

Jun 04, 2024

Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available

NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...

5 MIN READ

May 31, 2024

How to Train an Object Detection Model for Visual Inspection with Synthetic Data

AI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or...

8 MIN READ

May 16, 2024

Webinar: Path Traced Visuals in Unreal Engine

Integrate RTX into your own game and understand what ReSTIR means for the future of real-time lighting in this May 21 webinar.

1 MIN READ

Three reflective green spheres hovering above three white platforms on a neutral background.

Apr 29, 2024

GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads

Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...

7 MIN READ

Apr 26, 2024

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...

10 MIN READ

Apr 11, 2024

New Video Series: OpenUSD for Developers

Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...

3 MIN READ

Decorative collage of media images superimposed on data center mockup.

Apr 09, 2024

Next-Generation Live Media Apps on Repurposable Clusters with NVIDIA Holoscan for Media

NVIDIA Holoscan for Media is now available to all developers looking to build next-generation live media applications on fully repurposable clusters. ...

4 MIN READ

Conversational AI

See all

Jul 12, 2024

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

First introduced in 2019, NVIDIA Megatron-LM sparked a wave of innovation in the AI community, enabling researchers and developers to use the underpinnings of...

11 MIN READ

Jul 02, 2024

Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model

NVIDIA NeMo has released the T5-TTS model, a significant advancement in text-to-speech (TTS) technology. Based on large language models (LLMs), T5-TTS produces...

4 MIN READ

Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...

6 MIN READ

Jun 26, 2024

Generate High-Quality, Context-Aware Responses for Chatbots and Search Engines with Llama 3-ChatQA

Experience and test Llama3-ChatQA models at scale with performance optimized NVIDIA NIM inference microservice using the NVIDIA API catalog.

1 MIN READ

Jun 20, 2024

AI Brain Implant Restores Bilingual Communication for Stroke Survivor

Scientists have enabled a stroke survivor, who is unable to speak, to communicate in both Spanish and English by training a neuroprosthesis implant to decode...

3 MIN READ

Jun 12, 2024

Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...

7 MIN READ

An illustration representing an embedding model.

Jun 10, 2024

NVIDIA Text Embedding Model Tops MTEB Leaderboard

The latest embedding model from NVIDIA—NV-Embed—set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark...

6 MIN READ

Jun 04, 2024

Build Lifelike Digital Humans with NVIDIA ACE, Now Generally Available

NVIDIA ACE—a suite of technologies bringing digital humans to life with generative AI—is now generally available for developers. Packaged as NVIDIA NIMs,...

5 MIN READ

Jun 02, 2024

Streamline Development of AI-Powered Apps with NVIDIA RTX AI Toolkit for Windows RTX PCs

NVIDIA today launched the NVIDIA RTX AI Toolkit, a collection of tools and SDKs for Windows application developers to customize, optimize, and deploy AI models...

8 MIN READ

An illustration representing NeMo Guardrails.

May 31, 2024

Building Safer LLM Apps with LangChain Templates and NVIDIA NeMo Guardrails

An easily deployable reference architecture can help developers get to production faster with custom LLM use cases. LangChain Templates are a new way of...

7 MIN READ

Stylized image of a smartphone chat with a young woman smiling off to one side.

May 30, 2024

Personalized Learning with Gipi, NVIDIA TensortRT-LLM, and AI Foundation Models

Over 1.2B people are actively learning new languages, with over 500M learners on digital learning platforms such as Duolingo. At the same time, a significant...

6 MIN READ

May 29, 2024

Generative AI Agents Developer Contest: Top Tips for Getting Started

Join our contest that runs through June 17 and showcase your innovation using cutting-edge generative AI-powered applications using NVIDIA and LangChain...

3 MIN READ

Edge Computing

See all

Jul 03, 2024

Powering the Future of AI-Enabled Medical Devices with NVIDIA Holoscan and RTI Connext

The demand for real-time insights and autonomous decision-making is growing across industries, and healthcare and medical devices are no exception. Relying on...

8 MIN READ

Jun 28, 2024

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...

6 MIN READ

Jun 25, 2024

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...

5 MIN READ

Jun 18, 2024

Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0

Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...

11 MIN READ

Decorative image of TensorRT workflow on a black background.

Jun 11, 2024

Maximum Performance and Minimum Footprint for AI Apps with NVIDIA TensorRT Weight-Stripped Engines

NVIDIA TensorRT, an established inference library for data centers, has rapidly emerged as a desirable inference backend for NVIDIA GeForce RTX and NVIDIA RTX...

8 MIN READ

Jun 06, 2024

MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development

MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...

1 MIN READ

Jun 05, 2024

Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK

NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...

6 MIN READ

Jun 04, 2024

Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA

NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...

12 MIN READ

Decorative image of green icons on a black screen behind IGX hardware.

Jun 02, 2024

Production-Ready, Enterprise-Grade Software on NVIDIA IGX Platform, Support for NVIDIA RTX 6000 ADA, and More

Real-time AI at the edge is crucial for medical, industrial, and scientific computing because these mission-critical applications require immediate data...

6 MIN READ

May 22, 2024

Enhancing AI Cloud Data Centers and NVIDIA Spectrum-X with NVIDIA DOCA 2.7

The NVIDIA DOCA acceleration framework empowers developers with extensive libraries, drivers, and APIs to create high-performance applications and services for...

10 MIN READ

May 20, 2024

Supercharge Generative AI Development with Firebase Genkit, Optimized by NVIDIA RTX GPUs

At Google I/O 2024, Google announced Firebase Genkit, a new open-source framework for developers to add generative AI to web and mobile applications using...

4 MIN READ

May 14, 2024

NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development

NVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available...

11 MIN READ

Data Center / Cloud

See all

Jul 12, 2024

Event: WeAreDevelopers World Congress 2024

Join NVIDIA at WeAreDevelopers July 17-19 to learn how accelerated computing tools powered by GPUs are shaping the future.

1 MIN READ

Jul 09, 2024

Just Released: nvmath-python

nvmath-python is an open-source Python library that provides high performance access to the core mathematical operations in the NVIDIA Math Libraries. Available...

1 MIN READ

Jul 03, 2024

Just Released: cuDSS 0.3.0

cuDSS (Preview) is an accelerated direct sparse solver. It now supports multi-GPU multi-node platforms, and introduces a hybrid memory mode.

1 MIN READ

Jul 02, 2024

Checkpointing CUDA Applications with CRIU

Checkpoint and restore functionality for CUDA is exposed through a command-line utility called cuda-checkpoint. This utility can be used to transparently...

7 MIN READ

Jul 01, 2024

How Cutting-Edge Computer Chips are Speeding Up the AI Revolution

Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI.

1 MIN READ

Jun 24, 2024

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...

12 MIN READ

Jun 24, 2024

Exploring SONiC on NVIDIA Air

Testing out networking infrastructure and building working PoCs for a new environment can be tricky at best and downright dreadful at worst. You may run into...

6 MIN READ

Jun 17, 2024

Video: Talk to Your Supply Chain Data Using NVIDIA NIM

NVIDIA operates one of the largest and most complex supply chains in the world. The supercomputers we build connect tens of thousands of NVIDIA GPUs with...

2 MIN READ

A windmill and solar panel illustration.

Jun 14, 2024

Explainer: What Is Power Efficiency?

Power efficiency refers to a compute resource’s ability to convert electrical power into useful work with minimal waste or loss. It’s typically measured in...

1 MIN READ

Jun 13, 2024

Unlocking GPU-Accelerated RDMA with NVIDIA DOCA GPUNetIO

NVIDIA DOCA GPUNetIO is a library within the NVIDIA DOCA SDK, specifically designed for real-time inline GPU packet processing. It combines technologies like...

11 MIN READ

Jun 12, 2024

Demystifying AI Inference Deployments for Trillion Parameter Large Language Models

AI is transforming every industry, addressing grand human scientific challenges such as precision drug discovery and the development of autonomous vehicles, as...

14 MIN READ

Jun 12, 2024

NVIDIA Sets New Generative AI Performance and Scale Records in MLPerf Training v4.0

Generative AI models have a variety of uses, such as helping write computer code, crafting stories, composing music, generating images, producing videos, and...

11 MIN READ

Deploy Multilingual LLMs with NVIDIA NIM

Advancing Security for Large Language Models with NVIDIA GPUs and Edgeless Systems

AI Brain Implant Restores Bilingual Communication for Stroke Survivor

Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4 340B

End-to-End Driving at Scale with Hydra-MDP

Recent

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

Event: WeAreDevelopers World Congress 2024

Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU

Defending AI Model Files from Unauthorized Access with Canaries

Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

Next Generation of FlashAttention

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

Customizing NVIDIA NIMs for Domain-Specific Needs with NVIDIA NeMo

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Generative AI

Train Generative AI Models More Efficiently with New NVIDIA Megatron-Core Functionalities

Event: WeAreDevelopers World Congress 2024

Defending AI Model Files from Unauthorized Access with Canaries

Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG

Next Generation of FlashAttention

Customizing NVIDIA NIMs for Domain-Specific Needs with NVIDIA NeMo

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Curating Non-English Datasets for LLM Training with NVIDIA NeMo Curator

Deploy Multilingual LLMs with NVIDIA NIM

Power Advanced Coding Capabilities with Deepseek Code LLM

Addressing Hallucinations in Speech Synthesis LLMs with the NVIDIA NeMo T5-TTS Model

Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and TensorRT-LLM

AI Foundation Models

Transforming Financial Analysis with NVIDIA NIM

Addressing Medical Imaging Limitations with Synthetic Data Generation

Introducing SDXL-Lightning: New Lightning-Fast Model on NVIDIA API Catalog

SOLAR-10.7B: Optimized Model Tailored Instruction Following, Reasoning, and Mathematical Tasks

Breeze-7B: LLM Specialized for Traditional Chinese

BGE-M3: Advanced Multilingual Text Retrieval Model

Convert Natural Language to Code with CodeGemma

Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model

Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

New LLM: Snowflake Arctic Model for SQL and Code Generation

Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API

Simulation / Modeling / Design

Boosting Mathematical Optimization Performance and Energy Efficiency on the NVIDIA Grace CPU

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

Understanding Diffusion Models: An Essential Guide for AEC Professionals

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Just Released: cuDSS 0.3.0

Checkpointing CUDA Applications with CRIU

Explainer: What Is High-Performance Computing?

Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD

Addressing Medical Imaging Limitations with Synthetic Data Generation

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

Runtime Fatbin Creation Using the NVIDIA CUDA Toolkit 12.4 Compiler

Robotics

Training Sim-to-Real Transferable Robotic Assembly Skills over Diverse Geometries

Spotlight: Siemens Energy Accelerates Power Grid Asset Simulation 10,000x Using NVIDIA Modulus

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

Closing the Sim-to-Real Gap: Training Spot Quadruped Locomotion with NVIDIA Isaac Lab

Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab

Level Up Your Skills with Five New NVIDIA Technical Courses

Build OpenUSD Applications for the Cloud with NVIDIA Omniverse Kit 106 Milestone Release

Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK

Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA

Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow

Computer Vision / Video Analytics

Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data

Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning

Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC

Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD

AI-Enhanced Navigation Charts Safer Waters for Massive Ships

Addressing Medical Imaging Limitations with Synthetic Data Generation

Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim

Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0

Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab