Author: Anjali Shah | NVIDIA Technical Blog

Anjali Shah

Anjali Shah is a senior deep learning scientist at NVIDIA within the Developer Advocate Engineering group helping clients build generative AI solutions. Early in her career, as a software engineer, she built mission-critical platforms for the world's leading financial services firms. She then spent several years in the healthcare sector, architecting and implementing large scale healthcare (EHR) systems. Before joining NVIDIA, she spent several years at a leading tech company, working across different industries helping clients build innovative data and AI solutions. She has a Ph.D. in biomedical informatics and applied statistics and an M.S. and B.S. in computer science and engineering.

Posts by Anjali Shah

Generative AI Jan 16, 2025

Anjali Shah

Posts by Anjali Shah

Introducing New KV Cache Reuse Optimizations in NVIDIA TensorRT-LLM

Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding

NVIDIA TensorRT-LLM Now Accelerates Encoder-Decoder Models with In-Flight Batching

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs

Deploying Accelerated Llama 3.2 from the Edge to the Cloud