Nick Stracke’s Post

PhD Student in Generative AI for Computer Vision | CDTM | Data Science @ LMU

3mo

🎉 Excited to share our latest work on extracting better features from diffusion models co-led by Stefan Baumann and Kolja Bauer. ✨ Diffusion models are amazing at learning world representations. Their features power many tasks: • Semantic correspondence • Depth estimation • Semantic segmentation 🤔 But have you ever wondered why we extract diffusion features from noisy images? Doesn’t that destroy valuable information? We show it does - and also requires finding correct hyperparameters for every downstream task. We thought, there had to be a better way. And there is. 🚀 With just 30 minutes of task-agnostic finetuning on a single GPU, we eliminate the need for noisy inputs. ✅ No noise ✅ No timestep tuning ✅ Better features, better performance across many tasks We make code and cleaned 🧹 weights available for SD 1.5 and SD 2.1. Have a look now! ⬇️

2 Comments

Nick Stracke

PhD Student in Generative AI for Computer Vision | CDTM | Data Science @ LMU

3mo

📝 Paper: https://arxiv.org/abs/2412.03439 💻 Code: https://github.com/CompVis/cleandift 🤗 Hugging Face: https://huggingface.co/CompVis/cleandift

1 Reaction

Isabell Welpe

3mo

This is great

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Natthanan Bhukan

Machine Learning Engineer | MLOps | CKAD
3mo
Report this post
Hi I would like to share my side project https://lnkd.in/g2xPNhZt 🚀 Efficient Object Detection with YOLOv11 using CUDA and TensorRT 🖥️ YOLOv11 with CUDA and TensorRT! 🚀 This high-performance object detection pipeline leverages CUDA Streams, multi-threading, and TensorRT for blazing-fast inference, processing images and videos concurrently. With optimized preprocessing, batch inference, and efficient NMS postprocessing Key Features: - Multi-threaded Parallelism: Leveraging CUDA streams for concurrent processing of multiple images and video frames. - Batch Inference: Optimized for processing multiple inputs simultaneously, maximizing GPU throughput. - High-Speed Preprocessing: Fast resizing, normalization, and data transformation. - Non-Maximum Suppression (NMS): Efficient postprocessing to ensure clean bounding box predictions. - Scalable Input Support: Handles both image and video files with simple CLI input.
1 Comment
Like Comment
To view or add a comment, sign in
Harshita Pishwe

Student at School Of Instrumentation (SoI) , DAVV
2mo Edited
Report this post
I've always wanted to explore OpenCV and Computer Vision, and dive deep into its intricacies. The Invisible Cloak project inspired by Harry Potter, uses OpenCV to create an illusion of invisibility. It's still a little rusty, but I’m excited to refine it further and make it even better. Smart Gesture Control project, a practical solution to adjust screen brightness using hand gestures – no buttons needed! Check out the demo video and let me know your thoughts! Feedback is always welcome. https://lnkd.in/dnvq4CnU<-- smart screen brightness https://lnkd.in/dxKfHcME <-- magic cloak

25 Comments
Like Comment
To view or add a comment, sign in
Pooja Lipare

Working at Nvidia
3mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Pooja Lipare

Working at Nvidia
3mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Sarmita Chatterjee

Senior Technical Recruiter -Hardware Engineering
3mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Arundhati Banerjee

Senior Inception Partner, NVIDIA | Engineer | Innovator
2mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Claudio Polla

NVIDIA Telco Solutions - UKI & Africa
2mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Pooja Lipare

Working at Nvidia
3mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Tomasz Bednarz

Director, Strategic Researcher Engagement at NVIDIA | PhD MBA
2mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in
Dawn Voss

AI in Healthcare and Life Sciences @ NVIDIA | Partnering and Sales
2mo
Report this post
⚡ Llama 3.3 70B from Meta is accelerated by TensorRT-LLM. 🌟 State-of-the-art model on par with Llama 3.1 405B for reasoning, math, instruction following and tool use. 🦙 Explore the preview ➡️

Llama 3.3 70B Accelerated by TensorRT-LLM

build.nvidia.com
Like Comment
To view or add a comment, sign in

572 followers

9 Posts

View Profile Follow

Nick Stracke’s Post

More Relevant Posts

Explore topics