A lightweight, self-hosted friendly RSS aggregator and reader
Deep Learning-based Image Fusion: A Survey
Use Autodesk Fusion 360 on Linux
A tool to snap pixels to a perfect grid
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
Director, Screenwriter, Producer, and Video Generator All-in-One
TorchMultimodal is a PyTorch library
A guide for how to use your smartphone to code anywhere at anytime
Must Reading Papers, Research Library, Open-Source Code
Programmer's guide about how to cook at home
Multimodal-Driven Architecture for Customized Video Generation
Fast inference engine for Transformer models
3D reconstruction software
Advanced techniques for RAG systems
The Compute Library is a set of computer vision and machine learning
Robust robotic localization and mapping
Forward and reverse mode automatic differentiation primitives
Burn is a new comprehensive dynamic Deep Learning Framework
Parallel solvers for sparse linear systems featuring multigrid methods
Marrying Grounding DINO with Segment Anything & Stable Diffusion
An open source Vagrant configuration for developing with WordPress
Foundational Models for State-of-the-Art Speech and Text Translation
Implementation of Make-A-Video, new SOTA text to video generator
VMZ: Model Zoo for Video Modeling