arctanbell

arctanbell

2 followers · 13 following

DeepSeek-VL2 Public
Forked from deepseek-ai/DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python MIT License Updated Jan 16, 2025
mem0 Public
Forked from mem0ai/mem0

The Memory layer for your AI apps

Python Apache License 2.0 Updated Dec 8, 2024
SplatFormer Public
Forked from ChenYutongTHU/SplatFormer

SplatFormer: Point Transformer for Robust 3D Gaussian Splatting

Python Updated Nov 26, 2024
wvp-GB28181-pro Public
Forked from 648540858/wvp-GB28181-pro

WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台，支持NAT穿透，支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联，支持rtsp/rtmp等视频流转发到国标平台，支持rtsp/rtmp等推流转发到国标平台。

Java MIT License Updated Oct 17, 2024
LLaVA-NeXT Public
Forked from LLaVA-VL/LLaVA-NeXT

Python Apache License 2.0 Updated Oct 16, 2024
ShareGPT4Video Public
Forked from ShareGPT4Omni/ShareGPT4Video

[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python Updated Oct 9, 2024
LLaMA-Omni Public
Forked from ictnlp/LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python Apache License 2.0 Updated Sep 23, 2024
SlowFast Public
Forked from facebookresearch/SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python Apache License 2.0 Updated Aug 13, 2024
pipecat Public
Forked from pipecat-ai/pipecat

Open Source framework for voice and multimodal conversational AI

Python BSD 2-Clause "Simplified" License Updated Aug 12, 2024
CosyVoice Public
Forked from FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python Apache License 2.0 Updated Aug 8, 2024
VastGaussian Public
Forked from kangpeilun/VastGaussian

This is an unofficial Implementation

C++ Apache License 2.0 Updated Jul 28, 2024
LW-DETR Public
Forked from Atten4Vis/LW-DETR

This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".

Python Apache License 2.0 Updated Jul 25, 2024
2d-gaussian-splatting Public
Forked from hbb1/2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Python Other Updated Jun 5, 2024
GaussianPro Public
Forked from kcheng1021/GaussianPro

[ICML2024] Official code for GaussianPro: 3D Gaussian Splatting with Progressive Propagation

Python MIT License Updated May 31, 2024
FunASR Public
Forked from modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python Other Updated May 28, 2024
discocal Public
Forked from chaehyeonsong/discocal

C++ MIT License Updated May 7, 2024
gaussian-splatting Public
Forked from graphdeco-inria/gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python Other Updated May 6, 2024
AISP Public
Forked from mv-lab/AISP

AI Image SIgnal Processing and Computational Photography - Bokeh Rendering , Reversed ISP Challenge, Model-Based Image Signal Processors via Learnable Dictionaries. Official repo for NTIRE and AIM …

Jupyter Notebook Updated Apr 20, 2024
Person_reID_baseline_pytorch Public
Forked from layumi/Person_reID_baseline_pytorch

⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Python MIT License Updated Apr 15, 2024
yolo_tracking Public
Forked from mikel-brostrom/boxmot

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Python GNU Affero General Public License v3.0 Updated Apr 3, 2024
projectaria_tools Public
Forked from facebookresearch/projectaria_tools

projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data

C++ Apache License 2.0 Updated Mar 29, 2024
co-tracker Public
Forked from facebookresearch/co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook Other Updated Mar 28, 2024
sherpa-onnx Public
Forked from k2-fsa/sherpa-onnx

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, webs…

C++ Apache License 2.0 Updated Feb 28, 2024
espnet Public
Forked from espnet/espnet

End-to-End Speech Processing Toolkit

Python Apache License 2.0 Updated Feb 27, 2024
Depth-Anything Public
Forked from LiheYoung/Depth-Anything

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python Apache License 2.0 Updated Feb 21, 2024
edge-tts Public
Forked from rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python GNU General Public License v3.0 Updated Feb 16, 2024
build-openwrt Public
Forked from topak47/build-openwrt

利用Actions在线云编译openwrt固件，适合官方源码，lede，lienol和immortalwrt源码，支持X86，电视盒子等众多设备！

Shell GNU General Public License v2.0 Updated Feb 4, 2024
act-plus-plus Public
Forked from MarkFzp/act-plus-plus

Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python MIT License Updated Jan 4, 2024
mobile-aloha Public
Forked from MarkFzp/mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook MIT License Updated Jan 3, 2024
sound_distance_estimation Public
Forked from sakshamsingh1/sound_distance_estimation

Official implementation of "sound distance estimation" WASPAA 23

Python Updated Dec 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arctanbell

Block or report arctanbell

DeepSeek-VL2 Public

mem0 Public

SplatFormer Public

wvp-GB28181-pro Public

LLaVA-NeXT Public

ShareGPT4Video Public

LLaMA-Omni Public

SlowFast Public

pipecat Public

CosyVoice Public

VastGaussian Public

LW-DETR Public

2d-gaussian-splatting Public

GaussianPro Public

FunASR Public

discocal Public

gaussian-splatting Public

AISP Public

Person_reID_baseline_pytorch Public

yolo_tracking Public

projectaria_tools Public

co-tracker Public

sherpa-onnx Public

espnet Public

Depth-Anything Public

edge-tts Public

build-openwrt Public

act-plus-plus Public

mobile-aloha Public

sound_distance_estimation Public