MemGen-V: Incentivizing Visual Reasoning with Generative Latent Memory

Motivation

Memory and reasoning are two crucial components of self-evolving AI. Traditional works have explored their potential for LLM agents or VLMs seperately without considering their subtle connection. Recently, MemGen has discovered that memory and reasoning are not discrete but rather interweave in the context of LLM agents. However, few works investigate the memory mechanisms in the context of visual intelligence. In this repo, we want to ask: Can memory benefit visual understanding by incentivizing visual reasoning?

🙏 Acknowledgment

This work is implemented based on VLM-R1 and MemGen. We greatly appreciate their valuable contributions to the community.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
run_scripts		run_scripts
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
setup.sh		setup.sh
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MemGen-V: Incentivizing Visual Reasoning with Generative Latent Memory

Motivation

🙏 Acknowledgment

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

pILLOW-1/MemGen-V

Folders and files

Latest commit

History

Repository files navigation

MemGen-V: Incentivizing Visual Reasoning with Generative Latent Memory

Motivation

🙏 Acknowledgment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages