Skip to content

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

License

Notifications You must be signed in to change notification settings

facebookresearch/mmf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

223b137 · Feb 3, 2025
Jan 27, 2023
Apr 5, 2023
Oct 14, 2021
Feb 3, 2025
Nov 15, 2024
Dec 20, 2022
Feb 3, 2025
Nov 15, 2024
Oct 29, 2024
May 8, 2020
May 8, 2020
Mar 9, 2021
Jan 27, 2023
May 8, 2020
Aug 19, 2020
Dec 14, 2021
Jul 11, 2023
Oct 19, 2021
Nov 15, 2021
Apr 5, 2023
Apr 5, 2021

Repository files navigation


MMF is a modular framework for vision and language multimodal research from Facebook AI Research. MMF contains reference implementations of state-of-the-art vision and language models and has powered multiple research projects at Facebook AI Research. See full list of project inside or built on MMF here.

MMF is powered by PyTorch, allows distributed training and is un-opinionated, scalable and fast. Use MMF to bootstrap for your next vision and language multimodal research project by following the installation instructions. Take a look at list of MMF features here.

MMF also acts as starter codebase for challenges around vision and language datasets (The Hateful Memes, TextVQA, TextCaps and VQA challenges). MMF was formerly known as Pythia. The next video shows an overview of how datasets and models work inside MMF. Checkout MMF's video overview.

Installation

Follow installation instructions in the documentation.

Documentation

Learn more about MMF here.

Citation

If you use MMF in your work or use any models published in MMF, please cite:

@misc{singh2020mmf,
  author =       {Singh, Amanpreet and Goswami, Vedanuj and Natarajan, Vivek and Jiang, Yu and Chen, Xinlei and Shah, Meet and
                 Rohrbach, Marcus and Batra, Dhruv and Parikh, Devi},
  title =        {MMF: A multimodal framework for vision and language research},
  howpublished = {\url{https://github.com/facebookresearch/mmf}},
  year =         {2020}
}

License

MMF is licensed under BSD license available in LICENSE file