Paper Reading Group - Session 11
Vision Language Models: Connecting Image Encoders to LLMs
- Presenter: I Putu Agi Karasugi
- Presentation Material: PPT
- Paper: https://arxiv.org/abs/2301.12597
- Code: https://github.com/salesforce/LAVIS/tree/main/projects/blip2