Paper Reading Group - Session 11

Vision Language Models: Connecting Image Encoders to LLMs

Presenter: I Putu Agi Karasugi
Presentation Material: PPT
Paper: https://arxiv.org/abs/2301.12597
Code: https://github.com/salesforce/LAVIS/tree/main/projects/blip2