From bb138b01eb37d73109ddf2cada85e112d8990477 Mon Sep 17 00:00:00 2001 From: Wenbo Hu Date: Thu, 24 Aug 2023 17:47:17 -0700 Subject: [PATCH] Update README.md --- README.md | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index c6a026c..652110e 100644 --- a/README.md +++ b/README.md @@ -449,7 +449,11 @@ The goal of vision-language instruction tuning is to train a model that can effe - InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning [[paper]](https://arxiv.org/abs/2305.06500) [[code]](https://github.com/salesforce/LAVIS/tree/main/projects/instructblip) - `arXiv 2023/05` + `arXiv 2023/05` + +- BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions [[paper]](https://arxiv.org/abs/2308.09936) [[code]](https://github.com/mlpc-ucsd/BLIVA) + + `arXiv 2023/08`