mirror of
https://github.com/Vision-CAIR/MiniGPT-4.git
synced 2025-04-05 02:20:47 +00:00
Update README_MINIGPTv2_FINETUNE.md
This commit is contained in:
parent
50df66e81e
commit
fa19bc09f2
@ -1,26 +1,10 @@
|
||||
## Download the COCO captions, RefCOCO, RefCOCO+. RefCOCOg, visual genome, textcaps, LLaVA, gqa, AOK-VQA, OK-VQA, OCR-VQA, filtered Flickr-30k, multi-task conversation, and Unnatural instruction datasets
|
||||
|
||||
After downloading all of them, organize the data as follows in `./playground/data`,
|
||||
|
||||
```
|
||||
├── coco
|
||||
│ └── train2017
|
||||
├── gqa
|
||||
│ └── images
|
||||
├── ocr_vqa
|
||||
│ └── images
|
||||
├── textvqa
|
||||
│ └── train_images
|
||||
└── vg
|
||||
├── VG_100K
|
||||
└── VG_100K_2
|
||||
```
|
||||
|
||||
### COCO captions
|
||||
- [train2017](http://images.cocodataset.org/zips/train2017.zip)
|
||||
|
||||
|
||||
|
||||
### Visual genome
|
||||
- [part1](https://cs.stanford.edu/people/rak248/VG_100K_2/images.zip), [part2](https://cs.stanford.edu/people/rak248/VG_100K_2/images2.zip)
|
||||
### TextCaps
|
||||
@ -70,12 +54,6 @@ Similarly, set **ann_path** in all the following configs to the above folder (Lo
|
||||
|
||||
### LLaVA
|
||||
|
||||
### TextVQA
|
||||
Images, and question-answer pairs will be loaded during evaluation.
|
||||
|
||||
### GQA
|
||||
Images, and question-answer pairs will be loaded during evaluation.
|
||||
|
||||
### OKVQA
|
||||
|
||||
### AOK-VQA
|
||||
|
Loading…
Reference in New Issue
Block a user