MiniGPT-4/MiniGPTv2_Train .md at eb560920e00a356a0b3ba90e5bf9736f8998441b

chris/MiniGPT-4

Fork 0

mirror of https://github.com/Vision-CAIR/MiniGPT-4.git synced 2025-04-05 02:20:47 +00:00

junchen14 89878d661e update dataset readme

2023-10-25 07:52:44 +03:00

1.2 KiB

Raw Blame History

Finetune of MiniGPT-4

The training of MiniGPT-4 contains two alignment stages.

1. First pretraining stage

In the first pretrained stage, the model is trained using image-text pairs from Laion and CC datasets to align the vision and language model. To download and prepare the datasets, please check our first stage dataset preparation instruction. After the first stage, the visual features are mapped and can be understood by the language model. To launch the first stage training, run the following command. In our experiments, we use 4 A100. You can change the save path in the config file train_configs/minigpt4_stage1_pretrain.yaml

torchrun --nproc-per-node NUM_GPU train.py --cfg-path train_configs/minigpt4_stage1_pretrain.yaml

A MiniGPT-4 checkpoint with only stage one training can be downloaded here (13B) or here (7B). Compared to the model after stage two, this checkpoint generate incomplete and repeated sentences frequently.

1.2 KiB Raw Blame History

Finetune of MiniGPT-4

1.2 KiB

Raw Blame History