Question on CogVLM image input resolution at inference time. #372

aldoz-mila · 2024-02-05T22:19:51Z

aldoz-mila
Feb 5, 2024

Hi all, I am currently playing with the CogVLM17b model (using this checkpoint: https://huggingface.co/THUDM/cogvlm-chat-hf). What is the resizing resolution of the vision encoder? The ending resolution during the training is 490 × 490 according to the paper, I assume this is also the image resolution used during inference? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on CogVLM image input resolution at inference time. #372

{{title}}

Replies: 0 comments

Select a reply

Question on CogVLM image input resolution at inference time. #372

aldoz-mila Feb 5, 2024

Replies: 0 comments

aldoz-mila
Feb 5, 2024