-
Notifications
You must be signed in to change notification settings - Fork 248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OFA on customised task e.g. OK-VQA #76
Comments
For organizing the TSV file, please refer to the reply in this issue #56 (see the 2nd & 3rd paragraphs). For the creation of |
Hi @yangapku, Thank you for your reply! Are the answer candidates for |
Hi, in fact the 3,129 answer candidates do not covers all the appeared answers in the original VQAv2 dataset. It's just a common practice in the VQAv2 challenge to limit in these relatively frequent answers. We only keep the training and validation samples containing these answers. The label-ids are assigned randomly without specific rules. |
Is there a rule to choose the 'frequent answer candidates' for a customised datasets if not all appeared answers are included? Also at testing/inference are the generated answers also restricted to only the answers in the |
Hi, I would recommend to refer to previous works for OK-VQA on how to determine the proper size of the frequent candidate set. In our work, we just exactly followed the practice of previous works on VQAv2. During inference, the generated answers are restricted to only the answers in the |
Thank you for the clarification! |
Hi, thanks for the awesome work!
I'd like to fine-tune OFA on OK-VQA, I have been trying to follow the instructions of VQA assuming they are similar, but I have raw image input (and question) , how do I convert to what OFA understands? Do I need to follow the format of the example tsv file? Is
trainval_ans2label.pkl
required (if so how do I generate it)?What are the steps to take to extend OFA on OK-VQA?
Thank you in advance for your help!
The text was updated successfully, but these errors were encountered: