机器视觉 论文笔记:Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering 2024-07-09 852