MuRAG Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text Wenhu Chen Hexiang Hu Xi Chen Pat Verga William W. Cohen
MuRAG:MultimodalRetrieval-AugmentedGeneratorforOpenQuestionAnsweringoverImagesandTextWenhuChen,HexiangHu,XiChen,PatVerga,WilliamW.CohenGoogleResearch{wenhuchen,hexiang,patverga,wcohen}@google.comAbstractWhilelanguageModelsstoreamassiveamountofworldknowledgeimplicitlyintheirparameters,evenverylargemo...
2025-05-02
3.64MB 13 页 0
0
10玖币