PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations
-
Updated
Mar 30, 2024 - Jupyter Notebook
PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations
Add a description, image, and links to the multimodal-wsd topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-wsd topic, visit your repo's landing page and select "manage topics."