avatar
Big Data Science
@bdscience
27.03.2024 20:59
💡📊Training data used in ComCLIP
CLIP (Contrastive Language-Image Pre-Training) is a neural network developed by OpenAI to perform visual as well as language comprehension tasks. The algorithms aim to understand the relationship between text and images.
ComCLIP is an improved version of CLIP for matching text and graphic representations. ComCLIP can mitigate spurious correlations introduced by pre-trained CLIP models and dynamically estimate the importance of each component. Experiments were conducted on four datasets for compositional matching between images and texts.
These datasets are publicly available on the Internet and can be found at this link
GitHub
GitHub - eric-ai-lab/ComCLIP: Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching" - eric-ai-lab/ComCLIP
1.8K

Обсуждение 0

Обсуждение не доступно в веб-версии. Чтобы написать комментарий, перейдите в приложение Telegram.

Обсудить в Telegram

Big Data Science

3.6K
Big Data Science channel gathers together all interesting facts about Data Science.
For cooperation: a.chernobrovov@gmail.com
💼 — https://t.me/bds_job — channel about Data Science jobs and career
💻 — https://t.me/bdscience_ru — Big Data Science [RU]
Открыть в Telegram