Literature Note
/
we obtain non-confusing textual embeddings of a concept by fine-tuning CLIP via contrasting a concept and the over-segmented visual regions of other concepts.
Search
Share
๐
we obtain non-confusing textual embeddings of a concept by fine-tuning CLIP via contrasting a concept and the over-segmented visual regions of other concepts.
์ถ์ฒ
์์ง์๊ฐ
2024/06/04 08:55
์ฐ๊ฒฐ์๋ฃ
1 more property