Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.18411 (cs)

[Submitted on 28 Feb 2024 (v1), last revised 24 Mar 2024 (this version, v2)]

Title:Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport

Authors:Bin Li, Ye Shi, Qian Yu, Jingya Wang

Abstract:Unsupervised cross-domain image retrieval (UCIR) aims to retrieve images sharing the same category across diverse domains without relying on labeled data. Prior approaches have typically decomposed the UCIR problem into two distinct tasks: intra-domain representation learning and cross-domain feature alignment. However, these segregated strategies overlook the potential synergies between these tasks. This paper introduces ProtoOT, a novel Optimal Transport formulation explicitly tailored for UCIR, which integrates intra-domain feature representation learning and cross-domain alignment into a unified framework. ProtoOT leverages the strengths of the K-means clustering method to effectively manage distribution imbalances inherent in UCIR. By utilizing K-means for generating initial prototypes and approximating class marginal distributions, we modify the constraints in Optimal Transport accordingly, significantly enhancing its performance in UCIR scenarios. Furthermore, we incorporate contrastive learning into the ProtoOT framework to further improve representation learning. This encourages local semantic consistency among features with similar semantics, while also explicitly enforcing separation between features and unmatched prototypes, thereby enhancing global discriminativeness. ProtoOT surpasses existing state-of-the-art methods by a notable margin across benchmark datasets. Notably, on DomainNet, ProtoOT achieves an average P@200 enhancement of 24.44%, and on Office-Home, it demonstrates a P@15 improvement of 12.12%. Code is available at this https URL.

Comments:	Accepted by AAAI2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2402.18411 [cs.CV]
	(or arXiv:2402.18411v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.18411

Submission history

From: Bin Li [view email]
[v1] Wed, 28 Feb 2024 15:31:45 UTC (876 KB)
[v2] Sun, 24 Mar 2024 12:04:11 UTC (875 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators