Abstract: This study proposes the Chinese Fine Alignment in Contrastive Language-Image Pre-training (CFA-CLIP), a fine-grained feature alignment model based on Chinese Contrastive Language-Image ...