Mismatch paper's approach and README pretrain command

I found that the pretraining phase from this code is a bit different from what I understand about the paper. According to 2 images below, only the Image modality is intra-contrastive with the aid of a semantic module.
<img width="1569" height="753" alt="Image" src="https://github.com/user-attachments/assets/8d9af0c8-1567-410b-a497-a0c23bad6c9b" />
<img width="1266" height="530" alt="Image" src="https://github.com/user-attachments/assets/f020dbb3-c58c-435b-8fc9-c58b2fa1f535" />

However, the recommended pretraining command in README says the differ with both `--separate_text` and `--separte_image` are activated. If I understand the paper correctly, only `--separate_image` should be used.
https://github.com/sarahESL/AlignCLIP/blob/a18e8058ce67c4b5490f2ab903fd887b0dc8fb03/README.md?plain=1#L26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mismatch paper's approach and README pretrain command #6

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Mismatch paper's approach and README pretrain command #6

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions