Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove pad_with_end_token argument in CLIPTokenizer. #2051

Closed

Conversation

james77777778
Copy link
Collaborator

@james77777778 james77777778 commented Jan 18, 2025

Related to #2018

I have verified that CLIPTokenizer should always use end_token_id as pad_token_id in both CLIP and SD3.
I’m not sure why I initially implemented pad_with_end_token arg. It might be related to how the internal SD3 code was implemented.

CLIPPreprocessor is still needed for both CLIP and SD3. When I implemented it, I didn’t have a clear idea about adding a task class for CLIP.
@mattdangerw @divyashreepathihalli WDYT?

The script to verify the outputs:

import keras
import numpy as np
import transformers

import keras_hub

text = "a cat sitting on the table"

tokenizer = keras_hub.models.Tokenizer.from_preset("clip_vit_base_patch32")
preprocessor = keras_hub.models.CLIPPreprocessor(tokenizer, sequence_length=16)
keras_results = preprocessor(text)
print(keras_results)

tokenizer = transformers.CLIPTokenizerFast.from_pretrained(
    "openai/clip-vit-base-patch32"
)
transformers_results = tokenizer(text, padding="max_length", max_length=16)
print(transformers_results)

np.testing.assert_allclose(
    keras.ops.convert_to_numpy(keras_results["token_ids"]),
    transformers_results["input_ids"],
)
np.testing.assert_allclose(
    keras.ops.convert_to_numpy(keras_results["padding_mask"]),
    transformers_results["attention_mask"],
)

@james77777778 james77777778 added the kokoro:force-run Runs Tests on GPU label Jan 18, 2025
@kokoro-team kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jan 18, 2025
@james77777778 james77777778 force-pushed the clean-up-clip-tokenizer branch from 9871bb1 to 8be7564 Compare January 18, 2025 16:58
@james77777778 james77777778 added the kokoro:force-run Runs Tests on GPU label Jan 18, 2025
@kokoro-team kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jan 18, 2025
@james77777778 james77777778 marked this pull request as draft January 19, 2025 05:45
@james77777778
Copy link
Collaborator Author

I think the original impl is correct.

  • OpenAI/CLIP: Use the end token as the padding token
  • OpenCLIP: Use 0 (the token for !) as the padding token

Therefore, we still need the pad_with_end_token argument.

This PR should be closed now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants