-
Notifications
You must be signed in to change notification settings - Fork 200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WWB]: Add ImageText-to-Image pipeline validation #1373
[WWB]: Add ImageText-to-Image pipeline validation #1373
Conversation
is image to image sensitive to random |
the problem came from the difference in the resolutions of generated images by HF and GenAI libs. |
I am getting good accuracy convergence for big models, e.g. SD-XL but cannot make tests working with GenAI for any dummy model (I tried several). Waiting for fixes from OV. |
…nvino.genai into ak/wwb_inpainting
@ilya-lavrenov, CI is passed with non-dummy model. |
prompt, | ||
image=image_data, | ||
num_inference_steps=num_inference_steps, | ||
strength=0.8, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do you run optimum / diffusers with the same strength
value?
I just don't see where reference image2timage is called..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. It is 0.8 by default in Diffusers but it is better to set it explicitly.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually, it is already set to 0.8 as well in the default_gen_image_fn
of im2im pipeline.
@@ -65,6 +67,7 @@ def test_image_model_types(model_id, model_type, backend): | |||
@pytest.mark.parametrize( | |||
("model_id", "model_type"), | |||
[ | |||
("dreamlike-art/dreamlike-anime-1.0", "image-to-image"), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe echarlaix/tiny-random-latent-consistency
can work here instead of non-working echarlaix/tiny-random-stable-diffusion-xl
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tried several of them including the one you mentioned and they do not work due the known bug.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do you need torch model in these tests or openvino only enough?
We have several preconverted models here https://huggingface.co/collections/OpenVINO/image-generation-67697d9952fb1eee4a252aa8 (I requested dreamlike models too, but current status is waiting approval)
among trained but still small is https://huggingface.co/segmind/tiny-sd
but not sure that used in model scheduler currently supported
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
also for speedup testing, you can expose num_infeence_steps parameter in cli (I suppose even with streigh=0.8, it runs 40 steps
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do you need torch model in these tests or openvino only enough? We have several preconverted models here https://huggingface.co/collections/OpenVINO/image-generation-67697d9952fb1eee4a252aa8 (I requested dreamlike models too, but current status is waiting approval)
among trained but still small is https://huggingface.co/segmind/tiny-sd but not sure that used in model scheduler currently supported
Yes, I need a Torch model and I tried "segmind/tiny-sd" but it didn't work OOB.
And, yes num_infeence_steps
is already exposed and I am planning to use it in the test to reduce the test time if I don't find a dummy model that works.
CVS-159223