-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add giga embeddings #1741
base: main
Are you sure you want to change the base?
Add giga embeddings #1741
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would appreciate the metadata add but otherwise it looks good. Of course lets wait until we have a look at the differences in score.
use_instructions=True, | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add the training data annotation as well (we are going through models and adding that)
see_ #1561
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
They haven't publish report yet, so I don't know anything about training dataset
# to passage prompts won't be applied to passages | ||
if ( | ||
not self.apply_instruction_to_passages | ||
and prompt_type == PromptType.passage | ||
and task.metadata.type == "s2p" | ||
): | ||
instruction = None |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similar to jasper
and nv-embed
this model doesn't use prompt for passages. I think that can be helpful to add this to base class
Added
InstructSentenceTransformerWrapper
to useSentenceTransforme
models with instructions.Ref embeddings-benchmark/results#77
@ekolodin My results are a bit higher. Could you rerun your results using this implementation, or provide your implementation? My code for run
Checklist
make test
.make lint
.Adding a model checklist
mteb.get_model(model_name, revision)
andmteb.get_model_meta(model_name, revision)