You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Following the conversation in the Discord channel, I'm planning to begin working on the Retriever-Augmented Generation (RAG). My approach will involve using FAISS for the vector database and for extracting text from PDF files, I'll be using PyPDF.
Please let me know if these new dependencies are ok.
I am doing some tests to use image to image description models to include those in the text from PyPDF.
For the embeddings I am planing to use mixedbread-ai/mxbai-embed-large-v1, based on the model size and MTEB performance. Please let me know if there is another model that is preferred.
The text was updated successfully, but these errors were encountered:
Hey @4binas that sounds great! I am totally fine with your choices and whenever you have a first version, feel free to open a pull request and then we can discuss any details for merging it into the existing project. If you like, please keep me up to date and if you have any questions about the current code base, just let me know.
Following the conversation in the Discord channel, I'm planning to begin working on the Retriever-Augmented Generation (RAG). My approach will involve using FAISS for the vector database and for extracting text from PDF files, I'll be using PyPDF.
Please let me know if these new dependencies are ok.
I am doing some tests to use image to image description models to include those in the text from PyPDF.
For the embeddings I am planing to use mixedbread-ai/mxbai-embed-large-v1, based on the model size and MTEB performance. Please let me know if there is another model that is preferred.
The text was updated successfully, but these errors were encountered: