Been optimizing my RAG implementatin in the last few weeks, and it's presently in this structure:
vector_store -> embedding -> reranker -> llm
I noticed much better results when I made a slight change in the context that is created after the reranker...
https://winsa2.blogspot.com/p/context-engineering-is-art-form.html