Evaluating modular RAG with reasoning models