Retrieval-Augmented Generation (RAG) effectively grounds LLM outputs in external knowledge, but does not model the runtime context, such as user identity, session state, or domain constraints, on ...