ARM: Alignment-oriented Retrieval Method

An optimal retrieval system performs both information and structure alignment

Information alignment: In the example above, both Document A and Table D address concepts in the user query and should therefore be retrieved. However, since A and D do not contain overlapping information, they cannot be connected together to form a complete answer to the query.
Structure alignment: Tables B and C, although not directly referenced in the user question, are necessary to bridge the connection between A and D, and therefore should also be retrieved.

What are existing solutions?

Standard RAG with query decomposition: While these methods excel at aligning information, they fall short in capturing structural alignment, as the bridging objects needed may not be explicitly referenced in the user query.
React-style agentic systems: These methods iteratively explore semantically relevant information but do not explicitly take into account the available data objects and how they are organized. Additionally, they often lack convergence guarantees, which results in significant inefficiency.

What is our solution?

We propose alignment-oriented method that enables an efficient and comprehensive retrieval for complex questions by aligning the question and its decomposition with existing data objects and their organization.
Specifically, we evaluate not only the semantic similarity between the query and each object but also the compatibility between objects themselves. Using these scores, we generate multiple alignment drafts that account for both informational and structural alignment, and ultimately select only the objects from these drafts that the model is most confident in.
Although this paper focuses on tables and documents, our framework can be readily extended to other modalities (e.g., images), provided that a compatibility score can be computed.

How does ARM perform?

We evaluated ARM on three complex retrieval tasks involving both passages and tables, and found that it substantially outperforms traditional RAG systems—with or without query decomposition—as well as ReAct-style agent-based frameworks. ARM delivers superior retrieval and downstream performance while also significantly improving efficiency and reducing monetary costs.

Can we Retrieve Everything All at Once?
ARM: An Alignment-Oriented LLM-based Retrieval Method