Datasets API¶
Loaders for standard Temporal RAG benchmarks.
TEMPO (Primary)¶
A dataset for temporal retrieval and reasoning across multiple domains (History, Politics, Finance, etc.).
TimeQA (Reasoning)¶
Question Answering dataset requiring explicit temporal constraint satisfaction.
SituatedQA¶
Dataset where answers change depending on the temporal context (e.g., "Who represents District 5?").
TimeBench¶
Complex temporal reasoning benchmark involving reasoning over timelines.
Complex TempQA¶
Multi-hop queries requiring synthesis of multiple temporal facts.
Dataset Structure¶
All loaders return a list of TemporalQuery objects: