Demand-driven navigation (DDN) refers to identifying and locating objects based on implicit user needs, although in dynamic and uncertain scenarios where the locations of objects are unknown. Traditional data-driven methods rely on pre-collected data for model training and decision-making, which limits their ability to generalize in unseen scenarios. In this paper, we propose CogDDN, a framework that emulates the human attentional mechanism by selectively focusing on key objects crucial for fulfilling user demands. CogDDN incorporates a dual-process decision-making module, comprising a Heuristic Process (System-I) for fast and efficient decision-making, with an Analytic Process (System-II) that analyzes past errors, accumulates them in a knowledge base and continuously improves its performance. Chain of Thought (CoT) reasoning is employed to strengthen the decision-making process. Extensive closed-loop evaluations on the AI2Thor simulator with the ProcThor dataset demonstrate that CogDDN outperforms single-view camera-only methods by 15%, showing significant improvements in navigation accuracy and adaptability.
@misc{huang2025cogddncognitivedemanddrivennavigation,
title={CogDDN: A Cognitive Demand-Driven Navigation with Decision Optimization and Dual-Process Thinking},
author={Yuehao Huang and Liang Liu and Shuangming Lei and Yukai Ma and Hao Su and Jianbiao Mei and Pengxiang Zhao and Yaqing Gu and Yong Liu and Jiajun Lv},
year={2025},
eprint={2507.11334},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2507.11334},
}