Large language models as virtual experts?: Evaluating AHP-based criteria weighting performance for solar power plant site selection

Multi-Criteria Decision Making (MCDM) methods integrated with Geographic Information Systems (GIS) are essential for spatial planning, particularly in renewable energy development. The Analytic Hierarchy Process (AHP) traditionally relies on expert elicitation for criteria weighting, which can be co...

Full description

Saved in:
Bibliographic Details
Main Authors: Memduhoğlu, Abdulkadir (Author) , Fulman, Nir (Author) , Polat, Nizar (Author) , Ataş, Tacettin (Author)
Format: Article (Journal)
Language:English
Published: 1 March 2026
In: Expert systems with applications
Year: 2026, Volume: 299, Pages: 1-17
ISSN:1873-6793
DOI:10.1016/j.eswa.2025.130171
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1016/j.eswa.2025.130171
Verlag, lizenzpflichtig, Volltext: https://www.sciencedirect.com/science/article/pii/S0957417425037868
Get full text
Author Notes:Abdulkadir Memduhoğlu, Nir Fulman, Nizar Polat, Tacettin Ataş
Description
Summary:Multi-Criteria Decision Making (MCDM) methods integrated with Geographic Information Systems (GIS) are essential for spatial planning, particularly in renewable energy development. The Analytic Hierarchy Process (AHP) traditionally relies on expert elicitation for criteria weighting, which can be costly, time-consuming, and inconsistent. This study develops a framework to evaluate Large Language Models (LLMs) as virtual experts for spatial MCDM, addressing the need for accessible and reliable decision-support alternatives in urban and environmental systems. We empirically compared six contemporary LLMs with a panel of ten regional domain experts in determining AHP criteria weights for solar power plant site selection. The framework tests multiple prompting strategies, assesses architectural performance, examines stability across iterations, and identifies biases. Fourteen spatially explicit criteria were used, spanning resource potential, topography, environmental constraints, and infrastructure connectivity. Optimally configured LLMs achieve strong correlations with expert consensus (r = 0.838), approaching typical inter-expert agreement levels, though systematic biases in specific criterion categories reveal important limitations in expert simulation. Expert Panel Role-Based prompting outperformed Chain-of-Thought and Minimal Context methods. Stability analysis revealed significant reliability differences across architectures, while systematic biases included underestimation of topographical factors and overemphasis on infrastructure connectivity. The framework provides empirical performance benchmarks and identifies model-specific optimization needs and bias correction protocols. This research establishes a controlled testbed for exploring AI decision-making patterns rather than replacing expert processes in high-stakes energy infrastructure decisions.
Item Description:Ahead of print
Gesehen am 03.02.2026
Physical Description:Online Resource
ISSN:1873-6793
DOI:10.1016/j.eswa.2025.130171