Enhancing Geo-localization for Crowdsourced Flood Imagery via LLM-Guided Attention
📰 ArXiv cs.AI
arXiv:2512.11811v3 Announce Type: replace-cross Abstract: Crowdsourced social media imagery provides real-time visual evidence of urban flooding but often lacks reliable geographic metadata for emergency response. Existing Visual Place Recognition (VPR) models struggle to geo-localize these images due to cross-source domain shifts and visual distortions. We present VPR-AttLLM, a model-agnostic framework integrating the semantic reasoning and geospatial knowledge of Large Language Models (LLMs) i
DeepCamp AI