《Using social media images for building function classification》
打印
- 作者
- Eike Jens Hoffmann;Karam Abdulahhad;Xiao Xiang Zhu
- 来源
- CITIES,Vol.134,Issue1,Article 104107
- 语言
- 英文
- 关键字
- Social media image analysis;Big data analytics;Building function classification;Urban land use
- 作者单位
- Technical University of Munich, Chair of Data Science in Earth Observation, Arcisstraße 21, Munich 80333, Germany;German Aerospace Center, Remote Sensing Technology Institute, Muenchener Straße 20, Weßling 82234, Germany;Technical University of Munich, Chair of Data Science in Earth Observation, Arcisstraße 21, Munich 80333, Germany;German Aerospace Center, Remote Sensing Technology Institute, Muenchener Straße 20, Weßling 82234, Germany
- 摘要
- Urban land use on a building instance level is crucial geo-information for many applications yet challenging to obtain. Steet-level images are highly suited to predict building functions as the building façades provide clear hints. Social media image platforms contain billions of images, including but not limited to street perspectives. This study proposes a filtering pipeline to yield high-quality, ground-level imagery from large-scale social media image datasets to cope with this issue. The pipeline ensures all resulting images have complete and valid geotags with a compass direction to relate image content and spatial objects.We analyze our method on a culturally diverse social media dataset from Flickr with more than 28 million images from 42 cities worldwide. The obtained dataset is then evaluated in the context of a building function classification task with three classes: Commercial, residential, and other. Fine-tuned state-of-the-art architectures yield F1 scores of up to 0.51 on the filtered images. Our analysis shows that the quality of the labels from OpenStreetMap limits the performance. Human-validated labels increase the F1 score by 0.2. Therefore, we consider these labels weak and publish the resulting images from our pipeline and the depicted buildings as a weakly labeled dataset.