Scientists at Mines Paris-PSL University in France have created a dataset of aerial images, segmentation masks, and installation metadata for rooftop PV systems. They conceived the dataset to set up installation registries by extracting small-scale PV metadata from overhead imagery.
“Our dataset provides ground truth installation masks for 13303 images from Google Earth and 7686 images from the French national institute of geographical and forestry information (IGN),” the researchers said, noting that the metadata includes installed power, surface, tilt, and azimuth angles. “To address architectural differences, researchers can either use the coarse-grained location included in our dataset or use our dataset in conjunction with other training datasets that mapped different areas.”
The dataset provides thumbnails with a resolution of 400 × 400 pixels centered around the locations of PV systems. The thumbnails are based on the geolocation of a PV database for deep learning known as Base de données apprentissage profond PV (BDAPPV).
The user initially performs image classification by clicking on an image if it depicts a PV system and then annotators delineate the PV panels on the images.
“Once we generate our PV panels polygons, we match them with the installations’ metadata reported in the BDPV dataset.” the French researchers said, adding that this step involves internal consistency, unique matching, and external consistency.
The file containing PV installations’ metadata and the notebooks used to generate the masks are available in two separate datasets. The dataset provides installation metadata for more than 28,000 installations and segmentation masks for 13,000 installations.
“Dataset applications include end-to-end PV registry construction, robust PV installations mapping, and analysis of crowdsourced datasets,” the scientists said.
They described the dataset in “A crowdsourced dataset of aerial images with annotated solar photovoltaic arrays and installation metadata,” which was recently published in Scientific Data.
“To the best of our knowledge, it is the first time a training dataset contains PV panel images, ground truth labels, and installation metadata,” they concluded.
This content is protected by copyright and may not be reused. If you want to cooperate with us and would like to reuse some of our content, please contact: editors@pv-magazine.com.
By submitting this form you agree to pv magazine using your data for the purposes of publishing your comment.
Your personal data will only be disclosed or otherwise transmitted to third parties for the purposes of spam filtering or if this is necessary for technical maintenance of the website. Any other transfer to third parties will not take place unless this is justified on the basis of applicable data protection regulations or if pv magazine is legally obliged to do so.
You may revoke this consent at any time with effect for the future, in which case your personal data will be deleted immediately. Otherwise, your data will be deleted if pv magazine has processed your request or the purpose of data storage is fulfilled.
Further information on data privacy can be found in our Data Protection Policy.