DFC2022#
- class torchgeo.datasets.DFC2022(root='data', split='train', transforms=None, checksum=False)[source]#
Bases:
NonGeoDatasetDFC2022 dataset.
The DFC2022 dataset is used as a benchmark dataset for the 2022 IEEE GRSS Data Fusion Contest and extends the MiniFrance dataset for semi-supervised semantic segmentation. The dataset consists of a train set containing labeled and unlabeled imagery and an unlabeled validation set. The dataset can be downloaded from the IEEEDataPort DFC2022 website.
Dataset features:
RGB aerial images at 0.5 m per pixel spatial resolution (~2,000x2,0000 px)
DEMs at 1 m per pixel spatial resolution (~1,000x1,0000 px)
Masks at 0.5 m per pixel spatial resolution (~2,000x2,0000 px)
16 land use/land cover categories
Images collected from the IGN BD ORTHO database
DEMs collected from the IGN RGE ALTI database
Labels collected from the UrbanAtlas 2012 database
Data collected from 19 regions in France
Dataset format:
images are three-channel geotiffs
DEMS are single-channel geotiffs
masks are single-channel geotiffs with the pixel values represent the class
Dataset classes:
No information
Urban fabric
Industrial, commercial, public, military, private and transport units
Mine, dump and construction sites
Artificial non-agricultural vegetated areas
Arable land (annual crops)
Permanent crops
Pastures
Complex and mixed cultivation patterns
Orchards at the fringe of urban classes
Forests
Herbaceous vegetation associations
Open spaces with little or no vegetation
Wetlands
Water
Clouds and Shadows
If you use this dataset in your research, please cite the following paper:
Added in version 0.3.
- __init__(root='data', split='train', transforms=None, checksum=False)[source]#
Initialize a new DFC2022 dataset instance.
- Parameters:
root (str | PathLike[str]) – root directory where dataset can be found
split (str) – one of “train” or “test”
transforms (Callable[[dict[str, Any]], dict[str, Any]] | None) – a function/transform that takes input sample and its target as entry and returns a transformed version
checksum (bool) – if True, check the MD5 of the downloaded files (may be slow)
- Raises:
AssertionError – if
splitis invalidDatasetNotFoundError – If dataset is not found.
- __len__()[source]#
Return the number of data points in the dataset.
- Returns:
length of the dataset
- Return type:
- __annotate_func__()#
The type of the None singleton.