DFC2022#

class torchgeo.datasets.DFC2022(root='data', split='train', transforms=None, checksum=False)[source]#

Bases: NonGeoDataset

DFC2022 dataset.

The DFC2022 dataset is used as a benchmark dataset for the 2022 IEEE GRSS Data Fusion Contest and extends the MiniFrance dataset for semi-supervised semantic segmentation. The dataset consists of a train set containing labeled and unlabeled imagery and an unlabeled validation set. The dataset can be downloaded from the IEEEDataPort DFC2022 website.

Dataset features:

RGB aerial images at 0.5 m per pixel spatial resolution (~2,000x2,0000 px)
DEMs at 1 m per pixel spatial resolution (~1,000x1,0000 px)
Masks at 0.5 m per pixel spatial resolution (~2,000x2,0000 px)
16 land use/land cover categories
Images collected from the IGN BD ORTHO database
DEMs collected from the IGN RGE ALTI database
Labels collected from the UrbanAtlas 2012 database
Data collected from 19 regions in France

Dataset format:

images are three-channel geotiffs
DEMS are single-channel geotiffs
masks are single-channel geotiffs with the pixel values represent the class

Dataset classes:

No information
Urban fabric
Industrial, commercial, public, military, private and transport units
Mine, dump and construction sites
Artificial non-agricultural vegetated areas
Arable land (annual crops)
Permanent crops
Pastures
Complex and mixed cultivation patterns
Orchards at the fringe of urban classes
Forests
Herbaceous vegetation associations
Open spaces with little or no vegetation
Wetlands
Water
Clouds and Shadows

If you use this dataset in your research, please cite the following paper:

https://doi.org/10.1007/s10994-020-05943-y

Added in version 0.3.

__init__(root='data', split='train', transforms=None, checksum=False)[source]#

Initialize a new DFC2022 dataset instance.

Parameters:

root (str | PathLike[str]) – root directory where dataset can be found
split (str) – one of “train” or “test”
transforms (Callable[[dict[str, Any]], dict[str, Any]] | None) – a function/transform that takes input sample and its target as entry and returns a transformed version
checksum (bool) – if True, check the MD5 of the downloaded files (may be slow)

Raises:

AssertionError – if split is invalid
DatasetNotFoundError – If dataset is not found.

__getitem__(index)[source]#

Return an index within the dataset.

Parameters:: index (int) – index to return
Returns:: data and label at that index
Return type:: dict[str, Any]

__len__()[source]#

Return the number of data points in the dataset.

Returns:: length of the dataset
Return type:: int

plot(sample, show_titles=True, suptitle=None)[source]#

Plot a sample from the dataset.

Parameters:

sample (dict[str, Any]) – a sample returned by __getitem__()
show_titles (bool) – flag indicating whether to show titles above each panel
suptitle (str | None) – optional string to use as a suptitle

Returns:

a matplotlib Figure with the rendered sample

Return type:

Figure

__annotate_func__()#: The type of the None singleton.

DFC2022#

This Page