torchgeo.datamodules#

Geospatial DataModules#

AgriFieldNet#

class torchgeo.datamodules.AgriFieldNetDataModule(batch_size=64, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the AgriFieldNet dataset.

Added in version 0.6.

__init__(batch_size=64, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Initialize a new AgriFieldNetDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to AgriFieldNet.

setup(stage)[source]#

Set up datasets.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

Chesapeake Land Cover#

class torchgeo.datamodules.ChesapeakeCVPRDataModule(train_splits, val_splits, test_splits, batch_size=64, patch_size=256, length=None, num_workers=0, class_set=7, use_prior_labels=False, prior_smoothing_constant=0.0001, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the Chesapeake CVPR Land Cover dataset.

Uses the random splits defined per state to partition tiles into train, val, and test sets.

__init__(train_splits, val_splits, test_splits, batch_size=64, patch_size=256, length=None, num_workers=0, class_set=7, use_prior_labels=False, prior_smoothing_constant=0.0001, **kwargs)[source]#

Initialize a new ChesapeakeCVPRDataModule instance.

Parameters:

train_splits (list[str]) – Splits used to train the model, e.g., [“ny-train”].
val_splits (list[str]) – Splits used to validate the model, e.g., [“ny-val”].
test_splits (list[str]) – Splits used to test the model, e.g., [“ny-test”].
batch_size (int) – Size of each mini-batch.
patch_size (int) – Size of each patch, either size or (height, width). Should be a multiple of 32 for most segmentation architectures.
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
class_set (int) – The high-resolution land cover class set to use (5 or 7).
use_prior_labels (bool) – Flag for using a prior over high-resolution classes instead of the high-resolution labels themselves.
prior_smoothing_constant (float) – Additive smoothing to add when using prior labels.
**kwargs (Any) – Additional keyword arguments passed to ChesapeakeCVPR.

Raises:

AssertionError – If use_prior_labels=True is used with class_set=7.

setup(stage)[source]#

Set up datasets and samplers.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

on_after_batch_transfer(batch, dataloader_idx)[source]#

Apply batch augmentations to the batch after it is transferred to the device.

Parameters:

batch (dict[str, Any]) – A batch of data that needs to be altered or augmented.
dataloader_idx (int) – The index of the dataloader to which the batch belongs.

Returns:

A batch of data.

Return type:

dict[str, Any]

L7 Irish#

class torchgeo.datamodules.L7IrishDataModule(batch_size=1, patch_size=224, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the L7 Irish dataset.

Added in version 0.5.

__init__(batch_size=1, patch_size=224, length=None, num_workers=0, **kwargs)[source]#

Initialize a new L7IrishDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to L7Irish.

setup(stage)[source]#

Set up datasets.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

L8 Biome#

class torchgeo.datamodules.L8BiomeDataModule(batch_size=1, patch_size=224, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the L8 Biome dataset.

Added in version 0.5.

__init__(batch_size=1, patch_size=224, length=None, num_workers=0, **kwargs)[source]#

Initialize a new L8BiomeDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to L8Biome.

setup(stage)[source]#

Set up datasets.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

MMFlood#

class torchgeo.datamodules.MMFloodDataModule(batch_size=32, patch_size=512, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the MMFlood dataset.

Added in version 0.7.

__init__(batch_size=32, patch_size=512, length=None, num_workers=0, **kwargs)[source]#

Initialize a new MMFloodDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to MMFlood.

setup(stage)[source]#

Set up datasets.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, ‘predict’.

NAIP#

class torchgeo.datamodules.NAIPChesapeakeDataModule(batch_size=64, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the NAIP and Chesapeake datasets.

Uses the train/val/test splits from the dataset.

__init__(batch_size=64, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Initialize a new NAIPChesapeakeDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to NAIP (prefix keys with naip_) and Chesapeake (prefix keys with chesapeake_).

setup(stage)[source]#

Set up datasets and samplers.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

plot(*args, **kwargs)[source]#

Run NAIP plot method.

Parameters:

*args (Any) – Arguments passed to plot method.
**kwargs (Any) – Keyword arguments passed to plot method.

Returns:

A matplotlib Figure with the image, ground truth, and predictions.

Return type:

Figure

Added in version 0.4.

I/O Bench#

class torchgeo.datamodules.IOBenchDataModule(batch_size=32, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the I/O benchmark dataset.

Added in version 0.6.

__init__(batch_size=32, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Initialize a new IOBenchDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to IOBench.

setup(stage)[source]#

Set up datasets.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

Sentinel#

class torchgeo.datamodules.Sentinel2CDLDataModule(batch_size=64, patch_size=64, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the Sentinel-2 and CDL datasets.

Added in version 0.6.

__init__(batch_size=64, patch_size=64, length=None, num_workers=0, **kwargs)[source]#

Initialize a new Sentinel2CDLDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to CDL (prefix keys with cdl_) and Sentinel2 (prefix keys with sentinel2_).

setup(stage)[source]#

Set up datasets and samplers.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

plot(*args, **kwargs)[source]#

Run CDL plot method.

Parameters:

*args (Any) – Arguments passed to plot method.
**kwargs (Any) – Keyword arguments passed to plot method.

Returns:

A matplotlib Figure with the image, ground truth, and predictions.

Return type:

Figure

class torchgeo.datamodules.Sentinel2EuroCropsDataModule(batch_size=64, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the EuroCrops and Sentinel2 datasets.

Uses the train/val/test splits from the dataset.

Added in version 0.6.

__init__(batch_size=64, patch_size=256, length=None, num_workers=0, **kwargs)[source]#

Initialize a new Sentinel2EuroCropsDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to EuroCrops (prefix keys with eurocrops_) and Sentinel2 (prefix keys with sentinel2_).

setup(stage)[source]#

Set up datasets and samplers.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

plot(*args, **kwargs)[source]#

Run EuroCrops plot method.

Parameters:

*args (Any) – Arguments passed to plot method.
**kwargs (Any) – Keyword arguments passed to plot method.

Returns:

A matplotlib Figure with the image, ground truth, and predictions.

Return type:

Figure

class torchgeo.datamodules.Sentinel2NCCMDataModule(batch_size=64, patch_size=64, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the Sentinel-2 and NCCM dataset.

Added in version 0.6.

__init__(batch_size=64, patch_size=64, length=None, num_workers=0, **kwargs)[source]#

Initialize a new Sentinel2NCCMDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to NCCM (prefix keys with nccm_) and Sentinel2 (prefix keys with sentinel2_).

setup(stage)[source]#

Set up datasets and samplers.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

plot(*args, **kwargs)[source]#

Run NCCM plot method.

Parameters:

*args (Any) – Arguments passed to plot method.
**kwargs (Any) – Keyword arguments passed to plot method.

Returns:

A matplotlib Figure with the image, ground truth, and predictions.

Return type:

Figure

class torchgeo.datamodules.Sentinel2SouthAmericaSoybeanDataModule(batch_size=64, patch_size=64, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule for SouthAmericaSoybean and Sentinel2 datasets.

Added in version 0.6.

__init__(batch_size=64, patch_size=64, length=None, num_workers=0, **kwargs)[source]#

Initialize a new Sentinel2SouthAmericaSoybeanDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to SouthAmericaSoybean (prefix keys with south_america_soybean_) and Sentinel2 (prefix keys with sentinel2_).

setup(stage)[source]#

Set up datasets and samplers.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

plot(*args, **kwargs)[source]#

Run SouthAmericaSoybean plot method.

Parameters:

*args (Any) – Arguments passed to plot method.
**kwargs (Any) – Keyword arguments passed to plot method.

Returns:

A matplotlib Figure with the image, ground truth, and predictions.

Return type:

Figure

SouthAfricaCropType#

class torchgeo.datamodules.SouthAfricaCropTypeDataModule(batch_size=64, patch_size=16, length=None, num_workers=0, **kwargs)[source]#

Bases: GeoDataModule

LightningDataModule implementation for the SouthAfricaCropType dataset.

Added in version 0.6.

__init__(batch_size=64, patch_size=16, length=None, num_workers=0, **kwargs)[source]#

Initialize a new SouthAfricaCropTypeDataModule instance.

Parameters:

batch_size (int) – Size of each mini-batch.
patch_size (int | tuple[int, int]) – Size of each patch, either size or (height, width).
length (int | None) – Length of each training epoch.
num_workers (int) – Number of workers for parallel data loading.
**kwargs (Any) – Additional keyword arguments passed to SouthAfricaCropType.

setup(stage)[source]#

Set up datasets.

Parameters:: stage (str) – Either ‘fit’, ‘validate’, ‘test’, or ‘predict’.

Non-geospatial DataModules#

BigEarthNet#

class torchgeo.datamodules.BigEarthNetDataModule(batch_size=64, num_workers=0, **kwargs)[source]#

torchgeo.datamodules#

Geospatial DataModules#

AgriFieldNet#

Chesapeake Land Cover#

L7 Irish#

L8 Biome#

MMFlood#

NAIP#

I/O Bench#

Sentinel#

SouthAfricaCropType#

Non-geospatial DataModules#

BigEarthNet#

BRIGHT#

CaBuAr#

CaFFe#

ChaBuD#

Cloud Cover Detection#

COWC#

Deep Globe Land Cover Challenge#

Digital Typhoon#

ETCI2021 Flood Detection#

EuroSAT#

FAIR1M#

Fields Of The World#

FireRisk#

GeoNRW#

GID-15#

HySpecNet-11k#

Inria Aerial Image Labeling#

LandCover.ai#

LEVIR-CD#

LEVIR-CD+#

LoveDA#

NASA Marine Debris#

OSCD#

PASTIS#

PatternNet#

Potsdam#

QuakeSet#

ReforesTree#

RESISC45#

Seasonal Contrast#

SEN12MS#

SKIPP’D#

So2Sat#

Solar Plants Brazil#

SpaceNet#

SSL4EO#

SSL4EO-L Benchmark#

Substation#

SustainBench Crop Yield#

TreeSatAI#

Tropical Cyclone#

UC Merced#

USAVars#

Vaihingen#

VHR-10#

xBD#

Base Classes#

BaseDataModule#

GeoDataModule#

NonGeoDataModule#

Utilities#