FAIR1M#

class torchgeo.datasets.FAIR1M(root='data', split='train', transforms=None, download=False, checksum=False)[source]#

Bases: NonGeoDataset

FAIR1M dataset.

The FAIR1M dataset is a dataset for remote sensing fine-grained oriented object detection.

Dataset features:

  • 15,000+ images with 0.3-0.8 m per pixel resolution (1,000-10,000 px)

  • 1 million object instances

  • 5 object categories, 37 object sub-categories

  • three spectral bands - RGB

  • images taken by Gaofen satellites and Google Earth

Dataset format:

  • images are three-channel tiffs

  • labels are xml files with PASCAL VOC like annotations

Dataset classes:

  1. Passenger Ship

  2. Motorboat

  3. Fishing Boat

  4. Tugboat

  5. other-ship

  6. Engineering Ship

  7. Liquid Cargo Ship

  8. Dry Cargo Ship

  9. Warship

  10. Small Car

  11. Bus

  12. Cargo Truck

  13. Dump Truck

  14. other-vehicle

  15. Van

  16. Trailer

  17. Tractor

  18. Excavator

  19. Truck Tractor

  20. Boeing737

  21. Boeing747

  22. Boeing777

  23. Boeing787

  24. ARJ21

  25. C919

  26. A220

  27. A321

  28. A330

  29. A350

  30. other-airplane

  31. Baseball Field

  32. Basketball Court

  33. Football Field

  34. Tennis Court

  35. Roundabout

  36. Intersection

  37. Bridge

If you use this dataset in your research, please cite the following paper:

Added in version 0.2.

__init__(root='data', split='train', transforms=None, download=False, checksum=False)[source]#

Initialize a new FAIR1M dataset instance.

Parameters:
  • root (str | PathLike[str]) – root directory where dataset can be found

  • split (str) – one of “train”, “val”, or “test”

  • transforms (Callable[[dict[str, Any]], dict[str, Any]] | None) – a function/transform that takes input sample and its target as entry and returns a transformed version

  • download (bool) – if True, download dataset and store it in the root directory

  • checksum (bool) – if True, check the MD5 of the downloaded files (may be slow)

Raises:

Changed in version 0.5: Added split and download parameters.

__getitem__(index)[source]#

Return an index within the dataset.

Parameters:

index (int) – index to return

Returns:

data and label at that index

Return type:

dict[str, Any]

__len__()[source]#

Return the number of data points in the dataset.

Returns:

length of the dataset

Return type:

int

plot(sample, show_titles=True, suptitle=None)[source]#

Plot a sample from the dataset.

Parameters:
  • sample (dict[str, Any]) – a sample returned by __getitem__()

  • show_titles (bool) – flag indicating whether to show titles above each panel

  • suptitle (str | None) – optional string to use as a suptitle

Returns:

a matplotlib Figure with the rendered sample

Return type:

Figure

__annotate_func__()#

The type of the None singleton.