Man pages for torchvision
Models, Datasets and Transformations for Images

base_loaderBase loader
batched_nmsBatched Non-maximum Suppression (NMS)
box_areaBox Area
box_convertBox Convert
box_cxcywh_to_xyxybox_cxcywh_to_xyxy
box_iouBox IoU
box_xywh_to_xyxybox_xywh_to_xyxy
box_xyxy_to_cxcywhbox_xyxy_to_cxcywh
box_xyxy_to_xywhbox_xyxy_to_xywh
caltech_datasetCaltech Datasets
cifar_datasetsCIFAR datasets
clip_boxes_to_imageClip Boxes to Image
coco_caption_datasetCOCO Caption Dataset
coco_detection_datasetCOCO Detection Dataset
coco_polygon_to_maskConvert COCO polygon to mask tensor (Robust Version)
draw_bounding_boxesDraws bounding boxes on image.
draw_keypointsDraws Keypoints
draw_segmentation_masksDraw segmentation masks
eurosat_datasetEuroSAT datasets
fer_datasetFER-2013 Facial Expression Dataset
fgvc_aircraft_datasetFGVC Aircraft Dataset
flickr_caption_datasetFlickr Caption Datasets
flowers102_datasetOxford Flowers 102 Dataset
generalized_box_iouGeneralized Box IoU
image_folder_datasetCreate an image folder dataset
magick_loaderLoad an Image using ImageMagick
mnist_datasetMNIST and Derived Datasets
model_alexnetAlexNet Model Architecture
model_inception_v3Inception v3 model
model_mobilenet_v2Constructs a MobileNetV2 architecture from MobileNetV2:...
model_resnetResNet implementation
model_vggVGG implementation
nmsNon-maximum Suppression (NMS)
oxfordiiitpet_datasetOxford-IIIT Pet Classification Datasets
oxfordiiitpet_segmentation_datasetOxford-IIIT Pet Segmentation Dataset
remove_small_boxesRemove Small Boxes
tensor_image_browseDisplay image tensor
tensor_image_displayDisplay image tensor
tiny_imagenet_datasetTiny ImageNet dataset
transform_adjust_brightnessAdjust the brightness of an image
transform_adjust_contrastAdjust the contrast of an image
transform_adjust_gammaAdjust the gamma of an RGB image
transform_adjust_hueAdjust the hue of an image
transform_adjust_saturationAdjust the color saturation of an image
transform_affineApply affine transformation on an image keeping image center...
transform_center_cropCrops the given image at the center
transform_color_jitterRandomly change the brightness, contrast and saturation of an...
transform_convert_image_dtypeConvert a tensor image to the given 'dtype' and scale the...
transform_cropCrop the given image at specified location and output size
transform_five_cropCrop image into four corners and a central crop
transform_grayscaleConvert image to grayscale
transform_hflipHorizontally flip a PIL Image or Tensor
transform_linear_transformationTransform a tensor image with a square transformation matrix...
transform_normalizeNormalize a tensor image with mean and standard deviation
transform_padPad the given image on all sides with the given "pad" value
transform_perspectivePerspective transformation of an image
transform_random_affineRandom affine transformation of the image keeping center...
transform_random_applyApply a list of transformations randomly with a given...
transform_random_choiceApply single transformation randomly picked from a list
transform_random_cropCrop the given image at a random location
transform_random_erasingRandomly selects a rectangular region in an image and erases...
transform_random_grayscaleRandomly convert image to grayscale with a given probability
transform_random_horizontal_flipHorizontally flip an image randomly with a given probability
transform_random_orderApply a list of transformations in a random order
transform_random_perspectiveRandom perspective transformation of an image with a given...
transform_random_resized_cropCrop image to random size and aspect ratio
transform_random_rotationRotate the image by angle
transform_random_vertical_flipVertically flip an image randomly with a given probability
transform_resizeResize the input image to the given size
transform_resized_cropCrop an image and resize it to a desired size
transform_rgb_to_grayscaleConvert RGB Image Tensor to Grayscale
transform_rotateAngular rotation of an image
transform_ten_cropCrop an image and the flipped image each into four corners...
transform_to_tensorConvert an image to a tensor
transform_vflipVertically flip a PIL Image or Tensor
vision_make_gridA simplified version of torchvision.utils.make_grid
torchvision documentation built on Aug. 8, 2025, 7:27 p.m.