TransformsImage transformation functions |
|
---|---|
Adjust the brightness of an image |
|
Adjust the contrast of an image |
|
Adjust the gamma of an RGB image |
|
Adjust the hue of an image |
|
Adjust the color saturation of an image |
|
Apply affine transformation on an image keeping image center invariant |
|
Crops the given image at the center |
|
Randomly change the brightness, contrast and saturation of an image |
|
Convert a tensor image to the given |
|
Crop the given image at specified location and output size |
|
Crop image into four corners and a central crop |
|
Convert image to grayscale |
|
Horizontally flip a PIL Image or Tensor |
|
Transform a tensor image with a square transformation matrix and a mean_vector computed offline |
|
Normalize a tensor image with mean and standard deviation |
|
Pad the given image on all sides with the given "pad" value |
|
Perspective transformation of an image |
|
Random affine transformation of the image keeping center invariant |
|
Apply a list of transformations randomly with a given probability |
|
Apply single transformation randomly picked from a list |
|
Crop the given image at a random location |
|
Randomly selects a rectangular region in an image and erases its pixel values |
|
Randomly convert image to grayscale with a given probability |
|
Horizontally flip an image randomly with a given probability |
|
Apply a list of transformations in a random order |
|
Random perspective transformation of an image with a given probability |
|
Crop image to random size and aspect ratio |
|
Rotate the image by angle |
|
Vertically flip an image randomly with a given probability |
|
Resize the input image to the given size |
|
Crop an image and resize it to a desired size |
|
Convert RGB Image Tensor to Grayscale |
|
Angular rotation of an image |
|
Crop an image and the flipped image each into four corners and a central crop |
|
Convert an image to a tensor |
|
Vertically flip a PIL Image or Tensor |
|
ModelsModel architectures |
|
Classification models |
|
AlexNet Model Architecture |
|
Inception v3 model |
|
Constructs a MobileNetV2 architecture from MobileNetV2: Inverted Residuals and Linear Bottlenecks. |
|
|
ResNet implementation |
|
VGG implementation |
DatasetsDatasets readily available. All have a |
|
for Image ClassificationDataset having items with “y” for target class identifier. |
|
EMNIST dataset |
|
|
EuroSAT Dataset |
Fashion-MNIST dataset |
|
FGVC Aircraft dataset |
|
Kuzushiji-MNIST |
|
MNIST dataset |
|
QMNIST Dataset |
|
Tiny ImageNet dataset |
|
Cifar datasets |
|
FER-2013 Facial Expression Dataset |
|
Create an image folder dataset |
|
for Object DetectionDataset having items with “y” as a named list of bounding-box and labels for object detection. |
|
Caltech-101 Dataset |
|
Caltech-256 Object Category Dataset |
|
COCO Detection Dataset |
|
for Image captionningDataset having items with “y” as one or multiple captions of the image |
|
COCO Detection Dataset |
|
DisplayingShow images |
|
Draws bounding boxes on image. |
|
Draws Keypoints |
|
Draw segmentation masks |
|
Display image tensor |
|
Display image tensor |
|
Misc |
|
Load an Image using ImageMagick |
|
Base loader |
|
A simplified version of torchvision.utils.make_grid |
|
Batched Non-maximum Suppression (NMS) |
|
Non-maximum Suppression (NMS) |
|
Box Area |
|
Box Convert |
|
box_cxcywh_to_xyxy |
|
Box IoU |
|
box_xywh_to_xyxy |
|
box_xyxy_to_cxcywh |
|
box_xyxy_to_xywh |
|
Clip Boxes to Image |
|
Generalized Box IoU |
|
Remove Small Boxes |