TransformsImage transformation functions |
|
---|---|
Unitary transformation |
|
Adjust the brightness of an image |
|
Adjust the contrast of an image |
|
Adjust the gamma of an RGB image |
|
Adjust the hue of an image |
|
Adjust the color saturation of an image |
|
Apply affine transformation on an image keeping image center invariant |
|
Crops the given image at the center |
|
Convert a tensor image to the given |
|
Crop the given image at specified location and output size |
|
Convert image to grayscale |
|
Horizontally flip a PIL Image or Tensor |
|
Transform a tensor image with a square transformation matrix and a mean_vector computed offline |
|
Normalize a tensor image with mean and standard deviation |
|
Pad the given image on all sides with the given "pad" value |
|
Perspective transformation of an image |
|
Resize the input image to the given size |
|
Convert RGB Image Tensor to Grayscale |
|
Angular rotation of an image |
|
Convert an image to a tensor |
|
Vertically flip a PIL Image or Tensor |
|
Random transformation |
|
Randomly change the brightness, contrast and saturation of an image |
|
Random affine transformation of the image keeping center invariant |
|
Crop the given image at a random location |
|
Randomly selects a rectangular region in an image and erases its pixel values |
|
Randomly convert image to grayscale with a given probability |
|
Horizontally flip an image randomly with a given probability |
|
Random perspective transformation of an image with a given probability |
|
Crop image to random size and aspect ratio |
|
Rotate the image by angle |
|
Vertically flip an image randomly with a given probability |
|
Combining / multiplying transformations |
|
Crop image into four corners and a central crop |
|
Apply a list of transformations randomly with a given probability |
|
Apply single transformation randomly picked from a list |
|
Apply a list of transformations in a random order |
|
Crop an image and resize it to a desired size |
|
Crop an image and the flipped image each into four corners and a central crop |
|
ModelsComputer Vision deep-learning Model architectures |
|
Classification modelsModel providing a output vector of the size of |
|
AlexNet Model Architecture |
|
|
ConvNeXt Implementation |
|
EfficientNet Models |
|
EfficientNetV2 Models |
Inception v3 model |
|
MaxViT Model |
|
MobileNetV2 Model |
|
MobileNetV3 Model |
|
|
ResNet implementation |
|
VGG implementation |
|
Vision Transformer Implementation |
Object detection modelsModel providing an output list including a bounding-boxes vector of the detected object in the image each with format. |
|
|
MTCNN Face Detection Networks |
Semantic segmentation modelsModel providing an output list including a binary mask for each pixel of the image for each covered segmentation class. |
|
DeepLabV3 Models |
|
Fully Convolutional Network for Semantic Segmentation |
|
Other models |
|
DatasetsDatasets readily available. All have a |
|
for Image ClassificationDataset having items with “y” for target class identifier. |
|
Caltech Datasets |
|
CIFAR datasets |
|
|
EuroSAT datasets |
FER-2013 Facial Expression Dataset |
|
FGVC Aircraft Dataset |
|
Oxford Flowers 102 Dataset |
|
Create an image folder dataset |
|
LFW Datasets |
|
|
MNIST and Derived Datasets |
Oxford-IIIT Pet Classification Datasets |
|
Places365 Dataset |
|
Tiny ImageNet dataset |
|
WHOI Plankton Datasets |
|
Coralnet Dataset |
|
for Object DetectionDataset having items with “y” as a named list of bounding-box and labels for object detection. |
|
COCO Detection Dataset |
|
Pascal VOC Datasets |
|
RoboFlow 100 Biology dataset Collection |
|
RoboFlow 100 Damages dataset Collection |
|
RF100 Document Collection Datasets |
|
RoboFlow 100 Infrared dataset Collection |
|
RoboFlow 100 Medical dataset Collection |
|
RoboFlow 100 Underwater dataset Collection |
|
for Image captionningDataset having items with “y” as one or multiple captions of the image |
|
COCO Caption Dataset |
|
Flickr Caption Datasets |
|
for Semantic segmentationDataset having items with “y” as a named list containing a segmentation mask and labels for image segmentation. |
|
Oxford-IIIT Pet Segmentation Dataset |
|
Pascal VOC Datasets |
|
RF100 Peixos Segmentation Dataset |
|
Displaying |
|
Images loadingTools for Images loading |
|
Load an Image using ImageMagick |
|
Base loader |
|
Images visualizationTools for Images manipulation and visualization |
|
Draws bounding boxes on image. |
|
Draws Keypoints |
|
Draw segmentation masks |
|
Display image tensor |
|
Display image tensor |
|
A simplified version of torchvision.utils.make_grid |
|
Misc |
|
ImageNet Class Labels |
|
Batched Non-maximum Suppression (NMS) |
|
Non-maximum Suppression (NMS) |
|
Box Area |
|
Box Convert |
|
box_cxcywh_to_xyxy |
|
Box IoU |
|
box_xywh_to_xyxy |
|
box_xyxy_to_cxcywh |
|
box_xyxy_to_xywh |
|
Clip Boxes to Image |
|
Generalized Box IoU |
|
Remove Small Boxes |