Computer Vision Lesson 5 – Transformations | Dataplexa

Image Transformations

Now that you understand how images are represented in memory, we move to an extremely important topic: image transformations.

Image transformations change the geometry or appearance of an image without changing its actual meaning. They are used everywhere — from mobile camera apps to self-driving cars.

Before computers can analyze images reliably, they often need images to be resized, rotated, aligned, or adjusted. That is exactly what image transformations do.

What Is an Image Transformation?

An image transformation is an operation that maps original pixel locations to new pixel locations.

In simple terms:

The image stays the same conceptually
Pixel positions or values are changed
The structure is adjusted for processing

Transformations can affect:

Size
Orientation
Position
Scale

Why Image Transformations Are Needed

Real-world images are rarely perfect.

Images come in different sizes
Objects may appear rotated or tilted
Cameras may capture images at odd angles

Transformations help us:

Standardize images before analysis
Align images properly
Improve robustness of CV models
Prepare data for machine learning

Common Types of Image Transformations

Most image transformations fall into a few core categories.

Transformation	Purpose
Resizing	Change image dimensions
Scaling	Increase or decrease size proportionally
Translation	Move image left, right, up, or down
Rotation	Rotate image around a point
Flipping	Mirror image horizontally or vertically

Resizing Images

Resizing changes the width and height of an image.

This is one of the most common preprocessing steps in Computer Vision.

Deep learning models require fixed image sizes
Large images consume more memory
Smaller images process faster

However, careless resizing can distort images. That is why aspect ratio must be handled carefully.

Scaling Images

Scaling is similar to resizing, but it focuses on proportional size change.

Instead of specifying exact dimensions, we scale by a factor:

Scale > 1 → image enlarges
Scale < 1 → image shrinks

Scaling preserves shape better than arbitrary resizing.

Translation (Shifting Images)

Translation moves the image along the X or Y axis.

This does not change the image size — only the position of pixels.

Shift left or right
Shift up or down

Translation is useful when objects are not centered and need alignment.

Rotation

Rotation turns an image around a point (usually the center).

Rotation is measured in degrees:

Positive angle → counter-clockwise
Negative angle → clockwise

Rotation introduces empty spaces at corners, which must be handled carefully in real systems.

Flipping Images

Flipping creates a mirror image.

Horizontal flip → left becomes right
Vertical flip → top becomes bottom

Flipping is widely used in:

Face recognition
Pose estimation
Data augmentation

Geometric vs Intensity Transformations

Image transformations can be classified into two broad groups.

Type	Description
Geometric	Change pixel positions (resize, rotate, shift)
Intensity	Change pixel values (brightness, contrast)

This lesson focuses on geometric transformations. Intensity transformations will be covered later.

How Transformations Affect Pixels

After transformation:

Original pixel locations may disappear
New pixel values may be interpolated
Edges may become smoother or distorted

That is why transformation quality matters, especially in medical or autonomous systems.

Real-World Examples

Passport photo resizing
Camera auto-rotation on phones
Aligning satellite images
Normalizing images for AI models

Practice Questions

Q1. What is the main goal of image transformations?

To change the geometry or appearance of an image while preserving its meaning.

Q2. What is the difference between resizing and scaling?

Resizing sets fixed dimensions, while scaling changes size proportionally.

Q3. Which transformation moves images without changing size?

Translation.

Quick Quiz

Q1. Which transformation mirrors an image?

Flipping.

Q2. Which transformations change pixel positions?

Geometric transformations.

Key Takeaways

Image transformations modify image geometry
Resizing and scaling adjust image size
Translation shifts image position
Rotation changes orientation
Flipping creates mirror images

In the next lesson, we will study color spaces — how colors are represented beyond RGB and why it matters in Computer Vision.

← Previous Course Index Next →