Geometric transformations are fundamental operations in image processing and computer graphics that alter the spatial arrangement of pixels in an image. They can be represented efficiently using matrix operations.

1. Translation

Concept: Shifting an image horizontally and/or vertically.
Parameters:
- $t_{x}$ : Translation distance along the x-axis.
- $t_{y}$ : Translation distance along the y-axis.
Transformation Equations: Let $(x, y)$ be the original pixel coordinates and $(x^{'}, y^{'})$ be the translated coordinates. $x^{'} = x + t_{x}$ $y^{'} = y + t_{y}$
Matrix Form (Homogeneous Coordinates): To represent translation in matrix form, we use homogeneous coordinates. A 2D point $(x, y)$ is represented as a 3D vector $(x, y, 1)$ .

$x^{'} y^{'} 1 = 100010 t_{x} t_{y} 1 x y 1$

Where:
- $T = 100010 t_{x} t_{y} 1$ is the translation matrix.

2. Rotation

Concept: Rotating an image around a specific point (usually the origin).
Parameters:
- $θ$ : Rotation angle (in radians or degrees). Positive values usually indicate counterclockwise rotation.
Transformation Equations: Let $(x, y)$ be the original pixel coordinates and $(x^{'}, y^{'})$ be the rotated coordinates. $x^{'} = x cos (θ) - y sin (θ)$ $y^{'} = x sin (θ) + y cos (θ)$
Matrix Form (Homogeneous Coordinates):

$x^{'} y^{'} 1 = cos (θ) sin (θ) 0 - sin (θ) cos (θ) 0 001 x y 1$

Where:
- $R = cos (θ) sin (θ) 0 - sin (θ) cos (θ) 0 001$ is the rotation matrix.

Derivation of Rotation Geometric Transformation

3. Scaling

Concept: Enlarging or shrinking an image along the x and/or y axes.
Parameters:
- $s_{x}$ : Scaling factor along the x-axis.
- $s_{y}$ : Scaling factor along the y-axis.
Transformation Equations: Let $(x, y)$ be the original pixel coordinates and $(x^{'}, y^{'})$ be the scaled coordinates. $x^{'} = s_{x} \cdot x$ $y^{'} = s_{y} \cdot y$
Matrix Form (Homogeneous Coordinates):

$x^{'} y^{'} 1 = s_{x} 00 0 s_{y} 0 001 x y 1$

Where:
- $S = s_{x} 00 0 s_{y} 0 001$ is the scaling matrix.

Combining Transformations

Multiple transformations can be combined by multiplying their corresponding matrices. The order of matrix multiplication matters, as matrix multiplication is not commutative.

For example, to perform a rotation followed by a translation, the combined transformation matrix $M$ would be:

$M = T \cdot R$

Then, to apply this combined transformation to a point:

$x^{'} y^{'} 1 = M x y 1$

Note: In general, to apply a sequence of transformations represented by matrices $M_{1}, M_{2}, ..., M_{n}$ , the combined transformation matrix is:

$M = M_{n} \cdot ... \cdot M_{2} \cdot M_{1}$ (multiply from right to left in the order of operations).

Okay, here are concise digital notes on Euclidean, affine, and projective transformations, emphasizing their mathematical representations:

Other Transformation

1. Euclidean Transformation

Concept: Preserves distances and angles. Includes rotations, translations, and reflections. Also known as a rigid-body transformation or isometry.
Degrees of Freedom: 3 in 2D (one rotation angle, two translation parameters).
General Form: $x^{'} = Rx + t$ Where:
- $x$ , $x^{'}$ : Original and transformed points (2x1 vectors in 2D).
- $R$ : Rotation matrix (2x2 in 2D), orthogonal with determinant 1.
- $t$ : Translation vector (2x1 in 2D).
Matrix Form (Homogeneous Coordinates): $x^{'} y^{'} 1 = [R 0^{T} t 1] x y 1 = r_{11} r_{21} 0 r_{12} r_{22} 0 t_{x} t_{y} 1 x y 1$ Where:
- $r_{11}^{2} + r_{21}^{2} = 1$
- $r_{12}^{2} + r_{22}^{2} = 1$
- $r_{11} r_{12} + r_{21} r_{22} = 0$
- $r_{11} = cos θ, r_{12} = - sin θ, r_{21} = sin θ, r_{22} = cos θ$

2. Affine Transformation

Concept: Preserves parallelism of lines but not necessarily distances and angles. Includes shearing and scaling in addition to Euclidean transformations.
Degrees of Freedom: 6 in 2D.
General Form: $x^{'} = Ax + t$ Where:
- $x$ , $x^{'}$ : Original and transformed points (2x1 vectors in 2D).
- $A$ : Arbitrary 2x2 matrix (not necessarily orthogonal).
- $t$ : Translation vector (2x1 in 2D).
Matrix Form (Homogeneous Coordinates): $x^{'} y^{'} 1 = [A 0^{T} t 1] x y 1 = a_{11} a_{21} 0 a_{12} a_{22} 0 t_{x} t_{y} 1 x y 1$

3. Projective Transformation

Concept: Most general linear transformation of homogeneous coordinates. Preserves straight lines but not necessarily parallelism, lengths, or angles. Used to model perspective projections. Also known as a homography.
Degrees of Freedom: 8 in 2D.
General Form: Cannot be expressed in the simple $x^{'} = Ax + t$ form.
Matrix Form (Homogeneous Coordinates): $x^{'} y^{'} w^{'} = h_{11} h_{21} h_{31} h_{12} h_{22} h_{32} h_{13} h_{23} h_{33} x y 1$ Where:
- $x_{inh}^{'} = x^{'} / w^{'}$
- $y_{inh}^{'} = y^{'} / w^{'}$
- $(x_{inh}^{'}, y_{inh}^{'})$ are inhomogeneous coordinates of the transformed point.
- $H$ is a 3x3 homogeneous matrix defined up to a scale factor (only the ratios of the elements matter).
- We can usually set $h_{33} = 1$ (unless it’s zero).

Summary Table

Transformation	Degrees of Freedom	Preserves	Matrix Form (2D)
Euclidean	3	Distances, angles, parallelism	$[R 0^{T} t 1]$ , $R$ is orthogonal, det( $R$ ) = 1
Affine	6	Parallelism	$[A 0^{T} t 1]$ , $A$ is arbitrary
Projective (Homography)	8	Straight lines	$h_{11} h_{21} h_{31} h_{12} h_{22} h_{32} h_{13} h_{23} h_{33}$ , defined up to scale

2ndSem

Explorer

Geometric Transformations

Other Transformation

Graph View

Backlinks