Lab 2.1 - 2D Projective Geometry and Homography Estimation

Using homography warps to project an image onto tracked points in a scene. Source material from [1, 2].

Overview

In this lab you’ll learn about 2D projective geometry and homography estimation.

Important: You can obtain 2 bonus marks for each section you finish and demo while present during the lab period we introduce the lab.

1. Homogeneous Transforms (20)

A transformatrion matrix, \(M\), can be used to transform 2D points in homogeneous form:

\begin{equation} \left(\begin{array}{l} \displaylines{x’\\ y’\\ w} \end{array}\right) = M \left(\begin{array}{l} \displaylines{x\\ y\\ 1} \end{array}\right) \end{equation}

We define translation, rotation, scale and shear matricies in homogenenous coordinates as follows:

\begin{equation} T=\left[\begin{array}{ccc} \displaylines{ 1 & 0 & t_x\\ 0 & 1 & t_y\\ 0 & 0 & 1 } \end{array}\right] \end{equation}

\begin{equation} R=\left[\begin{array}{ccc} \displaylines{ cos\theta & -sin\theta & 0\\ sin\theta & cos\theta & 0\\ 0 & 0 & 1 } \end{array}\right] \end{equation}

\begin{equation} S=\left[\begin{array}{ccc} \displaylines{ s_x & 0 & 0\\ 0 & s_y & 0\\ 0 & 0 & 1 } \end{array}\right] \end{equation}

\begin{equation} Sh=\left[\begin{array}{ccc} \displaylines{ 1 & sh_x & 0\\ sh_y & 1 & 0\\ 0 & 0 & 1 } \end{array}\right] \end{equation}

These matrices transform the homogeneous points with respect to the coordinate origin. They can be multiplied in any order and order of multiplication determines the order of the transformations performed on the points. For further reading, see Chapter 2.4 of Hartley and Zisserman.

The following code loads the point cloud house.npy and displays it in a 2d scatterplot. Use these points to complete Question 1. You must add the homogeneous coordinate row yourself.

house.txt is also available if you prefer. Use np.loadtxt() with delimiter=',' to load it.

with open('happy.npy', 'rb') as f:
    house = np.load(f, allow_pickle=True)

fig = plt.figure()
ax = fig.add_subplot()
plt.scatter(house[0], house[1], color='gray')
ax.set_aspect('equal', 'box')
plt.title("House")
plt.show()

a) 2D Homogeneous Transformations

Plots of various transformations on the 2D house point cloud.

Create functions that create translation, rotation, scale and shear transformation matricies using the equations provided above. Apply the following transformations to our provided house point cloud and recreate the above figure (with subplots).
1. Original Point Cloud.
2. Translate by (100, -50).
3. Rotate by +90 degrees.
4. Rotate by -90 degrees about centroid.
5. Scale by (0.5, -1).
6. Shear by (0.2, 0).
7. Affine transformation given Affine = np.array([[0.9, -0.2, 10], [0.3, 1.2, 10]]).
8. Homography transformation given Homography = np.array([[1, 0, 0], [0, 1, 0], [0.005, 0, 1]]). Ensure you divide all points by the homogeneous coordinate after applying the transformation.
You will lose marks if your house is not at the same coordinates as our figure.

Deliverables:

Save your figure to include with your submission.
Display the image in your report.

Report Question 1a: What group of transformations do translation, rotation, scale and shear belong to? Name two ways these transformations differ from projective (homography) transforms?

b) Create a composite image

Homography transforms applied to the house to create a simple street image.

Using a homography transform and our provided point cloud, approximately recreate the above image.

Deliverables:

Display the image in your report.
Save your image to include with your submission.

Report Question 1b: Describe how you created this image and the transformations you used.

c) Bonus (5) - Adding your own assets

Getting a point cloud from an image.

import matplotlib.pyplot as plt
import numpy as np
import cv2

def get_point_cloud(image, scale = 1):
    if len(image.shape) > 2:
        image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
    (dimy, dimx) = image.shape
    dimx = int(dimx*scale)
    dimy = int(dimy*scale)
    image = cv2.resize(image, (dimx, dimy), interpolation=cv2.INTER_AREA)
    _, image = cv2.threshold(image,0,255,cv2.THRESH_BINARY+cv2.THRESH_OTSU)
    y, x = np.nonzero(image == 0)
    x = x.T
    y = -y.T + dimy
    return x, y


image = cv2.imread("bubble.jpg")
scale = 0.5
image_pts = get_point_cloud(image, scale=scale)
fig = plt.figure()

ax1 = fig.add_subplot(1, 2, 1)
plt.imshow(image)
ax1.set_aspect('equal', 'box')
plt.title("Original")

ax2 = fig.add_subplot(1, 2, 2)
plt.scatter(image_pts[0], image_pts[1], color='gray', s=1, marker='s', linewidths=0)
ax2.set_aspect('equal', 'box')
ax2.set_xlim([0, image.shape[1]*scale])
ax2.set_ylim([0, image.shape[0]*scale])
plt.title("Point Cloud")

plt.show()

Use the above function to generate point clouds of your own assets. Search up coloring page versions of your favorite characters and load them in.
Create a composite image of your own using the transformations you’ve learned. Load in at least two assets and transform them to create a scene.
Note: Large images can create tens of thousands of points. If it hampers performance reduce the number of points using scale.

Deliverables:

Display the image in your report.
Save your image to include with your submission.

Report Question 1c: Describe how you created this image and the transformations you used.

2. 2D Projective Geometry: Points, lines and their incidence (20)

a) Hand Drawing

The homogeneous equation of a line is given as follows: \begin{equation} l^Tx=ax+by+c=0 \end{equation} It holds true for all points that lie on the line.

The following three homogeneous lines display a simplified diagram of a railway track and the horizon.

\[l_1=\left(\begin{array}{c} 10 \\ -1 \\ 10 \end{array}\right) \quad l_2=\left(\begin{array}{c} -10 \\ -1 \\ 10 \end{array}\right) \quad l_H=\left(\begin{array}{c} 0 \\ 1 \\ -10 \end{array}\right)\]

Sketch the drawing labelling all three lines and the position of the vanishing point, the intersection of \(l_1\) and \(l_2\), given by \(l_1\times l_2\).

Deliverables:

Display the sketch in your report (you may use software like paint to draw this sketch or scan a hand sketch; 0 marks will be awarded for plots).
Save the sketch to include with your submission.

b) Construct the midpoint and midline of the football field in the image below.

Save this image to use for question 2b, obtained from [3].

Click on the corners of the field using matplotlib.pyplot.ginput() or your own mouse callback.
Convert the points to homogeneous coordinates.
Compute the diagonal lines \(l_1\) and \(l_2\) (the line between two points is \(l=p_1\times p_2\)).
Compute the midpoint, \(p_m\) using the diagonal lines (the intersection between two lines is \(p=l_1\times l_2\)).
Compute the vanishing point, \(p_\infty\) as the intersection of the two short sides of the field.
Compute the midline using the vanishing point and the mid-point.

Deliverables:

Display the image with overlaid features in your report.
Save the image with overlaid features to include with your submission.

c) Bonus (10) - Construct the midpoint and midline on a video sequence

Place a book with a textured cover on a table and record a video.
Track the four corners of the book using 4 trackers, initialized by mouse clicks. You can use your own tracker or one of openCV’s built in trackers (CSRT is suitable, see resource on other openCV trackers).
Use the tracked positions to draw the midline and midpoint on the book.

Deliverables:

Save the video with overlaid features to include with your submission.

3. Robot Visual Servoing Task Specifications (20)

a) Line-to-line constraint

Line to line constraint for robot visual servoing.

\begin{equation} e_{l2l} = y_1 \cdot (y_3 \times y_4) + y_2 \cdot (y_3 \times y_4) \end{equation}

Create a live implementation of the line-to-line constraint.
Place two box objects in the scene. Initialize the appropriate tracked points with mouse clicks to construct the line to line constraint.
You may use openCV’s built in trackers (CSRT is suitable, see resource on other openCV trackers).
Overlay the value of the constraint and the points/lines we are tracking onto the video (see cv2.putText()).
Save a video with overlaid features. Rotate and move one of the books and bring the lines together.
Optional: normalize the points and lines before computing the constraints to avoid overly large numbers.

Deliverables:

Save the video with overlaid features to include with your submission.
Display one frame of your video in your report.

Report Question 3a: What happens to the value of our constraint when we bring one line to the other?

b) Parallel lines constraint

Parallel lines constraint for robot visual servoing.

The equation for the line to line constraint is:

\begin{equation} e_{par} = (l_1 \times l_2) \times (l_3 \times l_4) \end{equation}

Apply the same process in 3a) to make the parallel lines constraint. A live implementation is mandatory.
The recorded video for this question should include rotations that change the sign of the parallel lines constraint.
This time our constraint is vector valued. Overlay the constraint and the points/lines we are tracking onto the video and verify that its magnitude decreases when the books are near parallel, and its sign changes when the books cross the point of being parallel.

Deliverables:

Save the video with overlaid features to include with your submission.
Display one frame of your video in your report.

Report Question 3b: How can we use these constraints to control a robot?

4. Homography Estimation (40)

Sample images for question 3.

See page 39-49 of these slides
See sample results

a) Basic DLT Function

Write a function get_homography(x_1, x_2, normalization=False) that returns the homography matrix, H, between two images, given the 4 (or more) point correspondences, x_1 and x_2 between two images.
Apply your transformation to the first image to verify your results.
Get the point correspondences through clicking on the same four features in two different images.

Deliverables:

Display the two original images and the transformed image in your report.

Report Question 4a: What could be one real life use of this process?

b) Normalization

Implement a normalization procedure into your algorithm. Translate the points such that their mean is centred at the origin. Scale the points such that the average norm of the distance to the origin is \(\sqrt 2\).
Explore performance of your algorithm with your own pictures.

Deliverables:

Using your own images, display the two originals and the transformed image in your report.

Report Question 4b: After the homography is applied, do the overlapping areas in the two images look exactly the same? Why or why not?

Report Question 4c: Correspondences are not always perfect. What happens if some of the correspondences are incorrect? How can this be handled?

5. Bonus (15) - Augmented Reality

Projecting Hartley and Zisserman onto pieces of paper.

Track planar regions: Identify and track one or more four-point planar regions (e.g., a poster or book cover) in a video.
Compute homography: Calculate the homography for each region.
Project image: Warp an image (or images) onto the tracked surface so that, even as the camera moves, the projection remains fixed to the surface.
Be creative! You can project onto trackable surface. An additional 5 bonus marks will be awarded cool-factor, as judged by the TAs.

Deliverables:

Save the video with overlaid projections to include with your submission.

Submission Details

Include accompanying code used to complete each question. Ensure they are adequately commented.
Ensure all functions are and sections are clearly labeled in your report to match the tasks and deliverables outlined in the lab.
Organize files as follows:
- code/ folder containing all scripts used in the assignment.
- media/ folder for images, videos, and results.
Final submission format: a single zip file named CompVisW25_lab2.1_lastname_firstname.zip containing the above structure.
Your combined report for Lab 2.1 and 2.2 is to be submitted at the end of the topic at a later date. The report contains all media, results, and answers as specified in the instructions above. Ensure your answers are concise and directly address the questions.
Total marks for this lab is 100 for all students. Your lab assignment grade with bonus marks is capped at 130%.