Summary
- Semester: 1
- Points: 6
- Formal Contact: 4 hours per week
- Total Workload: 10 hours per week
The aim of this course is to provide an overview of digital image
processing and computer vision analysis. Both areas are large enough
to justify a course on their own; this 1-semester course covers the
fundamentals of image formation, low-level image processing and
enhancement in both the spatial and Fourier domains, and gives a
flavour of higher level vision tasks by studying some of the
Structure-from-X algorithms, such as Structure from Stereo, and
Structure from Motion. Finally, we briefy cover some aspects
of Shape from Shading and Photometric Stereo.
The course consists of 13 two-hour lectures
and a weekly 2 hour laboratory session. The
assessment for this course is via a 2 hour written examination at the
end of the semester (for 70%) and a portfolio of practical laboratory
exercises done throughout the semester (for 30%) submitted
on Friday 30th of May 2008 (i.e. at the end of
the second last week of the 1st semeter).
Almost all the laboratory work is done using MATLAB.
Unit Coordinator
Dr. Du Huynh
Room 1.22, CSSE Building
Lecturers
* Dr. Du Huynh
Consultation hours: TBA
* Assoc. Prof. R. Kozera
Room 1.10, CSSE Building
Email: ryszard@csse.uwa.edu.au
Consultation hours: Thursdays 13:00-14:00
Lectures
With the absence of a suitable text a considerable amount of
supplementary material has been prepared for the course over
the years by Dr. Peter Kovesi and
Prof. Robyn Owens. This material is not intended to replace the notes presented in
lectures.
Material presented in lectures may include areas not covered by these
supplementary notes.
Note: Not all of the links below under Lectures and Laboratories
are currently pointing to files that exist.
As we progress through the semester, these links will start to point
to the right place. It is your responsibility to constantly
check this website to download the supplementary notes yourself.
Lecture note coverpage ( pdf file )
-
Introduction, image formation, camera models and perspective geometry.
( pdf file )
-
Binary images: thresholding, moments, topology.
( pdf file )
-
Connected components, and morphological operations.
( pdf file )
Sample Matlab code
-
Fourier transform theory
( pdf file )
Supplementary materials:
DFT,
Matlab demo.
Web demo:
demo1.
-
Image enhancement
( pdf file )
Convolution and Correlation
( pdf file )
Some useful demos on the web:
demo1,
demo2.
-
Edge detection: classical methods
( pdf file )
-
Camera calibration
( pdf file )
-
Projective Geometry: Single View Reconstruction
( pdf file )
-
Stereo and structured light
( pdf file )
Self-calibration and fundamental matrix
( pdf file )
-
Motion
( pdf file )
-
(optional) Tracking and the Kalman Filter ( pdf file )
-
Colour
( pdf file )
-
Shape from Shading
-
Photometric Stereo
Lectopia (recording of lectures) ***
Laboratories
Laboratory exercises will be posted progressively over the
semester. Do a 'reload' to ensure you have the most up to date links.
The work done in these lab exercises will form your portfolio of
practical assessment.
Note: The portfolio is due on Friday 30th of May 2008 (week 12) at 12 noon.
Instruction about submission (in the electronic and hard copy)
will be announced here later in the semester.
- Lab Sheet 1 Introduction to grey
scale manipulation.
- Lab Sheet 2 Binary images and
object moments.
- Lab Sheet 3 Morphology.
- Lab Sheet 4 The Fourier
Transform and the convolution theorem.
- Lab Sheet 5 Constructing filters
in the frequency domain.
- Lab Sheet 6 Edge detection and the Hough Transform.
Note that information in the lab sheet on finding local maxima in
the Hough transform has been updated.
- Lab Sheet 7 Camera calibration
- Lab Sheet 8 Stereo reconstruction - The last lab sheet!
How to construct your portfolio.
This document provides an introduction to HTML and also explains how
to save and resize images and plots from MATLAB for incorporation in
your portfolio.
Help
- We want you to help each other (within reason!)
Post questions, look for answers and respond to other people's questions at
Self Help 4240 (DISCUSSION FORUM).
If you have a problem please look here!
Please only email the lecturers regarding issues that are personal to yourself.
- How To... solve some common
problems.
- Frequently Asked Questions.
- An introduction to Linux
- MATLAB lecture
notes for Computing for Engineers and Scientists, our 1st year MATLAB unit. Lectures 4 to 13
will cover all you need to know.
Some useful MATLAB references
Images
Past Exams
Texts
There is no specific text book specified for the course.
The recommended reading for this course includes:
- Image Processing, Analysis, and Machine Vision
by Sonka, Hlavac, and Boyle (ITP, 1999) MPSL 006.37 1999 IMG.
- Introductory Techniques for 3-D Computer Vision
by Emanuele Trucco and Alessandro Verri (Prentice Hall 1998)
MPSL 006.37 1998 INT
- Machine Vision: Theory, Algorithms, Practicalities
by E. R. Davies (Morgan Kaufmann Publications 2005)
- Machine Vision
by Ramesh Jain, Rangachar Kasturi and Brian Schunk (McGraw Hill 1995).
MPSL 006.42 1995 MAC
- Digital Image Processing
by Rafael C Gonzalez and Richard E Woods (Addison Wesley, 1992) MPSL 621.367 1992 DIG.
- Digital Image Processing using MATLAB
by Rafael C Gonzalez, Richard E Woods, and Steven L. Eddins
(Pearson Prentice Hall, c2004) MPSL 621.367 2004 DIG.
- Multiple view geometry in computer vision
by Richard Hartley and Andrew Zisserman (Cambridge University Press, 2004, 2nd Ed)
- A Guided Tour of Computer Vision
by Vishvjit S. Nalwa (Addison Wesley, 1993) MPSL 006.42 1993 GUI.
- Robot Vision
by Berthold Klaus Paul Horn (MIT Press, 1986) MPSL 629.892 1986 ROB.
- The Reconfigured Eye
by William J. Mitchell (MIT Press, 1992) EDFAA 621.367 1992 REC.
Some of the books above have been placed in closed reserve in the
Mathematics and Physical Sciences Library (MPSL).
Digital Image Processing by Gonzalez and Woods is available at
the CoOp Bookshop in the Guild Village and at United Booksellers, shop
5, 126 Broadway. It can be worthwhile comparing prices at these two shops.
Some useful links
Links marked with a * are outside PARNET. You will need to
use the proxy
server to access these sites.
- MATLAB links
- Vision reference material
- Mathematical reference material
- Software and commercial sites
- *Adept Electronic
Solutions is a local machine vision company. They also award the student prize for Computer Vision CITS4240.
- *DDD Dynamic Digital
Depth is a local company that specialises in converting 2D movies
into 3D, and developing various tools for facilitating this process.
- *The Official XV Home Page
- *Intel's
OpenCV library, a very comprehensive open source vision library of C functions.
- *Vision 1 a commercial site with some useful info
|