CV1 Introduction
CV1 Introduction
Computer Vision
-- Introduction
钟凡
[email protected]
课程规划
讲授相对不变的基本原理;( 40% )
调研某个前沿技术方向,并完成报告;( 30% )
每位同学独立完成一个书面报告 ( 不求全,但要有一定深度,有实际场景的测试分析 )
每个方向的所有同学合作完成一个口头报告(综述,技术发展路线和代表性方法,测试分析结果等)
计算机视觉—算法与应用
【作者】 Richard Szeliski
【出 版 社】 清华大学出版社
计算机视觉导论
图像表示
Image File Formats
Vector images ( .ai, .eps, .ps, … )
No aliasing and blur when scaling;
Difficult to be obtained, limited applications in practice;
draw circle
center 0.5, 0.5
radius 0.4
fill-color yellow
stroke-color black
stroke-width 0.05
draw circle
center 0.35, 0.4
radius 0.05
fill-color black
…………
Image File Formats
a
光栅化 (rasterize)
The superiority:
its small size & high quality
JPEG — Joint Photographic Experts Group
16-bit-- capable of
displaying millions
of colors at once
without dithering.
a compression setting
of about 60% will
result in the optimum
balance of quality and
file size .
PNG — Portable Network Graphics
ZIP based lossless compression
Can be transparent (4-channel images)
BMP — Windows Bitmap
Simple uncompressed
Can either be indexed or not
DIB (Device Independent Bitmap) / DDB (Device Dependent
Bitmap)
Image (bitmap) representation
Image
Function defined over 2D domain, f(x, y)
f(x, y) f(x, y)
x
lena
y
Image Representation
Digital Image
x, y, f(x, y) take only discrete values
Formed with finite elements
Each element is called a pixel ( 像素 )
picture elements
image elements
pels
pixels - most widely used
pixel
Image in Memory
2D or 3D array
x y
(0, 0)
(0, 0)
y x
Left Handed Right Handed
Image in Program
struct MyImage
{
int width, height; // 大小
…
……
data 0 step 0
(data0, W, H, step0)
data1 step 0
(data1, w, h, step0)
struct MyImage
Access Pixels {
int width, height;
img.type=CV_8UC3 : 8 位无符号, 3 通道数据 int type;
}
OpenCV
CV=Computer Vision
Created by Intel and maintained by Willow Garage.
Available for C, C++, and Python
Cross-platform: Windows, Linux/Mac, Android, iOS
Open Source and free
Plenty of features : more than 500 functions for image
processing and computer vision
Being actively developed and updated
…
Google for more
OpenCV API Reference
Introduction
core. The Core Functionality
imgproc. Image Processing
highgui. High-level GUI and Media I/O
video. Video Analysis
calib3d. Camera Calibration and 3D Reconstruction
features2d. 2D feature detection and matching
objdetect. Object Detection
ml. Machine Learning
flann. Clustering and Search in Multi-Dimensional Spaces
gpu. GPU-accelerated Computer Vision
stitching. Images stitching