0% found this document useful (0 votes)

8 views23 pages

CV1 Introduction

Uploaded by

spacemankevinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views23 pages

CV1 Introduction

Uploaded by

spacemankevinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

计算机视觉

Computer Vision

-- Introduction

钟凡
[email protected]
课程规划
 讲授相对不变的基本原理；（ 40% ）

 调研某个前沿技术方向，并完成报告；（ 30% ）
 每位同学独立完成一个书面报告 ( 不求全，但要有一定深度，有实际场景的测试分析 )
 每个方向的所有同学合作完成一个口头报告（综述，技术发展路线和代表性方法，测试分析结果等）

 实验： 10 个小实验 or 一个大作业二选一（ 30% ）

 个人完成
 都指定题目
参考书

 计算机视觉—算法与应用
 【作者】 Richard Szeliski
 【出版社】清华大学出版社
计算机视觉导论
图像表示
Image File Formats
 Vector images （ .ai, .eps, .ps, … ）
 No aliasing and blur when scaling;
 Difficult to be obtained, limited applications in practice;

draw circle
center 0.5, 0.5
radius 0.4
fill-color yellow
stroke-color black
stroke-width 0.05
draw circle
center 0.35, 0.4
radius 0.05
fill-color black
…………
Image File Formats

 Bitmap （ .bmp, .jpg, .png, .gif,… ）

 Easy to get, wide applications;
 Becomes blur and aliased when scaling ；

a
光栅化 (rasterize)

Vector -> Bitmap

GIF — Graphics Interchange Format
 8-bit indexed, can be saved with a maximum of 256 colors;
 having the option to dither– (will mix pixels of two different
available colors to create a suggestion of another color)
 can be animated . transparent.

The superiority:
its small size & high quality
JPEG — Joint Photographic Experts Group
16-bit-- capable of
displaying millions
of colors at once
without dithering.

a compression setting
of about 60% will
result in the optimum
balance of quality and
file size .
PNG — Portable Network Graphics
 ZIP based lossless compression
 Can be transparent (4-channel images)
BMP — Windows Bitmap
 Simple uncompressed
 Can either be indexed or not
 DIB (Device Independent Bitmap) / DDB (Device Dependent
Bitmap)
Image (bitmap) representation
 Image
 Function defined over 2D domain, f(x, y)

f(x, y) f(x, y)
x

lena
y
Image Representation
 Digital Image
 x, y, f(x, y) take only discrete values
 Formed with finite elements
 Each element is called a pixel ( 像素 )
 picture elements
 image elements
 pels
 pixels - most widely used

pixel
Image in Memory
 2D or 3D array

[B,G,R] [B,G,R] … [B,G,R] [B] [B] [B] [B] … [B]

[B,G,R] [B,G,R] … [B,G,R] [B] [B] [B] [B] … [B]
……………………………
[B,G,R] [B,G,R] … [B,G,R] [G] [G] [G] [G] … [G]
……………………………… [G] [G] [G] [G] … [G]
……………………………
[B,G,R] [B,G,R] … [B,G,R]
[R] [R] [R] [R] … [R]
[B,G,R] [B,G,R] … [B,G,R]
[R] [R] [R] [R] … [R]
[B,G,R] [B,G,R] … [B,G,R] ……………………………

交叉存贮 (Interlaced) 顺序存贮 (Sequential)

Image in Memory

 Size (resolution, dpi, width*height, number of pixels)

 Color Space (RGB, CMYK, YUV, Lab, …)
 Channels （ 1 ， 2 ， 3 ， 4 ， gray&color ）
 Bit Depth （ number of bits for each channel, 8bits, 12bits,
……,LDR ＆ HDR ）
 Coordinate system ：

x y
(0, 0)

(0, 0)
y x
Left Handed Right Handed
Image in Program

struct MyImage
{
int width, height; // 大小
…

int type; // 类型，含通道数、位深度信息

/* CV_8UC3 : unsigned char [3]
CV_32SC1 : int [1]
CV_32UC1 : uint [1]
CV_32FC4 : float [4]
*/

void* data; // 图像数据

int step; // 步长（每行所占用的字节数）
};
step ? （ stride, 步长）
 For data-alignment: make each row start from address that are multiple of 4, 8, or 16.

 For representing sub-region (ROI) ：

……

data 0  step 0
(data0, W, H, step0)
data1  step 0
(data1, w, h, step0)
struct MyImage
Access Pixels {
int width, height;
 img.type=CV_8UC3 ： 8 位无符号， 3 通道数据 int type;

uchar* get_pixel(const MyImage &img, int x, int y) void* data;

{ int step;
???????????? };
}

 img.type=CV_32SC3: 32 位带符号， 3 通道数据

int* get_pixel(const MyImage &img, int x, int y)

{
??????????????
}
Access Pixels

 img.type=CV_8UC3 ： 8 位无符号， 3 通道数据

uchar* get_pixel(const MyImage &img, int x, int y)

{
// return (uchar*)img.data+y*img.width*3+x*3;
step != width*nc
return (uchar*)img.data+y*img.step+x*3;
}

 img.type=CV_32SC3: 32 位带符号， 3 通道数据

int* get_pixel(const MyImage &img, int x, int y)

{
// return (int*)( (char*)img.data+y*img.step*4+x*3*4 );
step 始终是字节数
return (int*)( (char*)img.data+y*img.step+x*3*4 );
}
 Scan Pixels
void scan_pixels(uchar *data, int width, int height, int step, int nc)
{
uchar *row=data;
for(int yi=0; yi<height; ++yi, row+=step)
{
uchar *px=row;
for(int xi=0; xi<width; ++xi, px+=nc)
{
// px now address the pixel (xi, yi)
}
}
}

 Scan Pixels in ROI

void scan_roi_pixels(MyImage &img, int x, int y, int roi_width, int roi_height)

{// 通道数 nc=img.nc();
???????????????????????????
}
 Scan Pixels
void scan_pixels(uchar *data, int width, int height, int step, int nc)
{
//…….
}

 Scan Pixels in ROI

void scan_roi_pixels(MyImage &img, int x, int y, int roi_width, int roi_height)

{// 通道数 nc=img.nc();

scan_pixels( get_pixel(img, x, y), roi_width, roi_height, img.step, img.nc() );

}
OpenCV

 CV=Computer Vision
 Created by Intel and maintained by Willow Garage.
 Available for C, C++, and Python
 Cross-platform: Windows, Linux/Mac, Android, iOS
 Open Source and free
 Plenty of features : more than 500 functions for image
processing and computer vision
 Being actively developed and updated
 …
 Google for more
OpenCV API Reference
 Introduction
 core. The Core Functionality
 imgproc. Image Processing
 highgui. High-level GUI and Media I/O
 video. Video Analysis
 calib3d. Camera Calibration and 3D Reconstruction
 features2d. 2D feature detection and matching
 objdetect. Object Detection
 ml. Machine Learning
 flann. Clustering and Search in Multi-Dimensional Spaces
 gpu. GPU-accelerated Computer Vision
 stitching. Images stitching

IP Fundamentals
No ratings yet
IP Fundamentals
41 pages
Image Processing: Robotics Club Summer Camp'12
No ratings yet
Image Processing: Robotics Club Summer Camp'12
28 pages
CV2-Image Processing 1
No ratings yet
CV2-Image Processing 1
117 pages
Computer Graphics Syllabus
No ratings yet
Computer Graphics Syllabus
4 pages
Image Formation Fundamentals: CS308 Data Structures
No ratings yet
Image Formation Fundamentals: CS308 Data Structures
41 pages
Digital Image Processing
No ratings yet
Digital Image Processing
19 pages
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
48 pages
Documentation Image Processing Day 1
No ratings yet
Documentation Image Processing Day 1
11 pages
The Opencv 1.X C Reference Manual: Release 2.3
No ratings yet
The Opencv 1.X C Reference Manual: Release 2.3
284 pages
OpenCV Lections: 3. Mat Class
100% (1)
OpenCV Lections: 3. Mat Class
16 pages
Basic Image Processing
No ratings yet
Basic Image Processing
32 pages
COMPUTER VISION notes
No ratings yet
COMPUTER VISION notes
3 pages
2_
No ratings yet
2_
74 pages
Opencv Beginners
100% (2)
Opencv Beginners
17 pages
Image
No ratings yet
Image
3 pages
Scan Conversion
100% (2)
Scan Conversion
22 pages
Documentation Image Processing Day 1
No ratings yet
Documentation Image Processing Day 1
11 pages
Opencv: Electronics Club, Iitk
No ratings yet
Opencv: Electronics Club, Iitk
27 pages
Ch-Computer Vision
No ratings yet
Ch-Computer Vision
6 pages
ImageProcessingTutorial
No ratings yet
ImageProcessingTutorial
28 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Computer Vision Class 10 AI Notes CBSE
No ratings yet
Computer Vision Class 10 AI Notes CBSE
8 pages
Hardware Software Codesign Assignment
No ratings yet
Hardware Software Codesign Assignment
7 pages
Digital Image Processing - Lecture Notes
0% (1)
Digital Image Processing - Lecture Notes
32 pages
Introduction To The Opencv Library
No ratings yet
Introduction To The Opencv Library
6 pages
An Introduction To Opencv Using Python With Ubuntu: Krupali Mistry, Avneet Saluja
No ratings yet
An Introduction To Opencv Using Python With Ubuntu: Krupali Mistry, Avneet Saluja
4 pages
106105216
No ratings yet
106105216
1,053 pages
Color Detection Opencv
No ratings yet
Color Detection Opencv
4 pages
Extra credit III
No ratings yet
Extra credit III
12 pages
Appendix Ii. Image Processing Using Opencv Library
No ratings yet
Appendix Ii. Image Processing Using Opencv Library
11 pages
Unit 1 - Cga - 2021
No ratings yet
Unit 1 - Cga - 2021
40 pages
Machine Vision and Image Processing Algorithm - Machine Vision and Image Processing Algorithm Fall 2009 Mario
No ratings yet
Machine Vision and Image Processing Algorithm - Machine Vision and Image Processing Algorithm Fall 2009 Mario
47 pages
Raster Images, Raster Devices AND Pixmap Manipulation: Marc Levoy
No ratings yet
Raster Images, Raster Devices AND Pixmap Manipulation: Marc Levoy
35 pages
Introduction To Programming With OpenCV
No ratings yet
Introduction To Programming With OpenCV
20 pages
Appendix 2 Introduction To Opencv: Speaker: 黃世勳
No ratings yet
Appendix 2 Introduction To Opencv: Speaker: 黃世勳
35 pages
College of Information Science and Engineering. Central South University. Changsha, Hunan, 410083, P.R China
100% (2)
College of Information Science and Engineering. Central South University. Changsha, Hunan, 410083, P.R China
37 pages
Lecture W2abc 2
No ratings yet
Lecture W2abc 2
39 pages
Opencv2refman Py
No ratings yet
Opencv2refman Py
172 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Lecture1_merged
No ratings yet
Lecture1_merged
182 pages
Basics of OpenCV API
No ratings yet
Basics of OpenCV API
10 pages
Gip Uniy-1 Notes 2018-19
No ratings yet
Gip Uniy-1 Notes 2018-19
47 pages
ch3
No ratings yet
ch3
22 pages
CG Module-1
No ratings yet
CG Module-1
66 pages
Opencv Cheatsheet
No ratings yet
Opencv Cheatsheet
2 pages
Image Reference Guide: Install Pillow
No ratings yet
Image Reference Guide: Install Pillow
4 pages
CS463 Cis - Digital Image Processing
No ratings yet
CS463 Cis - Digital Image Processing
5 pages
2D Graphics: John E. Laird
No ratings yet
2D Graphics: John E. Laird
27 pages
Chapter 1
No ratings yet
Chapter 1
58 pages
CV_SVD_L02_P1_IntroImageProcColor
No ratings yet
CV_SVD_L02_P1_IntroImageProcColor
89 pages
CSE4019 Image - Processing ETH 1 AC41
No ratings yet
CSE4019 Image - Processing ETH 1 AC41
13 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Coding Assignment C++
No ratings yet
Coding Assignment C++
2 pages
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Vector Graphics Editor: Empowering Visual Creation with Advanced Algorithms
From Everand
Vector Graphics Editor: Empowering Visual Creation with Advanced Algorithms
Fouad Sabry
No ratings yet
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
From Everand
Raster Graphics Editor: Transforming Visual Realities: Mastering Raster Graphics Editors in Computer Vision
Fouad Sabry
No ratings yet
Raster Graphics: Understanding the Foundations of Raster Graphics in Computer Vision
From Everand
Raster Graphics: Understanding the Foundations of Raster Graphics in Computer Vision
Fouad Sabry
No ratings yet
Digital Raster Graphic: Unveiling the Power of Digital Raster Graphics in Computer Vision
From Everand
Digital Raster Graphic: Unveiling the Power of Digital Raster Graphics in Computer Vision
Fouad Sabry
No ratings yet
Volume Rendering: Exploring Visual Realism in Computer Vision
From Everand
Volume Rendering: Exploring Visual Realism in Computer Vision
Fouad Sabry
No ratings yet
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
From Everand
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
Fouad Sabry
No ratings yet
Estrategia Sr1 Target
No ratings yet
Estrategia Sr1 Target
7 pages
UVM ASSIGNMENT 2 Rahulss
No ratings yet
UVM ASSIGNMENT 2 Rahulss
6 pages
Pt. Amarta Indonesia Makmur: Judul: Safety Talk
No ratings yet
Pt. Amarta Indonesia Makmur: Judul: Safety Talk
5 pages
'Ramesh' 'Ahmedabad' 'Khilan' 'Delhi' 'Kaushik' 'Kota' 'Chaitali' 'Mumbai' 'Hardik' 'Bhopal' 'Komal' 'MP'
No ratings yet
'Ramesh' 'Ahmedabad' 'Khilan' 'Delhi' 'Kaushik' 'Kota' 'Chaitali' 'Mumbai' 'Hardik' 'Bhopal' 'Komal' 'MP'
9 pages
HTML Access Installation 2111
No ratings yet
HTML Access Installation 2111
70 pages
Snowflake · Streamlit
No ratings yet
Snowflake · Streamlit
4 pages
Volinfo
No ratings yet
Volinfo
1 page
YOLICO YD101 ModBus
No ratings yet
YOLICO YD101 ModBus
33 pages
Peter Shirley - Ray Tracing in One Weekend (2016)
No ratings yet
Peter Shirley - Ray Tracing in One Weekend (2016)
38 pages
Experiment Report DIY Spectrometer
No ratings yet
Experiment Report DIY Spectrometer
6 pages
Facebook User?
No ratings yet
Facebook User?
11 pages
Ip Project
No ratings yet
Ip Project
27 pages
Keysight X-Series RELEASE NOTES
No ratings yet
Keysight X-Series RELEASE NOTES
107 pages
Baulkham Hills 2019 2U Prelim Yearly & Solutions
No ratings yet
Baulkham Hills 2019 2U Prelim Yearly & Solutions
20 pages
ET4280 ACN-04 Key Distribution and User Authentication
No ratings yet
ET4280 ACN-04 Key Distribution and User Authentication
35 pages
1738595104120 Class 9 Artificial Intelligence Study Material4147
No ratings yet
1738595104120 Class 9 Artificial Intelligence Study Material4147
144 pages
Ellicium Solutions Mock Test A
No ratings yet
Ellicium Solutions Mock Test A
25 pages
Lasya Priya Capstone
No ratings yet
Lasya Priya Capstone
64 pages
Module 4 - Lecture 5
No ratings yet
Module 4 - Lecture 5
24 pages
Design and Implementation of CNC Machine Remote Monitoring and Controlling System Based On Embedded Internet
No ratings yet
Design and Implementation of CNC Machine Remote Monitoring and Controlling System Based On Embedded Internet
4 pages
Microproject NIS Incomp
No ratings yet
Microproject NIS Incomp
21 pages
EEE Minutes BoS To Dean 2024-2025
No ratings yet
EEE Minutes BoS To Dean 2024-2025
33 pages
3 Computer Memory
No ratings yet
3 Computer Memory
24 pages
Software Release Life Cycle
No ratings yet
Software Release Life Cycle
9 pages
EDR vs. XDR vs. SIEM vs. MDR vs. SOAR
No ratings yet
EDR vs. XDR vs. SIEM vs. MDR vs. SOAR
7 pages
CH 01 Eng S v1.0
No ratings yet
CH 01 Eng S v1.0
37 pages
Cookbook Examples Langchain Gemini LangChain QA Chroma WebLoad - Ipynb at Main Google-Gemini Cookbook
No ratings yet
Cookbook Examples Langchain Gemini LangChain QA Chroma WebLoad - Ipynb at Main Google-Gemini Cookbook
8 pages
Buy ebook Introductory Differential Equations Fourth Edition Martha L. Abell cheap price
100% (1)
Buy ebook Introductory Differential Equations Fourth Edition Martha L. Abell cheap price
82 pages
FDS Answer Bank 11-20
No ratings yet
FDS Answer Bank 11-20
38 pages
Data Communication & Computer Network
No ratings yet
Data Communication & Computer Network
3 pages

CV1 Introduction

Uploaded by

CV1 Introduction

Uploaded by

计算机视觉

 实验： 10 个小实验 or 一个大作业 二选一 （ 30% ）

 Bitmap （ .bmp, .jpg, .png, .gif,… ）

Vector -> Bitmap

[B,G,R] [B,G,R] … [B,G,R] [B] [B] [B] [B] … [B]

交叉存贮 (Interlaced) 顺序存贮 (Sequential)

 Size (resolution, dpi, width*height, number of pixels)

int type; // 类型，含通道数、位深度信息

void* data; // 图像数据

 For representing sub-region (ROI) ：

uchar* get_pixel(const MyImage &img, int x, int y) void* data;

 img.type=CV_32SC3: 32 位带符号， 3 通道数据

int* get_pixel(const MyImage &img, int x, int y)

 img.type=CV_8UC3 ： 8 位无符号， 3 通道数据

uchar* get_pixel(const MyImage &img, int x, int y)

 img.type=CV_32SC3: 32 位带符号， 3 通道数据

int* get_pixel(const MyImage &img, int x, int y)

 Scan Pixels in ROI

void scan_roi_pixels(MyImage &img, int x, int y, int roi_width, int roi_height)

 Scan Pixels in ROI

void scan_roi_pixels(MyImage &img, int x, int y, int roi_width, int roi_height)

scan_pixels( get_pixel(img, x, y), roi_width, roi_height, img.step, img.nc() );

You might also like

 实验： 10 个小实验 or 一个大作业二选一（ 30% ）