深度学习图像处理算法（二）：Canny算子

Canny算子是一种基于多阶段优化的边缘检测方法，旨在以高精度、低误检率提取图像边缘。它综合了噪声抑制、梯度计算和边缘连接等步骤，相较于Sobel算子更为复杂但效果更优。目标：在噪声环境下精确检测图像中的真实边缘。应用：图像分割、轮廓提取、目标检测等。Canny算子是一个精密的边缘检测工具，通过去噪、梯度计算、非极大值抑制和双阈值连接，绘制出清晰完整的边缘线条。它就像一个“边缘雕刻师”，在噪声中精

2501_90215210

2862人浏览 · 2025-03-25 18:37:00

2501_90215210 · 2025-03-25 18:37:00 发布

Canny算子完整笔记：从原理到代码实现

Canny算子是一种经典且高效的边缘检测算法，由John F. Canny于1986年提出，广泛应用于图像处理和计算机视觉。本文将详细介绍Canny算子的原理、步骤、通俗解释，并提供Python代码实现，帮助读者深入理解并掌握这一技术。

一、什么是Canny算子？

Canny算子是一种基于多阶段优化的边缘检测方法，旨在以高精度、低误检率提取图像边缘。它综合了噪声抑制、梯度计算和边缘连接等步骤，相较于Sobel算子更为复杂但效果更优。

目标：在噪声环境下精确检测图像中的真实边缘。
应用：图像分割、轮廓提取、目标检测等。

二、通俗原理解释

1. 边缘是什么？

就像上一节Sobel算子提到的，边缘是图像中亮度变化明显的地方，比如杯子与桌子的交界。Canny算子不仅要找到这些地方，还要确保不被噪声干扰，尽量画出一条“干净”的边缘线。

2. Canny怎么工作？

Canny算子像一个“精致的边缘艺术家”：

去噪：先用高斯模糊“抹平”图像，去掉噪声干扰。
找变化：用梯度计算亮度变化的强度和方向。
筛选：去掉不重要的弱边缘（非极大值抑制）。
连接：用双阈值把边缘连成完整的线条。

它就像在嘈杂的画面中，先擦掉杂点，再用铅笔勾勒出清晰的轮廓。

3. 生活比喻

假设你在画一幅素描：

先用橡皮轻轻擦掉纸上的灰尘（去噪）。
用铅笔描出物体的粗略轮廓（梯度）。
擦掉不明显的线条，只留最强的线（非极大值抑制）。
用细笔把断开的线连接起来（双阈值连接）。

Canny算子就是这样一个从粗糙到精致的过程。

三、数学原理与步骤

Canny算子包含以下五个核心步骤：

1. 高斯模糊（去噪）

噪声会干扰边缘检测，因此先用高斯滤波平滑图像。高斯核公式为：
$\frac{1}{2\pi\sigma^2} e^{-\frac{x^2 + y^2}{2\sigma^2}}$

$σ\sigma$ ：标准差，控制模糊程度。
作用：减少噪声对后续梯度计算的影响。

2. 计算梯度（边缘强度与方向）

与Sobel类似，用卷积核计算水平 ( $G_x$ ) 和垂直 ( $G_y$ ) 梯度：
$G_x = \begin{bmatrix} -1 & 0 & 1 \\ -2 & 0 & 2 \\ -1 & 0 & 1 \end{bmatrix}, \quad G_y = \begin{bmatrix} -1 & -2 & -1 \\ 0 & 0 & 0 \\ 1 & 2 & 1 \end{bmatrix}$

梯度大小： $|\nabla I| = \sqrt{G_x^2 + G_y^2}$
梯度方向： $\theta = \arctan\left(\frac{G_y}{G_x}\right)$
方向通常分为0°、45°、90°、135°四类，用于后续处理。

3. 非极大值抑制

在梯度方向上，比较每个像素的梯度大小，只保留局部最大值，去掉非边缘点。

例如，若方向为水平（0°），比较左右像素，保留最强的。

4. 双阈值检测

设定高阈值 ( $T_h$ ) 和低阈值 ( $T_l$ )：

$∣∇I∣>Th|\nabla I| > T_h$ ：强边缘，直接保留。
$Tl<∣∇I∣<ThT_l < |\nabla I| < T_h$ ：弱边缘，需进一步检查。
$∣∇I∣<Tl|\nabla I| < T_l$ ：非边缘，丢弃。

5. 边缘连接

检查弱边缘像素，若与强边缘相连，则保留，否则丢弃，形成连续边缘。

四、手动计算示例

假设有一块 3 × 3 灰度图像：
$\begin{bmatrix} 10 & 20 & 30 \\ 40 & 50 & 60 \\ 70 & 80 & 90 \end{bmatrix}$

1. 高斯模糊（简化）

用简单高斯核（如 $σ=1\sigma = 1$ 的近似）：
$\frac{1}{16} \begin{bmatrix} 1 & 2 & 1 \\ 2 & 4 & 2 \\ 1 & 2 & 1 \end{bmatrix}$
卷积后（中心像素50为例）：
$\frac{1}{16} (1 \cdot 10 + 2 \cdot 20 + 1 \cdot 30 + 2 \cdot 40 + 4 \cdot 50 + 2 \cdot 60 + 1 \cdot 70 + 2 \cdot 80 + 1 \cdot 90) = 50$

2. 梯度计算

与Sobel示例相同：

$G_x = 80$
$G_y = 240$
$∣∇I∣=802+2402≈252.98|\nabla I| = \sqrt{80^2 + 240^2} \approx 252.98$
$θ=arctan⁡(240/80)≈71.57∘\theta = \arctan(240/80) \approx 71.57^\circ$ （近似45°或90°）。

3. 非极大值抑制（假设）

若方向约90°，比较上下像素，保留局部最大值。

4. 双阈值

设 $T_h = 200$ ， $T_l = 100$ ：

$252.98 > 200$ ：强边缘。

5. 连接

若周围有强边缘相连，则保留。

五、代码实现

以下是使用Python（OpenCV和NumPy）的Canny算子实现。

1. 使用OpenCV的实现

import cv2
import numpy as np

def canny_edge_detection(img, low_threshold=100, high_threshold=200):
    """
    使用Canny算子进行边缘检测
    参数:
        img: 输入图像 (RGB或灰度)
        low_threshold: 低阈值
        high_threshold: 高阈值
    返回:
        edge: 边缘图
    """
    # 如果是RGB图像，先转为灰度
    if len(img.shape) == 3:
        gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
    else:
        gray = img

    # Canny边缘检测
    edge = cv2.Canny(gray, low_threshold, high_threshold)

    return edge

# 测试代码
if __name__ == "__main__":
    # 读取图像
    img_path = "test.jpg"  # 替换为您的图像路径
    img = cv2.imread(img_path)

    # 边缘检测
    edge_img = canny_edge_detection(img, 100, 200)

    # 显示结果
    cv2.imshow("Original", img)
    cv2.imshow("Canny Edge", edge_img)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

    # 保存结果
    cv2.imwrite("canny_edge.jpg", edge_img)

2. 手动实现（简版）

import cv2
import numpy as np

def gaussian_blur(img, sigma=1.0):
    """高斯模糊"""
    ksize = int(6 * sigma + 1) | 1  # 确保奇数
    return cv2.GaussianBlur(img, (ksize, ksize), sigma)

def gradient(img):
    """计算梯度"""
    sobel_x = cv2.Sobel(img, cv2.CV_64F, 1, 0, ksize=3)
    sobel_y = cv2.Sobel(img, cv2.CV_64F, 0, 1, ksize=3)
    magnitude = np.sqrt(sobel_x**2 + sobel_y**2)
    direction = np.arctan2(sobel_y, sobel_x) * 180 / np.pi
    return magnitude, direction

def non_max_suppression(magnitude, direction):
    """非极大值抑制"""
    H, W = magnitude.shape
    out = np.zeros((H, W), dtype=np.float32)
    for i in range(1, H-1):
        for j in range(1, W-1):
            angle = direction[i, j]
            if 0 <= angle < 22.5 or 157.5 <= angle <= 180:
                neighbors = [magnitude[i, j-1], magnitude[i, j+1]]
            elif 22.5 <= angle < 67.5:
                neighbors = [magnitude[i-1, j+1], magnitude[i+1, j-1]]
            elif 67.5 <= angle < 112.5:
                neighbors = [magnitude[i-1, j], magnitude[i+1, j]]
            else:
                neighbors = [magnitude[i-1, j-1], magnitude[i+1, j+1]]
            if magnitude[i, j] >= max(neighbors):
                out[i, j] = magnitude[i, j]
    return out

def double_threshold(img, low, high):
    """双阈值处理"""
    strong = 255
    weak = 75
    out = np.zeros_like(img, dtype=np.uint8)
    strong_i, strong_j = np.where(img >= high)
    weak_i, weak_j = np.where((img >= low) & (img < high))
    out[strong_i, strong_j] = strong
    out[weak_i, weak_j] = weak
    return out

def edge_linking(img):
    """边缘连接"""
    H, W = img.shape
    out = img.copy()
    for i in range(1, H-1):
        for j in range(1, W-1):
            if out[i, j] == 75:  # 弱边缘
                if np.any(out[i-1:i+2, j-1:j+2] == 255):  # 周围有强边缘
                    out[i, j] = 255
                else:
                    out[i, j] = 0
    return out

def canny_manual(img, low=100, high=200):
    """手动实现Canny边缘检测"""
    if len(img.shape) == 3:
        gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
    else:
        gray = img
    # 步骤
    blurred = gaussian_blur(gray)
    mag, dir = gradient(blurred)
    nms = non_max_suppression(mag, dir)
    thresh = double_threshold(nms, low, high)
    edge = edge_linking(thresh)
    return edge

# 测试代码
if __name__ == "__main__":
    img_path = "test.jpg"
    img = cv2.imread(img_path)
    edge_img = canny_manual(img)

    cv2.imshow("Original", img)
    cv2.imshow("Manual Canny Edge", edge_img)
    cv2.waitKey(0)
    cv2.destroyAllWindows()
    cv2.imwrite("manual_canny_edge.jpg", edge_img)