保姆級 Keras 實現 YOLO v3 一
- 一. YOLO v3 總覽
- 二. 特征提取網絡
- 特征提取網絡代碼實現
- 三. 特征融合
- 特征融合代碼實現
- 四. 網絡輸出
- 模型輸出代碼實現
- 五. 網絡模型代碼實現
- 六. 代碼下載
如果要給 YOLO 目標檢測算法一個評價的話, 就是快和準, 現在已經到了 v8, 但是我為什么還要寫 v3 呢? 我覺得 v3 是一個節點, 承上啟下的節點. 它有 v1 和 v2 的影子, 也為后面的其他版本奠定了基礎. 對于教學或者學習 YOLO 是極好的
一. YOLO v3 總覽
如果要給 YOLO v3 一個學習的策略的話, 我覺得從整體到局部比較合適, 我們把 YOLO v3 總結如下
相比于祥細的結構圖, 這樣的三個框就把 YOLO v3 概括完了. 后面我們再將各個部分拆開祥細說明, 這就是從整體到局部的策略
二. 特征提取網絡
這是最容易實現的部分, 因為不會涉及到坐標計算與損失函數之類的東西, 只需要按結構用代碼實現即可, 下面是結構圖, 括號里面的數字是各方塊輸出的 shape
這個也不是祥細的結構圖, 祥細的結構圖還需要將各個方塊展開, 前面的數字是 n 個這樣的 Block 重復, 現在把 Conv Block 展開如下
Residual Block 展開如下
特征圖的尺寸是輸入圖像的 1 32 1 \over 32 321?, 但是并沒有用我們常見的 Pooling 來減小特征圖尺寸, 而是使用步長為 2 的卷積層來實現的, 就是各個 Residual Block 之前的 Conv2D 層
Conv2D(kernel_size = (3, 3), strides = (2, 2), padding = "same")
特征提取網絡代碼實現
因為結構有重復性, 所以可以定義一個函數來重復調用
# 定義 cbl (Conv2D, BatchNormalization, LeakyReLU) 函數
def cbl(inputs, filters, kernel_size):x = keras.layers.Conv2D(filters = filters, kernel_size = kernel_size, strides = (1, 1),padding = "valid" if (1, 1) == kernel_size else "same")(inputs)x = keras.layers.BatchNormalization()(x)x = keras.layers.LeakyReLU(alpha = 0.1)(x)return x
接下來定義 Residual Block
# 定義 residual_block 函數
# filters: 第一個 cbl 的卷積核數量, 第二個 cbl 卷積核數量自動乘 2
# repeats: 模塊重復次數
def residual_block(inputs, filters, repeats):x = inputsfor i in range(repeats):x = cbl(x, filters, kernel_size = (1, 1))x = cbl(x, filters * 2, kernel_size = (3, 3))x = keras.layers.Add()([inputs, x])return x
有了這兩個函數, 就可以定義完整的特征提取網絡 darknet
# 定義 darn_net 函數
def dark_net(inputs = None):x = cbl(inputs, filters = 32, kernel_size = (3, 3))x = keras.layers.Conv2D(filters = 64, kernel_size = (3, 3), strides = (2, 2), padding = "same")(x)x = residual_block(x, filters = 32, repeats = 1)x = keras.layers.Conv2D(filters = 128, kernel_size = (3, 3), strides = (2, 2), padding = "same")(x)x = residual_block(x, filters = 64, repeats = 2)x = keras.layers.Conv2D(filters = 256, kernel_size = (3, 3), strides = (2, 2), padding = "same")(x)# 52 × 52 特征圖x_52 = residual_block(x, filters = 128, repeats = 8)x = keras.layers.Conv2D(filters = 512, kernel_size = (3, 3), strides = (2, 2), padding = "same")(x_52)# 26 × 26 特征圖x_26 = residual_block(x, filters = 256, repeats = 8)x = keras.layers.Conv2D(filters = 1024, kernel_size = (3, 3), strides = (2, 2), padding = "same")(x_26)# 13 × 13 特征圖x_13 = residual_block(x, filters = 512, repeats = 4)return x_13, x_26, x_52
這樣就和前面的結構圖對上了, 函數輸出 x_13, x_26, x_52 三層, 后面特征融合的時候會用到
三. 特征融合
這個也沒有什么大問題, 只需要將上面的 13 × 13 特征圖上采樣放大與 26 × 26 特征圖在最后一個維度拼接, 26 × 26 特征圖上采樣放大與 52 × 52 特征圖在最后一個維度拼接, 如下圖
特征融合代碼實現
特征融合 Conv Block 部分也有很多重復的方塊, 所以可以定義成一個函數方便調用
# 定義 cbl block 函數
# filters: 第一個 block 的卷積核數量, 其他會自動計算
def cbl_block(inputs, filters):x = cbl(inputs, filters, kernel_size = (1, 1))x = cbl(x, filters * 2, kernel_size = (3, 3))x = cbl(x, filters, kernel_size = (1, 1))x = cbl(x, filters * 2, kernel_size = (3, 3))x = cbl(x, filters, kernel_size = (1, 1))return x
總的特征融合函數如下
# 定義 neck 函數
def neck(inputs = None):x_13, x_26, x_52 = inputsfeature = cbl_block(x_13, 512)feature = cbl(feature, filters = 256, kernel_size = (1, 1))feature = keras.layers.UpSampling2D(size = (2, 2), interpolation = "bilinear")(feature)feature = keras.layers.Concatenate(axis = -1)([feature, x_26])x_26 = cbl_block(feature, 256)feature = cbl(x_26, filters = 128, kernel_size = (1, 1))feature = keras.layers.UpSampling2D(size = (2, 2), interpolation = "bilinear")(feature)feature = keras.layers.Concatenate(axis = -1)([feature, x_52])x_52 = cbl_block(feature, 128)return x_13, x_26, x_52
四. 網絡輸出
這部分就更簡單了, 將融合后的特征圖做卷積, 變換到對應的通道數, 因為我要訓練的數據集是 VOC2007, 所以輸出通道數為 75 = (4 + 1 + 20) × 3. 模型結構如下
模型輸出代碼實現
輸出函數如下, 輸入是三個融合后的特征圖
# 定義 head 函數
def head(inputs, filters):x_13, x_26, x_52 = inputsx_13 = cbl(x_13, 1024, kernel_size = (3, 3))x_13 = cbl(x_13, filters, kernel_size = (1, 1))x_26 = cbl(x_26, 512, kernel_size = (3, 3))x_26 = cbl(x_26, filters, kernel_size = (1, 1))x_52 = cbl(x_52, 256, kernel_size = (3, 3))x_52 = cbl(x_52, filters, kernel_size = (1, 1))return x_13, x_26, x_52
五. 網絡模型代碼實現
有了上面的相應的函數之后, 定義完整的模型就變得很簡單了, 由 dark_net, neck, head 三部分構成
# 模型定義
image = keras.layers.Input(shape = (416, 416, 3), name = "input")x_13, x_26, x_52 = dark_net(inputs = image)
x_13, x_26, x_52 = neck([x_13, x_26, x_52])
x_13, x_26, x_52 = head([x_13, x_26, x_52], filters = 75)model = keras.Model(inputs = image,outputs = [x_13, x_26, x_52],name = "yolov3")
model.summary()
Model: "yolov3"
__________________________________________________________________________________________________
Layer (type) Output Shape Param # Connected to
==================================================================================================
input (InputLayer) [(None, 416, 416, 3) 0
__________________________________________________________________________________________________
conv2d (Conv2D) (None, 416, 416, 32) 896 input[0][0]
__________________________________________________________________________________________________
batch_normalization (BatchNorma (None, 416, 416, 32) 128 conv2d[0][0]
__________________________________________________________________________________________________
leaky_re_lu (LeakyReLU) (None, 416, 416, 32) 0 batch_normalization[0][0]
__________________________________________________________________________________________________
conv2d_1 (Conv2D) (None, 208, 208, 64) 18496 leaky_re_lu[0][0]
__________________________________________________________________________________________________
conv2d_2 (Conv2D) (None, 208, 208, 32) 2080 conv2d_1[0][0]
__________________________________________________________________________________________________
batch_normalization_1 (BatchNor (None, 208, 208, 32) 128 conv2d_2[0][0]
__________________________________________________________________________________________________
leaky_re_lu_1 (LeakyReLU) (None, 208, 208, 32) 0 batch_normalization_1[0][0]
__________________________________________________________________________________________________
conv2d_3 (Conv2D) (None, 208, 208, 64) 18496 leaky_re_lu_1[0][0]
__________________________________________________________________________________________________
batch_normalization_2 (BatchNor (None, 208, 208, 64) 256 conv2d_3[0][0]
__________________________________________________________________________________________________
leaky_re_lu_2 (LeakyReLU) (None, 208, 208, 64) 0 batch_normalization_2[0][0]
__________________________________________________________________________________________________
add (Add) (None, 208, 208, 64) 0 conv2d_1[0][0] leaky_re_lu_2[0][0]
__________________________________________________________________________________________________
conv2d_4 (Conv2D) (None, 104, 104, 128 73856 add[0][0]
__________________________________________________________________________________________________
conv2d_5 (Conv2D) (None, 104, 104, 64) 8256 conv2d_4[0][0]
__________________________________________________________________________________________________
batch_normalization_3 (BatchNor (None, 104, 104, 64) 256 conv2d_5[0][0]
__________________________________________________________________________________________________
leaky_re_lu_3 (LeakyReLU) (None, 104, 104, 64) 0 batch_normalization_3[0][0]
__________________________________________________________________________________________________
conv2d_6 (Conv2D) (None, 104, 104, 128 73856 leaky_re_lu_3[0][0]
__________________________________________________________________________________________________
batch_normalization_4 (BatchNor (None, 104, 104, 128 512 conv2d_6[0][0]
__________________________________________________________________________________________________
leaky_re_lu_4 (LeakyReLU) (None, 104, 104, 128 0 batch_normalization_4[0][0]
__________________________________________________________________________________________________
add_1 (Add) (None, 104, 104, 128 0 conv2d_4[0][0] leaky_re_lu_4[0][0]
__________________________________________________________________________________________________
conv2d_7 (Conv2D) (None, 104, 104, 64) 8256 add_1[0][0]
__________________________________________________________________________________________________
batch_normalization_5 (BatchNor (None, 104, 104, 64) 256 conv2d_7[0][0]
__________________________________________________________________________________________________
leaky_re_lu_5 (LeakyReLU) (None, 104, 104, 64) 0 batch_normalization_5[0][0]
__________________________________________________________________________________________________
conv2d_8 (Conv2D) (None, 104, 104, 128 73856 leaky_re_lu_5[0][0]
__________________________________________________________________________________________________
batch_normalization_6 (BatchNor (None, 104, 104, 128 512 conv2d_8[0][0]
__________________________________________________________________________________________________
leaky_re_lu_6 (LeakyReLU) (None, 104, 104, 128 0 batch_normalization_6[0][0]
__________________________________________________________________________________________________
add_2 (Add) (None, 104, 104, 128 0 conv2d_4[0][0] leaky_re_lu_6[0][0]
__________________________________________________________________________________________________
conv2d_9 (Conv2D) (None, 52, 52, 256) 295168 add_2[0][0]
__________________________________________________________________________________________________
conv2d_10 (Conv2D) (None, 52, 52, 128) 32896 conv2d_9[0][0]
__________________________________________________________________________________________________
batch_normalization_7 (BatchNor (None, 52, 52, 128) 512 conv2d_10[0][0]
__________________________________________________________________________________________________
leaky_re_lu_7 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_7[0][0]
__________________________________________________________________________________________________
conv2d_11 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_7[0][0]
__________________________________________________________________________________________________
batch_normalization_8 (BatchNor (None, 52, 52, 256) 1024 conv2d_11[0][0]
__________________________________________________________________________________________________
leaky_re_lu_8 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_8[0][0]
__________________________________________________________________________________________________
add_3 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_8[0][0]
__________________________________________________________________________________________________
conv2d_12 (Conv2D) (None, 52, 52, 128) 32896 add_3[0][0]
__________________________________________________________________________________________________
batch_normalization_9 (BatchNor (None, 52, 52, 128) 512 conv2d_12[0][0]
__________________________________________________________________________________________________
leaky_re_lu_9 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_9[0][0]
__________________________________________________________________________________________________
conv2d_13 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_9[0][0]
__________________________________________________________________________________________________
batch_normalization_10 (BatchNo (None, 52, 52, 256) 1024 conv2d_13[0][0]
__________________________________________________________________________________________________
leaky_re_lu_10 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_10[0][0]
__________________________________________________________________________________________________
add_4 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_10[0][0]
__________________________________________________________________________________________________
conv2d_14 (Conv2D) (None, 52, 52, 128) 32896 add_4[0][0]
__________________________________________________________________________________________________
batch_normalization_11 (BatchNo (None, 52, 52, 128) 512 conv2d_14[0][0]
__________________________________________________________________________________________________
leaky_re_lu_11 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_11[0][0]
__________________________________________________________________________________________________
conv2d_15 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_11[0][0]
__________________________________________________________________________________________________
batch_normalization_12 (BatchNo (None, 52, 52, 256) 1024 conv2d_15[0][0]
__________________________________________________________________________________________________
leaky_re_lu_12 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_12[0][0]
__________________________________________________________________________________________________
add_5 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_12[0][0]
__________________________________________________________________________________________________
conv2d_16 (Conv2D) (None, 52, 52, 128) 32896 add_5[0][0]
__________________________________________________________________________________________________
batch_normalization_13 (BatchNo (None, 52, 52, 128) 512 conv2d_16[0][0]
__________________________________________________________________________________________________
leaky_re_lu_13 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_13[0][0]
__________________________________________________________________________________________________
conv2d_17 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_13[0][0]
__________________________________________________________________________________________________
batch_normalization_14 (BatchNo (None, 52, 52, 256) 1024 conv2d_17[0][0]
__________________________________________________________________________________________________
leaky_re_lu_14 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_14[0][0]
__________________________________________________________________________________________________
add_6 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_14[0][0]
__________________________________________________________________________________________________
conv2d_18 (Conv2D) (None, 52, 52, 128) 32896 add_6[0][0]
__________________________________________________________________________________________________
batch_normalization_15 (BatchNo (None, 52, 52, 128) 512 conv2d_18[0][0]
__________________________________________________________________________________________________
leaky_re_lu_15 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_15[0][0]
__________________________________________________________________________________________________
conv2d_19 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_15[0][0]
__________________________________________________________________________________________________
batch_normalization_16 (BatchNo (None, 52, 52, 256) 1024 conv2d_19[0][0]
__________________________________________________________________________________________________
leaky_re_lu_16 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_16[0][0]
__________________________________________________________________________________________________
add_7 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_16[0][0]
__________________________________________________________________________________________________
conv2d_20 (Conv2D) (None, 52, 52, 128) 32896 add_7[0][0]
__________________________________________________________________________________________________
batch_normalization_17 (BatchNo (None, 52, 52, 128) 512 conv2d_20[0][0]
__________________________________________________________________________________________________
leaky_re_lu_17 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_17[0][0]
__________________________________________________________________________________________________
conv2d_21 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_17[0][0]
__________________________________________________________________________________________________
batch_normalization_18 (BatchNo (None, 52, 52, 256) 1024 conv2d_21[0][0]
__________________________________________________________________________________________________
leaky_re_lu_18 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_18[0][0]
__________________________________________________________________________________________________
add_8 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_18[0][0]
__________________________________________________________________________________________________
conv2d_22 (Conv2D) (None, 52, 52, 128) 32896 add_8[0][0]
__________________________________________________________________________________________________
batch_normalization_19 (BatchNo (None, 52, 52, 128) 512 conv2d_22[0][0]
__________________________________________________________________________________________________
leaky_re_lu_19 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_19[0][0]
__________________________________________________________________________________________________
conv2d_23 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_19[0][0]
__________________________________________________________________________________________________
batch_normalization_20 (BatchNo (None, 52, 52, 256) 1024 conv2d_23[0][0]
__________________________________________________________________________________________________
leaky_re_lu_20 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_20[0][0]
__________________________________________________________________________________________________
add_9 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_20[0][0]
__________________________________________________________________________________________________
conv2d_24 (Conv2D) (None, 52, 52, 128) 32896 add_9[0][0]
__________________________________________________________________________________________________
batch_normalization_21 (BatchNo (None, 52, 52, 128) 512 conv2d_24[0][0]
__________________________________________________________________________________________________
leaky_re_lu_21 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_21[0][0]
__________________________________________________________________________________________________
conv2d_25 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_21[0][0]
__________________________________________________________________________________________________
batch_normalization_22 (BatchNo (None, 52, 52, 256) 1024 conv2d_25[0][0]
__________________________________________________________________________________________________
leaky_re_lu_22 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_22[0][0]
__________________________________________________________________________________________________
add_10 (Add) (None, 52, 52, 256) 0 conv2d_9[0][0] leaky_re_lu_22[0][0]
__________________________________________________________________________________________________
conv2d_26 (Conv2D) (None, 26, 26, 512) 1180160 add_10[0][0]
__________________________________________________________________________________________________
conv2d_27 (Conv2D) (None, 26, 26, 256) 131328 conv2d_26[0][0]
__________________________________________________________________________________________________
batch_normalization_23 (BatchNo (None, 26, 26, 256) 1024 conv2d_27[0][0]
__________________________________________________________________________________________________
leaky_re_lu_23 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_23[0][0]
__________________________________________________________________________________________________
conv2d_28 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_23[0][0]
__________________________________________________________________________________________________
batch_normalization_24 (BatchNo (None, 26, 26, 512) 2048 conv2d_28[0][0]
__________________________________________________________________________________________________
leaky_re_lu_24 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_24[0][0]
__________________________________________________________________________________________________
add_11 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_24[0][0]
__________________________________________________________________________________________________
conv2d_29 (Conv2D) (None, 26, 26, 256) 131328 add_11[0][0]
__________________________________________________________________________________________________
batch_normalization_25 (BatchNo (None, 26, 26, 256) 1024 conv2d_29[0][0]
__________________________________________________________________________________________________
leaky_re_lu_25 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_25[0][0]
__________________________________________________________________________________________________
conv2d_30 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_25[0][0]
__________________________________________________________________________________________________
batch_normalization_26 (BatchNo (None, 26, 26, 512) 2048 conv2d_30[0][0]
__________________________________________________________________________________________________
leaky_re_lu_26 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_26[0][0]
__________________________________________________________________________________________________
add_12 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_26[0][0]
__________________________________________________________________________________________________
conv2d_31 (Conv2D) (None, 26, 26, 256) 131328 add_12[0][0]
__________________________________________________________________________________________________
batch_normalization_27 (BatchNo (None, 26, 26, 256) 1024 conv2d_31[0][0]
__________________________________________________________________________________________________
leaky_re_lu_27 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_27[0][0]
__________________________________________________________________________________________________
conv2d_32 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_27[0][0]
__________________________________________________________________________________________________
batch_normalization_28 (BatchNo (None, 26, 26, 512) 2048 conv2d_32[0][0]
__________________________________________________________________________________________________
leaky_re_lu_28 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_28[0][0]
__________________________________________________________________________________________________
add_13 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_28[0][0]
__________________________________________________________________________________________________
conv2d_33 (Conv2D) (None, 26, 26, 256) 131328 add_13[0][0]
__________________________________________________________________________________________________
batch_normalization_29 (BatchNo (None, 26, 26, 256) 1024 conv2d_33[0][0]
__________________________________________________________________________________________________
leaky_re_lu_29 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_29[0][0]
__________________________________________________________________________________________________
conv2d_34 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_29[0][0]
__________________________________________________________________________________________________
batch_normalization_30 (BatchNo (None, 26, 26, 512) 2048 conv2d_34[0][0]
__________________________________________________________________________________________________
leaky_re_lu_30 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_30[0][0]
__________________________________________________________________________________________________
add_14 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_30[0][0]
__________________________________________________________________________________________________
conv2d_35 (Conv2D) (None, 26, 26, 256) 131328 add_14[0][0]
__________________________________________________________________________________________________
batch_normalization_31 (BatchNo (None, 26, 26, 256) 1024 conv2d_35[0][0]
__________________________________________________________________________________________________
leaky_re_lu_31 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_31[0][0]
__________________________________________________________________________________________________
conv2d_36 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_31[0][0]
__________________________________________________________________________________________________
batch_normalization_32 (BatchNo (None, 26, 26, 512) 2048 conv2d_36[0][0]
__________________________________________________________________________________________________
leaky_re_lu_32 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_32[0][0]
__________________________________________________________________________________________________
add_15 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_32[0][0]
__________________________________________________________________________________________________
conv2d_37 (Conv2D) (None, 26, 26, 256) 131328 add_15[0][0]
__________________________________________________________________________________________________
batch_normalization_33 (BatchNo (None, 26, 26, 256) 1024 conv2d_37[0][0]
__________________________________________________________________________________________________
leaky_re_lu_33 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_33[0][0]
__________________________________________________________________________________________________
conv2d_38 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_33[0][0]
__________________________________________________________________________________________________
batch_normalization_34 (BatchNo (None, 26, 26, 512) 2048 conv2d_38[0][0]
__________________________________________________________________________________________________
leaky_re_lu_34 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_34[0][0]
__________________________________________________________________________________________________
add_16 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_34[0][0]
__________________________________________________________________________________________________
conv2d_39 (Conv2D) (None, 26, 26, 256) 131328 add_16[0][0]
__________________________________________________________________________________________________
batch_normalization_35 (BatchNo (None, 26, 26, 256) 1024 conv2d_39[0][0]
__________________________________________________________________________________________________
leaky_re_lu_35 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_35[0][0]
__________________________________________________________________________________________________
conv2d_40 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_35[0][0]
__________________________________________________________________________________________________
batch_normalization_36 (BatchNo (None, 26, 26, 512) 2048 conv2d_40[0][0]
__________________________________________________________________________________________________
leaky_re_lu_36 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_36[0][0]
__________________________________________________________________________________________________
add_17 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_36[0][0]
__________________________________________________________________________________________________
conv2d_41 (Conv2D) (None, 26, 26, 256) 131328 add_17[0][0]
__________________________________________________________________________________________________
batch_normalization_37 (BatchNo (None, 26, 26, 256) 1024 conv2d_41[0][0]
__________________________________________________________________________________________________
leaky_re_lu_37 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_37[0][0]
__________________________________________________________________________________________________
conv2d_42 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_37[0][0]
__________________________________________________________________________________________________
batch_normalization_38 (BatchNo (None, 26, 26, 512) 2048 conv2d_42[0][0]
__________________________________________________________________________________________________
leaky_re_lu_38 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_38[0][0]
__________________________________________________________________________________________________
add_18 (Add) (None, 26, 26, 512) 0 conv2d_26[0][0] leaky_re_lu_38[0][0]
__________________________________________________________________________________________________
conv2d_43 (Conv2D) (None, 13, 13, 1024) 4719616 add_18[0][0]
__________________________________________________________________________________________________
conv2d_44 (Conv2D) (None, 13, 13, 512) 524800 conv2d_43[0][0]
__________________________________________________________________________________________________
batch_normalization_39 (BatchNo (None, 13, 13, 512) 2048 conv2d_44[0][0]
__________________________________________________________________________________________________
leaky_re_lu_39 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_39[0][0]
__________________________________________________________________________________________________
conv2d_45 (Conv2D) (None, 13, 13, 1024) 4719616 leaky_re_lu_39[0][0]
__________________________________________________________________________________________________
batch_normalization_40 (BatchNo (None, 13, 13, 1024) 4096 conv2d_45[0][0]
__________________________________________________________________________________________________
leaky_re_lu_40 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_40[0][0]
__________________________________________________________________________________________________
add_19 (Add) (None, 13, 13, 1024) 0 conv2d_43[0][0] leaky_re_lu_40[0][0]
__________________________________________________________________________________________________
conv2d_46 (Conv2D) (None, 13, 13, 512) 524800 add_19[0][0]
__________________________________________________________________________________________________
batch_normalization_41 (BatchNo (None, 13, 13, 512) 2048 conv2d_46[0][0]
__________________________________________________________________________________________________
leaky_re_lu_41 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_41[0][0]
__________________________________________________________________________________________________
conv2d_47 (Conv2D) (None, 13, 13, 1024) 4719616 leaky_re_lu_41[0][0]
__________________________________________________________________________________________________
batch_normalization_42 (BatchNo (None, 13, 13, 1024) 4096 conv2d_47[0][0]
__________________________________________________________________________________________________
leaky_re_lu_42 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_42[0][0]
__________________________________________________________________________________________________
add_20 (Add) (None, 13, 13, 1024) 0 conv2d_43[0][0] leaky_re_lu_42[0][0]
__________________________________________________________________________________________________
conv2d_48 (Conv2D) (None, 13, 13, 512) 524800 add_20[0][0]
__________________________________________________________________________________________________
batch_normalization_43 (BatchNo (None, 13, 13, 512) 2048 conv2d_48[0][0]
__________________________________________________________________________________________________
leaky_re_lu_43 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_43[0][0]
__________________________________________________________________________________________________
conv2d_49 (Conv2D) (None, 13, 13, 1024) 4719616 leaky_re_lu_43[0][0]
__________________________________________________________________________________________________
batch_normalization_44 (BatchNo (None, 13, 13, 1024) 4096 conv2d_49[0][0]
__________________________________________________________________________________________________
leaky_re_lu_44 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_44[0][0]
__________________________________________________________________________________________________
add_21 (Add) (None, 13, 13, 1024) 0 conv2d_43[0][0] leaky_re_lu_44[0][0]
__________________________________________________________________________________________________
conv2d_50 (Conv2D) (None, 13, 13, 512) 524800 add_21[0][0]
__________________________________________________________________________________________________
batch_normalization_45 (BatchNo (None, 13, 13, 512) 2048 conv2d_50[0][0]
__________________________________________________________________________________________________
leaky_re_lu_45 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_45[0][0]
__________________________________________________________________________________________________
conv2d_51 (Conv2D) (None, 13, 13, 1024) 4719616 leaky_re_lu_45[0][0]
__________________________________________________________________________________________________
batch_normalization_46 (BatchNo (None, 13, 13, 1024) 4096 conv2d_51[0][0]
__________________________________________________________________________________________________
leaky_re_lu_46 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_46[0][0]
__________________________________________________________________________________________________
add_22 (Add) (None, 13, 13, 1024) 0 conv2d_43[0][0] leaky_re_lu_46[0][0]
__________________________________________________________________________________________________
conv2d_52 (Conv2D) (None, 13, 13, 512) 524800 add_22[0][0]
__________________________________________________________________________________________________
batch_normalization_47 (BatchNo (None, 13, 13, 512) 2048 conv2d_52[0][0]
__________________________________________________________________________________________________
leaky_re_lu_47 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_47[0][0]
__________________________________________________________________________________________________
conv2d_53 (Conv2D) (None, 13, 13, 1024) 4719616 leaky_re_lu_47[0][0]
__________________________________________________________________________________________________
batch_normalization_48 (BatchNo (None, 13, 13, 1024) 4096 conv2d_53[0][0]
__________________________________________________________________________________________________
leaky_re_lu_48 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_48[0][0]
__________________________________________________________________________________________________
conv2d_54 (Conv2D) (None, 13, 13, 512) 524800 leaky_re_lu_48[0][0]
__________________________________________________________________________________________________
batch_normalization_49 (BatchNo (None, 13, 13, 512) 2048 conv2d_54[0][0]
__________________________________________________________________________________________________
leaky_re_lu_49 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_49[0][0]
__________________________________________________________________________________________________
conv2d_55 (Conv2D) (None, 13, 13, 1024) 4719616 leaky_re_lu_49[0][0]
__________________________________________________________________________________________________
batch_normalization_50 (BatchNo (None, 13, 13, 1024) 4096 conv2d_55[0][0]
__________________________________________________________________________________________________
leaky_re_lu_50 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_50[0][0]
__________________________________________________________________________________________________
conv2d_56 (Conv2D) (None, 13, 13, 512) 524800 leaky_re_lu_50[0][0]
__________________________________________________________________________________________________
batch_normalization_51 (BatchNo (None, 13, 13, 512) 2048 conv2d_56[0][0]
__________________________________________________________________________________________________
leaky_re_lu_51 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_51[0][0]
__________________________________________________________________________________________________
conv2d_57 (Conv2D) (None, 13, 13, 256) 131328 leaky_re_lu_51[0][0]
__________________________________________________________________________________________________
batch_normalization_52 (BatchNo (None, 13, 13, 256) 1024 conv2d_57[0][0]
__________________________________________________________________________________________________
leaky_re_lu_52 (LeakyReLU) (None, 13, 13, 256) 0 batch_normalization_52[0][0]
__________________________________________________________________________________________________
up_sampling2d (UpSampling2D) (None, 26, 26, 256) 0 leaky_re_lu_52[0][0]
__________________________________________________________________________________________________
concatenate (Concatenate) (None, 26, 26, 768) 0 up_sampling2d[0][0] add_18[0][0]
__________________________________________________________________________________________________
conv2d_58 (Conv2D) (None, 26, 26, 256) 196864 concatenate[0][0]
__________________________________________________________________________________________________
batch_normalization_53 (BatchNo (None, 26, 26, 256) 1024 conv2d_58[0][0]
__________________________________________________________________________________________________
leaky_re_lu_53 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_53[0][0]
__________________________________________________________________________________________________
conv2d_59 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_53[0][0]
__________________________________________________________________________________________________
batch_normalization_54 (BatchNo (None, 26, 26, 512) 2048 conv2d_59[0][0]
__________________________________________________________________________________________________
leaky_re_lu_54 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_54[0][0]
__________________________________________________________________________________________________
conv2d_60 (Conv2D) (None, 26, 26, 256) 131328 leaky_re_lu_54[0][0]
__________________________________________________________________________________________________
batch_normalization_55 (BatchNo (None, 26, 26, 256) 1024 conv2d_60[0][0]
__________________________________________________________________________________________________
leaky_re_lu_55 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_55[0][0]
__________________________________________________________________________________________________
conv2d_61 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_55[0][0]
__________________________________________________________________________________________________
batch_normalization_56 (BatchNo (None, 26, 26, 512) 2048 conv2d_61[0][0]
__________________________________________________________________________________________________
leaky_re_lu_56 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_56[0][0]
__________________________________________________________________________________________________
conv2d_62 (Conv2D) (None, 26, 26, 256) 131328 leaky_re_lu_56[0][0]
__________________________________________________________________________________________________
batch_normalization_57 (BatchNo (None, 26, 26, 256) 1024 conv2d_62[0][0]
__________________________________________________________________________________________________
leaky_re_lu_57 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_57[0][0]
__________________________________________________________________________________________________
conv2d_63 (Conv2D) (None, 26, 26, 128) 32896 leaky_re_lu_57[0][0]
__________________________________________________________________________________________________
batch_normalization_58 (BatchNo (None, 26, 26, 128) 512 conv2d_63[0][0]
__________________________________________________________________________________________________
leaky_re_lu_58 (LeakyReLU) (None, 26, 26, 128) 0 batch_normalization_58[0][0]
__________________________________________________________________________________________________
up_sampling2d_1 (UpSampling2D) (None, 52, 52, 128) 0 leaky_re_lu_58[0][0]
__________________________________________________________________________________________________
concatenate_1 (Concatenate) (None, 52, 52, 384) 0 up_sampling2d_1[0][0] add_10[0][0]
__________________________________________________________________________________________________
conv2d_64 (Conv2D) (None, 52, 52, 128) 49280 concatenate_1[0][0]
__________________________________________________________________________________________________
batch_normalization_59 (BatchNo (None, 52, 52, 128) 512 conv2d_64[0][0]
__________________________________________________________________________________________________
leaky_re_lu_59 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_59[0][0]
__________________________________________________________________________________________________
conv2d_65 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_59[0][0]
__________________________________________________________________________________________________
batch_normalization_60 (BatchNo (None, 52, 52, 256) 1024 conv2d_65[0][0]
__________________________________________________________________________________________________
leaky_re_lu_60 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_60[0][0]
__________________________________________________________________________________________________
conv2d_66 (Conv2D) (None, 52, 52, 128) 32896 leaky_re_lu_60[0][0]
__________________________________________________________________________________________________
batch_normalization_61 (BatchNo (None, 52, 52, 128) 512 conv2d_66[0][0]
__________________________________________________________________________________________________
leaky_re_lu_61 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_61[0][0]
__________________________________________________________________________________________________
conv2d_67 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_61[0][0]
__________________________________________________________________________________________________
batch_normalization_62 (BatchNo (None, 52, 52, 256) 1024 conv2d_67[0][0]
__________________________________________________________________________________________________
leaky_re_lu_62 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_62[0][0]
__________________________________________________________________________________________________
conv2d_68 (Conv2D) (None, 52, 52, 128) 32896 leaky_re_lu_62[0][0]
__________________________________________________________________________________________________
batch_normalization_63 (BatchNo (None, 52, 52, 128) 512 conv2d_68[0][0]
__________________________________________________________________________________________________
leaky_re_lu_63 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_63[0][0]
__________________________________________________________________________________________________
conv2d_69 (Conv2D) (None, 13, 13, 1024) 9438208 add_22[0][0]
__________________________________________________________________________________________________
conv2d_71 (Conv2D) (None, 26, 26, 512) 1180160 leaky_re_lu_57[0][0]
__________________________________________________________________________________________________
conv2d_73 (Conv2D) (None, 52, 52, 256) 295168 leaky_re_lu_63[0][0]
__________________________________________________________________________________________________
batch_normalization_64 (BatchNo (None, 13, 13, 1024) 4096 conv2d_69[0][0]
__________________________________________________________________________________________________
batch_normalization_66 (BatchNo (None, 26, 26, 512) 2048 conv2d_71[0][0]
__________________________________________________________________________________________________
batch_normalization_68 (BatchNo (None, 52, 52, 256) 1024 conv2d_73[0][0]
__________________________________________________________________________________________________
leaky_re_lu_64 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_64[0][0]
__________________________________________________________________________________________________
leaky_re_lu_66 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_66[0][0]
__________________________________________________________________________________________________
leaky_re_lu_68 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_68[0][0]
__________________________________________________________________________________________________
conv2d_70 (Conv2D) (None, 13, 13, 75) 76875 leaky_re_lu_64[0][0]
__________________________________________________________________________________________________
conv2d_72 (Conv2D) (None, 26, 26, 75) 38475 leaky_re_lu_66[0][0]
__________________________________________________________________________________________________
conv2d_74 (Conv2D) (None, 52, 52, 75) 19275 leaky_re_lu_68[0][0]
__________________________________________________________________________________________________
batch_normalization_65 (BatchNo (None, 13, 13, 75) 300 conv2d_70[0][0]
__________________________________________________________________________________________________
batch_normalization_67 (BatchNo (None, 26, 26, 75) 300 conv2d_72[0][0]
__________________________________________________________________________________________________
batch_normalization_69 (BatchNo (None, 52, 52, 75) 300 conv2d_74[0][0]
__________________________________________________________________________________________________
leaky_re_lu_65 (LeakyReLU) (None, 13, 13, 75) 0 batch_normalization_65[0][0]
__________________________________________________________________________________________________
leaky_re_lu_67 (LeakyReLU) (None, 26, 26, 75) 0 batch_normalization_67[0][0]
__________________________________________________________________________________________________
leaky_re_lu_69 (LeakyReLU) (None, 52, 52, 75) 0 batch_normalization_69[0][0]
==================================================================================================
Total params: 66,416,517
Trainable params: 66,367,427
Non-trainable params: 49,090
__________________________________________________________________________________________________
加上輸入層一共 243 層
六. 代碼下載
示例代碼可下載 Jupyter Notebook 示例代碼
下一篇: 保姆級 Keras 實現 YOLO v3 二