剖析Caffe源码之Layer

最新推荐文章于 2021-10-09 23:36:27 发布

原创最新推荐文章于 2021-10-09 23:36:27 发布 · 1k 阅读

2 ·

CC 4.0 BY-SA版权

文章标签：

#caffe #layer

Caffe 专栏收录该内容

23 篇文章

订阅专栏

本文深入解析Caffe框架中的Layer模块，介绍了Layer的构造、关键函数如SetUp、Forward及Backward的作用，以及LayerParameter的配置。Layer是神经网络计算的基础单元，通过不同的派生类实现各种神经网络层的功能。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Layer介绍

如果将caffe比作成一个大楼，那么Blob就是盖成大楼的每块砖瓦，而Layer就是用Blob组成的一层层楼房，layer是神经网络中模型中比较关键的部分，是构成整个计算过程的基础。在上节中，通过对Blob源码进行剖析，对Blob进行了一定了解，而Layer是以Blob作为输入和输出，Layer其本质就是根据输入计算输出，每个layer只完成一类特定的计算，例如convolution操作、pooling、非线性变换、内积运算，以及数据加载、归一化和损失计算等。

上图是一个经典的卷积Layer，是以bottom blob为输入，输出为top blob。

官方文档：http://caffe.berkeleyvision.org/tutorial/net_layer_blob.html

Layer分析

在看Layer源码之前首先需要对整个Layer参数进行了解，Layer的入参除了Blob之外还有很多其他参数，用于配置

LayerParameter

LayerParameter参数的定义在src\caffe\proto\caffe.proto，是采用的ProtoBuf格式，上节已经对ProtoBuf 基本语法进行了解析：

注意在LayerParameter结构之前有一段说明，记录了该message中的number ID使用到了哪里，以及最近新添加的参数，ProtoBuf规定ID不能重复。原代码中对每个参数都进行了详细描述

message LayerParameter {
  optional string name = 1; // the layer name: Layer 名称
  optional string type = 2; // the layer type: Layer type
  repeated string bottom = 3; // the name of each bottom blob: bottom blob入参名称
  repeated string top = 4; // the name of each top blob：top blob出参名称

  // The train / test phase for computation.
  optional Phase phase = 10; //是用于模型训练还是测试，TRAIN值为0：用于训练，TEST值为1：用于测试

  // The amount of weight to assign each top blob in the objective.
  // Each layer assigns a default value, usually of either 0 or 1,
  // to each top blob.
  repeated float loss_weight = 5;// 每个输出top blob的loss权重

  // Specifies training parameters (multipliers on global learning constants,
  // and the name and other settings used for weight sharing).
  repeated ParamSpec param = 6; //特定训练参数，可以查看ParamSpec结构

  // The blobs containing the numeric parameters of the layer.
  repeated BlobProto blobs = 7; // 每层的参数 blob

  // Specifies whether to backpropagate to each bottom. If unspecified,
  // Caffe will automatically infer whether each input needs backpropagation
  // to compute parameter gradients. If set to true for some inputs,
  // backpropagation to those inputs is forced; if set false for some inputs,
  // backpropagation to those inputs is skipped.
  //
  // The size must be either 0 or equal to the number of bottoms.
  repeated bool propagate_down = 11;

  // Rules controlling whether and when a layer is included in the network,
  // based on the current NetState.  You may specify a non-zero number of rules
  // to include OR exclude, but not both.  If no include or exclude rules are
  // specified, the layer is always included.  If the current NetState meets
  // ANY (i.e., one or more) of the specified rules, the layer is
  // included/excluded.
  repeated NetStateRule include = 8;
  repeated NetStateRule exclude = 9;

  // Parameters for data pre-processing.
  optional TransformationParameter transform_param = 100;

  // Parameters shared by loss layers.
  optional LossParameter loss_param = 101;

  // Layer type-specific parameters.
  //
  // Note: certain layers may have more than one computational engine
  // for their implementation. These layers include an Engine type and
  // engine parameter for selecting the implementation.
  // The default for the engine is set by the ENGINE switch at compile-time.
  optional AccuracyParameter accuracy_param = 102;
  optional ArgMaxParameter argmax_param = 103;
  optional BatchNormParameter batch_norm_param = 139;
  optional BiasParameter bias_param = 141;
  optional ClipParameter clip_param = 148;
  optional ConcatParameter concat_param = 104;
  optional ContrastiveLossParameter contrastive_loss_param = 105;
  optional ConvolutionParameter convolution_param = 106;
  optional CropParameter crop_param = 144;
  optional DataParameter data_param = 107;
  optional DropoutParameter dropout_param = 108;
  optional DummyDataParameter dummy_data_param = 109;
  optional EltwiseParameter eltwise_param = 110;
  optional ELUParameter elu_param = 140;
  optional EmbedParameter embed_param = 137;
  optional ExpParameter exp_param = 111;
  optional FlattenParameter flatten_param = 135;
  optional HDF5DataParameter hdf5_data_param = 112;
  optional HDF5OutputParameter hdf5_output_param = 113;
  optional HingeLossParameter hinge_loss_param = 114;
  optional ImageDataParameter image_data_param = 115;
  optional InfogainLossParameter infogain_loss_param = 116;
  optional InnerProductParameter inner_product_param = 117;
  optional InputParameter input_param = 143;
  optional LogParameter log_param = 134;
  optional LRNParameter lrn_param = 118;
  optional MemoryDataParameter memory_data_param = 119;
  optional MVNParameter mvn_param = 120;
  optional ParameterParameter parameter_param = 145;
  optional PoolingParameter pooling_param = 121;
  optional PowerParameter power_param = 122;
  optional PReLUParameter prelu_param = 131;
  optional PythonParameter python_param = 130;
  optional RecurrentParameter recurrent_param = 146;
  optional ReductionParameter reduction_param = 136;
  optional ReLUParameter relu_param = 123;
  optional ReshapeParameter reshape_param = 133;
  optional ScaleParameter scale_param = 142;
  optional SigmoidParameter sigmoid_param = 124;
  optional SoftmaxParameter softmax_param = 125;
  optional SPPParameter spp_param = 132;
  optional SliceParameter slice_param = 126;
  optional SwishParameter swish_param = 147;
  optional TanHParameter tanh_param = 127;
  optional ThresholdParameter threshold_param = 128;
  optional TileParameter tile_param = 138;
  optional WindowDataParameter window_data_param = 129;
}

Class Layer

Layer是caffe较为复杂模块，其中layer是所有layer基类，定义所有layer的基本接口，其头文件为caffe\include\caffe\layer.hpp，C++文件在\src\caffe\layer.cpp中

在layer.hpp文件中其大部分接口都是虚函数，具体实现是由各个派生类来实现，class layer中的接口成员主要由以下列表：

Layer方法类别	Layer方法	描述
构造与析构函数	explicit Layer(const LayerParameter& param)	带参数显示构造函数
构造与析构函数	virtual ~Layer()	析构函数，由具体派生类来实现
Setup以及LayerParameter相关函数	void SetUp (const vector<Blob<Dtype>>& bottom, const vector<Blob<Dtype>>& top)	bottom Blob为Layer输入参数 top Blob为其输出参数为Layer的环境安装函数，主要是参数设置
	virtual void LayerSetUp(const vector<Blob<Dtype>>& bottom, const vector<Blob<Dtype>>& top	虚函数，Layer安装函数，由具体的派生类来实现
	virtual void Reshape (const vector<Blob<Dtype>>& bottom, const vector<Blob<Dtype>>& top)	虚函数，根据bottom的输入shape，设置其输出Blob shape,由具体的派生类来实现
	const LayerParameter& layer_param()	获取Layer的入参LayerParameter，将其存储在class layer中的私有变量layer_param_中
	virtual void ToProto(LayerParameter* param, bool write_diff = false);	获取Layer的入参LayerParameter，将其存储在class layer中的私有变量layer_param_中
	virtual inline const char* type()	虚函数，返回Layer type，具体实现由派生类来实现
前向传播和后向传播	inline Dtype Forward(const vector<Blob<Dtype>>& bottom, const vector<Blob<Dtype>>& top)	前向传播实现函数，bottom为输入，top为输出
	inline void Backward(const vector<Blob<Dtype>>& top, const vector<bool>& propagate_down, const vector<Blob<Dtype>>& bottom)	后向传播实现函数，bottom为输入，top为输出 propagate_down为与bottom 大小相对的标志位vector，主要用来标记是否做误差梯度
	virtual inline bool AllowForceBackward (const int bottom_index)	虚函数，是否允许强制反向传播，如果 AllowForceBackward(i) == false，则会忽略force_backward设定，具体实现由派生类来实现
	inline bool param_propagate_down(const int param_id)	指定该Layer是否计算相对权值和偏置项的梯度，具体相对谁由param_id指定
	inline void set_param_propagate_down(const int param_id, const bool value)	设置该Layer是否计算相对权值或偏置项的梯度，具体相对谁由param_id指定
	virtual void Forward_cpu (const vector<Blob<Dtype>>& bottom, const vector<Blob<Dtype>>& top)	虚函数，CPU版本的前向传播函数，具体实现是由派生类来实现
	virtual void Forward_gpu (const vector<Blob<Dtype>>& bottom, const vector<Blob<Dtype>>& top)	虚函数，GPU版本的前向传播函数，具体实现由派生类来实现
Loss相关	inline Dtype loss(const int top_index)	返回某个top blob标量loss值
	inline void set_loss(const int top_index, const Dtype value)	根据给定的index，设置某个top blob标量loss值
	virtual inline const char* type()	虚函数，返回Layer type，具体实现由派生类来实现
Blob相关	virtual inline int ExactNumBottomBlobs(）	虚函数，返回Bottom Blob的数量，具体实现由派生类来实现
	virtual inline int MinBottomBlobs()	虚函数，返回layer所需要的最小Bottom Blob数量，具体实现由派生类来实现
	virtual inline int MaxBottomBlobs()	虚函数，返回layer期望的最多Bottom Blob数量，具体实现由派生类来实现
	virtual inline int ExactNumTopBlobs()	虚函数，返回layer的输出Top Bottom Blob数量，具体实现由派生类来实现
	virtual inline int MinTopBlobs()	虚函数，返回期望最小的Tob Blob数量，具体实现由派生类来实现
	virtual inline int MaxTopBlobs()	虚函数，返回期望最大的Tob Blob数量，具体实现由派生类来实现
	virtual inline bool EqualNumBottomTopBlobs()	虚函数，输入bottom与输出top数量是否相同，如果相对返回true,否则返回false，具体实现由派生类来实现
	virtual inline bool AutoTopBlobs()	虚函数，是否允许匿名Top Blob，即由该layer自动创建，如为真，在Net:Init()函数会创建足够多的匿名Top Blob来满足该Layer ExactNumTopBlobs()、MinTopBlobs()需求
Layer相关参数	LayerParameter layer_param_	用于存储LayerParameter参数
	Phase phase_	用于训练train还是test参数
	vector<shared_ptr<Blob<Dtype> > > blobs_	Layer内部权值或偏置项，以Blob方式组织
	vector<bool> param_propagate_down_	标志位，是否计算对应参数的误差梯度
	vector<Dtype> loss_	标志位，在目标函数中，是否每个Top Blob都有非零权重