pytorch中获取模型input/outputshape实例_获取模型输入输出形状资源-CSDN下载

5星 · 超过95%的资源 181 浏览量 2020-09-18 05:00:39 上传评论 1 收藏 42KB PDF 举报

在PyTorch中，获取模型的输入(input)和输出(output)形状(shape)并不像在TensorFlow或Caffe那样直接，因为PyTorch的设计更注重灵活性。然而，可以通过编写自定义代码来实现这一功能。以下是一个实例，展示了如何通过遍历模型的层并计算其前向传播输出的形状来获取输入和输出形状。我们需要导入必要的库： ```python from collections import OrderedDict import torch from torch.autograd import Variable import torch.nn as nn ``` 接下来，定义一个函数`get_output_size`，这个函数递归地处理输出，如果输出是元组，它会继续处理每个元素： ```python def get_output_size(summary_dict, output): if isinstance(output, tuple): for i in range(len(output)): summary_dict[i] = OrderedDict() summary_dict[i] = get_output_size(summary_dict[i], output[i]) else: summary_dict['output_shape'] = list(output.size()) return summary_dict ``` 然后，定义主函数`summary`，它接受输入尺寸(input_size)和模型(model)作为参数。这个函数会遍历模型的所有层，并记录输入和输出形状： ```python def summary(input_size, model): def register_hook(module): def hook(module, input, output): class_name = str(module.__class__).split('.')[-1].split("'")[0] module_idx = len(summary) m_key = '%s-%i' % (class_name, module_idx + 1) summary[m_key] = OrderedDict() summary[m_key]['input_shape'] = list(input[0].size()) summary[m_key] = get_output_size(summary[m_key], output) params = 0 if hasattr(module, 'weight'): params += torch.prod(torch.LongTensor(list(module.weight.size()))) if module.weight.requires_grad: summary[m_key]['trainable'] = True else: summary[m_key]['trainable'] = False # 如果有偏置项，可以添加类似处理 # if hasattr(module, 'bias'): # params += torch.prod(torch.LongTensor(list(module.bias.size()))) summary[m_key]['nb_params'] = params if not isinstance(module, nn.Sequential) and \ not isinstance(module, nn.ModuleList) and \ not (module == model): hooks.append(module.register_forward_hook(hook)) # 检查是否有多个输入到网络 if isinstance(input_size[0], (list, tuple)): x = [Variable(torch.rand(1, *in_size)) for in_size in input_size] else: x = Variable(torch.rand(1, *input_size)) # 创建属性 summary = OrderedDict() hooks = [] # 注册hook model.apply(register_hook) # 运行前向传播 with torch.no_grad(): model(x) # 移除所有hook for h in hooks: h.remove() return summary ``` 在上述代码中，`register_hook`是一个内部辅助函数，用于注册一个hook到每个模块上，当前向传播执行时，`hook`函数会被调用，从而记录每个模块的输入和输出形状。`summary`函数首先创建一个空的OrderedDict，然后遍历模型的所有子模块，注册hook。在运行前向传播后，所有的形状信息都会被记录下来。注意，这个方法需要构造一个实际的输入数据调用`model(x)`，以便让模型的前向传播执行。此外，此方法仅适用于那些权重存储在`weight`属性中的模块，对于像RNN这样的模块，可能需要额外的处理，因为它们的权重和偏置存储方式可能不同。你可以通过调用`summary(input_size, model)`并传入你的模型和输入尺寸来获取模型的输入输出形状信息。例如，如果你有一个卷积神经网络(CNN)，并且知道输入图像的尺寸是224x224x3，你可以这样使用： ```python input_size = (3, 224, 224) model = your_cnn_model print(summary(input_size, model)) ``` 这将输出一个详细的字典，包含每个层的输入形状、输出形状、是否训练以及参数数量。这个信息对于理解和调试模型非常有用。

资源推荐

资源详情

资源评论

pytorch中获取模型中获取模型input/output shape实例实例

今天小编就为大家分享一篇pytorch中获取模型input/output shape实例，具有很好的参考价值，希望对大家有所

帮助。一起跟随小编过来看看吧

Pytorch官方目前无法像tensorflow, caffe那样直接给出shape信息，详见

https://github.com/pytorch/pytorch/pull/3043

以下代码算一种workaround。由于CNN, RNN等模块实现不一样，添加其他模块支持可能需要改代码。

例如RNN中bias是bool类型，其权重也不是存于weight属性中，不过我们只关注shape够用了。

该方法必须构造一个输入调用forward后（model(x)调用）才可获取shape

#coding:utf-8

from collections import OrderedDict

import torch

from torch.autograd import Variable

import torch.nn as nn

import models.crnn as crnn

import json

def get_output_size(summary_dict, output):

if isinstance(output, tuple):

for i in xrange(len(output)):

summary_dict[i] = OrderedDict()

summary_dict[i] = get_output_size(summary_dict[i],output[i])

else:

summary_dict['output_shape'] = list(output.size())

return summary_dict

def summary(input_size, model):

def register_hook(module):

def hook(module, input, output):

class_name = str(module.__class__).split('.')[-1].split("'")[0]

module_idx = len(summary)

m_key = '%s-%i' % (class_name, module_idx+1)

summary[m_key] = OrderedDict()

summary[m_key]['input_shape'] = list(input[0].size())

summary[m_key] = get_output_size(summary[m_key], output)

params = 0

if hasattr(module, 'weight'):

params += torch.prod(torch.LongTensor(list(module.weight.size())))

if module.weight.requires_grad:

summary[m_key]['trainable'] = True

else:

summary[m_key]['trainable'] = False

#if hasattr(module, 'bias'):

# params += torch.prod(torch.LongTensor(list(module.bias.size())))

summary[m_key]['nb_params'] = params

if not isinstance(module, nn.Sequential) and \

not isinstance(module, nn.ModuleList) and \

not (module == model):

hooks.append(module.register_forward_hook(hook))

# check if there are multiple inputs to the network

if isinstance(input_size[0], (list, tuple)):

x = [Variable(torch.rand(1,*in_size)) for in_size in input_size]

else:

x = Variable(torch.rand(1,*input_size))

# create properties

summary = OrderedDict()

hooks = []

# register hook

model.apply(register_hook)

# make a forward pass

model(x)

# remove these hooks

for h in hooks:

h.remove()

本内容试读结束，登录后可阅读更多

下载后可阅读完整内容，剩余3页未读，立即下载

评论收藏

内容反馈

泡泡SOHO

2023-06-20

如果你是Pytorch新手，这篇文章是一个不错的指南。
王佛伟

2023-06-20

我之前看了许多文章，但这篇是最能直接解决我的问题的。
光与火花

2023-06-20

这篇文件帮助我更好地理解了Pytorch中的模型结构，很实用。
宏馨

2023-06-20

我个人认为这篇文章的示例很实用，让我快速定位到了自己的问题。
禁忌的爱

2023-06-20

作者讲解的很详细，适合初学者掌握。

前往

页

weixin_38544152

粉丝: 4

pytorch中获取模型input/output shape实例

Pytorch 卷积中的 Input Shape用法

Pytorch模型训练实用教程

基于python指定包的安装路径方法

图像还原工具箱（PyTorch）。 USRNet，DnCNN，FFDNet，SRMD，DPSR，MSRResNet，ESRGAN，IMDN的培训和测试代码-Python开发

PyTorch实现的图像恢复/去噪工具箱-python

pytorch查看通道数 维数 尺寸大小方式

pytorch qat 2////////////

基于python+pytorch和ResNet模型的文字/非文字场景图像快速分类系统，具有多引擎爬虫功能+友好的GUI界面+源码（毕业设计&课程设计&项目开发）

pytorch-tutorial-master.zip

pytorch-0.4.1

Pytorch 模型训练实用教程 代码免费下载

基于pytorch和bert模型的中文新闻文本分类项目源码.zip

基于Pytorch的UNet语义分割模型与代码

ResNet代码（超详细注释）+数据集，pytorch实现

基于pytorch的中文语言模型预训练模型源码

使用LSTM实现C-MAPSS数据集里面的剩余寿命预测（Pytorch）

pytorch resnet 101 模型参数数据

基于pytorch的谷歌自然语言处理模型BERT代码实现

xception pytorch 预训练模型.zip

pytorch vit base 16 预训练模型

60分钟入门pytorch

d2l-zh-pytorch.pdf

基于Pytorch声纹识别模型EcapaTdnn全部模型参数文件

在Pytorch版本中生成模型的集合

基于PyTorch和Transformer模型进行中文文本分类项目源码+文档说明（高分项目）.zip

PyTorch模型部署实例

pytorch 模型可视化的例子

java jna 调用pytorch c++模型推理

你好，你好。

matlab中存档算法代码-BayesChemEng:贝叶斯推断适用于昂贵的实验，具有多个不同的设置（设计变量）且重复次数很少。侧重于化学工程

最新资源

pytorch查看通道数维数尺寸大小方式

Pytorch 模型训练实用教程代码免费下载