pytorch日积月累2-张量操作

1.索引与切片

#dim 0 first
a=torch.rand(4,3,28,28)
a[0].shape#对第一个维度索引(索引的是第一个维度，相当于把第一张图片全部取出)
a[0,0].shape#(取第0张图片的第0个通道)
a[0,0,2,4]#dim0 标量

沿用python中的索引方法：start:end:step

#对各个维度进行索引切片
#select first/last N
a=torch.rand(4,3,28,28)
a[:2].shape
a[:2,:1,:,:].shape
a[:2,1:,:,:].shape
a[:2,-1:,:,:].shape
a[:,:,0:28:2,0:28:2].shape#进行隔点采样
a[:,:,::2,::2].shape

根据特殊的索引来进行采样：

a=torch.rand(4,3,28,28)
a.index_select(0,torch.tensor([0,2])).shape
a.index_select(1,torch.tensor([1,2])).shape
a.index_select(2,torch.arange(28)).shape
a.index_select(2,torch.arange(8)).shape

特殊的切片符号...，为了方便与: :的表示法相同

a.shape
a[...].shape
a[0,...].shape
a[:,1,...].shape
a[...,:2].shape
#当有...出现时，右边的索引需要理解为最右边

根据mask来选择：按照mask中的True序列进行索引

t = torch.randint(0, 9, size=(3, 3))
mask = t.ge(5)#生成一个bool矩阵(mask)，将>0.5的位置变成1，类似于matlab用法
t_select = torch.masked_select(t, mask)
print(t)
print(mask)
print(t_select)

select by flatten index

1 2	src=torch.tensor([[4,3,5],[6,7,8]]) torch.take(src,torch.tensor([0,2,5]))#先进行打平，然后根据索引取出打平之后的值

2.维度变换

View/reshape操作

t=torch.randperm(8)
t_reshape=torch.reshape(t,(2,4))
print("t:{}\nt_reshape:\n{}".format(t,t_reshape))
t[0]=1024
#使用过程中原张量和改变维度之后的张量的内存共享，改变之后会同时改变
print("t:{}\nt_reshape:\n{}".format(t,t_reshape))
print("t.data 内存地址:{}".format(id(t.data)))
print("t_reshape.data 内存地址:{}".format(id(t_reshape.data)))

a=torch.rand(4,1,28,28)
#一定要具有物理意义，如果没有物理意义可能导致数据的丢失和混乱
a.view(4,28*28)
a.view(4,28*28).shape
a.view(4*28,28).shape#只关心一行的形式
a.view(4*1,28,28).shape
#主要问题
b=a.view(4,784)
b.view(4,28,28,1)#逻辑错误，把数据中的维度信息丢失掉了

Squeeze/unsqueeze操作：unsqueeze(pos\index)

功能：压缩长度为1的维度（轴）

torch.squeeze(input, dim=None, out=None)
dim：若为None，移除所有长度为1的轴，若指定维度，当且仅当该轴长度为1时，可以被移除。

注意：[-a.dim()-1,a.dim()+1)

t = torch.rand((1, 2, 3, 1))
t_sq = torch.squeeze(t)
t_0 = torch.squeeze(t, dim=0)
t_1 = torch.squeeze(t, dim=1)
print(t.shape, t_sq.shape, t_0.shape, t_1.shape)

torch.unsqueeze(input, dim=None, out=None)
功能：依据dim扩展维度。dim：指定扩展的维度

实际的应用，在图像处理的过程中添加偏置项：

b=torch.rand(32)
f=torch.rand(4,32,14,14)
b=b.unsqueeze(1).unsqueeze(2).unsqueeze(0)
b.shape

squeeze(pos\index)如果不设置索引，会将所有的维度全部收缩挤压

b.shape
b.squeeze().shape
b.squeeze(0).shape
b.squeeze(-1).shape
b.squeeze(1).shape

Expand/repeat

Expand：broadcasting

Repeat：memory copied

a=torch.rand(4,32,14,14)
b.shape
b.expand(4,32,14,14).shape
b.expand(-1,32,-1,-1).shape
b.expand(-1,32,-1,-4).shape

b.shape
#表示要拷贝的次数
b.repeat(4,32,1,1).shape#torch.Size([4, 1024, 1, 1])
b.repeat(4,1,1,1).shape#torch.Size([4, 32, 1, 1])
b.repeat(4,1,32,32).shape#torch.Size([4, 32, 32, 32])

转置操作

t = torch.rand((2, 3, 4))
t_transpose = torch.transpose(t, dim0=1, dim1=2)
print(t.shape)
print(t_transpose.shape)

3.Broadcasting自动扩张

Expand：without copying data

Key idea

Insert 1 dim ahead Expand dims with size 1 to same size
Feature maps: [4, 32, 14, 14] Bias: [32, 1, 1] => [1, 32, 1, 1] => [4, 32, 14, 14]

自动扩张：先添加1维，再扩展数据

Is it broadcasting-able?

▪ Match from Last dim!

▪ If current dim=1, expand to same

▪ If either has no dim, insert one dim and expand to same

▪ otherwise, NOT broadcasting-able

小维度指定，大维度随意

4.拼接与拆分

Merge or split

▪ Cat ▪ Stack ▪ Split ▪ Chunk

cat:必须保证除了拼接维度以外的维度都要相等

a=torch.rand(4,32,8)
b=torch.rand(5,32,8)
torch.cat([a,b],dim=0).shape
#torch.Size([9, 32, 8])

stack：插入一个新的维度create new dim

a=torch.rand(4,32,8,8)
b=torch.rand(4,32,8,8)
torch.stack([a,b],dim=2).shape
#torch.Size([4, 32, 2, 8, 8])

chunk：将张量按照维度dim进行平均切分，返回值为张量列表

注意：如果不能整除，最后一份张量小于其他的张量。

a=torch.ones((2,5))
list_of_tensors=torch.chunk(a,dim=1,chunks=2)
for idx,t in enumerate(list_of_tensors):
    print("第{}个张量：{}，shape is {}".format(idx+1,t,t.shape))

split：将张量按维度dim进行切分，返回值为张量列表

torch.split(tensor, split_size_or_sections, dim=0)

split_size_or_sections：为int时，表示每一份的长度，为list时，按list元素切分

t=torch.ones((2,5))
list_of_tensors1=torch.split(t,2,dim=1)
for idx,t in enumerate(list_of_tensors1):
    print("第{}个张量：{}，shape is {}".format(idx+1,t,t.shape))
list_of_tensors2=torch.split(t,[2,1,2],dim=1)
#列表中各个元素的和一定要等于指定维度的长度
for idx,t in enumerate(list_of_tensors2):
    print("第{}个张量：{}，shape is {}".format(idx+1,t,t.shape))

5.数学运算

torch.add()                      torch.addcdiv()
torch.addcmul()                  torch.sub()
torch.div()                      torch.mul()
torch.log(input, out=None)       torch.log10(input, out=None)
torch.log2(input, out=None)      torch.exp(input, out=None)
torch.pow()                      torch.abs(input, out=None)
torch.acos(input, out=None)      torch.cosh(input, out=None)
torch.cos(input, out=None)       torch.asin(input, out=None)
torch.atan(input, out=None)      torch.atan2(input, other, out=None)
#矩阵乘法
torch.mm(a,b)#只能应用于2D的矩阵
torch.matmul(a,b)#可以应用于很多维
a@b#可以应用于很多维

为了便于深度学习的进行pytorch封装了一些内置的特殊用法：

1	torch.add(input, alpha=1, other, out=None)#逐元素计算 input+alpha × other

torch.addcdiv()：

$\text{out} _{i}= \text{input} _{i}+ \text{value} \times \frac{\text { tensor } 1_{i}}{\text { tensor } 2_{i}}$

torch.addcmul()：

$\text{out} _{i}= \text{input} _{i}+ \text{value} \times \text { tensor } 1_{i} \times \text { tensor } 2_{i}$

1	torch.addcmul(input, value=1, tensor1, tensor2,out=None)

t_0 = torch.randn((3, 3))
t_1 = torch.ones_like(t_0)
t_add = torch.add(t_0, 10, t_1)
print(t_0)
print(t_1)
print(t_add)