-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Closed
Labels
Description
一、背景
0-Size Tensor是指,Tensor.shape中存在某维度值为0,进而该Tensor虽有shape、dtype、place等信息,但他的元素个数为0。在切割等特定业务中,0-Size出现频率较高。Paddle目前对0-Size的支持非常薄弱,需要系统性摸排、建设。
参与本项活动,你将学习到Paddle算子库框架的设计,并了解Paddle动态图的整体执行流程。对深度学习框架0-Size业务有深刻了解。也有可能会涉猎到Paddle组合算子机制和编译器符号推导机制。遇到问题我们将有专人解答。
二、任务描述
本次开源任务主要用于完善Paddle的一些API对0-size Tensor的支持,这些不支持0-size Tensor的API遇到0-size Tensor时会出现例如精度异常、Coredump、Cuda Error、报错等问题。这些存在问题的API大概率是由于Kernel没有考虑到0-Size场景而产生的,当然也涉及到API shape的推导。 其他的成因需要具体分析。需要修复的API如下:
Important
每个任务难度:0.025×🌟
题目讲解见录屏文件:https://meeting.tencent.com/crm/l59EWmRZc4 (00:16:00~00:36:00)
详细介绍见:
- 报名格式参考:
【报名】:2、3
,多个任务之间需要使用中文顿号分隔,报名多个连续任务可用横线表示,如2-5
- PR 提交格式:在 PR 的标题中以 【0-size Tensor Job2 No.xxx】 开头,注明任务编号。请注意,标题中一定要加
Job2
,以和第一批任务区分。
序号 | API | 报名人/状态 | CPU/GPU/testcase |
---|---|---|---|
1 | paddle.add | @co63oc |
gpu |
2 | paddle.add_n | @co63oc |
gpu,cpu |
3 | paddle.bincount | @co63oc |
gpu,cpu |
4 | paddle.broadcast_tensors | @co63oc |
gpu |
5 | paddle.cdist | @co63oc |
gpu,cpu |
6 | paddle.crop | @co63oc |
gpu,cpu |
7 | paddle.cumprod | @co63oc |
gpu |
8 | paddle.divide | @co63oc |
gpu |
9 | paddle.dot | @co63oc |
gpu,cpu |
10 | paddle.equal_all | @co63oc |
gpu |
11 | paddle.expand_as | @zhengshengning @co63oc |
gpu,cpu |
12 | paddle.full | @co63oc |
gpu,cpu |
13 | paddle.gammainc | @co63oc |
gpu |
14 | paddle.gammaincc | @co63oc |
gpu |
15 | paddle.gather | @co63oc |
gpu,cpu |
16 | paddle.gather_nd | @co63oc |
gpu,cpu |
17 | paddle.geometric.segment_max | @co63oc |
gpu,cpu |
18 | paddle.geometric.segment_mean | @co63oc |
gpu,cpu |
19 | paddle.geometric.segment_min | @co63oc |
gpu,cpu |
20 | paddle.geometric.segment_sum | @co63oc |
gpu,cpu |
21 | paddle.geometric.send_u_recv | @zhengshengning |
gpu,cpu |
22 | paddle.geometric.send_ue_recv | @zhengshengning |
gpu,cpu |
23 | paddle.geometric.send_uv | @zhengshengning |
gpu,cpu |
24 | paddle.histogram | @co63oc |
gpu,cpu |
25 | paddle.increment | @co63oc |
gpu,cpu |
26 | paddle.incubate.nn.functional.fused_bias_dropout_residual_layer_norm | @co63oc |
gpu |
27 | paddle.incubate.nn.functional.fused_linear | @co63oc |
gpu |
28 | paddle.incubate.nn.functional.fused_linear_activation | @co63oc |
gpu |
29 | paddle.incubate.nn.functional.fused_multi_head_attention | @co63oc |
gpu |
30 | paddle.incubate.nn.functional.swiglu | @co63oc |
gpu,cpu |
31 | paddle.incubate.segment_max | @co63oc |
gpu,cpu |
32 | paddle.incubate.segment_mean | @co63oc |
gpu,cpu |
33 | paddle.incubate.segment_min | @co63oc |
gpu,cpu |
34 | paddle.incubate.segment_sum | @co63oc |
gpu,cpu |
35 | paddle.incubate.softmax_mask_fuse_upper_triangle | @co63oc |
gpu,cpu |
36 | paddle.index_add | @co63oc |
gpu,cpu |
37 | paddle.index_put | @co63oc |
gpu,cpu |
38 | paddle.index_sample | @co63oc |
gpu,cpu |
39 | paddle.index_select | @co63oc |
gpu,cpu |
40 | paddle.linalg.cholesky | @co63oc |
gpu,cpu |
41 | paddle.linalg.eigh | @co63oc |
gpu,cpu |
42 | paddle.linalg.eigvals | @co63oc |
gpu,cpu |
43 | paddle.linalg.lu | @zhengshengning |
gpu,cpu |
44 | paddle.linalg.lu_unpack | @co63oc |
gpu,cpu |
45 | paddle.linalg.svdvals | @co63oc |
gpu,cpu |
46 | paddle.masked_scatter | @co63oc |
gpu,cpu |
47 | paddle.max | @co63oc |
gpu,cpu |
48 | paddle.median | @co63oc |
gpu,cpu |
49 | paddle.min | @co63oc |
gpu,cpu |
50 | paddle.mode | @co63oc |
gpu,cpu |
51 | paddle.multiplex | @co63oc |
gpu,cpu |
52 | paddle.multiply | @co63oc |
gpu |
53 | paddle.nn.functional.adaptive_log_softmax_with_loss | @co63oc |
gpu,cpu |
54 | paddle.nn.functional.binary_cross_entropy | @co63oc |
gpu,cpu |
55 | paddle.nn.functional.cross_entropy | @co63oc |
gpu,cpu |
56 | paddle.nn.functional.dice_loss | @co63oc |
gpu,cpu |
57 | paddle.nn.functional.embedding | @co63oc |
gpu,cpu |
58 | paddle.nn.functional.gather_tree | @co63oc |
gpu,cpu |
59 | paddle.nn.functional.gaussian_nll_loss | @co63oc |
gpu,cpu |
60 | paddle.nn.functional.instance_norm | @co63oc |
gpu,cpu |
61 | paddle.nn.functional.log_loss | @co63oc |
gpu,cpu |
62 | paddle.nn.functional.margin_cross_entropy | @co63oc |
gpu,cpu |
63 | paddle.nn.functional.nll_loss | @zhengshengning @co63oc |
gpu,cpu |
64 | paddle.nn.functional.one_hot | @co63oc |
gpu |
65 | paddle.nn.functional.smooth_l1_loss | @co63oc |
gpu |
66 | paddle.nn.functional.softmax_with_cross_entropy | @co63oc |
gpu,cpu |
67 | paddle.nn.functional.upsample | @co63oc |
gpu,cpu |
68 | paddle.pow | @co63oc |
gpu |
69 | paddle.prod | @co63oc |
gpu,cpu |
70 | paddle.put_along_axis | @co63oc |
gpu,cpu |
71 | paddle.scale | @co63oc |
gpu,cpu |
72 | paddle.scatter | @co63oc |
gpu,cpu |
73 | paddle.scatter_nd | @co63oc |
gpu,cpu |
74 | paddle.scatter_nd_add | @co63oc |
gpu,cpu |
75 | paddle.shard_index | @co63oc |
gpu,cpu |
76 | paddle.slice_scatter | @co63oc |
cpu |
77 | paddle.squeeze | @co63oc |
gpu,cpu |
78 | paddle.sum | @co63oc |
gpu,cpu |
79 | paddle.Tensor.add | @co63oc |
gpu |
80 | paddle.Tensor.mul | @co63oc |
gpu |
81 | paddle.Tensor.truediv | @co63oc |
gpu |
82 | paddle.Tensor.add | @co63oc |
gpu |
83 | paddle.Tensor.cumprod | @co63oc |
gpu |
84 | paddle.Tensor.gather | @co63oc |
gpu,cpu |
85 | paddle.Tensor.gather_nd | @co63oc |
gpu,cpu |
86 | paddle.Tensor.index_select | @co63oc |
gpu,cpu |
87 | paddle.Tensor.masked_fill | @co63oc |
gpu |
88 | paddle.Tensor.max | @co63oc |
gpu,cpu |
89 | paddle.Tensor.mean | @zhengshengning |
gpu,cpu |
90 | paddle.Tensor.min | @co63oc |
gpu,cpu |
91 | paddle.Tensor.multiply | @co63oc |
gpu |
92 | paddle.Tensor.pow | @co63oc |
gpu |
93 | paddle.Tensor.prod | @co63oc |
gpu,cpu |
94 | paddle.Tensor.put_along_axis | @co63oc |
gpu,cpu |
95 | paddle.Tensor.real | @co63oc |
gpu,cpu |
96 | paddle.Tensor.reshape | @zhengshengning |
gpu,cpu |
97 | paddle.Tensor.set_ | @co63oc |
gpu,cpu |
98 | paddle.Tensor.sum | @co63oc |
gpu,cpu |
99 | paddle.Tensor.take_along_axis | @zhengshengning |
gpu,cpu |
100 | paddle.topk | @co63oc |
gpu,cpu |
101 | paddle.unfold | @zhengshengning |
gpu |
102 | paddle.vander | @co63oc |
gpu,cpu |
103 | paddle.vision.ops.deform_conv2d | @co63oc |
gpu,cpu |
104 | paddle.vision.ops.distribute_fpn_proposals | @co63oc |
gpu,cpu |
105 | paddle.vision.ops.roi_align | @co63oc |
gpu,cpu |
106 | paddle.vision.ops.roi_pool | @zhengshengning |
gpu,cpu |
看板信息
任务方向 | 任务数量 | 提交作品 / 任务认领 | 提交率 | 完成 | 完成率 |
---|---|---|---|---|---|
0-size Tensor的支持任务-第2批 | 106 | 106 / 106 | 100.0% | 106 | 100.0% |
统计信息
排名不分先后 @co63oc (96) @zhengshengning (10)
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
Done