Skip to content

Commit

Permalink
fix vst1.f32 of coeff sum at eltwise_arm layer
Browse files Browse the repository at this point in the history
In line 414:  "vmla.f32   q1, q0, %q6         \n",        destination register is q1 instead of q0,      So, replace the {d0-d1} of line 416 with {d2-d3}.
  • Loading branch information
hxbloom authored and nihui committed Mar 11, 2018
1 parent 57f89a0 commit 28154dc
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/layer/arm/eltwise_arm.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -413,7 +413,7 @@ int Eltwise_arm::forward(const std::vector<Mat>& bottom_blobs, std::vector<Mat>&
"vld1.f32 {d2-d3}, [%2 :128] \n"
"vmla.f32 q1, q0, %q6 \n"
"subs %0, #1 \n"
"vst1.f32 {d0-d1}, [%2 :128]! \n"
"vst1.f32 {d2-d3}, [%2 :128]! \n"
"bne 0b \n"
: "=r"(nn), // %0
"=r"(ptr), // %1
Expand Down

0 comments on commit 28154dc

Please sign in to comment.