Opcode | Encoding | 16-bit | 32-bit | 64-bit | CPUID Feature Flag(s) | Description |
---|---|---|---|---|---|---|
66 0F 3A 0C /r ib BLENDPS xmm1, xmm2/m128, imm8 | rmi | Invalid | Valid | Valid | sse4.1 | Select packed single-precision floating-point values from xmm1 and xmm2/m128 using a mask specified in imm8. Store the result in xmm1. |
VEX.128.66.0F3A.WIG 0C /r ib VBLENDPS xmm1, xmm2, xmm3/m128, imm8 | rvmi | Invalid | Valid | Valid | avx | Select packed single-precision floating-point values from xmm2 and xmm3/m128 using a mask specified in imm8. Store the result in xmm1. |
VEX.256.66.0F3A.WIG 0C /r ib VBLENDPS ymm1, ymm2, ymm3/m256, imm8 | rvmi | Invalid | Valid | Valid | avx | Select packed single-precision floating-point values from ymm2 and ymm3/m128 using a mask specified in imm8. Store the result in ymm1. |
Encoding
Encoding | Operand 1 | Operand 2 | Operand 3 | Operand 4 |
---|---|---|---|---|
rmi | ModRM.reg[rw] | ModRM.r/m[r] | imm8 | |
rvmi | ModRM.reg[w] | VEX.vvvv[r] | ModRM.r/m[r] | imm8 |
Description
The (V)BLENDPS
instruction conditionally moves single-precision floating-point values from the two source operands. From the control byte, each bit, if cleared, will move from the first source operand, and, if set, will move from the second source operand. The result is stored in the destination operand.
This instruction, despite being named as if it operates on floating-point numbers, will work on 32-bit integers as well.
All forms except the legacy SSE one will zero the upper (untouched) bits.
Operation
public void BLENDPS(SimdF32 dest, SimdF32 src, U8 mask)
{
// if `mask.Bit[n]` is 0, `dest` will be copied into itself (i.e. nothing happens)
if (mask.Bit[0])
dest[0] = src[0];
if (mask.Bit[1])
dest[1] = src[1];
if (mask.Bit[2])
dest[2] = src[2];
if (mask.Bit[3])
dest[3] = src[3];
// dest[4..] is unmodified
}
void VBLENDPS_Vex(SimdF32 dest, SimdF32 src1, SimdF32 src2, U8 mask, int kl)
{
for (int n = 0; n < kl; n++)
dest[n] = mask.Bit[n] ? src2[n] : src1[n];
dest[kl..] = 0;
}
public void VBLENDPS_Vex128(SimdF32 dest, SimdF32 src1, SimdF32 src2, U8 mask) =>
VBLENDPS_Vex(dest, src1, src2, mask, 4);
public void VBLENDPS_Vex256(SimdF32 dest, SimdF32 src1, SimdF32 src2, U8 mask) =>
VBLENDPS_Vex(dest, src1, src2, mask, 8);
Intrinsics
__m128d _mm_blend_ps(__m128d v1, __m128d v2, const uint32_t mask)
__m256d _mm256_blend_ps(__m256d a, __m256d b, const uint32_t mask)
Exceptions
SIMD Floating-Point
None.Other Exceptions
VEX Encoded Form: See Type 4 Exception Conditions.