intrinsics
Intrinsics program (SSE) - g++ - help needed
This is the first time I am posting a question on stackoverflow, so please try and overlook any errors I may have made in formatting my question/code. But please do point the same out to me so I may b[详细]
2023-01-01 06:09 分类:问答Stack usage with MMX intrinsics and Microsoft C++
I have an inline assembler loop that cumulatively adds elements from an int32 data array with MMX instructions. In particular, it uses the fact that the MMX registers can accommodate 16 int32s to calc[详细]
2022-12-31 03:34 分类:问答Intel AVX intrinsics: any compatibility library out?
Are there any Intel AVX intrinsics library out? I\'m looking for something similar as \'sse2mmx.h\' header which fall-backs to MMX intrinsics if SSE2 integer intrinsics are not available on compile ti[详细]
2022-12-28 06:40 分类:问答does passing __m128i objects by reference to inline function cause these objects to be moved to stack?
I\'m writin开发者_JAVA百科g transpose function for 8x16bit vectors with SSE2 intrinsics. Since there are 8 arguments for that function (a matrix of 8x8x16bit size), I can\'t do anything but pass them[详细]
2022-12-28 00:21 分类:问答How does _mm_mwait work?
How does _mm_mwait from pmmintrin.h work? (I mean not the asm for it, but action and how this action is taken in NUMA systems. The store monitoring is easy to implement only on bus-based SMP systems w[详细]
2022-12-26 00:25 分类:问答Equivalent of InterlockedIncrement in Linux/gcc
It would be a very simple question (could be duplicated), but I was unable to find it. Win32 API provides a very开发者_StackOverflow handy set of atomic operations (as intrinsics) such as Interlocked[详细]
2022-12-17 03:54 分类:问答VC++ SSE intrinsic optimisation weirdness
I am performing a scattered read of 8-bit data from a file (De-Interleaving a 64 channel wave file).I am then combining them to be a single stream of bytes.The problem I\'m having is with my re-constr[详细]
2022-12-15 23:25 分类:问答x86 max/min asm instructions?
Are there any asm instructions that can speed up computation of min/max of vector of doubles/integers on Core i7 architecture?[详细]
2022-12-15 11:01 分类:问答How do I replace __asm jno no_oflow with an intristic in a VS2008 64bit build?
I have this code: __asm jno no_oflow overflow = 1; __asm no_oflow: It produces this nice warning: error C4235: nonstandard extension used : \'__asm\' keyword not supported on this architecture[详细]
2022-12-14 03:13 分类:问答C# fast crc32 calculation :
I\'ve profiled my application with Ants and found out that > 10% is in CRC32 calculations. (The CRC32-calculation is done in plain C#)[详细]
2022-12-13 14:54 分类:问答