sse
Data not aligned correctly in Visual Studio if run in debugger
I\'ve been working with SSE for a while now, and I\'ve seen my share of alignment issues. This, however, is beyond my understanding:[详细]
2023-02-15 08:07 分类:问答Array of sse type: Segmentation Fault
today I tried to initialize an array of the sse type __m128d. Unfortunately it didn\'t work - why? Is it generally impossible to create arrays of sse types (since they are register types?). The follow[详细]
2023-02-15 01:15 分类:问答Help me improve some more SSE2 code
I am looking for some help to improve this bilinear scaling sse2 code on core2 cpus On my Atom N270 and on an i7 this code is about 2x faster than the mmx code.But under core2 cpus it is only equal t[详细]
2023-02-13 07:58 分类:问答Alignment of vectors in LLVM's amd64 output
I\'m trying to use vectors inside structs with LLVM. I have the following C definition of my struct: struct Foo[详细]
2023-02-12 03:21 分类:问答Most performant way to subtract one array from another
I have the following code which is the bottleneck in one part of my application. All I do is subtract on Array from another. Both of these arrays have more around 100000 elements. I\'m trying to find[详细]
2023-02-11 19:24 分类:问答Load constant floats into SSE registers
I\'m trying to figure out an efficient way to load compile time constant floats into SSE(2/3) registers. I\'ve tried doing simple code like this,[详细]
2023-02-11 09:54 分类:问答G++ SSE memory alignment on the stack
I am attempting to re-write a raytracer using Streaming SIMD Extensions.My original raytracer used inline assembly and movups instructions to load data into the xmm registers.I have read that compiler[详细]
2023-02-10 08:44 分类:问答Using struct as SSE vector type in gcc?
Is it possible in GCC to use a struct or class as a vector type for SSE instructions? something like: typedef struct vfloat __attribute__((vector_size(16))) {[详细]
2023-02-09 12:14 分类:问答Building sse switches for GCC from /proc/cpuinfo
I\'ve got a Makefile that\'s I\'d like to parse the flags in /proc/cpuinfo and build up a list of available sse instruction sets to pass to gcc (-msse -msse2, e开发者_如何学编程tc).This is the best I\[详细]
2023-02-09 07:47 分类:问答SSE best way to set register to 0.0's and 1.0's?
I am doing some sse vector3 math. Generally, I set the 4th digit of my vector to 1.0f, as this makes most of my math work, but sometimes I need to set it to 0.0f.[详细]
2023-02-08 20:21 分类:问答