02-10-2007 12:13 PM
WARNING: PDF DOCUMENTPage 13: "Dual 128-bit SSE dataflow, Dual 128-bit loads per cycle"Page 14: "128-bit SSE and 128-bit Loads, 128b FADD, 128 bit FMUL, 128b SSE, 128b SSE"etc etc etc
02-11-2007 12:10 PM
Page 17: "128-bit SSE and 128-bit Loads, 128b FADD, 128b FMUL, 128b SSE, 128b SSE"
02-12-2007 10:56 AM
02-13-2007 02:16 AM
Hmmm, this is highly special and I don't see LabVIEW adding this just now. Yes The LabVIEW extended floating point potentially would have the ability to be adapted to that but currently makes use of the extended floating point format implemented in all x86 FPUs since about 386. It does use 10 or 12 bytes instead of 16. The flattened format for extended precision floats would however provide space for 16 bytes as that was necessary for some other CPU architectures.
@tarheel_hax0r wrote:
Bump for a Monday morning - anybody know anything about the capabilities of this "SSE 128" circuitry?"
02-13-2007
10:46 AM
- last edited on
05-05-2025
09:13 AM
by
Content Cleaner
02-13-2007 05:08 PM
AMD Details Native Quad-core Design Features for Breakthrough Performance and Advanced Power EfficienciesFebruary 12, 2007...High-performance computing (HPC) applications can benefit tremendously from a doubling of Barcelona’s floating-point execution pipeline to 128-bit width, which includes an AMD-only doubling of instruction and data delivery capabilities...Quad-Core AMD Opteron Processors are expected to be available in mid-2007...
02-14-2007 01:00 AM
@tarheel_hax0r wrote:
This is really maddening; to date, the only thing I can find at AMD's website is this single sentence, from a press release yesterday:But man, that sure does sound like 128-bit floating point calculations in hardware...
02-14-2007 02:48 AM
02-14-2007 04:41 AM
02-15-2007 11:20 AM
/* "quad" is a hypothetical 128-bit quad precision *//* floating point number, similar to "long double" *//* in recent versions of C++: */quad x, y, z;x = 1.000000000000000000000000000001;y = 1.000000000000000000000000000001;/* the hope was that "128b FADD" could perform the *//* following 128-bit addition in hardware: */z = x + y;
double x[2], y[2], z[2];x[0] = 1.000000000000000000000000000001;y[0] = 1.000000000000000000000000000001;x[1] = 2.000000000000000000000000000222;y[1] = 2.000000000000000000000000000222;/* Apparently the coordinates of the two "vectors" x & y *//* can be sent to "128b FADD" in parallel, and the following *//* two summations can be computed more or less simultaneously: */z[0] = x[0] + y[0];z[1] = x[1] + y[1];
PDF DOCUMENT: How JAVA's Floating-Point Hurts Everyone EverywherePDF DOCUMENT: Matlab's Loss is Nobody's Gain