Shoot... Editted out my previous garbage entirely.
One of Core2's many improvements over the Pentium4 stems from its shorter pipeline, but glancing at a few old articles from its introduction, I didn't fully absorb the details.
butc8, I think the figure you are looking at shows either Integer ops, or possibly SSE-optimized instructions.
Now that I'm home and have some peace and quiet, I went out and downloaded LINPACK, an old, classic benchmark. Full description quoted below for the curious. Results for
one core out of my C2Duo, running at 3.4 GHz, rated me at 1.85 GFLOPs. This implies that all four cores in a Q6600, similarly overclocked, put out
7.4 GFLOPs. (A general purpose metric would be: 0.544 GFLOPs per-GHz per-Core.)
I also have a copy that's optimized for SSE2 instructions, which reports 2.08 GFLOPs/core... but Folding cores benefit far more from SSE2, so I'm not sure what to make of that.
LINPACK
The LINPACK Benchmarks are a measure of a system's floating point computing power. Introduced by Jack Dongarra, they measure how fast a computer solves a dense n by n system of linear equations Ax=b, which is a common task in engineering. It was written in Fortran by Jack Dongarra, Jim Bunch, Cleve Moler, and Pete Stewart, and was intended for use on supercomputers in the 1970s and early 1980s.