"Robert Myers" <rmyers.RemoveThis@rustuck.com> wrote in message
news:bsgfrvcg4lfs92524p2r2i2tnqe3hhbs36@4ax.com...
> On Sun, 16 Nov 2003 16:50:50 GMT, "Yousuf Khan"
> <removethisspam.bjsk90.removethispam.RemoveThis@hotmail.com> wrote:
>
> >Multicore, symettric multi-threading, and 24MB of cache. Looks like this
one
> >was designed with help from the Alpha team that Intel just bought out
> >recently from HPaq.
I kind of doubt that: those people are reportedly all working on
Tanglewood, any Itanic SMT effort aimed at shipping in 2005 would have had
to have started at least a bit before the first of them settled in at Intel,
and while they may have offered comments I suspect that whatever SMT
mechanism may be incorporated into Itanic (I'm still a bit skeptical of this
report, but it does seem to be pretty wide-spread) differs sufficiently at a
very basic level from what they were working on for EV8 that their
experience may not have been directly transferrable.
> >
> > Yousuf Khan
> >
> >http://www.theinquirer.net/?article=12686
> >
>
> SMT was always aimed at Itanium.
Really? My impression is that the Itanic architecture was largely
established somewhat before SMT appeared on the horizon, that most of the
coordination by the University of Washington researchers was with DEC and
Alpha, and that SMT is particularly amenable to leveraging existing
mechanisms for out-of-order execution (e.g., in Alpha) that are
conspicuously absent in Itanic.
Intel may later have investigated ways to make use of SMT in Itanic, but I
think it was definitely a retrofit.
You can achieve most of the benefits
> of OoO execution without actually going OoO by using SMT helper
> threads.
Maybe. But without doubt one of the things that you sacrifice is power
efficiency (not that Itanic appears to worry about this much), since without
the OoO hardware facilities you don't have a clue whether the extra work
you're doing will be useful (and even if it is useful in preloading the
caches, when the *real* code path reaches that point the instructions still
get executed a second time anyway).
Such helper threads are also a lot more expensive in use of execution units
than OoO SMT mechanisms are (again, because of the redundant or useless
execution activity noted above), so you need more EUs (and thus more core
area, which starts to limit clock rates unless you go asynchronous) than
you'd need in an OoO SMT implementation to perform as well.
> If you're supporting two cores with four threads each,
Do you have a source for the suggestion that each Montecito core supports 4
threads?
the
> huge cache is inevitable.
Not if you're primarily using the SMT for helper threads (not that I'm
suggesting that this as a great idea).
- bill<!-- ~MESSAGE_AFTER~ -->
>> Stay informed about: Itanium Montecito stuff