Extending IPv4 with source translation/source privacy.
Hello,
The internet protocol version 4 could be extended with source
translation/source privacy.
The idea is as follows:
The ip.source is translated into something else/arbitrary along the path's
routers to it's destination.
Each router selects a random available ip from a table which is to replace
the ip.source of the packet.
A bit is set in the packet indicating the privacy option. This bit could be
bit 0 of the ip.flag, alternatively
it could also be an add-on option, the flag bit is to be preferred because
it would require little changes
to simple ip implementati
|
9/9/2010 5:25:49 AM
|
9
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
|
|
New High Bandwidth Supercomputer
http://www.eetimes.com/electronics-news/4206456/IBM-describes-Power7-supercomputer
Maybe all you whiners out there will like this one. :-)
The video link in the article doesn't seem to work. I'll have to see if
I can locate it.
|
9/9/2010 3:31:16 AM
|
1
|
Del Cecchi <delcec...@gmail.com>
|
Where is Bulldozer's renamer?
By now, most of you will be familiar with Bulldozer and/or the
multicluster multithreading concept:
Shared front end
* branch prediction
* instruction cache
* decode
Separate clusters of the tight loop (AMD calls these cores)
* scheduler
* execution units
* L1 data cache
Shared
* L2 cache
* and in Bulldozer's case, floating point.
By the way: I called this design (with the shared FP optional)
multicluster multithreading, MCMT, back in Wisconsin in 1996-2000.
My term is not appropriate for AMD, since call the Scheduler/Exec/L1$ a
core, and the whole thing a modul
|
9/8/2010 3:04:26 AM
|
2
|
Andy Glew <"newsgroup at comp-arch.net">
|
Renamer Port Reduction
I tried to find Mitch's post to reply to, but my newsreader,
Thunderbird, is not cooperative. So I'll have to swag it:
Mitch Alsup a little while back said something like "It isn't the ports
on the register file that are a problem, it's the ports on the renamer."
With, IIRC, a comment that you had to squeeze the renamer into a
single pipestage.
Not true. This reminds me of a comment by Tim Olson, then of AMD (and
active on comp.arch), when I presented HaRRM, my Hardware Register
Renaming Mechanism (which is basically the modern form of renaming) to
him. He said that it
|
9/8/2010 2:38:48 AM
|
11
|
Andy Glew <"newsgroup at comp-arch.net">
|
Microprocessor Report quotes on AMD Bulldozer
I can't resist posting some quotes excerpted from Microprocessor
Report's article on AMD Bulldozer:
:::
Quote: At last week's Hot Chips conference, AMD unveiled its
next-generation CPU, code-named Bulldozer. It is AMD's first significant
CPU redesign in seven years. In that time, multicore processors have
become ubiquitous in PCs and servers, and Bulldozer is the first CPU
design to take full advantage of this situation.
....
Quote: Bulldozer is a bold attempt to reinvent CPU microarchitecture. If
all processors have two or more CPUs, why not make the basic building
b
|
9/4/2010 4:29:44 AM
|
3
|
Andy Glew <"newsgroup at comp-arch.net">
|
Turing Machine versus parallel programming
Turing specified the mathematical basis of single threaded programming.
His machine was as a practical solution useless but it was a magnificent
example of maths setting the basis of the whole concept of calculating
computability. Has anyone actually asked a pure mathematician to try to
produce a mathematical model for parallel computing.
Ken Young
|
9/3/2010 1:50:03 PM
|
1
|
ken...@cix.compulink.co.uk
|
Fans could generate electricity and damage motherboard ?
Hello,
I just saw a dutch technician mention the following possibility of damaging
a motherboard when cleaning the PC of dust and I wonder if there is any
thruth in it, in short the technician writes the following:
"Be carefull not to make the fans spin real fast because then they could
start generating electricity and damage the motherboard ?!"
How much thruth is in that sentence ?! Should motherboards be equiped with
fan-back-surge protectors ?
Bye,
Skybuck.
|
9/2/2010 12:13:18 AM
|
65
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
Disadvanteges of Windowed Register File (like Am29k)
I am thinking about an architecture for a processor mainly targeted
for implementation in FPGAs (but also with non-FPGA applications in
mind). I came up with a variable size windowed register file, just to
realize that something similar was already implemented in the Am29000
family. (Damn, another good idea, 25 years too late...)
Now I am curios why this IMHO superior concept did not have more
impact. The arguments against it that I found by googling were not
very convincing. Beside the advantage of fast function-calling, are
there any real disadvantages (assuming that the window-siz
|
8/31/2010 1:34:56 AM
|
95
|
Thomas Entner <thomas.ent...@entner-electronics.com>
|
Bulldozer on Slashdot
http://hardware.slashdot.org/story/10/08/24/1521203/AMD-Details-Upcoming-Bulldozer-Architecture
Very few details, except this stuck out:
"AMD expects around 80% of the performance of a traditional dual core part
in about 50% to 60% of the size and power consumption."
The workload is obviously going to matter a lot. But, it seems that the
integer performance must be lower.
It makes me sad.
Ned
|
8/24/2010 11:27:45 PM
|
20
|
"nedbrek" <nedb...@yahoo.com>
|
AMD retires 3DNow! instruction set
I think it's good that somebody isn't afraid to remove support for
certain legacy instruction sets in the x86 world. It's not that software
will fail because of it, since the proper way to implement support for
it was to check its flag with the CPUID instructions.
Yousuf Khan
***
HEXUS.net - News :: AMD retires 3DNow! instruction set : Page - 1/1
"There are two commands that will survive the retirement, though, and
those are PREFETCH and PREFETCHW. AMD is integrating these commands
into a class of their own and will continue to include them in future CPUs."
http://www.he
|
8/23/2010 3:35:42 PM
|
2
|
Yousuf Khan <bbb...@spammenot.yahoo.com>
|
Detecting floating point mistakes in the universe ;) :)
Suppose that we are all living inside a computer simulation... it seems to
make sense because of "physics law" conservation of energy.
No energy is ever lost.
However suppose we live inside a computer simulation which uses floating
points or some other finite numbers then there should be some very very very
very very small
loss somewhere ?!? Since the numbers cannot be infinite... so there would be
some "numerical drift" in the universe.
If this could somehow be detected then this could confirm that we are all
living in a computer simulation ! ;) :)
Bye,
Skybuck :)
|
8/21/2010 10:12:53 PM
|
22
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
OOO and Branch prediction
Sorry but my programming days were back in the 8 bit days. Is their a
simple explanation of how OOO and Branch prediction work and the
advantages of these? Pipelines in in-order machines I think I understand.
Ken Young
|
8/18/2010 3:27:43 PM
|
2
|
ken...@cix.compulink.co.uk
|
Skechers,Skechers shoes,Shape Ups shoes,Skechers Shape Ups Shoes-Fitness Footwear for You.
Shape Ups from the Skechers Fitness Group are the latest in fitness
footwear. It designed to help burn more calories,tone muscles,improve
posture, strengthen and firm the back muscles, and reduce stress on
knee and ankle joints,without stepping foot in a gym again!
Shape Ups to get you fit while you walk, work, shop, and more. Walking
in Shape Ups is like walking on sand, which requires more effort than
walking on hard surfaces. The heel actually feel like it's sinking
into the ground.In a word,Shape Ups will enhance the way you feel and
look,it is a great way to exercise in your b
|
8/14/2010 1:21:36 AM
|
0
|
jasmine <girl520.lov...@gmail.com>
|
IBM Daisy (VLIW binary translation project) source download?
Hi,
I'm trying to access the source code of IBM's Daisy project. While the
project web pages are still available at http://www.research.ibm.com/daisy/,
the download link
http://oss.software.ibm.com/developerworks/opensource/daisy
redirects to a generic IBM open source page which does not mention
the Daisy project.
So, I'm looking for a working download link (Google didn't help) or a
copy
of the source code from someone who happened to save it... anyone?
Best regards,
Michael
|
8/10/2010 7:11:29 PM
|
1
|
Michael Engel <en...@multicores.org>
|
Running Windows or Ubuntu on a multi-core CPU is there a way to get the O/S to use only 2 of the CPU's so that the other 2/4/6 CPU's are "free" for programs; in the sense ...
Running Windows or Ubuntu on a multi-core CPU is there a way to get
the O/S to use only 2 of the CPU's so that the other 2/4/6 CPU's are
"free" for programs; in the sense that those CPU's would have much
less context switching and hence be more cache-friendly and
pipeline-friendly?
--
Regards,
Casey
|
8/9/2010 2:42:05 AM
|
2
|
Casey Hawthorne <caseyhHAMMER_T...@istar.ca>
|
Cost of moving data and the power budget
This chart
http://www.coherentlogix.com/index.php?option=com_content&view=article&id=48&Itemid=67
made me wonder, as usual, how one would feed such a beast, but it also
made me wonder if the power consumed by processing (performance/
processor-watt) matters all that much any more, and in what contexts.
In general, I would expect the power budget for high-throughput
processing to be dominated by the cost of moving data and not the cost
of crunching it.
I assume that the answer would be signal processing applications,
about which I know very little in practice. Can someone offe
|
8/8/2010 8:51:54 PM
|
3
|
Robert Myers <rbmyers...@gmail.com>
|
This is new and creative
I wish I had known this blog a long time ago cos it's so cool. Check
it out here: http://easystreetlibertyreserve.blogspot.com/2010/08/creating-link-for-regeneration-of.html
|
8/6/2010 7:19:47 AM
|
0
|
onyx093 <olaleyeobid...@gmail.com>
|
Alphabet_Soup:_a_Collection_of_Microarchitectures
Attempting to describe some microarchitectures in text-only email to a
friend inspires me to post this. It would be much better to have lots of
drawings illustrating precisely what I am talking about; but this
textual notation is fairly compact, can be understood by some people,
and in some ways, because it underspecifies certain configurations,
allows us to talk without getting bogged down in multiple different ways
of interconnecting the blocks. I.e. sometimes drawing is TOO specific.
= Basic Pipelines =
Let's start out with in-order pipelines
InO
and out-of-order pipel
|
8/4/2010 11:32:16 PM
|
6
|
Andy Glew <"newsgroup at comp-arch.net">
|
Delphi needs opengl gui :)
As long as cpu's remain slow... there is no way that cpu's can fill the
screen 1920x1200 with data fast enough... without taking a huge chunk of cpu
processing power and leaving little to actual usefull work...
Delphi's VCL is quite old and perhaps even obsolete in that regards... nice
for simple/slow apps where data is text and has to be updated only
rarely....
but for more high performant applications... perhaps with visualizers this
gdi is shit.
Unless microsoft replaces the gdi implementation with a lightening fast
hardware accelerated version... such things probably exi
|
8/4/2010 1:59:13 PM
|
20
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
Obvious coherence filter idea?
Might it not make sense to use different granularity bloom filters
for
coherence filtering? (I have read suggestions of using bloom filters
in coherence and having page-granularity coherence, so merging the
two
seems obvious--especially given the multiple-hash nature of bloom
filters.) Unfortunately, I cannot think of a good way to maintain a
page-grained filter given the cache-block granularity of most
coherence. One could count cache blocks, perhaps compressing the
counters by having a set selector information (perhaps with four
states: no-entries, use hash#1 of address, use has
|
8/3/2010 4:26:05 PM
|
0
|
"Paul A. Clayton" <paaronclay...@embarqmail.com>
|
Warning for laptop users: Starcraft 2 can overheat your computer. (any game can really ;) :))
There seems to be a big discussion going on about the latest and greatest
game called:
"Starcraft 2 Wing of Liberty"
Some people are stupid enough to play it on their laptops lol... with result
fried chips ?! ;) :):
http://www.gamespot.com/pc/strategy/starcraft2/show_msgs.php?topic_id=m-1-55785055&pid=939643&page=6
However there seems to be a little bit more to the story than just badly
ventilated systems...
It also seems a little bit a software issue... the game can apperently run
very fast in menu's
and transitions...
Some people have reported frame rates of 400 to
|
8/3/2010 2:22:11 PM
|
0
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
ROMmability
ROMmability refers to the ability to place the stuff needed by a
computer system that must reside in memory - e.g. instructions, data,
page tables - in ROM, read-only memory.
= Hard ROMming vs. Soft ROMming =
ROMmability might originally have referred to *hard* *ROMming* - truly
having a read only memory.
However, ROMmability is also of interest to soft ROMing: when memory is
writeable, but where writing exacts a penalty. Such soft ROmming
situations include
* writeable but mainly read-only memory, where dirty pages would have to
be paged out to disk, eventually if not i
|
8/2/2010 1:14:55 AM
|
15
|
Andy Glew <"newsgroup at comp-arch.net">
|
TLBs which cache invalid (not present) entries
Anyone know of architectures where the TLB caches not-present page
table entries (as well as present ones)? The issue arises when a page
fault is taken. AIUI some CPUs require even a not-present TLB entry to
be invalidated whereas others, sensibly, simply avoid caching not-
present entries.
The Berkeley course CS162 (Operating systems and systems programming)
mentions the need to invalidate TLB entries apparently on any page
fault including a missing (not-present) page. I can't find any
documentation that says what the Intel 386 and 486 do.
On the other hand I have found documenta
|
8/1/2010 9:55:12 PM
|
114
|
James Harris <james.harris....@googlemail.com>
|
DRAM Devices and Systems: A JEDEC-Sponsored Educational Event
http://www.jedec.org/events-meetings/memory-strategies-san-jose-oct-2010/dram-devices-systems
I'll be hosting a 2-day tutorial on DRAM Devices and Systems in San
Jose on Oct 6-7. We've set forth an ambitious agenda where I'll try
to do a brain dump of a few things I've learned in the past few
years. Please join us if you can.
David
|
7/29/2010 7:56:31 PM
|
0
|
davewang202 <davewang...@gmail.com>
|
Picture of Bosch FGS-4000 ?
Hello,
I would like to see a picture of a Bosch FGS-4000.
Google has none ? ;) :)
Can somebody make a picture of such a system and upload it somewhere so I
can see it... thanks in advance ! ;) :)
Bye,
Skybuck ;) =D
|
7/29/2010 7:54:50 AM
|
0
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
designer Christian Louboutin
designer Christian Louboutin
Barbie=92s new cobbler is the revered shoe designer ,[url=3Dhttp://
www.vipchristianLouboutin.com]Christian Louboutin[/url]. Mattel,
Barbie=92s manufacturer, has partnered with Christian Louboutin for a
multifaceted program under which the French designer will serve as a
yearlong godfather to Barbie.=94Ryan Harris underwent arthroscopic
surgery in Melbourne on Thursday to correct cartilage damage to his
right knee,=94 Cricket Australia doctor Trefor James said.According to
WWD, the collaboration includes the Barbie-pink, red-soled Louboutin
shoes th
|
7/27/2010 4:06:53 AM
|
0
|
"www.guomeitrade.com" <tessa...@126.com>
|
What will Microsoft use its ARM license for?
Will Microsofts design an ARM processor? If so, what operating system
will it be intended for? What architectural features would Microsoft
be looking for in its own ARM processor?
Is the main purpose to be infull control over the next ARM version
"ARMv8"?
By the way, what characteristics does the Apple ARM Cortex A8-
processor Apple A4 have? Is this known?
- Jan T=E5ngring, editor of swedish language electronics news site
etn.se
|
7/26/2010 11:33:25 AM
|
90
|
gnirre <gni...@gmail.com>
|
Effects of Memory Latency and Bandwidth on Supercomputer,Application Performance
Thanks to all who contributed their thoughts on high-bandwidth computing.
A wiki and mailing list are on their way for those who wish to pursue
the discussion.
I have previously posted a link here to this article
On the Effects of Memory Latency and Bandwidth on Supercomputer.
Application Performance. Richard Murphy. Sandia National Laboratories...
www.sandia.gov/~rcmurph/doc/latency.pdf
Google groups search is so broken that I can't find my previous post or
the discussion around it.
Robert.
|
7/23/2010 3:53:48 AM
|
37
|
Robert Myers <rbmyers...@gmail.com>
|
IBM zEnterprise Announced
The latest iteration of IBM mainframe architecture has finally been
announced. This box includes x86 and POWER blades inside the frame
with special interconnection infrastructure. For more info visit:
http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?subtype=ca&infotype=an&appname=iSource&supplier=897&letternum=ENUS110-170&open&cm_mmc=5733-_-n-_-vrm_newsletter-_-10207_134134&cmibm_em=dm:0:12046832
|
7/22/2010 3:21:59 PM
|
2
|
"David L. Craig" <dlc....@gmail.com>
|
Intel and AMD RDMA implementation
Arising out of a course I am writing, I want to find out roughly
how Intel and AMD handle I/O transfers to and from memory at the
hardware level. So far, my searching has led nowhere beyond what
I know, such as:
The I/O controller uses the HyperTransport or QuickPath link
to talk to the memory controller on the CPU that owns the memory.
Well, I assume that, because anything else would be silly.
But:
Do they do those in a cache-coherent fashion, or is that
independent of the cache?
Indeed, do they update the cache (as some systems used to) and,
if so, up to whi
|
7/21/2010 3:38:51 PM
|
25
|
n...@cam.ac.uk
|
comp.arch dinner circa HotChips?
We've had comp.arch dinners in Silicon Valley in April, around the time
of a workshop some of us attend.
I'll be attending HotChips at Stanford Sunday August 22 to Tuesday
August 24. I wonder if any comp.archers want to get together for an
in-person chat and meal?
|
7/15/2010 1:35:59 AM
|
0
|
Andy Glew <gigan...@andy.glew.ca>
|
A Poor Manager of Engineers...
I once asked manager 1 at company 1 about manager 2 at company 2, who
had worked for manager 1 at company 3. Manager 1 said "manager 2 is a
good engineer, but a poor manager of engineers".
I didn't think much more on this topic until recently, when I wondered
"Wait, manager 2 is managing a team of engineers at a VLSI engineering
driven company. How can it be that he is a poor manager of engineers?"
And then I realized: at companies like Intel, most VLSI engineers' first
experience of management and team leadership is (or at least was, until
recently) NOT managing other engi
|
7/14/2010 1:51:33 PM
|
3
|
Andy Glew <gigan...@andy.glew.ca>
|
My Intelligence Theory paper was cited by 1659
http://quadsearch.csd.auth.gr/ssearch.php?m=1&wtl=1&frop=501&lan=1&tos=2&pp=2&query=quantitative+measure+of+intelligence&query2=&sst=2&ResPerPage=500&frp_p=1&btn=1
You have to search this quadsearch engine using the full article name.
Otherwise it will not appear at all.
I am surprised that my conference paper was cited more times than
Multiple Intelligences theory, although google ranking is much much
lower at 1.
I had tried a few citation extracting engines. quadsearch is the third
one that I tried. It does not search properly when I use my name. Even
complete title search fail
|
7/13/2010 8:50:13 AM
|
3
|
"Ir. Hj. Othman bin Hj. Ahmad" <othm...@lycos.com>
|
OT: Dutch clearly robbed of victory by not giving a crucial/obvious corner near the end of the match, putting the defense offguard resulting in a counter-goal.
At the end of the match Dutch player Sneijder takes a free shot... the ball
deflects of the wall and ends beyond the end/back line.
Curiously enough a could-be-deciding corner is not given at the end of the
match.
This puts the Dutch defense offguard and the result is a goal of the
Spanish.
I have seen something like this before in another match.
So I am now convinced that this is a trick by match-judge/fifa to influence
a match.
This will be the last World Champsion Football I watched unless technical
aids are used in the future.
Otherwise I think the sport is just
|
7/11/2010 9:12:58 PM
|
14
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
Stephen Wolfram's book, "A New Kind of Science" gives us three (3) very important facts:
Stephen Wolfram's book, "A New Kind of Science" gives us three (3)
very important facts:
1. Mathematics can only explain simple things.
2. You need a model to explain complicated things.
3. But - a simple model can explain complicated things.
It looks to me like we have a PHASE model here.
"in phase attract"
Type those three words - above - into Google (Include quote marks) to
learn not only the basics of electricity but how all the fundamental
forces work.
Or click the following link that will give you the same page in
Google. http://www.google.com/search?hl=en&source
|
7/11/2010 10:30:20 AM
|
0
|
fitz <zeus...@yahoo.com>
|
"SIE" on a RISC architecture
I was reading about the IBM SIE instruction on Andy's wiki (Article at
http://semipublic.comp-arch.net/wiki/SIE ), and a few things struck me.
The first is the question of "If I'm implementing this efficient virtual
machine system, do I really need a separate 'user mode'?". The second is
"How can this instruction be made to fit better with the general design of a
RISC architecture?"
An important consideration is also how to integrate the instruction with
some typical RISC performance enhancing features - a key one being tagged
address spaces - which may require kernel support
|
7/9/2010 7:27:32 PM
|
10
|
Owen Shepherd <owen.sheph...@e43.eu>
|
OT: Let there be no doubt who's gonna win !?! Beessie tells all !
Let there be no doubt who's gonna win !?! Beessie tells all !
Yes:
http://www.youtube.com/watch?v=U2NHvMabI5U
Bye,
Skybuck ;) =D
|
7/9/2010 2:42:40 PM
|
0
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
Cache line list handling
Hi
Just wondered how useful this idea maybe.
64 bytes => 16 * 32 bit values
= 4 nodes.
Node
====
0. chain pointer
1. value
2. reference count
3. moved to please update pointer
The general idea is to migrate any referenced cell into the
referencing cache line or a closer one off the free list. This would
reduce information content if relative addressing was used. And frees
4 bits * 2 for pointer/typing. The algorithm would be more complex,
but prefetch would be more effective. Am I missing anything?
Cheers Jacko
|
7/5/2010 4:27:42 PM
|
32
|
jacko <jackokr...@gmail.com>
|
A naive conjecture about computer architecture and artificial intelligence
I am struck by the success of a relatively clueless strategy in computer
microarchitecture:
If it worked more than once, it will probably work again. The only
possible improvement is to learn to anticipate exceptions.
I'd call that bottom up prediction.
Most top-down prediction schemes (I think I understand the computer and
the program and I'll tell the computer what to expect) have been
relative failures (most notoriously: Itanium).
The top-down approach has been a similar disappointment in artificial
intelligence: I understand logic and I'll teach the computer how to
|
7/4/2010 5:02:12 PM
|
56
|
Robert Myers <rbmyers...@gmail.com>
|
Picking N-th ready element (e.g. in an OOO scheduler)
https://semipublic.comp-arch.net/wiki/Picking_N-th_ready_element
One common operation in schedulers is picking the 1st, or 1st and 2nd, or ... N-th ... ready elements to schedule.
Without loss of generality we will assume that we are picking in a physical order. E.g. we assume that an array looks like
0
1
2
....
N-1
from top to bottom, and we are trying to find the highest (lowest numbered) element that has a ready bit set.
If we want to pick according to some permuted, priority order we might simply send the ready bits through a
[[permutation circuits]], perhaps a [[per
|
6/26/2010 6:51:35 PM
|
38
|
Andy 'Krazy' Glew <ag-n...@patten-glew.net>
|
Fueling your car with natural gas from home
Dear Everyone,
as you know the price of the Oil is more and more increasing, while
the oil supply is decreasing. Moreover Oil is
causing wars, terror, oil spills, a lot of greenhouse gases. Do you
know that there is plenty of natural gas ? The
supply will last for many decades, probably for hundred years. A lot
of methane (natural gas) is found as shale gas,
a lot more will come from methane hidrates. Natural gas is causing
much less greenhouse gases and since it is found
locally, it will not cause any wars or terror. It will create jobs in
your own country and not in the middle east
|
6/20/2010 11:38:28 AM
|
0
|
".." <sustainable.future...@gmail.com>
|
Vaporizing dust during chip manufacturing ?
Hello,
Would it be possible to "vaporize" any dust particles during the chip
manufacturing ?
From what I understand "dust" particles cause lot's of chip-duds.
Since E=MC^2 maybe the "matter of the dust particle" can be turned into
energy clearing it ?
Bye,
Skybuck.
|
6/18/2010 9:48:11 AM
|
66
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
RISC load-store verses x86 Add from memory.
RISC load-store verses x86 Add from memory.
t = a->x + a->y;
RISC
load x,a[0]
load y,a[1]
add t = x,y
x86
load x,a[0]
add t = x,a[1]
RISC shows its superiority by being 50% more instructions and 50% slower...
2 loads for both.
2 register reads, verses 1.
2 bypasses verses 1.
3 register writes verses 2.
Same 2 cycle load to use delay.
Yes x86 instructions are cracked to RISC like opcodes and
executed out of order, but it looks like x86 has less
register port pressure, adding more register ports costs
die size, power, speed.
The proposed 8 way DEC Alpha would have
|
6/17/2010 5:33:12 AM
|
138
|
Brett Davis <gg...@yahoo.com>
|
Simple Hack to get $500 to your home
Simple hack to get $500 to your home at http://ukfullenjoy.co.cc
Due to high security risks,i have hidden the cheque link in an
image. in that website on left side below search box, click on image
and enter your name and address where you want to receive your
cheque.please dont tell to anyone.
|
6/10/2010 9:16:20 PM
|
0
|
money mania <maaind...@gmail.com>
|
ARM-based desktop computer ? (Hybrid computers ?: Low + High performance ;))
Hello,
Today Apple "released" the iPhone 4.0... I believe it has something like a
1.0 GHz processor...
I find that quite impressive, 1.0 GHz in such a small package and
non-overheating ???
Maybe to good to be true ?
I wonder what the future will bring ?...
Will we see the rise of "low power/low heat/low noise desktop computers"
being powered by ARM-based processors ?
Is this the end of Windows because it doesn't work on ARM processors ?
Can intel atom processors compete with ARM processors ?
What's AMD's answer to atom and arm ?
Can an AMD/Intel single 1.0 to 2
|
6/8/2010 9:58:16 AM
|
36
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
2 new instructions for marker scanning for Skybuck's Universal Code 6.
Decoding of Skybuck's Universal Code 6 can be accelerated by two new
instructions:
The instructions would be called:
ScanMarkerForward (SMF)
ScanMarkerBackward (SMB)
Two different versions are thinkable:
Version 1 uses a BitPosition to indicate where to start scanning to prevent
the need for masking unwanted bits to zero.
(Version 2 uses a MaximumBitCount to indicate a default value for when no
marker/one bit is found.)
Version 1 is to be preferred because of probably faster operation.
Therefore version 1 will be discussed below:
Input operands would be:
<SourceData
|
6/8/2010 5:24:49 AM
|
0
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
Skybuck's Universal Code Version 6 (The Fast Version)
Hello,
It is time to make the world acquinted with Skybuck's Universal Code 6 (The
Fast Version).
Today's computers/instructions/instruction sets don't really work with
"interleaved bits".
They work more with "consecutive bits".
Therefore it is time to update Skybuck's Universal Code Concept with version
6: The Fast Version.
First a little recap from version 1:
Variants of version 1:
C = Content/Binary Code/Data Bits
M = Marker bits
1. CMCMCM (Universal field: original interleaved version)
2. MCMCMC (Universal field: reversed interleaved version)
Now the new
|
6/8/2010 3:52:59 AM
|
0
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
BSR needs to be a bit faster on AMD processors...
Hello,
I have finished implementing a variation of Skybuck's Universal Code in
something :)
It turns out it needs BSR for decoding... this instruction seems quite slow
on AMD's processors according to documents.
The slowness goes up to 16 to 17 clock cycles/latency ?!? which is a bit too
high for my taste...
In the future LZCNT might be available to mimic BSR's behaviour/need in a
slightly different way... but this would require supporting this new
instruction
and adding two code paths and so forth... I rather just use BSR for now.
So I hope AMD will make the BSR inst
|
6/8/2010 3:25:26 AM
|
1
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
Maximum number of operands for x86 and x64 instruction set ?
Hello,
Suppose I want to design a new instruction for the x86 instruction set or
perhaps the newer version/extension the x64 instruction set... I have a
question about that...
The question is:
What's the maximum number of operands at my disposal for the design ?
For example SHLD has 3 operands:
Two flexible ones and one hard coded one: CL
So what is the maximum number of "flexible operands" ?
And what is the maximum number of "hardcoded operands" ?
For hardcoded operands I imagine that there might be no limit... for example
a "machine/state reset" instruction coul
|
6/7/2010 9:11:19 PM
|
29
|
"Skybuck Flying" <IntoTheFut...@hotmail.com>
|
[ASM 8086]obstacle management in a breakout
Hello everybody,
I have to do a breakout game in ASM 8086 for a study project. I don't know how to do the obstacle management. I'm stuck on it. Should I use an array with the positions of the obstacles ? Please, I need your help.
Thanks to read my post.
Best Regards,
Gnoirzox
|
6/5/2010 6:05:03 PM
|
3
|
gnoir...@nospicedham.No-SpAm.gmail.com
|