Assembler and Disassembler

The Mops' PowerPC assembler and disassembler were originally written by Xan Gregg for Power MacForth. They've been adapted for use in the Mops environment as modules. The source files are "pasmMod.txt" and "disasm". The syntax is traditional Forth postfix-style.

Assembler colon definitions

You write a code definition thus:

`:ppc_code  someName`\
` </nowiki><var><ppc instructions></var><nowiki>`\
`;ppc_code`\

This will create a normal header for someName in the dictionary, then align the CDP to a 4-byte boundary (as required for PowerPC code), and compile the code. Note that if you tick someName, the resulting xt on the stack will be the address of the first instruction, minus 2. The extra 2 bytes are used internally (by Mops) as flags. Remember to add 2 if you need the actual address of the first instruction. If, however, you use EXECUTE, you pass the xt in the normal way.

We don't provide a full rundown of the assembly syntax here; however the comments in the source files are fairly extensive. So if you're the type of person who might want to write assembly language, you'll probably be able to figure out what to do! �especially as the file "test pasm" in the 'Module Source' folder has a definition containing all the PowerPC instructions supported by the assembler. Remember that it's a Forth-style postfix assembler. We also give a short example at the end of this section, and there are a number of code definitions in the source files in 'PPC source', especially the file "setup".

We don't provide any way of writing a method with the assembler. This should never be necessary for performance, and if you really need access to machine-level features, you can write a :ppc\_code word and call it from your method.

Accessing the dictionary

If you need to get the address of an item in the code or data area of the dictionary, you can do it with this syntax:

r0 ' <var>someWord</var> <var>dicaddr,</var>

This generates an addi instruction, using the appropriate base register and offset to place the address of the location you want into a register. We used r0 in this example, but you could have used any free register. For a location in the data area, remember that ticking the name of the item won't get you there�you have to do something like this:

r0 ' <var>myValue</var> >body <var>dicaddr,</var>

You can execute any code you like between naming the register and putting dicaddr,. You are in execution mode, and can execute any Mops words, so long as you leave just one item on the stack. One proviso is that at the moment, the target address must be within 32k bytes distance from where the base register points, as otherwise the address can't be generated with a single addi instruction. If the assembler gives you an out-of-range error on a line with dicaddr, this is probably the reason.

Executing colon definitions from code

In a word, don't. Mops colon definitions will expect various numbers of their parameters in registers when they're called, and the algorithm for working this out is quite complex (and might even change!). So you should really only use code definitions for words that don't call any other words.

Executing other code definitions from code

There's no problem with doing this, since you have full control of the registers. Remembering that the first instruction of a code definition starts two bytes after where the xt points, you call another code definition this way:

' <var>someWord</var> 2+ bl,

The Assembler automatically converts the absolute address of the first instruction of someWord, which we generate with 'someWord 2+', to the offset that is required by the bl instruction.

Assembler source

The assembler-related source files are all in the 'Module source' folder. They are:


pasmMod.txt the assembler disasm the disassembler (loaded by "pasmMod.txt") test pasm a big test definition with all the PPC instructions vectors pasm test another big test definition with all the AltiVec instructions


On entry to a code definition, the top-of-stack item is in register r4, the second item is in r3, and the next stack item is in memory, pointed to by the data stack pointer, r18. You can do whatever you like with the stack (within reason), but on exit from your definition you must observe this same convention.

Within the definition, you can use registers r5-12 freely for whatever you like without having to save or restore them. You can also use r23-31, but as Mops uses these for locals, they might contain values in use by the calling code. So you will need to save and restore any of these that you use.

The same applies to the floating point stack. On entry to a code definition, the top-of-stack item is in register fr2, the second item is in fr1, and the next stack item is in memory, pointed to by the FP stack pointer, r19. You can use fr0 and fr3-13 without saving or restoring them. fr14-31 are used for FP locals, so you can use them if you save and restore them.

PowerPC register usage

Here's a summary of the register usage in the PowerMops runtime environment:


r0 scratch r1 system stack pointer (leave it alone, normally)
r2 RTOC (Table Of Contents pointer -- leave alone)
r3 initially, second stack cell
r4 initially, top stack cell
r5 scratch
r6 scratch
r7 scratch
r8 scratch
r9 scratch
r10 scratch
r11 scratch, also used in system calls
r12 ditto
r13 base address of main dic code area
r14 base address of main dic data area
r15 base address of current module's code area
r16 base address of current module's data area
r17 return stack pointer (points to top cell)
r18 data stack pointer (points to top memory cell)
r19 floating point stack pointer (points to top memory cell)
r20 base address of current object
r21 loop counter I
r22 limit value for DO...LOOP
r23 can use if you save and restore
r24 ditto
r25 ditto
r26 ditto
r27 ditto
r28 ditto
r29 ditto
r30 ditto
r31 ditto
 
fr0 scratch
fr1 initially, second stack cell
fr2 initially, top stack cell
fr3 scratch
fr4 scratch
fr5 scratch
fr6 scratch
fr7 scratch
fr8 scratch
fr9 scratch
fr10 scratch
fr11 scratch
fr12 scratch
fr13 scratch
fr14 can use if you save and restore
fr15 ditto
fr16 ditto
fr17 ditto
fr18 ditto
fr19 ditto
fr20 ditto
fr21 ditto
fr22 ditto
fr23 ditto
fr24 ditto
fr25 ditto
fr26 ditto
fr27 ditto
fr28 ditto
fr29 ditto
fr30 ditto
fr31 ditto


Example

Finally, here's a short example, from the file "pnuc1" (in 'PPC source'). This is the definition of PICK.

`:ppc_code PICK`\
` r4 0 cmpi, \is it 0 pick?`\
` eq if,`\
` r4 r3 r3 or, \yes - copy TOS`\
` else,`\
` r5 r4 2 0 29 rlwinm, \no- mult index by 4 by left shift`\
` r5 r5 -4 addi, \and subtract 4 to get SP offset`\
` r4 r18 r5 lwzx, \grab the cell`\
` then,`\
` blr, \and return.`\
`;ppc_code`\

Note that PICK doesn't push or pop from the data stack, but simply replaces the top cell (in r4) with the stack cell that it fetches. Also, since it doesn't call any other routine, there's no need to save and restore the return address which is in the link register on entry. If we had needed to save and restore the link register, we would have put the instructions

r0 mflr, \save lr on return stack
r0 -4 rRP stwu,

at the beginning, and

r0 mtlr, \restore lr

at the end before the blr,.

Note also that we need to special-case the '0 pick' case, since the desired cell isn't in the memory part of the stack, but is already in r3.

For an example of a much longer code definition, have a look at (EX) in the file "setup" (in 'PPC source').

Warning: If you are already familiar with the old 68k Mops Assembler, note that you must write instructions at the end of your code routine so that it will return. The PowerPC Assembler doesn't do this for you, since it can't always know where your return address is. (It might not necessarily be in the link register at that point�you might have put it in the count register or have saved it on the return stack or somewhere else.)

Disassembler

The disassembler will disassemble PowerPC code from a range of addresses you specify, dumping its output to the Mops window.

You call the disassembler with one of the following words:


disasm\_word someWord Disassembles the word someWord. disasm\_xt ( xt \-- ) Disassembles the word with the given xt. disasm\_rng ( from to \-- ) Disassembles within the given address range. disasm\_cnt class="STACK" nowrap | ( from \#inst \-- ) Starts at 'from', disassembles the given number of instructions. disasm ( from \-- ) Starts at 'from', keeps going till you hit a key or until a blr instruction is seen (which normally comes at the end of a definition).