The Basics: Registers, Data Types and Memory
- Most of the x86 complexity comes from backwards compatibility
- Modern x86=IA32
- 8 general purpose 32 bit integer registers
- eax, ebx, ecx, edx, esi, edi, ebp, esp
- These can be addressed by 32 bits (e_x), lower 16 (_x), higher 8 bits in the lower 16 (_H), lower 8 bits in the lower 16 (_L)
- 2 special purpose 32 bit registers, instruction pointer and flags
- % denotes a register in assembly code
- 8 general purpose 32 bit integer registers
- x86 ISA supports:
- 2’s complement
- unsigned integers in widths 32, 16, 8 bits
- IEEE single double precision
- 80-bit intel floating point
- ASCII Strings
- BCD
- x86 is byte addressable every memory address (32 bits) points to a byte
- x86 uses little endian
- 0x12345678 is stored as
- 0x78, 0x56, 0x34, 0x,12 in consecutive increasing memory locations
- 0x12345678 is stored as
LC-3 to x86
Operate Instructions
- x86 has many more operations than LC3
- Arithmetic Operators:
- ADD, SUB, NEG, INC, DEC
- Logical Operators:
- AND, OR, XOR, NOT
- Shift Operators:
- SHL, SAR, SHR, ROL, ROR
- Arithmetic Operators:
- x86 typically has 2 argument instructions with the second one being the destination
- addl %eax, %ebx # EBX<-EBX+EAX
- Operands can optionally have “l”, “w’, or “b” attached to them
- l=32 bits (long)
- w=16 bits (word)
- b=8 bits (byte)
- Immediate values have to be preceded by the $ sign such as
- addl $20, %esp # ESP<-ESP+20
- $0x____=hex
- $0____=octal
- $(1-9)(0-9)=decimal cannot use this to add 0
- If the $ sign is not used a number is treated as a memory reference
- addl 20, %esp # ESP<-ESP+M[20]
- Addresses can be calculated in an operand as follows
- displacement(SR1, SR2, scale): displacement+SR1+(SR2*scale)
- displacement defaults to 0 if not specified
- scale defaults to 1 if not specified and can be 1,2,4,8
- The reason for this format is to allow one to select a data structure (SR1) and then furthur offset it (SR2) and if an array use an index (displacement)
- Access Nth element of an array of 32bit integers one could put a poitner to the base of the array into EBX and index N into ESI and execute
- movw (%ebx, %esi, 4), %eax # EAX<-M[EBX+ESI*4]
- if the array started at the 28th byte of the structure
- movw 28(%ebx, %esi, 4), %eax # EAX<-M[EBX+ESI*4]
- leal (%eax, %ebx), %ecx # ECX<-EAX+EBX
- displacement(SR1, SR2, scale): displacement+SR1+(SR2*scale)
- Interesting instruction: xorl %edx, %edx # EDX<-0
Data Movement Instructions
- LC-3 had three addressing modes for load and stores
- PC relative: LD/ST
- Indirect: LDI/STI
- Base+Offset: LDR/STR
- LEA
- x86 unifies the above (except LEA) into the mov instruction
- LDI/STI is not covered but can be done in 2 instructions
- LD/ST is technically unavailable but can be done with direct addressing where the address to be used is specified as an immediate value in the instruction
Condition Codes: Only 5 will be mentioned
- Sign Flag (SF): Records whether the last result represented a negative 2’s complement integer (MSB set)
- Zero Flag (ZF): If last result was exactly zero will be set
- Carry Flag (CF): If last result generated a carry or required a borrow. Some instructions use this flag for specialty purposes
- such as to check if the high word of multiplication is nonzero or not
- Overflow Flag (OF): Checks whether the last result overflowed if interpreted as 2’s complement operation also has specialty purposes
- Parity Flag (PF): Set if the last result has even # of 1s else cleared
- Not all instructions set flags
- eg. mov, lea, not
- To generate flags from these instructions call CMP (compare) or TEST instructions to set flags
- cmp: Performs a subtraction second arg-first arg and sets flags discard result
- test: Performs AND between two operands sets flags (OF and CF are cleared, SF, ZF, PF are set according to result) discard result
- Instructions that set flags do not have to change all flags
- eg. rol, ror which only affect OF and CF
- eg. inc and dec which affect all but CF
Conditional Branches
- There are 8 basic branch conditions and their inverses in x86
- Branches are listed below along with the conditions under which the branch is taken
- The table should be used as follows. After a comparison such as
- cmp %ebx,%esi # set flags based on (ESI – EBX)
- Choose the operator to place between ESI and EBX, based on the data type.
- If ESI and EBX hold unsigned values, and the branch should be taken if ESI ≤ EBX, use either JBE or JNA.
- If ESI and EBX hold signed values, and the branch should be taken if ESI > EBX, use either JG or JNLE.
- For branches other than JE/JNE based on instructions other than CMP, you should check the stated branch conditions rather than trying to use the table.
Other Control Instructions
- x86 uses INT instead of LC-3’s TRAP
- x86 also replaces JSR and JSRR with CALL which can be both direct and indirect
- indirect operands are preceded by an asterisk
- Call pushes the return address onto stack before changing EIP
- RET pops the return address off the stack before changing EIP
Labels, Comments, Directives, Pseudo-ops
- Labels can begin wtih any letter a period or underscore and must be terminated by a colon
- gcc generated assembly code uses a period to start all its labels
- Use # for comments ; for putting more code on one line and /* */ like C for multi line comments
- .ORIG/.END are inserted byt the compiler don’t bother yourself with it
- .GLOBAL/.EXTERN used to declare symbols to be visible externally and to be defined externally respectively
- Implies that assembly cannot identitify unidefined

- .INCLUDE directive tells the assembler to read in the contents of another file and to insert it in place of the directive (C’s #include)
Input and Output
- x86 can use both memory-mapped and instruction based I/O
- Uses specific registers for the I/O instructions
- Data must be put in the EAX register
- Data from ports are also read into EAX
- Port number can be specified as either an 8 bit immediate or loaded into DX
Other Useful Instructions
Stack Operations
- Stack abstraction directly has native PUSH and POP instructions
- ESP contains the address of the element on top of the stack
- Stack grows downward in addresses (smaller address on top of stack)
- pushl %eax # M[ESP-4]<-EAX, ESP<-ESP-4
- Equivalently
- movl %eax, -4(%esp) # M[ESP-4]<-EAX
- subl $4, %esp # ESP<-ESP-4
- pushl %eax # M[ESP-4]<-EAX, ESP<-ESP-4
- Other than a pop into the eflags register push and pop do not set flags
Multiplication and Division
- Both signed and unsigned are available
- Unsigned multiply MUL requires EAX to be one of the operands (or AX or AL)
- Places results of multiplication of high bits in EDX and low bits in EAX (or DX:AX or AX)
- Only the CF and OF flags have meaning (both imul and mul) all other flags are undefined (not unaffected they may chagne)
- Both CF and OF are set if the high bits of the result are nonzero
- Signed multiply IMUL allows both 2 and 3 operand forms
- High bits are discarded in signed multiplication
- imull %ebx,%eax # EAX ← EAX * EBX
imull $1000,%ebx,%eax # EAX ← 1000 * EBX
- Division works similarly to the multiplication
- Dividend must be placedin EDX:EAX (or DX:AX or AX)
- EAX (or AX, or AL) holds the quotient
- EDX (or DX, or AH) holds the remainder
- Overflows in the destination register will cause an exception to be generated
- Flags are undefined after division (no meaning subject to change)
Data Type Conversions
- MOV can be used to convert small ints into big ints
- MOVS(orig size)(newsize): Sign extension
- MOVZ(orig size)(newsize): Zero extension
- eg. MOVZBL: zero extend a byte into a long
- There are variations of this for the EAX register
- CBTW converts signed byte AL to a word AX
- CWTL converts a signed word AX to a long word EAX
- CLTD converts a signed long word EAX to a double word (EDX:EAX)
- Useful for preparing for IDIV (?)
- CWTD (caution): AX->DX:AX
The Calling Convention
Caller-saved registers (AKA volatile registers) are used to hold temporary quantities that need not be preserved across calls.
For that reason, it is the caller’s responsibility to push these registers onto the stack if it wants to restore this value after a procedure call.
Callee-saved registers (AKA non-volatile registers) are used to hold long-lived values that should be preserved across calls.
When the caller makes a procedure call, it can expect that those registers will hold the same value after the callee returns, making it the responsibility of the callee to save them and restore them before returning to the caller.
Parameters, return values, and registers
- Parameters passed from a function are pushed right to left onto the stack this allows for any number of parameters without needing a var to store # var or sentinels
- For pointers and integers < 32 bits the return value is placed in the EAX register
- For 32 < bits < 64 EDX stores high bits EAX stores low bits
- Floating point values are returned on top of the floating point stack
- Most registers are caller saved (eg ECX EFLAGS) but the ESP and EBP must be returned unchanged
- EBX ESI and EDI are callee saved

Caller Side: When a_func is called
- Push function parameters to stack (20, 10) called formals
- Call is executed which pushes EIP (pointing to ADD after call)
- EIP is changed to the start of a_func
- Execute func
- Remove paremeters from stack by using ADD
- Store the result in value (returned in EAX)
Callee Side
- Set up stack frame by pushing old frame pointer (EBP) to stack
- Copy ESP into EBP (old EBP at bottom of stack)
- If any callee-saved registers are used for a_func values are pushed to the stack (EBX, ESI, EDI)
- Stack pointer is updated to make room for local vars
- Tear down stack using the LEAVE instruction this restores the old values of EBP and ESP
- ESP<-EBP+4, EBP<-M[EBP]
- IF callee-saved registers must be restored us LEA to point the ESP to the upper most saved register and then a sequence POPs (the last into EBP) restores the original stack state
Miscellaneous
- Byte addressable (1 address=1 byte)
- 1 byte=8 bits (therefore long is 4 addresses)
- x86 ISA does not require alignment (16 bit values only even addresses 32 bit values multiple of 4 addresses)
- Arrays of data should be properly aligned
- Use the .ALIGN directive to do this