AArch64 is the 64-bit execution state introduced with the ARMv8-A architecture. It runs the A64 instruction set and provides 31 general-purpose 64-bit registers, a 64-bit address space, a cleaner exception-level model and improved performance, while AArch32 retains backward compatibility with 32-bit ARM and Thumb code.

How many registers does AArch64 have?

AArch64 provides 31 general-purpose registers named X0 to X30, each 64 bits wide, with 32-bit views named W0 to W30 that access the lower half. A special encoding for register 31 is interpreted either as the zero register (XZR/WZR) or the stack pointer (SP) depending on the instruction, and the program counter is no longer a general-purpose register.

What are exception levels in AArch64?

AArch64 replaces the older processor modes with four exception levels: EL0 for unprivileged application code, EL1 for the operating system kernel, EL2 for a hypervisor, and EL3 for the secure monitor. Higher exception levels are more privileged, and the model maps cleanly onto user space, OS, virtualization and TrustZone secure firmware.

What is the difference between AArch32 and AArch64?

AArch32 is the 32-bit state with 16 visible registers, banked registers, processor modes and the conditional-execution-rich ARM/Thumb instruction sets. AArch64 is the 64-bit state with 31 general-purpose registers, fixed 32-bit A64 instructions, exception levels instead of modes, a non-general-purpose PC, and PSTATE instead of the CPSR. AArch64 generally offers better performance and a cleaner design.

What is the AAPCS64 calling convention?

AAPCS64 is the procedure call standard for AArch64. The first eight integer or pointer arguments are passed in X0 to X7, the result is returned in X0, X9 to X15 are caller-saved temporaries, X19 to X28 are callee-saved, X29 is the frame pointer, and X30 is the link register holding the return address.

DAY 26 · ADVANCED (64-BIT & BEYOND)

AArch64 — The 64-bit Register Model

By EcrioniX · Updated Jun 6, 2026

Welcome to Phase 4. Everything you learned in Days 1–25 was mostly the 32-bit world. Now we step into AArch64 — the 64-bit architecture that powers modern phones, Apple Silicon Macs, AWS Graviton servers and the fastest ARM chips on Earth. The good news: it's actually cleaner and simpler than 32-bit ARM. Let's rebuild the programmer's model for 64 bits.

1. Two states: AArch64 and AArch32

ARMv8-A introduced a fundamental split. A core can run in one of two execution states:

AArch64 — the new 64-bit state, running the A64 instruction set. 64-bit registers, 64-bit addresses, 31 general-purpose registers.
AArch32 — backward-compatible 32-bit state, running the classic A32 (ARM) and T32 (Thumb) instruction sets you met in earlier lessons.

This is why a single modern chip can still run old 32-bit apps while delivering full 64-bit performance to new ones. Crucially, the state can only change on an exception boundary — you don't flip between 64-bit and 32-bit mid-function the way you interworked ARM and Thumb (Day 16). From here on we focus entirely on AArch64, the state all new software targets.

2. The register file: X0–X30

Here's the headline change. AArch64 gives you 31 general-purpose registers, each 64 bits wide, named X0 through X30. That's nearly double the 16 you had in AArch32 — and more registers means fewer trips to memory, which is a direct performance win.

Every X register has a 32-bit view called W0–W30 that accesses its lower 32 bits. Writing to a W register zeroes the upper 32 bits of the corresponding X register — a deliberate rule that avoids partial-register stalls.

MOV x0, #1 // full 64-bit register ADD w1, w2, w3 // 32-bit add; upper 32 bits of x1 become 0 ADD x4, x5, x6 // 64-bit add

Name	Width	Meaning
X0–X30	64-bit	general-purpose registers
W0–W30	32-bit	lower-half views of X0–X30
XZR / WZR	64/32	the zero register (reads 0, writes discarded)
SP	64-bit	the stack pointer (must stay 16-byte aligned)
PC	64-bit	program counter — not a general-purpose register

3. The clever bit: register 31

You may have noticed the registers stop at 30, not 31. Encoding slot 31 is special — depending on the instruction it means either the zero register (XZR/WZR) or the stack pointer (SP).

The zero register always reads as 0 and silently discards writes. It's wonderfully handy: comparing against zero, clearing a value, or discarding a result becomes free — no register wasted holding a constant 0.
The stack pointer is no longer a normal GPR you can accidentally clobber; it has its own role and a strict 16-byte alignment requirement.

MOV x0, xzr // set x0 = 0 (no immediate needed) CMP x1, xzr // compare x1 with zero STR xzr, [x2] // store a 64-bit zero to memory

4. The PC is no longer a register

In AArch32, the program counter was r15 — a general-purpose register you could read and even write to jump. Powerful, but a security and predictability nightmare. AArch64 removes the PC from the general register file. You can no longer do arithmetic on it or load into it directly; control flow happens only through proper branch instructions. This single change kills a whole class of exploits and makes the pipeline easier to build.

The link register is now X30: a BL (branch with link) saves the return address there, and you return with RET (which defaults to X30). Compare this to the AArch32 BX LR from Day 15.

5. From processor modes to exception levels

The old AArch32 modes (User, IRQ, FIQ, Supervisor…) from Day 5 are gone. AArch64 replaces them with a clean ladder of four Exception Levels:

Level	Runs	Privilege
EL0	applications (user space)	least
EL1	OS kernel (Linux, Android)	↑
EL2	hypervisor (virtualization)	↑↑
EL3	secure monitor (TrustZone, Day 23)	most

Each level (except EL0) has its own banked SP and its own system registers (with the _EL1, _EL2 suffixes you saw in Day 24). Exceptions move you up a level; the ERET instruction returns you down. This maps perfectly onto modern software: app → kernel → hypervisor → secure firmware.

6. PSTATE replaces the CPSR

The single CPSR register from Day 3 is replaced by PSTATE — processor state held as a set of independently accessible fields rather than one packed word. The familiar condition flags live here:

N, Z, C, V — Negative, Zero, Carry, oVerflow (same meanings as before).
Interrupt masks (D, A, I, F), the current exception level, and the stack-pointer selector (SPSel).

On taking an exception, PSTATE is saved into SPSR_ELx and the return address into ELR_ELx — the 64-bit equivalent of the banked SPSR/LR mechanism from Day 17.

7. The A64 instruction set — cleaner by design

A64 keeps the RISC spirit but tidies up the quirks:

Fixed 32-bit instructions — every A64 instruction is exactly 4 bytes (no Thumb mixing within A64).
No blanket conditional execution. The AArch32 trick of predicating every instruction is gone. Instead you get conditional branches plus efficient conditional select instructions like CSEL, CSET and CSINC that avoid branches without bloating the encoding.
LDP / STP replace LDM/STM — load or store a pair of registers in one instruction, the workhorse of function prologues/epilogues.
Larger immediates and PC-relative addressing (ADRP) for the 64-bit address space.

my_func: STP x29, x30, [sp, #-16]! // prologue: push frame ptr + link reg MOV x29, sp CMP x0, xzr CSEL x0, x1, x2, GT // x0 = (x0>0) ? x1 : x2 — branchless LDP x29, x30, [sp], #16 // epilogue: pop RET

8. The AAPCS64 calling convention

The 64-bit procedure call standard (cf. Day 15's AAPCS) takes full advantage of the larger register file:

Registers	Role	Preserved by
X0–X7	arguments 1–8; X0 = return value	caller-saved
X8	indirect result location	caller-saved
X9–X15	scratch / temporaries	caller-saved
X16–X17	intra-procedure-call (IP0/IP1)	caller-saved
X18	platform register (reserved)	platform
X19–X28	local variables	callee-saved
X29	frame pointer (FP)	callee-saved
X30	link register (LR)	special

With eight argument registers instead of four, most function calls pass everything in registers and never touch the stack — a real speed advantage over AArch32.

9. A glimpse beyond: SVE

AArch64 also opened the door to the Scalable Vector Extension (SVE/SVE2) — a vector instruction set whose register width is not fixed in the program (it can be 128 to 2048 bits depending on the chip), so the same binary scales across implementations. It's huge for HPC and machine learning. We'll meet ARM's mainstream vector engine, NEON, next lesson; SVE is its supercomputer-class cousin.

💡 Why 64-bit is more than "bigger numbers"

People assume 64-bit just means handling larger integers and more than 4 GB of RAM. True — but the bigger wins here are architectural: nearly 2× the registers, a cleaner instruction set, branchless conditional selects, 8 argument registers, and a simpler privilege model. That's why AArch64 code is often faster and easier for compilers to optimise.

✅ The mental model

AArch64 is the 64-bit state of ARMv8-A: 31 registers X0–X30 (with W views), a magic register-31 that's either XZR or SP, a PC that's no longer a GPR, four exception levels EL0–EL3 instead of modes, PSTATE instead of CPSR, the fixed-width A64 instruction set with LDP/STP and CSEL, and the AAPCS64 convention passing 8 args in registers. Cleaner, wider, faster.

🎯 Day 26 takeaways

ARMv8-A has two states: AArch64 (64-bit, A64) and AArch32 (compatibility).
31 GPRs X0–X30 + 32-bit W views; writing a W zeroes the upper half.
Register 31 = XZR/WZR (zero) or SP depending on instruction; PC is not a GPR.
Modes → four exception levels EL0–EL3; CPSR → PSTATE (N,Z,C,V + masks).
A64: fixed 32-bit, LDP/STP, CSEL (branchless), no blanket conditional execution.
AAPCS64: args in X0–X7, return X0, X19–X28 callee-saved, X29 FP, X30 LR.

Quick check

How many general-purpose registers does AArch64 have, and what are the two views called?
What two things can encoding "register 31" mean?
Why is removing the PC from the register file a good thing?
Which registers carry the first eight function arguments?

FAQ

What is AArch64?

The 64-bit execution state of ARMv8-A, running the A64 instruction set with 31 general-purpose registers and exception levels EL0–EL3.

X vs W registers?

X0–X30 are the full 64-bit registers; W0–W30 are their lower 32-bit views, and writing a W zeroes the upper 32 bits.

What replaced the CPSR and processor modes?

PSTATE holds the condition flags and masks; four exception levels (EL0–EL3) replace the old modes.

AArch32 vs AArch64?

AArch32 is 32-bit compatibility (16 regs, modes, conditional execution); AArch64 is 64-bit with 31 regs, exception levels, fixed A64 instructions and better performance.

← Back to the full course roadmap · RISC-V vs ARM →