What is the main difference between ASIC and FPGA?

An ASIC (Application-Specific Integrated Circuit) is a custom chip permanently fabricated for a single function — it cannot be reprogrammed after manufacture. An FPGA (Field-Programmable Gate Array) is a reconfigurable device whose logic is defined by a bitstream loaded at power-on and can be changed an unlimited number of times. The core trade-off: ASICs have high upfront NRE (non-recurring engineering) costs of $1M–$50M+ for mask sets but extremely low per-unit cost at volume. FPGAs have near-zero NRE but 10–50× higher per-unit cost, 5–10× higher power, and 5–10× lower performance than an equivalent ASIC.

What does NRE cost mean in chip design?

NRE stands for Non-Recurring Engineering cost — the one-time expense of designing and manufacturing an ASIC. It includes mask set costs (typically $1M–$15M for advanced nodes like 7nm/5nm), EDA tool licenses (Synopsys, Cadence, Siemens), IP licensing fees (ARM cores, PHYs, memory compilers), verification engineers, tapeout services, and initial silicon bring-up. NRE costs are paid once regardless of how many units are produced. At high volumes (millions of units), the NRE is amortized across units and the ASIC's lower per-unit cost results in total cost savings that far exceed the upfront investment.

When should I choose FPGA over ASIC?

Choose FPGA when: (1) volume is below 10,000–100,000 units (NRE never pays off), (2) the design needs field updates or reconfiguration after deployment, (3) you need rapid prototyping before ASIC commitment, (4) time-to-market is more critical than per-unit cost or power, (5) the application can tolerate 5–10× higher power than ASIC. FPGAs are ideal for prototyping, low-volume industrial/military applications, research platforms, and applications with evolving standards (network protocol processors, software-defined radio).

What is the FPGA-to-ASIC flow in industry?

The standard industry flow is: (1) develop and verify RTL on FPGA (often multiple FPGA prototyping boards for large SoCs), (2) achieve system-level validation on FPGA including software/firmware bring-up, (3) run ASIC synthesis, STA, DFT insertion, place-and-route, and signoff, (4) tape out to foundry. The FPGA prototype stage reduces ASIC risk enormously because most functional bugs are found before committing to expensive mask sets. Companies like Apple, Qualcomm, and NVIDIA all use large FPGA prototyping farms before every major chip tapeout.

How much faster is an ASIC than an equivalent FPGA?

A typical ASIC implementation runs 5–10× faster than the equivalent FPGA implementation of the same logic. This gap comes from: (1) ASIC uses optimized standard cells with sizes tuned per path, while FPGA uses fixed LUT delays; (2) ASIC routing is optimized by the P&R tool, while FPGA routing has fixed wire segments and switch matrices; (3) ASIC can use custom flip-flops, memory macros, and analog PHYs unavailable on FPGA. A design that runs at 100 MHz on FPGA typically achieves 500–1000 MHz in ASIC at an equivalent node. Power efficiency is similarly 5–10× better in ASIC.

What is the typical ASIC design flow?

ASIC design flow: (1) Specification and architecture definition, (2) RTL design in Verilog/SystemVerilog, (3) Functional verification (simulation, UVM testbenches, formal verification), (4) Logic synthesis (Synopsys Design Compiler or Cadence Genus — RTL → gate-level netlist), (5) STA (PrimeTime or Tempus — timing signoff), (6) DFT (scan insertion, ATPG), (7) Physical design — floorplanning, placement, CTS, routing (Innovus, ICC2), (8) Physical verification — LVS, DRC, antenna checks, (9) Tapeout — GDSII sent to foundry. Total cycle from tapeout to first silicon: 8–16 weeks depending on foundry and node.

Hardware Fundamentals

ASIC vs FPGA —
The Complete Engineer's Guide

By EcrioniX · Updated Jun 6, 2026

EcrioniX · General· ~20 min read· NRE Cost · PPA · Design Flow · Decision Framework

FPGA or ASIC? It depends on volume, time-to-market, power budget, and reconfigurability requirements. This guide covers every dimension engineers, architects, and product managers weigh before committing to silicon.

Side-by-Side Comparison

Dimension	ASIC	FPGA
NRE Cost	$1M – $50M+ (mask set, tools, IP)	~$0 (device cost only)
Per-Unit Cost	Cents – a few dollars at high volume	$10 – $1,000+ (device cost)
Clock Frequency	500 MHz – 5+ GHz	50 MHz – 700 MHz (DSP-heavy)
Power Efficiency	5–10× better than FPGA	Higher due to SRAM routing fabric
Area Efficiency	10–30× smaller vs FPGA logic	LUT overhead, fixed logic blocks
Reconfigurability	None — fixed after fabrication	Unlimited — reload bitstream
Time to First Silicon	12–36 months (full tapeout cycle)	Days to weeks (bitstream)
Analog/Mixed-Signal	Full support (PLLs, ADC, DAC on-chip)	Limited (built-in PLLs, SERDES only)
IP Ecosystem	Foundry-specific hard IPs	Rich soft IP library from vendors
Volume Break-even	~50,000 – 500,000 units (node & NRE dependent)
Risk on Bug	Respin = $1M+	Reprogram in hours
Design Flow	Synthesis → P&R → signoff → mask → fab	Synthesis → map → P&R → bitstream
Best Use Cases	High-volume consumer, networking, AI	Prototyping, low-vol, field-updateable

NRE Cost Deep-Dive

NRE is the killer variable that drives every ASIC vs FPGA decision. Here's where the money goes at a 7nm tapeout:

Mask Set (7nm)

$10M – $15M

Full mask layers for logic, metal, contacts. Wafer fabrication bill of materials.

EDA Tool Licenses

$2M – $5M

Synopsys DC + PT + ICC2 or Cadence suite. Annual contracts, typically team-wide.

Hard IP Licensing

$0.5M – $5M

ARM Cortex cores, PCIe/SERDES PHYs, memory compilers, USB, MIPI.

Verification Engineers

$1M – $3M

UVM testbench development, emulation, formal — often 3× RTL effort.

Physical Design Team

$0.5M – $2M

Floorplan, CTS, P&R, signoff closure (STA, LVS, DRC, power).

DFT & Characterization

$0.5M – $1.5M

Scan insertion, ATPG patterns, MBIST, package characterization, bringup.

At 28nm, mask sets drop to $3M–$6M — which is why many mid-volume chips (10K–100K units) tape out at mature nodes for cost reasons rather than chasing PPA at advanced nodes.

Performance: Why ASICs Are Faster

The FPGA–ASIC frequency gap is not primarily about process node — it is structural:

Cell Sizing Optimization

ASIC P&R tools size each standard cell individually to meet timing on its specific path. A data-path gate on the critical path gets a large, fast (but power-hungry) variant; non-critical gates get minimum-size cells. FPGAs use fixed LUT configurations — every LUT has the same delay regardless of load.

Routing Freedom

ASIC routers place wires anywhere on any metal layer, with custom spacing and width tuning. FPGAs have a fixed routing fabric — multiplexer-based switch matrices that add delay at every programmable junction. A 4-hop FPGA route easily adds 500ps–1ns that would be 50–100ps in ASIC metal.

Custom Memory & PHYs

ASIC integrates optimized SRAM macros, high-speed SERDES (56G, 112G), and PLLs designed specifically for the target frequency. FPGA BRAMs, SERDES, and clock resources are generic and shared across all possible user designs.

Technology Node

Leading ASIC products use TSMC 3nm/5nm, while even the latest Xilinx UltraScale+ / Intel Stratix devices are at 14nm–16nm. A 7nm ASIC runs on a more advanced process than a 16nm FPGA — compounding the speed and power advantage.

Power Efficiency: The Physics Gap

FPGAs consume far more dynamic power than ASICs for three structural reasons:

Power Source	ASIC	FPGA
Routing fabric	Direct metal — minimal	SRAM-gated muxes switch every cycle
Logic overhead	One gate per function	6-input LUT for even 1-input function
Configuration SRAM	None	Millions of SRAM bits leaking constantly
Process node	3nm–7nm typical for new designs	14nm–16nm for latest high-end FPGAs
Result	~0.01–0.1 pJ/op	~0.5–5 pJ/op for equivalent logic

Design Flow Comparison

ASIC Design Flow

RTL Design & Functional Verification

Verilog/SystemVerilog RTL, UVM testbenches, formal verification, coverage closure. Typically 60% of total project time.

Logic Synthesis

Synopsys Design Compiler or Cadence Genus maps RTL to a gate-level netlist using the foundry's standard cell library. SDC constraints guide timing.

Static Timing Analysis (STA)

PrimeTime or Tempus verifies setup/hold margins at all PVT corners. Timing closure is iterative — synthesis → STA → ECO → repeat.

Physical Design (P&R)

Cadence Innovus or Synopsys ICC2: floorplanning → power planning → placement → CTS → routing → filler/decap insertion.

Physical Verification & Signoff

DRC (Design Rule Check), LVS (Layout vs Schematic), antenna checks, IR drop (Voltus/RedHawk), thermal. All must pass before GDSII submission.

Tapeout → Fabrication → Bringup

GDSII sent to foundry (TSMC, Samsung, GlobalFoundries). 8–16 weeks to first wafers. Bringup: power sequencing, scan test, functional test, yield monitoring.

FPGA Design Flow

RTL Design

Same RTL as ASIC (good RTL is portable) — but inference patterns matter: use BRAM inference templates, DSP multiply patterns, register-balanced pipelines.

Synthesis & Technology Mapping

Vivado (Xilinx) or Quartus Prime (Intel): maps RTL to LUTs, DSPs, BRAMs, and CARRY chains on the target device.

Place & Route

Tool places LUTs/FFs on the FPGA fabric and routes connections through the switch matrix. Timing-driven P&R tries to meet timing constraints.

Bitstream Generation & Download

Tool generates a binary bitstream (Xilinx: .bit / .bin). Loaded via JTAG or from SPI flash. FPGA configures in milliseconds at power-on.

When to Choose Each

Choose ASIC When…

Volume exceeds 500K–1M units (NRE is amortized)
Power budget is critical (battery devices, data center $/W)
Performance needs >1 GHz or custom analog blocks
Competitive differentiation requires a proprietary chip
Design is stable and unlikely to need field updates
Regulatory requirements mandate custom silicon (automotive ASIL-D)
Long product lifetime justifies upfront investment

Choose FPGA When…

Volume is below 50K–100K units
Rapid prototyping before ASIC commitment
Field updates required post-deployment
Evolving standards (network protocol processors)
Short project timelines, tight schedules
Research / academic / low-volume industrial
ASIC as the target but need SW/FW bringup early

Real-World Examples

Product	Choice	Why
Apple M-series	ASIC (TSMC 3nm)	Hundreds of millions of devices; extreme PPA requirements
Nvidia H100 GPU	ASIC (TSMC 4nm)	Data center scale; power efficiency critical at 700W TDP
Network white-box switch	FPGA (Xilinx UltraScale+)	Protocol updates (P4 programmable forwarding), low-to-mid volume
Radar signal processor	FPGA (Intel Stratix)	Classified waveform updates, military volume ~1K units
Google TPU v4	ASIC (custom)	AI inference at scale; 10× efficiency advantage over GPU at workload
FPGA prototyping farm	FPGA (Synopsys HAPS, Cadence Palladium)	Pre-silicon validation of ASIC before $15M mask commit
Automotive ADAS SoC	ASIC (28nm / 16nm)	ASIL-D safety, 10M+ vehicle volume, fixed real-time algorithm

The FPGA-First, ASIC-Later Strategy

For products with a genuine path to high volume, the industry standard is a two-phase approach:

Phase 1

FPGA Prototype

Deploy the first product revision on FPGA. Ship to early customers. Bring up software stack, firmware, and drivers. Find real-world bugs without ASIC risk. A $200K FPGA prototyping platform catches 80%+ of functional bugs before tapeout.

Phase 2

ASIC Tapeout

Once the design is functionally validated on FPGA, begin ASIC flow in parallel. The verified RTL transfers directly to ASIC synthesis. Only net-new risk is physical design and process-specific timing. Silicon bringup is de-risked because software was validated on FPGA.

Phase 3

ASIC Production

First ASIC silicon replaces FPGA boards in production. FPGA hardware may remain deployed in low-volume markets or field-upgrade-sensitive segments while ASIC serves high-volume production.

Frequently Asked Questions

What is the break-even volume for ASIC vs FPGA?+

Break-even depends on NRE and per-unit cost delta. At 28nm (NRE ~$5M) with FPGA device at $100 and ASIC at $5 per unit, break-even is ~53,000 units ($5M / $95 saving). At 7nm (NRE ~$15M) with a $500 FPGA vs $10 ASIC, break-even is ~30,000 units. Rule of thumb: below 50K units, FPGA almost always wins on total cost; above 500K units, ASIC almost always wins. The 50K–500K zone is a judgment call on node, product lifetime, and power requirements.

Can FPGA RTL be reused directly for ASIC?+

Yes, with important caveats. Generic RTL (pure synchronous logic, parameterized modules, standard coding style) is directly portable. FPGA-specific constructs that do NOT port to ASIC: BRAM instantiation (replace with SRAM macros), DSP48 blocks (replace with operator-inferred multipliers or hard multiplier macros), SERDES primitives, PLL instantiation, tri-state IOBUF primitives, and Vivado/Quartus IP cores. Well-structured RTL with an FPGA/ASIC abstraction layer at the physical interface layer can achieve 90%+ RTL reuse.

What is a structured ASIC?+

A structured ASIC is a middle ground — it uses a pre-fabricated base layer (like an FPGA without the programming fabric) and only customizes the upper metal layers. NRE drops to $100K–$500K vs $5M+ for full-custom. Performance is between FPGA and full ASIC. Examples: eASIC (now Intel), Faraday eFPGA-based structured ASICs. Used for 100K–1M volume products where full ASIC NRE is hard to justify but FPGA power/performance is insufficient.

Does FPGA design require different verification than ASIC?+

FPGA functional verification is identical — simulate the RTL in ModelSim, Xcelium, or VCS; write directed and constrained-random tests. What differs: FPGA in-circuit emulation (Vivado ILA, SignalTap) replaces post-silicon JTAG debug; timing closure is vendor-tool-specific (Vivado Timing Report vs PrimeTime); FPGA DFT is not needed (no scan, no ATPG — JTAG boundary scan built in). ASIC adds DFT (scan), formal signoff, emulation (Palladium, Veloce), and STA/IR/electromigration signoff steps that FPGAs skip entirely.

What FPGA vendors and families should I know?+

AMD/Xilinx: Artix-7 (low-cost), Kintex/Virtex UltraScale+ (high-performance, 16nm), Versal (ACAP — FPGA + AI engine + hard NoC). Intel: Cyclone (low-cost), Arria (mid-range), Stratix 10 (high-performance, 14nm Intel FinFET). Microchip: PolarFire (25G SERDES, low power). Lattice: ECP5, Nexus (small, power-sensitive edge applications). For prototyping large ASICs, Xilinx VU19P (9B ASIC gate capacity) and Intel Agilex are standard. Aldec Riviera, Synopsys HAPS, and Cadence Protium are multi-FPGA prototyping systems used by large chip companies.

← Previous

What Is an FPGA?

Semiconductor Industry 101

ASIC vs FPGA —The Complete Engineer's Guide