Switch Hardware Architecture
Switch Hardware Architecture
BRKRST-3069
www.ciscolivevirtual.com
Agenda
Overview Concept System Design Mechanical / Physical Design Buffer Design
Hardware Engineering
Software Engineering
Forwarding Design
ASIC Engineering
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
Overview
Timeline
Product Requirements Document
ASIC
Requirements
Plan
Micro Architecture
Implementation
Final Netlist
Power On
Hardware
HW Design Detailed Design Mechanical Drawing Fab Out P0 P1 P2 A-0
Mechanical
Electrical Manufacturing
MDVT
PCB Layout BOM
RDT
EDVT
Software
SW Functional Spec SW Design Spec Unit Test Plan Unit Integration Plan
Software Test
Master Test Plans
BRKRST-3069
Automation
Cisco Public
Regression
FCS
4
BRKRST-3069
Cisco Public
Concept
Concept
What customer problem will the product solve? Vision Technology Market
Cost
Life Cycle
How Big?
Time to Market
Differentiation Innovation
BRKRST-3069
Cisco Public
Consolidate IP, Storage, and IPC networks onto a single Ethernet fabric and deliver innovative features and services that provide value to our customers.
BRKRST-3069
Cisco Public
DC Evolutionary Innovation
2013 Phase 3 2011 Phase 2 Terabit Slot 2009 Phase 1 Terabit Slot
10G Access 40 / 100G Aggregation Unified Fabric 10GbE Access 10GbE Aggregation Unified Fabric 10G Aggregation
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
F2 Series
High Level Goals
48 Ports 1/10G Line Rate 64 Bytes
Low Latency
L2MP, TRILL, FEX, FCoE, L3 Forwarding Optimise for Data Centre IPv4 & IPv6 Equal Performance Cost target
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
10
System Design
BRKRST-3069
Cisco Public
12
Data Plane
Buffering No packet drop
Control plane
Modular Restartable (including activeactive state handling) Non-disruptive code load & activation
Throughput
Port count
Modular
No single point of failure In-order delivery Future protocol compatibility
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved.
13
Mechanical Design
48 x 1G BaseT N7K-M148GT-11 Nexus 7010 Rear
Fabric N7K-C7010-FAB-1
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
15
Ejectors
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
16
BRKRST-3069
Cisco Public
17
Buffer Design
Cisco Public
Switch B
10 Links @ 1Gbps Each Bandwidth = 10Gbps Flow Bandwidth = 1Gbps Serialisation Delay = 20uS
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved.
1 Link @ 10Gbps Each Bandwidth = 10Gbps Flow Bandwidth = 10Gbps Serialisation Delay = 2uS
19
Switch B
Switch A
Switch A
Single ASIC
Scalability limited by memory bandwidth/size Typically optimised for fixed configuration Cost effective with small port counts
Output 3 Output 1
20
Input 3
BRKRST-3069
Cisco Public
Input 1
Output 2
Input 2
Switch Architecture
Mesh
Crossbar
BRKRST-3069
Cisco Public
21
Arbiter 3 4 5
Superframes
Fabric
Superframes
Ingress
Egress
SP SP SP
WRED
WRED
WRED
WRED
WRED
WRED
WRED
DWRR DWRR
WRED WRED
DWRR
8 1
BRKRST-3069
Cisco Public
22
Forwarding Design
BRKRST-3069
Cisco Public
24
How fast?
GT/s PCI express v1 PCI express v2 2.5 5 Serdes (Gbps) 2.525 5G Encoding
PCI express v3
10G Ethernet
7.99
10.3125
128b/130b
64b/66b
BRKRST-3069
Cisco Public
25
BRKRST-3069
Cisco Public
26
Hash Table
Trie
Result
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
27
CAMs
Content Addressable Memory
01001110
1 2 3 4
Hit!
Result
28
Hash Tables
Input MAC Address 0000.c000.0001
Pages
Page Size
BRKRST-3069
29
Tries
Many different *tries
Bitwise Trie
Balanced Trie Patricia Trie Fixed or Variable Stride Tries
30
L3 Table: Design 1
Rewrite Information IPv4 Unicast FIB VRF / Prefix / Mask / Paths / Offset 1 / 10.1.2.0 / 24 / 4 / 1 1 / 10.1.3.0 / 24 / 1 / 5 3 / 10.1.2.0 / 24 / 2 / 9 3 / 10.1.3.0 / 24 / 2 / 9 ADJ 1 - Rewrite SRC A+DST A MAC ADJ 2 - Rewrite SRC A+DST B MAC
H A S H
ADJ 3 - Rewrite SRC A+DST C MAC ADJ 4 - Rewrite SRC A+DST D MAC ADJ 5 - Rewrite SRC A+DST D MAC ADJ 6 - Rewrite SRC A+DST F MAC ADJ 7 - Rewrite SRC A+DST G MAC ADJ 8 - Rewrite SRC A+DST H MAC ADJ 9 - Rewrite SRC A+DST I MAC
Software View
BRKRST-3069
Cisco Public
31
L3 Table: Design 2
Path Table IPv4/v6 Unicast FIB VPN / Prefix / Mask / Paths / Offset 1 / 10.1.2.0 / 24 / 4 / 1 1 / 10.1.3.0 / 24 / 1 / 5 3 / 10.1.2.0 / 24 / 2 / 6 3 / 10.1.3.0 / 24 / 2 / 6 Path 1 Rewrite Information ADJ 1 - Rewrite SRC A+DST A MAC ADJ 2 - Rewrite SRC A+DST B MAC ADJ 3 - Rewrite SRC A+DST C MAC ADJ 4 - Rewrite SRC A+DST D MAC ADJ 5 - Rewrite SRC A+DST E MAC ADJ 6 - Rewrite SRC A+DST F MAC ADJ 7 - Rewrite SRC A+DST G MAC ADJ 8 - Rewrite SRC A+DST H MAC ADJ 9 - Rewrite SRC A+DST I MAC
H A S H
Path 1
Path 2
Software View
BRKRST-3069
Cisco Public
32
BRKRST-3069
Cisco Public
33
Forwarding Design
Design 1
L2 Table Parse Packet L3 Table Ingress Security ACLs Adjacency Table Egress Security ACLs Egress QoS ACL Input / Output Policing Fwd Decision Update Statistics
Design 2
L3 Table (x2)
Parse Packet L2 Table VPN CAM Ingress Security ACLs Ingress QoS ACL Adjacency Table Egress Security ACLs Egress QoS ACL Input / Output Policing Fwd Decision Update Statistics
BRKRST-3069
Cisco Public
34
Forwarding Design
Design 2
L3 Table (x2) Parse Packet L2 Table
VPN Table
Adjacency Table
Fwd Decision
Update Statistics
Design 3
Parse Packet
Adjacency Table
Fwd Decision
Update Statistics
BRKRST-3069
35
References
Network Algorithmics,: An Interdisciplinary Approach to Designing Fast Networked Devices George Varghese Art of Computer Programming Vol 1-4, Donald E. Knuth Introduction to Algorithms, Third Edition Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein
36
ASIC Engineering
ASICs vs FPGAs
ASIC - Application Specific Integrated Circuit A finished IC which is built to the exact specification & functionality of the customer Can make optimal use of the underlying silicon circuits Low part cost, High upfront investment Significant development time FPGA (EPLD) Field Programmable Gate Array An IC that can be configured with the required functionality after it is installed into a target system Flexibility vs. sub-optimal use of underlying silicon circuits Higher part cost Shorter development time
BRKRST-3069
Cisco Public
38
CMOS
VDD in VSS out in
out
p+ n-well
p+
n+
n+
39
Gates
module nand2(a,b,c) input a,b; ouput c; begin c <= !(a & b); end
BRKRST-3069
Cisco Public
40
Silicon Wafer With same number of defects per wafer, smaller Die size results in higher yield per wafer
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
41
Synthesis
Turn RTL into Gates and Logical Connections
Netlist
Gates and Logical Interconnections
Floor Plan
Overall Block and Function Placement
Placement
Specific Gate Placement
Route
Layout physical interconnection
GDSII
One file per layer (photomask)
Foundry Production
Metal layers on Wafer
Device Test
Test Dies on Wafer
Packaging
Cut wafer into dies Dies into IC packages
BRKRST-3069
Cisco Public
42
Synthesis
Turn RTL into Gates and Logical Connections
Netlist
Gates and Logical Interconnections
ASIC Customer
- Cisco
Floor Plan
Overall Block and Function Placement
Placement
Specific Gate Placement
ASIC Vendor
Route
Layout physical interconnection
GDSII
One file per layer (photomask)
Foundry Production
Metal layers on Wafer
Silicon Foundry
Device Test
Test Dies on Wafer
Packaging
Cut wafer into dies Dies into IC packages
BRKRST-3069
Cisco Public
43
Micro Architecture
Design Review
Implementation Final Netlist Handoff
Floorplan Netlist
12-26 Weeks ~52 Weeks ~12 Weeks Power On Release to Production
~12 Weeks
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
44
F2 ASIC - Clipper
Technology Die Size IBM Cu-65 18.0x18.3mm
Total SRAM
Total eDRAM Total TCAM
33.3Mb
134Mb 2.94Mb
Register Array
Logic Gates Signal Pin
1.34Mb
45M 186
Package IO
840
BRKRST-3069
Cisco Public
45
46
ASIC Packaging
Electrical parasitics of the chip package are critical
Silicon Die
BRKRST-3069
Cisco Public
47
Hardware Engineering
F2 Block Diagram
Central Arbiter To Spine Cards
Lightning
Sacramento
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
Clipper
BRKRST-3069
Cisco Public
49
Thermal Modelling
Component Case Temperatures Temperature Contours
BRKRST-3069
Cisco Public
50
20 Layers
BRKRST-3069
Cisco Public
51
EDVT
(Electronic Design Validation Test)
All tests performed using offline diagnostics and again with NXOS
On-board power supplies have voltages margined to +5% & -5% Temperature testing occurs while Soaking for 12 hours at 55o C and -5o C Ramping between extremes at 1o C per minute
52
All new products including systems and boards are subject to RDT.
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
53
Power Consumption
Skew Parts
Data Sheet
Typical 340W
Maximum 400W
BRKRST-3069
Cisco Public
54
Diagnostics run during system Boot-Up, after OIR, On-Demand using the CLI, or as Health Checks in the background.
Problem Areas:
Hardware Components (ASICs)
55
Software Engineering
NXOS Architecture
Layer-2 Protocols
VLAN mgr STP UDLD CDP
Layer-3 Protocols
OSPF BGP GLBP HSRP
Storage Protocols
Other Services
IGMP snp
LACP
802.1X
CTS
EIGRP
PIM
VRRP
SNMP
Interface Management
Kernel
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
57
Multi-threaded
Scalability with SMP and multi-core CPUs Faster Route Re-convergence Lower mean-time-to-recovery
Real-Time
Real-Time preemptive scheduling System operational when CPU is 100%
Modularity
Most of the features are conditional Can be enabled/disabled independently Maximises efficiency Minimises resources utilisation
58
Software Engineering
SW Functional Spec SW Design Spec Unit Test Plan
} mfib_hw_oif_t;
OIF
OIF
OIF Info
Pltfm Data: adj_ptr[]
OIF Info
Pltfm Data: adj_ptr[]
OIF Info
Pltfm Data: adj_ptr[]
OIF
OIF
ADJ RAM
MD Adj
RIT RAM
ccc=7 OIF Adj ptr 1 OIF Adj ptr 2
MDT
Idx1 Idx2
BRKRST-3069
Cisco Public
59
Design Review
BRKRST-3069
Cisco Public
60
Development Test
Master Test Plans Functional Test Plans Automation Regression FCS
Testing of completed integrated feature Test for interactions with other features and functions Test for interoperability with Cisco and 3rd party devices Build scripts to automate testing so is repeatable on future releases
BRKRST-3069 2012 Cisco and/or its affiliates. All rights reserved. Cisco Public
61
Micro Architecure
Implementation
Final Netlist
Power On
Hardware
HW Design Mechanical Electrical Manufacturing Detailed Design Mechanical Drawing MDVT PCB Layout BOM EDVT RDT
Fab Out
P0
P1
P2
A-0
Software
SW Functional Spec SW Design Spec Unit Test Plan Unit Integration Plan
Software Test
Master Test Plans
BRKRST-3069
Automation
Cisco Public
Regression
FCS
62
Q&A
Dont forget to activate your Cisco Live Virtual account for access to all session materials, communities, and on-demand and live activities throughout the year. Activate your account at any internet station or visit www.ciscolivevirtual.com.
Cisco Public
BRKRST-3069
64
BRKRST-3069
Cisco Public
65