

# Mission Peak Reference Server System for NGSFF SSD

Design Guidelines for NGSFF Server Development

White Paper

#### **Executive Summary**

NGSFF is the state-of-the-art next generation SSD form factor invented by Samsung to meet requirements of data centers and enterprise servers. To develop a server system based on NGSFF SSD, server vendors are required to spend a significant amount of time to design thermal simulation and Si simulation. Server vendors also have to collaborate with socket vendors to develop an appropriate socket for their designed system.

To show the superior features of NGSFF SSD for server systems, Samsung already designed and developed a NGSFF server system, called Mission Peak, and has provided a number of public and private demonstrations. Even though this Mission Peak server system was designed as a reference system, due to its excellent design, the Mission Peak design can be used for commercial level server production today. To help server system vendors who want to quickly develop NGSFF server systems as their commercial product, Samsung has published this white paper explaining the Mission Peak NGSFF reference server system.

This document describes Mission Peak architecture, designs of each component in detail, thermal simulation and measurement in a real Mission Peak server system, as well as signal integrity simulation in order to help server system vendors to build NGSFF server system in a more timeefficient manner.



1. Next Generation Small Form Factor (NGSFF) for SSD: This new form factor includes many features for enterprise server and data center server systems. Double/quadruple the Capacity of M.2, supporting multiple interfaces (PCIe, SAS/SATA, Gen-Z), High Availability through PCIe dual port supporting, Hot-insertion/removal, 12V power direct support, power-disabling capability for software reset of SSD, etc. NGSFF is under standardization in JEDEC, and its technical specification was approved by the technical committee in '18 June. For the detail and advantages of NGSFF, please refer to the white paper on NGSFF SSD.



#### Table 1 shows the overall features of the Mission Peak NGSFF reference system:

| Feature                     | Description / Comments                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|-----------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Form Factor                 | Standard EIA-3101U Rack mount Chassis, less than 800mm deep                                                                                                                                                                                                                                                                                                                                                                                                      |
| SSDs Drive Bays             | Supporting 36 NGSFF NVMe front-loaded, hot swappable SSD bays                                                                                                                                                                                                                                                                                                                                                                                                    |
| Boot Drive                  | On motherboard M.2                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| System Firmware             | BIOS and BMC Features: ACPI, PXE boot, Wake-on-LAN, AC loss recovery, IPMI 2.0 KCS interface, SMBIOS, Serial console redirection, BIOS Boot Specification, BIOS Recovery Mode, SRIOV, iSCSI, TPM support, PCIe NTB                                                                                                                                                                                                                                               |
| Front Panel                 | <b>4 Indicator LEDs:</b> Power Status, SSD activity, network activity, fault<br><b>Others:</b> Power on/off, 2 x USB 2.0 ports                                                                                                                                                                                                                                                                                                                                   |
| Motherboard<br>Key Features | Processors: 2x Intel <sup>®</sup> Xeon <sup>®</sup> Processors E5-2600 v5 (Skylake)<br>QPI Speeds: 9.6 GT/s, 8 GT/s, 7.2 GT/s<br>Chipset: Intel <sup>®</sup> C620 Chipset<br>System Memory: 24 DIMM slots (2 CPU sockets × 6 channels/CPU socket × two slots/channel)(DDR4 2400 (2DPC) / 2666(1DPC) MHz)<br>LAN • For CPU0, 1 x 100GbE (PCIE add on card) + 1 x 50GbE single port (OCP)<br>• For CPU1, 1 x 100GbE (OCP) + 1 x 50GbE dual port (PCIE add on card) |

Table 1: Mission Peak Reference System

### **Architecture Overview**

Mission Peak is a rack mounted all-flash based storage server in a 1U EIA-310D (19") based on NGSFF SSDs. As shown in Figure 1, Mission Peak is comprised of 36x NGSFF SSD drives connected to a motherboard via a PLX PCIe switch fabric, dual Intel Xeon processors and 2x100Gbe and 2x50Gbe NIC cards.

The Mission Peak platform CPU is using Intel Skylake, which offers 48x PCIe Gen3 lanes. For balancing between internal SSD performance and outof-the-box performance through Network Interface Card (NIC), 24x PCIe lanes of each CPU are allocated to PCIe switch for NGSFF connections, and the remaining 24x PCIe lanes are allocated to PCIe Ethernet cards. So, all the bandwidth generated between the CPU and NGSFF SSDs can be fully delivered out of the server through the NIC. To avoid cross traffic between CPUs, which can degrade overall performance, the Mission Peak system is designed in symmetric architecture as shown in Figure 2.



Figure 2: Mission Peak Data-path Architecture

Figure 3 shows the top-level server system design of Mission Peak.

The Mission Peak system contains the following components and features:

1) Dual Skylake Xeon E5-based Mission Peak Motherboard: It offers two Skylake CPUs, 24 DDR4 DIMMs, two PCIe x16 lanes slots and two OCP PCIe slots.

2) PCIe Switch Mid-plane: It contains two PLX9797 PCIe switches to expand 24 PCIe lanes to 18 x 4 PCIe lanes in order to support 18 NGSFF SSDs for each CPU

**3)** Orthogonal Connector: It allows the NGSFF SSD to be inserted directly into the connector without backplane, while provide enough air channel for cooling.

4) Two PSUs: Each PSU can support up to 1600W.

5) Four Ethernet cards: System support 2x100Gbe Ethernet cards and 2x 50Gbe OCP Ethernet cards.

Figure 4 shows the layout design for Mission Peak including the above components. A detailed design overview of each component is provided in the following sections.



Figure 3: Mission Peak Server System Top-Level Design

Figure 4: Mission Peak System Board Layout Design



#### **Mission Peak Motherboard Design**

The Mission Peak Motherboard is a full featured motherboard for a server system based on Intel Xeon Skylake.

The motherboard offers 6 memory channels of DDR4 2400 per socket, which offers the user a total of 24 DIMMs for large memory capacity to maximize the performance.

It provides 2 UPI<sup>2</sup> high speed coherent connections between two processors. Intel Skylake offers a total of 3 UPI channels at the socket level. It is up to the system designer to decide how many UPI channels are used. Connecting more UPI channels will offer a higher speed interconnection between processors, however at the same time, it will require more routing channels or PCB layers, and requires higher power consumption. The designer needs to balance the factors based on their system requirements. For Mission Peak, we selected 2 UPI channels to balance the power/ PCB cost and performance.



Figure 5: Mission Peak Motherboard Block Diagram

#### The Mission Peak Motherboard supports the following major IO interfaces:

#### PCIE:

#### Ethernet LAN:

- Super slot-#0: 48 lanes PCIe Gen3 lanes from CPU0
- Super slot-#1: 48 lanes PCIe Gen3 lanes from CPU1
- 2x M.2 support PCI-E Gen3 X2

- 2x100Gbe, 2 x 50Gbe
- 1x Gbe RJ45 dedicated to BMC management

#### 10:

- 1x external VGA port, 1x COM port
- 2x USB type A connectors + 1x RJ45 for BMC management port

<sup>2.</sup> UltraPath Interconnect (UPI) : A point-to-point processor interconnect. UPI is a low-latency coherent interconnects for scalable multiprocessor systems with a shared address space. It uses a directory-based home snoop coherency protocol with a transfer speed of up to 10.4 GT/s. Supporting processors typically have two or three UPI links.



### **PCIe Lane Distribution**

There are 2 PCIe super slots to which each CPU connects all its PCIe lanes (i.e. 48 PCIe Gen3 lanes are connected to a superslot). To maximize the performance of the NGSFF SSD in 1U system, all 48 PCIe lanes in the SkyLake CPU are fully used for the Mission Peak system design.

To avoid the bottleneck of the bandwidth, PCIe topology is designed in balanced and symmetric topolgy (i.e. out of 48 PCIe lanes per CPU, 24 PCIe lanes go to NIC and 24 PCIe lanes are used to connect to NGSFF SSDs). As a result, the bandwidth between SSDs to/from CPU and CPU to/ from NIC are balanced to prevent bottlenecking. And the connection between two CPUs are designed symmetrically to minimize the data traffic between CPUs, which could be raised due to asymetricity.

| Super Slot # | PCIe Lanes | Used By | Allocation For                            |
|--------------|------------|---------|-------------------------------------------|
| Slot-#1      | 24         | CPU0    | Riser card and cable connect to backplane |
|              | 16         |         | 100Gbe PCIe adding Card                   |
|              | 8          |         | 50Gbe OCP Mezz Card                       |
| Slot-#2      | 24         | CPU1    | Riser card and cable connect to backplane |
|              | 16         |         | 100Gbe OCP Mezz Module                    |
|              | 8          |         | 50Gbe PCIe adding card                    |

Table 2 – PCIe Lane Distribution (i.e Bandwidth Distribution)

### Mid-Plane Design

Mission Peak Mid-plane is the system board located between the Motherboard and the NGSFF SSDs. As shown in Figure 6, it contains two PCIe switch PEX9797. Each PCIe switch expands PCIe 24x lanes to 18 PCIe x4 lanes to support 18 NGSFF SSDs. Therefore, a total 36 NGSFF SSDs are supported in 1U size Mission Park reference system. Midplane also includes 9 Orthorgonal connectors, which is supporting 4 NGSFF SSD each, to support a total of 36 NGSFF SSDs. The NGSFF SSD can be directly inserted into the Orthogonal connector without additional backplane. The Midplane contains four Slimline connectors, which can be connected with the high speed cables for 48 PCIe lanes from the Motherboard to PEX9797 PCIe switch chip.

The Mid-plane also provides a power sequencing circuit. The Mid-plane provides the main 12V power to the NGSFF SSDs when both presence detection pins--PRSNT1#(Pin 67) and PRSNT2#(Pin 6)--are detected correctly.

# Architecture of Mission Peak NGSFF Server



Figure 6: Mission Peak Mid-Plane Board and Connections with Mission Peak Motherboard via the Mission Peak Raiser Board

### **Riser Board Design**

PCIe riser boards are the board used to convert the PCIe slot connection to the cable connection, so that the PCIe signals from the CPU can be connected to PCIe switches in Mid-plane. To keep good signal integrity, the riser card comprises a clock buffer chip and a PCIe re-driver chip to enhance the PCIe signal before sending to the cable connectors. Each riser also contains a DC-DC voltage converter chip to convert voltage from 12V to 3.3V for the on board re-drivers.



Figure 7: Mission Peak Riser Comprises DC-DC Converter and PCIe Re-driver

### Tray Design for NGSFF SSD

The NGSFF SSD needs to be mounted on a tray for system maintainer's easier insertion/removal into/from the Mission Peak system. The Mission Peak tray for NGSFF SSDs plays four major roles:

- 1) Physical/electrical protection of the NGSFF SSD
- 2) Locking the NGSFF SSD on the tray in a correct position
- 3) Guide the NGSFF SSD to securely plug in/out
- 4) Provide LED indication holes on the front panel of the tray

Figure 8 illustrates the Mission Peak NGSFF tray design and its detail features. The NGSFF Tray provides the four mounting stands for the NGSFF SSD to be mounted on using screws. The user can use the Front Handle to lift the latch up for the NGSFF Tray to be pulled out from the server system. LED indication holes on the front panel are designed to display the NGSFF LED light, which shows the NGSFF SSD's status. Figure 9 illustrates how the NGSFF SSD mounted on NGSFF Tray is inserted into the Front Bay of the Mission Peak server system.



Figure 8: NGSFF Tray Design

Figure 9: Tray with NGSFF Loading to System

### Server-side Connector Design for NGSFF SSD

The connector was designed with the following goals:

- 1) It shall support the hot-plug feature of the NGSFF SSD form factor specification (i.e. the pins on orthogonal connector shall have two staging pin groups; 12V pre-charge pin [Pin 36] and two presence detect signal pins--PRSNT1# [Pin 67] and PRSNT2# [Pin 6]—are the 1st mating group pins; Other pins are in 2nd mating group pins).
- 2) It shall have the role of backplane to dock the NGSFF SSD into system.
- 3) It shall support high density design.
- 4) It shall support better air flow for effective cooling compared to existing backplane design.
- 5) It shall provide more signal integrity margin than the existing backplane designs to support PCIe Gen3 and Gen4.



# Architecture of Mission Peak NGSFF Server

Figure 10 shows the orthogonal connector, which is developed with Amphenol, satisfying the above design goals.



Figure 10: Connector Design

#### **Power Supply Design**

Mission Peak contains two power supply units (PSU) to support a redundancy power supply function. Power redundancy for data center and enterprise systems is very important. When one power supply unit fails, redundant power supply units supply power seamlessly without any system malfunction. Additionally, each PSU should be hot swappable to easily replace the PSUs. This feature is called serviceability which is one of the most important requirements for data centers and enterprise servers.

Other key factors for PSU is power density and power efficiency. Mission Peak carefully selected and customized the PSU design to support very high-power density and power efficiency. Mission Peak PSU supports up to 1600W, and the power efficiency is > 90%. It provides enough power for the current system and leaves the power margin for future function expansion.

#### Power Disable (Power Reset)

Figure 11 shows a system level circuit for power disable using a PWDIS pin. When system software issues an IPMI <sup>3</sup> command to the BMC (Board Management Controller), a PWDIS signal is applied to reset the power of the connected NGSFF SSD via I2C, GPIO expander and MOSFET circuits.

3. IPMI is an open, industry-standard interface that was designed for the management of server systems over a number of different types of networks. IPMI functionality includes field-replaceable unit (FRU) inventory reporting, system monitoring, logging of system events, system recovery (including system resets and power on and power off capabilities), and alerting. The monitoring, logging, system recovery, and alerting functions available through IPMI provide access to the manageability that is built into the platform hardware.



Figure 11: Block Diagram to Reset Power of NGSFF SSD Using PWDIS Signal Pin

Figure 12 shows the schematic for I2C to GPIO expander using a PCA9555PW chip.

Figure 13 shows a gate implementation using a 2N7002DW MOSFET chip. Figure 14 shows a schematic of the NGSFF connector.





Figure 12: (Example) I2C to GPIO Expander, PCA9555PW

Figure 13: (Example) MOSFET, 2N7002DW to Drive the Signal



Figure 14: NGSFF SSD Connector Side Schematic

### **Thermal Simulation**

Since the Mission Peak is a high-density and high-performance server, the efficient cooling capability of the server system is one of the most important factors to be considered.

Figure 15 shows the Mission Peak thermal simulation model. It is based on the Mission Peak mechanical model plugged with all components with their thermal characteristics. The ambient temperature is assumed as 35°C in front area. The air-flow direction is from front to rear. The air passes through the NGSFF, Mid-plane/PCIe switch, fans, CPUs/DIMMs, PCH, high speed Ethernet cards and then heated air is pushed out of the box through rear panel grids.

For the Ethernet Card area, because the air is already pre-heated by NGSFF SSDs, CPU, DRAMs, etc., it is more challenging to cool the entire area. To resolve that, there are three Air Ducts added to the Ethernet Card area to guide and distribute the air effectively.

The open area ratio on the rear panel is also an important consideration. If the open ratio is too low, no matter how well the air duct is designed and how fast the air fan runs, it can be hard for air to be moved out of the chassis to carry enough heat out the box.

Figure 16 shows the temperature distribution simulation results. The simulation results show that the temperature inside the 1U server box is managed, distributed, and controlled very well.



Figure 15: Thermal Simulation Model

Figure 16: Thermal Distribution (Simulation Results)

Since managing the temperature of the SSD is the most important in a server system, a cutting plane mechanism is also used to investigate the temperature near the NGSFF SSD in more detail. The plane is cut in the middle of the NGSFF area, and the thermal distribution of the cut plane is zoomed in as shown in Figure 17. The simulation results indicate that the ambient temperature around NGSFF SSD area is kept around 38~42°C, even though the 36 NGSFF SSDs are fully running. This means that ambient temperature near the NGSFF SSDs is kept in good and safe condition while running NGSFF SSDs.



Figure 17: NGSFF Area Cut Plane to Thermal Analysis



# Thermal & Signal Integrity (SI) Testing

Figure 18 shows the air flow speed distribution. It is proved that the orthogonal connector provides good air channel for air to pass through the high density NGSFF SSD area with high speed evenly.

Figure 19 illustrates how air flows through the Mission Peak chassis. The air flows smoothly through the system. There is no swirling or back air in the chassis.



Figure 18: Air-Flow distribution (Simulation Results)



Figure 19: Mission Peak Air Flow Path (Simulation Result)

### Thermal Test Result on Real Mission Peak 1U Server System

To verify the simulation results, an actual thermal test was performed at the NGSFF SSD area in a real Mission Peak server system. Thermistors was attached on the key components on the NGSFF SSD in order to test the left hand side, center and right hand side of the server system while using IOMETER software to fully operate the 36 NGSFF SSDs.

Detailed temperature measurement conditions for the thermal testing are as below:

- 1) Input Ambient Temperature: 35.5 °C
- 2) Fan Duty Cycle: 100%
- Software to run 36 NGSFF SSDs at the same time: IO-METER (5K Random Write 100%)
- 4) Tray Guide only (Thermal Spreader for cooling is not attached)
- 5) Measured points in NGSFF SSDs in each measured area: NVMe Controller, Temp Sensor at controller side, RAM for NVMe Controller, PMIC, NAND chip, temperature sensor at NAND side.

Table 3 shows the thermal testing results. Temperatures are within the SSD product specification, including keeping the temperature at no higher than 71.1 °C in all NGSFF SSDs.



Figure 20: Thermal Testing Area in Actual Mission Peak Server System

| Measured area in<br>Mission Peak server system | Measured Temperature<br>(Max) |  |
|------------------------------------------------|-------------------------------|--|
| Left Side                                      | 71.1 °C                       |  |
| Center                                         | 67.5 °C                       |  |
| Right                                          | 69.6 °C                       |  |

Table 3: Thermal Testing Result on Real Mission Peak Server System

# Thermal & Signal Integrity (SI) Testing

### Signal Integrity

As shown in Figure 21, the PCIe Gen4 channel loss for the whole channel should be less than 28dB@8GHz and the chip-to-PCIe connector channel loss should be less than 20dB@8GHz. In the case of PCIe Gen3, the entire channel loss should be less than 20dB@4GHz, and chip-to-PCIe connector channel loss should be less than 16dB@4GHz.

Figure 22 illustrates the signal integrity (SI) thermal simulation channel topology. It is identical to the real Mission Peak server system configuration. Since the PCIe Switch improves signal integrity characteristics, the PCIe Switch model was not included to make a worst case condition for SI simulation.



Figure 21: PCIe Gen4 Channel Loss Specification



Figure 22: Channel Topology for SI Simulation







Figure 24: Eye Density (Measurement Result)



Figure 23: BER Eye Diagram (Simulation Result)

# Vibration and Shock Test

#### **Vibration Testing During Operation**

A vibration test was performed to verify the reliability of the system design. It vibrates the system in X, Y and Z axis, respectively, while the system is running with the IOMETER benchmark program in order to stress CPUs, memory and NGSFF SSDs on the Mission Peak Server system. Vibration test conditions were as follows:

1) Sinusoidal vibration -0.5G's 0-Peak

2) 5-500-5Hz Sweep @ Octaves / minute

3)10 sweeps

4) Bottom only tested for 3 axes

Figure 25 shows the setup of the vibration test. The operational vibration test is performed on X axis bed, Y axis bed and Z axis bed respectively.

#### **Operating Vibration Test**







X axis

Y axis

Z axis

Figure 25: Vibration Testing Setup

As shown in Table 5, all vibration tests were passed. The performance measured in IOMETER is within the 10% variance during those vibration tests.

| Test Result           | Befo                  | re Test              | After Test                       |
|-----------------------|-----------------------|----------------------|----------------------------------|
| Physical damage check | Р                     | Pass                 |                                  |
| Functional check      | Pass                  |                      | Pass                             |
|                       | IOMETER Test Results  |                      |                                  |
| Axis                  | Before Test<br>(IOPS) | After Test<br>(IOPS) | Function Test<br>(≤10% Variance) |
| SSD at X axis         | 34,704                | 34,638               | Pass                             |
| SSD at Y axis         | 34,678                | 33,633               | Pass                             |
| SSD at Z axis         | 34,570                | 34,593               | Pass                             |

Table 4: Vibration Test Results

### **Shock Testing During Operation**

Shock testing conditions were as follow:

- 1) Half sine wave shock
- 2) 6G (Gravity) with pulse duration of 11ms
- 3) 5 shocks for each of the three axes

Figure 26 shows the setup of the shock testing. The operational shock test is performed on X, Y and Z axes. The shock waveform is shown for each axis setup. The 6G half sine shock is a particularly hard condition.

#### **Operating Shock Test**



X axis

Y axis

Z axis

Figure 26: Shock Testing Setup

As shown in Table 5, all shock tests were passed. The performance measured in IOMETER is within the 10% variance during those shock tests.

| Test Result           | Befo                  | re Test              | After Test                       |
|-----------------------|-----------------------|----------------------|----------------------------------|
| Physical damage check | P                     | Pass                 |                                  |
| Functional check      | Pass                  |                      | Pass                             |
|                       | IOMETER Test Results  |                      |                                  |
| Axis                  | Before Test<br>(IOPS) | After Test<br>(IOPS) | Function Test<br>(≤10% Variance) |
| SSD at X axis         | 34,738                | 34,738               | Pass                             |
| SSD at Y axis         | 34,750                | 34,749               | Pass                             |
| SSD at Z axis         | 34,818                | 34,814               | Pass                             |

Table 5: Shock test result

# Summary

NGSFF is the state-of-the-art SSD device form factor which is optimized for data center and enterprise server systems. To further help vendors' NGSFF server designs, Samsung developed the Mission Peak 1U server and demonstrated its benefits through many public and private exhibitions and demonstrations.

This white paper explains the details of the architecture and design of Mission Peak, including the Mission Peak Motherboard, PCIe lane Distribution, Mid-plane Board, Riser Board, Tray for NGSFF SSD, Server-side Connector, Power Supply circuit and Power disabling circuit. Reliability is also a very important consideration for servers. Thermal simulation and Signal integration simulations as well as vibration and shock tests were performed to make sure that the Mission Peak reference design can be used for commercial product development.

If your company wants to develop NGSFF SSD based server systems, please contact us to get more detailed design information and technical support. If you want to get the technical information on Samsung's NGSFF SSD product, please refer to the NGSFF SSD form factor white paper or the NGSFF SSD product brochure.

#### About Samsung Electronics Co., Ltd.

Samsung inspires the world and shapes the future with transformative ideas and technologies. The company is redefining the worlds of TVs, smartphones, wearable devices, tablets, digital appliances, network systems, and memory, system LSI and LED solutions. For the latest news, please visit the Samsung Newsroom at **news.samsung.com**.

Copyright © 2018 Samsung Electronics Co., Ltd. All rights reserved. Samsung is a registered trademark of Samsung Electronics Co., Ltd. Specifications and designs are subject to change without notice. Nonmetric weights and measurements are approximate. All data were deemed correct at time of creation. Samsung is not liable for errors or omissions. All brand, product, service names and logos are trademarks and/or registered trademarks of their respective owners and are hereby recognized and acknowledged.

Fio is a registered trademark of Fio Corporation. Intel is a trademark of Intel Corporation in the U.S. and/or other countries. Linux is a registered trademark of Linus Torvalds. PCI Express and PCIe are registered trademarks of PCI-SIG. Toggle is a registered trademark of Toggle, Inc.