Extending Flash Memory Through Adaptive Tuning

4 downloads 7429 Views 1MB Size Report
Aug 14, 2013 ... Flash Memory Summit 2013. Santa Clara ... 2013. Santa Clara, CA. Landscape 3. 10. Time. Quality. Param 1 level ... The Not-So-Secret Sauce.
Extending Flash Memory Through Adaptive Parameter Tuning

Prof. Conor Ryan CTO - Software [email protected]

Flash Memory Summit 2013 Santa Clara, CA

Take Home Messages

• 20x increase in endurance • Works in addition to ECC • Lightweight, autonomic system • Use the least force possible

Flash Memory Summit 2013 Santa Clara, CA

2

At a glance

• NVMdurance Pathfinder – Suite of Machine Learning Algorithms – Searches hypertopology of register values – Plots “safe” temporal paths through hypertopology

• NVMdurance Navigator – Autonomic system – Navigates the paths Flash Memory Summit 2013 Santa Clara, CA

3

Error correction Error Correction Corrects errors after they occur

With NVMdurance Errors are delayed

Flash Memory Summit 2013 Santa Clara, CA

4

The Problem Force

Force applied at Time X Pass after 3K cycles

Pass after 1K cycles

Time Flash Memory Summit 2013 Santa Clara, CA

Solution Force Pass after 30K cycles Vary force over time Pass after 20K cycles

Pass after 10K cycles Pass after 5K cycles

Time Flash Memory Summit 2013 Santa Clara, CA

Solution Force Pass after 30K cycles

What values to change to? Vary force over time When to make the changes? Which registers to change?

Pass after 20K cycles

Pass after 10K cycles Pass after 5K cycles

Time Flash Memory Summit 2013 Santa Clara, CA

Landscape Value increases over time

Param 1 Level

Time

Flash Memory Summit 2013 Santa Clara, CA

8

Landscape 2 Quality

Time

Param 1 level

Flash Memory Summit 2013 Santa Clara, CA

9

Landscape 3 Quality

Time

Param 1 level

Flash Memory Summit 2013 Santa Clara, CA

10

Landscape 4 Quality

Time

Param 1 level

Flash Memory Summit 2013 Santa Clara, CA

11

NVMdurance Pathfinder

• Each free parameter adds three dimensions • 828 possible VALUE combinations – 1.93428131138341 X 1025 at each possible time step – JEDEC style tests (3 weeks per set) • 1.1128741791521 X 1024 YEARS

– Vary over time? Flash Memory Summit 2013 Santa Clara, CA

12

The Not-So-Secret Sauce

• Reduce Dimensionality – Understand the silicon

• Guide the search – Be sensible, if not insightful

• Test only what has to be tested – Or at least know what NOT to test

• ….This is still an astronomically difficult problem! Flash Memory Summit 2013 Santa Clara, CA

13

The Secret Sauce

• 90 dimensional space – Plot hypertopology

• Calculate set of “safe” paths – The space is huge

• Tune paths on the fly – All Flash is different; “The Tolstoy Principle” • Factory testing abuses all SSDs in the same way; • Field use abuses SSDs in many different ways Flash Memory Summit 2013 Santa Clara, CA

14

NVMdurance Pathfinder

Start

Flash Memory Summit 2013 Santa Clara, CA

Destination

15

NVMdurance Pathfinder

Start

Destination … and beyond!

Flash Memory Summit 2013 Santa Clara, CA

16

NVMdurance Pathfinder Destination

Asteroid Belt NVMdurance Pathfinder identifies a number of viable routes through the asteroids

Flash Memory Summit 2013 Santa Clara, CA

Start

17

NVMdurance Navigator

Asteroid Belt Live on device NVMdurance Navigator chooses which path to use, based on the “health” of the device.

Flash Memory Summit 2013 Santa Clara, CA

18

NVMdurance Navigator

Asteroid Belt Live on device NVMdurance Navigator chooses which path to use, based on the “health” of the device.

Flash Memory Summit 2013 Santa Clara, CA

19

NVMdurance Navigator

Asteroid Belt Live on device NVMdurance Navigator chooses which path to use, based on the “health” of the device.

Flash Memory Summit 2013 Santa Clara, CA

20

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

21

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

22

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

23

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

24

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

25

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

26

NVMdurance Navigator

Asteroid Belt At any time NVMdurance Navigator may change to another one of the previously identified viable paths.

Flash Memory Summit 2013 Santa Clara, CA

27

NVMdurance system Offline Preproduction Parameter Discovery NVMdurance Pathfinder: Discover routes through multidimensional space such that every parameter set passes retention for that point of life

Autonomic (runs live on the SSD controller) NVMdurance Navigator: Observes deterioration of the SSD; chooses when to change parameters

Flash Memory Summit 2013 Santa Clara, CA

28

Experiments

Device Endurance Retention Intrinsic Endurance (3 months retention)

Flash Memory Summit 2013 Santa Clara, CA

3X 3,000 12 months 8,000

29

Results

Type

p/e Cycles

Retention

3X Default

3,000 12 months

3X Default Intrinsic (lower retention)

8,000 3 months

NVMdurance

60,000 3 months

20x Improvement

Flash Memory Summit 2013 Santa Clara, CA

30

BER

Results Detailed

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

31

BER

Results Detailed

ECC fail point

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

32

BER

Results Detailed

Default Static Parameters Pre-retention BER

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

33

BER

Results Detailed

Default Static Parameters Post-retention BER Fail: 10K p/e cycles

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

34

BER

Results Detailed

NVMdurance Preretention BER

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

35

BER

Results Detailed

NVMdurance Postretention BER Pass: 60 K p/e cycles

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

36

BER

Results Detailed

Rated endurance of NVMdurance 60K p/e cycles Rated endurance of default set: 8K p/e cycles

1000s of p/e cycles Flash Memory Summit 2013 Santa Clara, CA

37

Projections

Type 1X

p/e Cycles

Retention 3,000 3 months

NVMdurance

30,000 3 months

NVMdurance Navigator 2.0

60,000 3 months

10x Improvement In developing NVMdurance 1.0 we have identified several additional techniques that we will implement to deliver further endurance gains in later versions of the product.

Flash Memory Summit 2013 Santa Clara, CA

38

Conclusions

• Avoid the problem of live optimization of parameters – Most work done pre-production

• 20x increase in endurance on 3X – 10x increase on 1X

• NVMdurance productized; will be deployed on live enterprise SSDs in 2014 • Stop by and see us at booth #917 Flash Memory Summit 2013 Santa Clara, CA

39