Computer Architecture A Quantitative Approach, Sixth Edition Chapter

Computer Architecture A Quantitative Approach, Sixth Edition Chapter

Computer Architecture A Quantitative Approach, Sixth Edition Chapter 6 Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism Copyright 2019, Elsevier Inc. All rights Reserved 1 Introduction Introduction Warehouse-scale computer (WSC) Provides Internet services Differences with HPC clusters:

Search, social networking, online maps, video sharing, online shopping, email, cloud computing, etc. Clusters have higher performance processors and network Clusters emphasize thread-level parallelism, WSCs emphasize request-level parallelism Differences with datacenters: Datacenters consolidate different machines and software into one location Datacenters emphasize virtual machines and hardware heterogeneity in order to serve varied customers Copyright 2019, Elsevier Inc. All rights Reserved 2

Important design factors for WSC: Cost-performance Small savings add up Energy efficiency Introduction Introduction Affects power distribution and cooling

Work per joule Dependability via redundancy Network I/O Interactive and batch processing workloads Copyright 2019, Elsevier Inc. All rights Reserved 3 Ample computational parallelism is not important Can afford to build customized systems since WSC require volume purchase Location counts

Power consumption is a primary, not secondary, constraint when designing system Scale and its opportunities and problems Most jobs are totally independent Request-level parallelism Operational costs count Introduction Introduction Real estate, power cost; Internet, end-user, and workforce availability Computing efficiently at low utilization Scale and the opportunities/problems associated with scale

Unique challenges: custom hardware, failures Unique opportunities: bulk discounts Copyright 2019, Elsevier Inc. All rights Reserved 4 Location of WSC Proximity to Internet backbones, electricity cost, property tax rates, low risk from earthquakes, floods, and hurricanes Power distribution Copyright 2019, Elsevier Inc. All rights Reserved Efficiency and Cost of WSC

Efficiency and Cost of WSC 5 Batch processing framework: MapReduce Map: applies a programmer-supplied function to each logical input record Programming Models and Workloads for WSCs Prgrmg Models and Workloads Runs on thousands of computers Provides new set of key-value pairs as intermediate values Reduce: collapses values using another programmer-supplied function

Copyright 2019, Elsevier Inc. All rights Reserved 6 Example: map (String key, String value): // key: document name // value: document contents for each word w in value EmitIntermediate(w,1); // Produce list of all words reduce (String key, Iterator values):

// key: a word // value: a list of counts int result = 0; for each v in values: Programming Models and Workloads for WSCs Prgrmg Models and Workloads result += ParseInt(v); // get integer from key-value pair Emit(AsString(result)); Copyright 2019, Elsevier Inc. All rights Reserved 7 Availability:

Use replicas of data across different servers Use relaxed consistency: No need for all replicas to always agree File systems: GFS and Colossus Databases: Dynamo and BigTable Copyright 2019, Elsevier Inc. All rights Reserved Programming Models and Workloads for WSCs Prgrmg Models and Workloads 8 MapReduce runtime environment schedules

map and reduce task to WSC nodes Workload demands often vary considerably Scheduler assigns tasks based on completion of prior tasks Tail latency/execution time variability: single slow task can hold up large MapReduce job Runtime libraries replicate tasks near end of job Copyright 2019, Elsevier Inc. All rights Reserved Programming Models and Workloads for WSCs Prgrmg Models and Workloads 9 Copyright 2019, Elsevier Inc. All rights Reserved Programming Models and Workloads for WSCs

Prgrmg Models and Workloads 10 WSC often use a hierarchy of networks for interconnection Each 19 rack holds 48 1U servers connected to a rack switch Rack switches are uplinked to switch higher in hierarchy Computer Ar4chitecture of WSC Computer Architecture of WSC Uplink has 6-24X times lower bandwidthGoal is to maximize locality of communication relative to the rack

Copyright 2019, Elsevier Inc. All rights Reserved 11 Storage options: Use disks inside the servers, or Network attached storage through Infiniband WSCs generally rely on local disks Google File System (GFS) uses local disks and maintains at least three relicas Copyright 2019, Elsevier Inc. All rights Reserved Computer Ar4chitecture of WSC Storage

12 Switch that connects an array of racks Array switch should have 10 X the bisection bandwidth of rack switch Cost of n-port switch grows as n2 Often utilize content addressible memory chips and FPGAs Copyright 2019, Elsevier Inc. All rights Reserved Computer Ar4chitecture of WSC Array Switch 13

Computer Ar4chitecture of WSC WSC Memory Hierarchy Servers can access DRAM and disks on other servers using a NUMA-style interface Copyright 2019, Elsevier Inc. All rights Reserved 14 Copyright 2019, Elsevier Inc. All rights Reserved Computer Ar4chitecture of WSC WSC Memory Hierarchy 15 Copyright 2019, Elsevier Inc. All rights Reserved Computer Ar4chitecture of WSC WSC Memory Hierarchy 16

Cooling Air conditioning used to cool server room 64 F 71 F Keep temperature higher (closer to 71 F) Cooling towers can also be used Minimum temperature is wet bulb temperature Copyright 2019, Elsevier Inc. All rights Reserved Physcical Infrastrcuture and Costs of WSC Infrastructure and Costs of WSC

17 Cooling system also uses water (evaporation and spills) Power cost breakdown: E.g. 70,000 to 200,000 gallons per day for an 8 MW facility Chillers: 30-50% of the power used by the IT equipment Air conditioning: 10-20% of the IT power, mostly due to fans How man servers can a WSC support? Each server:

Physcical Infrastrcuture and Costs of WSC Infrastructure and Costs of WSC Nameplate power rating gives maximum power consumption To get actual, measure power under actual workloads Oversubscribe cumulative server power by 40%, but monitor power closely Copyright 2019, Elsevier Inc. All rights Reserved 18 Determining the maximum server capacity

Nameplate power rating: maximum power that a server can draw Better approach: measure under various workloads Oversubscribe by 40% Typical power usage by component: Processors: 42% DRAM: 12% Disks: 14% Networking: 5% Cooling: 15% Power overhead: 8% Miscellaneous: 4% Copyright 2019, Elsevier Inc. All rights Reserved Physcical Infrastrcuture and Costs of WSC

Infrastructure and Costs of WSC 19 Power Utilization Effectiveness (PEU) = Total facility power / IT equipment power Median PUE on 2006 study was 1.69 Performance Latency is important metric because it is seen by users Bing study: users will use search less as

response time increases Service Level Objectives (SLOs)/Service Level Agreements (SLAs) Physcical Infrastrcuture and Costs of WSC Measuring Efficiency of a WSC E.g. 99% of requests be below 100 ms Copyright 2019, Elsevier Inc. All rights Reserved 20 Copyright 2019, Elsevier Inc. All rights Reserved Physcical Infrastrcuture and Costs of WSC Measuring Efficiency of a WSC 21 Capital expenditures (CAPEX)

Cost to build a WSC $9 to 13/watt Operational expenditures (OPEX) Cost to operate a WSC Copyright 2019, Elsevier Inc. All rights Reserved Physcical Infrastrcuture and Costs of WSC Cost of a WSC 22 Amazon Web Services

Virtual Machines: Linux/Xen Low cost Open source software Initially no guarantee of service No contract Copyright 2019, Elsevier Inc. All rights Reserved Cloud Computing Cloud Computing 23 Cloud Computing Growth Copyright 2019, Elsevier Inc. All rights Reserved Cloud Computing

Cloud Computing 24 Cloud computing providers are losing money AWS has a margin of 25%, Amazon retail 3% Fallcies and Pitfalls Fallacies and Pitfalls Focusing on average performance instead of 99 th percentile performance Using too wimpy a processor when trying to

improve WSC cost-performance Inconsistent Measure of PUE by different companies Capital costs of the WSC facility are higher than for the servers that it houses Copyright 2019, Elsevier Inc. All rights Reserved 25 Fallcies and Pitfalls Fallacies and Pitfalls Trying to save power with inactive low power modes versus active low power modes Given improvements in DRAM dependability and the fault tolerance of WSC systems software, there is no need to spend extra for ECC memory

in a WSC Coping effectively with microsecond (e.g. Flash and 100 GbE) delays as opposed to nansecond or millisecond delays Turning off hardware during periods of low activity improves the cost-performance of a WSC Copyright 2019, Elsevier Inc. All rights Reserved 26

Recently Viewed Presentations

  • I am an agricultural educator by choicePenny and

    I am an agricultural educator by choicePenny and

    Byron High School. Dave Wyrick. Chesaning High School. Norm Braun. Mark Forbush. Reading, Perry and Corunna. Doug Pennington. Olivet. Emily (Butcher) Smith . Centreville High School
  • Spiritual Warfare In The End Times - New Testament Prayer

    Spiritual Warfare In The End Times - New Testament Prayer

    Spiritual Warfare In The End Times ... The Days of Noah and of Lot. Matthew 24:36-39. ... and is then given power from the Dragon. Possibly one of the ancient fallen angels from Tartarus (2 Peter 2:4) brought back from...
  • Les "Local Authorities" aux Etats-Unis

    Les "Local Authorities" aux Etats-Unis

    And some lessons learnt Lessons learnt about risk Management European examples The "Chunnel": technical success, financial failure The channel tunnel: the concession The Barcelona light rail: a typical PPP project 2 lines 15+14 km Investment around 600 m€, subsidy 60%...
  • MarketspaceU - Chapter 3 Enhanced Lecture Slides

    MarketspaceU - Chapter 3 Enhanced Lecture Slides

    Chapter 3 Enhanced Lecture Slides ... via phone Brochure Purchase in-store Purchase via phone In-store specials Specials offered via phone *CSR = Customer service representative Flower / Gift Decision Process Need Recognition Search for Ideas and Offerings Purchase Decision Message...
  • The Giver PowerPoint - North Thurston Public Schools

    The Giver PowerPoint - North Thurston Public Schools

    The Giver PowerPoint. Required Slides. Title Page. Main Characters. Setting. Summary. Rules: 7-10 rules mentioned in the book.. Opinion of book: 1-5 stars and why you gave it that rating.. Optional Slides (for a 4) Create a Utopian Society with…
  • The Negotiable Instrument Act,1881 INTRODUCTION Section 13 of

    The Negotiable Instrument Act,1881 INTRODUCTION Section 13 of

    Parties Indorser The person who indorses the bill, note or cheque to another Indorsee The person to whom the bill, note or cheque is indorsed Parties Drawee in case of need When in the bill or in any indorsement thereon...
  • Logistics, Readiness, and Sustainment

    Logistics, Readiness, and Sustainment

    When it comes time for a contract you'll have all your questions answered. ... Flood Mitigation Assistance (FMA) Grant Program. Reduce or eliminate the risk of flood damage. Annual allocation of funding by FEMA. 25% cost share for insured properties...
  • EECS 373 Design of Microprocessor-Based Systems Mark Brehob

    EECS 373 Design of Microprocessor-Based Systems Mark Brehob

    That all share a single interrupt line and/or handler. That all fire at about the same rate on average. And that require about the same amount of time to poll. The handler might look something like this. isr_handler: bl chk_interrupt_src_1...