TIENE EN SU CESTA DE LA COMPRA
en total 0,00 €
This book starts with the basic premise that a service is comprised of the 3Ps-products, processes, and people. Moreover, these entities and their sub-entities interlink to support the services that end users require to run and support a business. This widens the scope of any availability design far beyond hardware and software. It also increases the potential for service failure for reasons beyond just hardware and software; the concept of logical outages.
High Availability IT Services details the considerations for designing and running highly available ´services´ and not just the systems infrastructure that supports those services. Providing an overview of virtualization and cloud computing, it supplies a detailed look at availability, redundancy, fault tolerance, and security. It also stresses the importance of human factors.
The book starts off by providing an availability primer and detailing the reasons why you need to be concerned with high availability. Next, it outlines the theory of reliability and availability and the elements of actual practices in this high availability (HA) area, including Service Level Agreements (SLAs) and Change Management.
Examining what the major hardware and software vendors have to offer in the HA world, the book considers the ubiquitous world of clouds and virtualization as well as the availability considerations they present.
The book examines high availability concepts and architectures such as reliability, availability, and serviceability (RAS); clusters; grids; and redundant arrays of independent disks (RAID) storage. It also covers the role of security in providing high availability, cluster offerings, emergent Linux clusters, online transaction processing (OLTP), and relational databases.
AN AVAILABILITY PRIMER
Preamble: A View from 30,000 Feet
Do You Know. . .?
Availability in Perspective
Murphy's Law of Availability
Availability Drivers in Flux: What Percentage of Business Is Critical?
Historical View of Availability: The First 7 × 24 Requirements?
Historical Availability Scenarios
Planar Technology
Power-On Self-Test
Other Diagnostics
Component Repair
In-Flight Diagnostics
Summary
Reliability and Availability
Introduction to Reliability, Availability, and Serviceability
RAS Moves Beyond Hardware
Availability: An Overview
Some Definitions
Quantitative Availability
Availability: 7 R's (SNIA)
Availability and Change
Change All around Us
Software: Effect of Change
Operations: Effect of Change
Monitoring and Change
Automation: The Solution?
Data Center Automation
Network Change/Configuration Automation
Automation Vendors
Types of Availability
Binary Availability
Duke of York Availability
Hierarchy of Failures
Hierarchy Example
State Parameters
Types of Nonavailability (Outages)
Logical Outage Examples
Summary
Planning for Availability and Recovery
Why Bother?
What Is a Business Continuity Plan?
What Is a BIA?
What Is DR?
Relationships: BC, BIA, and DR
Recovery Logistics
Business Continuity
Downtime: Who or What Is to Blame?
Elements of Failure: Interaction of the Wares
Summary
DR/BC Source Documents
Reliability: Background and Basics
Introduction
IT Structure-Schematic
IT Structure-Hardware Overview
Service Level Agreements
Service Level Agreements: The Dawn of Realism
What Is an SLA?
Why Is an SLA Important?
Service Life Cycle
Concept of User Service
Elements of Service Management
Introduction
Scope of Service Management
User Support
Operations Support
Systems Management
Service Management Hierarchy
The Effective Service
Services versus Systems
Availability Concepts
First Dip in the Water
Availability Parameters
Summary
What Is High Availability?
IDC and Availability
Availability Classification
Availability: Outage Analogy
A Recovery Analogy
Availability: Redundancy
Availability: Fault Tolerance
Sample List of Availability Requirements
System Architecture
Availability: Single Node
Dynamic Reconfiguration/Hot Repair of System Components
Disaster Backup and Recovery
System Administration Facilities
HA Costs Money So Why Bother?
Cost Impact Analysis
HA: Cost versus Benefit
Penalty for Nonavailability
Organizations: Attitude to HA
Aberdeen Group Study: February 2012
Outage Loss Factors (Percentage of Loss)
Software Failure Costs
Assessing the Cost of HA
Performance and Availability
HA Design: Top 10 Mistakes
The Development of HA
Servers
Systems and Subsystems Development
Production Clusters
Availability Architectures
RAS Features
Hot-Plug Hardware
Processors
Memory
Input/Output
Storage
Power/Cooling
Fault Tolerance
Outline of Server Domain Architecture
Introduction
Domain/LPAR Structure
Outline of Cluster Architecture
Cluster Configurations: Commercial Cluster
Cluster Components
Hardware
Software
Commercial LB
Commercial Performance
Commercial HA
HPC Clusters
Generic HPC Cluster
HPC Cluster: Oscar Configuration
HPC Cluster: Availability
HPC Cluster: Applications
HA in Scientific Computing
Topics in HPC Reliability: Summary
Errors in Cluster HA Design
Outline of Grid Computing
Grid Availability
Commercial Grid Computing
Outline of RAID Architecture
Origins of RAID
RAID Architecture and Levels
Hardware
Software
Hardware versus Software RAID
RAID Striping: Fundamental to RAID
RAID Configurations
RAID Components
ECC
Parity
RAID Level 0
RAID Level 1
RAID Level 3
RAID Level 5
RAID Level 6
RAID Level 10
RAID 0 + 1 Schematic
RAID 10 Schematic
RAID Level 30
RAID Level 50
RAID Level 51
RAID Level 60
RAID Level 100
Less Relevant RAID
RAID Level 2
RAID Level 4
RAID Level 7
Standard RAID Storage Efficiency
SSDs and RAID
SSD Longevity
Hybrid RAID: SSD and HDD
SSD References
Post-RAID Environment
Big Data: The Issue
Data Loss Overview
Big Data: Solutions?
Non-RAID RAID
Erasure Codes
RAID Successor Qualifications
EC Overview
EC Recovery Scope
Self-Healing Storage
Summary
AVAILABILITY THEORY AND PRACTICE
High Availability: Theory
Some Math
Guide to Reliability Graphs
Probability Density Function
Cumulative Distribution Function
Availability Probabilities
Lusser's Law
Availability Concepts
Hardware Reliability: The Bathtub Curve
Software Reliability: The Bathtub Curve
Simple Math of Availability
Availability
Nonavailability
Mean Time between Failures
Mean Time to Repair
Online Availability Tool
Availability Equation I: Time Factors in an Outage
Availability Equation II
Effect of Redundant Blocks on Availability
Parallel (Redundant) Compon