ITIL Study Guide

 

By Robert Perrine

 

Fall 2004

 


Table of Contents

ITIL Study Guide. 1

1.         Revision History. 3

2.         Introduction. 4

3.         Service Desk. 5

4.         Incident Management 14

5.         Problem Management 27

6.         Configuration Management 41

7.         Change Management 55

8.         Release Management 74

9.         Service Level Management 92

10.      Financial Management 105

11.      Capacity Management 114

12.      IT Service Continuity Management 127

13.      Availability Management 138


 

1.                Revision History

Date

Version

Author

Description

26 Oct 2004

0.8

Robert Perrine

First preliminary release for comment.

28-Oct-2004

0.9

Robert Perrine

Wider distribution of this preliminary release.

29-Oct-2004

1.0

Robert Perrine

Added Synopsis to each Process.

 


 

2.                Introduction

I compiled this study guide to organize the ITIL Service Support and Service Delivery material into a uniform format. This study guide is not a stand-alone publication, but simply an organized synopsis of the contents in the ITIL guides. Wherever possible, text and images have been directly copied from the ITIL publication. In copying text and images, my intent is not to break copyright laws, but to create a supplement to the publications from the British OGC.

 

It is my understanding that teaching materials may contain up to 10% of a publication without breaking the copyright laws in the United States of America. Also, the court system in the United States of America tends to prevail on the concept that damage only occurs if there is a financial loss. Thus, it is imperative that this material only be used in conjunction with a licensed copy of the ITIL Service Support and Service Delivery guides. And it is imperative that no charge be made for this material. I believe that by observing both of those precautions I am within the legal bounds allowed by the copyright laws of the United States of America. If you have information to the contrary, please notify me so that I may take action to correct any infringements.

 

As for this document, no warranty is offered or implied as to the suitable of this material for use. If you find errors or discrepancies, please send an email and I will do my best to correct the situation. Please understand that this is a voluntary work in progress.

 

Also, please remember that this is a supplement, not a replacement for the ITIL Service Delivery and Service Support guides. While this material may aid your studies, I strongly emphasize the importance of reading the actual OGC materials.

 

Robert Perrine

robert@RobertPerrine.biz

Fall 2004

Irvine, California, United States of America

 

 



3.1.          Service Desk Synopsis

Goals

Global approach

Integrated

Handles incidents & Problems

Provides interface to other processes

 

Benefits

Improved Customer service, perception and satisfaction

Single point of contact

Better-quality and speedier

Improved teamwork and communication

Focused and proactive

Reduced negative

Better managed infrastructure

Improved usage of resources

Increased productivity of business personnel

Better management information

 

Critical Success Factors

Business needs understood

Customer requirements understood

Investment in training

Service objectives, goals and deliverables clearly defined

Service levels are practical, agreed, and regularly reviewed

Benefits accepted by the business

 

Key Performance Indicators

Number of requests

Requests taking most time

Request taking Customer longest time

Which Customers require the most support?

 

Possible Implementation Problems

Unrealistic expectations

Back-door support options

Misaligned communications

 

Tools

Configuration Management Database

Service Catalog

Service Level Agreements, Operational Level Agreements and Underpinning Contracts

Users Manual

Support procedures

Notification and escalation lists

Customer Surveys

Knowledge base including list of known errors

Soft skills such as active listening

Time tracking

Reports

Work load analysis

 

Activities

Receiving calls & Customer liaison

Recording and tracking Incidents and complaints

Customers informed

Initial assessment & attempt to resolve

Monitoring and escalation by SLA

Full life-cycle

Communicate changes

Coordinating support groups

Management information and recommendations

Identify Problems

Highlight Customer training needs

Confirm closure with Customer

Contribute to Problem diagnosis

 

Inputs

Data feeds (assets, etc)

Incidents: Desktop, Security, Network, System, Application & Operational

Requests for status

Questions

Service Requests

 

Outputs

Management Information and Monitoring

Support: Desktop, Security, Network, System, Application & Operational

Third Party Support

Sales, Purchase, Contract & Account Management Support

Communicate status and schedules

 

Manager’s Duties

Coordinate distributed SD resources

Support the Service Management processes

Implement Incident Management

Workload monitoring to manage staffing level

Management reporting

Conduct & analyze customer satisfaction surveys

Classification Process Reviews

Review & revise SD tools & procedures

 

Other Roles

First Line, Second Line


 

3.2.          Service Desk Goal Statement

The Service Desk extends the range of services and offers a more global-focused approach, allowing business processes to be integrated into the Service Management infrastructure. It not only handles incidents, Problems and questions, but also provides an interface for other activities such as customer Change requests, maintenance contracts, software licenses, Service Level Management, Configuration Management, Availability Management, Financial Management for IT Services, and IT Service Continuity Management.

3.3.          Service Desk Benefits

·         Improved Customer service, perception and satisfaction

·         Increased accessibility through a single point of contact, communication, and information

·         Better-quality and speedier turnaround of customer requests

·         Improved teamwork and communication

·         Enhanced focus and a proactive approach to service provision

·         A reduced negative business impact

·         Better managed infrastructure and control

·         Improved usage of IT support resources and increased productivity of business personnel

·         More meaningful management information for decision support” (SS 4.1.8)

3.4.          Service Desk Critical Success Factors

·         Business needs are understood

·         Customer requirements are understood

·         Investment is made in training for Customers, support teams and Service Desk staff

·         Service objectives, goals and deliverables are clearly defined

·         Service levels are practical, agreed, and regularly reviewed

·         The benefits are accepted by the business (SS 4.10.1)

3.5.          Service Desk Key Performance Indicators

·         "The number of requests being handled by the Service Desk, with this initially needing to take into account requests passed directly to second-line support groups

·         The types of request that staff are spending the most time on, such as

·         Equipment failures

·         Business application problems

·         Telecommunications

·         Customer queries

·         Installations/upgrades

·         The types of request that are taking the longest time to turnaround to the Customer by:

·         First-line support

·         Second-line internal support groups

·         Third-party support groups

·         Suppliers

·         Which Customers require the most support, and in which areas – this information can be used to develop training programs for both Customer staff and Service Desk and support staff" (SS 4.4.7)

3.6.          Service Desk Possible Implementation Problems

·         Unrealistic expectations

·         Charging can cause the Users to find back-door support options

·         Misaligned communications

3.7.          Service Desk Tools

·         Configuration Management Database

·         Service Catalog

·         Service Level Agreements, Operational Level Agreements and Underpinning Contracts

·         Users Manual

·         Support procedures

·         Notification and escalation lists

·         Customer Surveys

·         Knowledge base including list of known errors

·         Soft skills such as active listening

·         Time tracking

·         Reports

·         Work load analysis

3.8.          Service Desk Inputs and Outputs

When fully implemented the Service Desk will serve as the single interface for IS Users. The following diagram illustrates the types of inputs and outputs into the Service Desk.

Inputs:

·         Data feeds (assets, etc)

·         Incidents: Desktop, Security, Network, System, Application & Operational

·         Requests for status

·         Questions

·         Service Requests

o        Incident product purchases

o        Application queries

o        Requests for equipment moves, installations, upgrades and enhancements

o        Requests for consumables

Outputs:

·         Management Information and Monitoring

·         Support: Desktop, Security, Network, System, Application & Operational

·         Third Party Support

·         Sales, Purchase, Contract & Account Management Support

·         Communicate status and schedules

Activities:

·         "Receiving calls, first-line Customer liaison

·         Recording and tracking Incidents and complaints

·         Keeping Customers informed on request status and progress

·         Making an initial assessment of requests, attempting to resolve them or refer them to someone who can, based on agreed service levels

·         Monitoring and escalation procedures relative to the appropriate SLA

·         Managing the request life-cycle, including closure and verification

·         Communicating planned and short-term changes of service levels to Customers

·         Coordinating second-line and third-party support groups

·         Providing management information and recommendations for service improvement

·         Identifying Problems

·         Highlighting Customer training and education needs

·         Closing Incidents and confirmation with the Customer

·         Contributing to Problem identification" (SS 4.4.1)

3.8.1.      Service Desk Inputs and Outputs Diagram

 

3.8.2.      Service Desk Inputs and Outputs Description

Interface

Description

Data feeds (assets, etc)

The Service Desk is the communications center for the users and for IS Support. It is the hub of multiple types of data, including Security, Network, System, Application and other Operational Incidents. It is also the recipient of multiple types of communication including automated interfaces, manual data interfaces and human interfaces.

  • Automated monitoring will feed Incidents into the Service Desk. Where possible, those Incidents feeds should be directly integrated into the Configuration Management Database (CMDB) in order to automatically create new Incident Records.
  • Where the automated monitoring cannot be interfaced, the Service Desk will be called upon to read emails, browse monitoring consoles and check web pages to find data and then manually create Incident Records.
  • Additionally, the Service Desk is the primary interface for both users and IS support. Phone calls and emails from users and IS support must be processed manually and keyed into the CMDB to create new Incident Records.

Security Incidents

Network & System Incidents

Application Incidents

Operational Incidents

Service Desk

The Service Desk is an organization with multiple responsibilities. Those responsibilities are documented within the ITIL processes. The primary responsibility is the implementation of the Incident Management process.

Management Information and Monitoring

As the communications center for IS, the Service Desk is the organization responsible for ensuring that all Incidents are recorded in the CMDB as Incident Records. That consolidation allows management to quickly assess the health of the organization and examine trends.

Desktop Support

The Service Desk is a two-way communications center. Incidents that cannot be managed within the Service Desk are escalated outward functionally to support groups and hierarchically to management.

The Service Desk also provides the interface between the technical support teams and the users for status updates, to facilitate diagnosis, to implement work-arounds and other activities as required supporting the ITIL processes.

Network Support

Application Support

Systems & Operations Support

Third Party Support

When escalations and implementations are managed by third party support companies, the Service Desk is the organization called on to facilitate communication. This activity is simply an extension of the communication described above for IS support.

Sales, Purchase, Contract & Account Management Support

As the single point of contact for the user community, the Service Desk is also called upon to provide communications for other processes. These other communications should be documented as Service Requests so that there is a clear distinction between the support of the infrastructure and the support of the users.

 

3.9.          Service Desk as a Function

Service Desk is a Function and not a Process. The primary function of Service Desk is to implement the Incident Management process. Please refer to the description of Incident Management and review that process flow as that process flow is the focus of the Service Desk.

3.10.      Service Desk Relationships with other Service Management Processes

Service Desk with Incident Management

Service Desk can be the customer interface to the ITIL processes (2.11)

Service Desk can implement Incident Management (2.11)

Service Desk can provide life-cycle management of Incidents including: receive, record, classify, prioritize, triage, notify, escalate, track, communicate status and report.

Tools, such as the list of Known Errors, used by Service Desk also support Incident Management.

Second level and third level Incident Management may be outside the Service Desk.

Service Desk should document issues that impact the SLA.

Service Desk should define the relationship to "Super Users".

Service Desk should implement and publish customer satisfaction surveys.

Service Desk can feed Incidents into the Incident Management process

Service Desk is responsible for the monitoring and resolution of all Incidents

Second level and third level Incident Management may be outside the Service Desk.

Service Desk is responsible for registering all Incidents

The Service Desk will generally be responsible for Incident resolution and often handles up to 85% of the Incidents

The Service Desk monitors the progress on all registered Incidents

Service Desk assigns the Priority

It is recommended that all five Service Support processes and the Service Desk be implemented as a unit

Incident closure should be handled by the Service Desk even if the resolution was determined outside of the Service Desk

Incidents, Problems, Known Errors, Requests for Change should all be stored or linked into the Configuration Management Database so that that records can be shared (2.9)

Changes may create new Incidents (2.9)

There should be a close interface between Incident, Problem, Change Management and the Service Desk (2.9)

 

Service Desk with Problem Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk interfaces with Problem Management include Problem identification, escalation and enforcement of SLA/OLA requirements.

Service Desk should escalate to Problem Management when it is unclear how to proceed with Incident Management.

Problem Management must only accept Problems from the Service Desk or the Service Desk will loose control of the process

Service Desk is responsible for disseminating the information about the work-around or resolution once Problem Management defines it

Problem Management must be kept separate from the Service Desk due to conflicts in interests.

 

 

Service Desk with Configuration Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk should be integrated with Configuration Management.

Service Desk is dependent on the Configuration Management tools.

 

Service Desk with Change Management

Service Desk can be the customer interface to the ITIL processes (2.11)

Service Desk should be integrated with Change Management.

Even self-service style "Service Desk" functionality must integrate with Change Management.

Service Desk might act as the focal point for Requests for Change from the Users, distribute the Forward Schedule of Change on behalf of Change Management and keep the Users informed regarding the progress of their Changes (2.11)

The Service Desk may be given delegation to implement standard Changes to circumvent Incidents within its sphere of authority. The scope of such Changes should be predefined and the Change Management function should be informed about all such Changes. Prior approval of Change Management is essential before Changes of specification of any Configuration Item are implemented  (2.11)

Change Management can delegate authority to the Service Desk to implement changes

The Service Desk is usually asked to handle customer notification regarding Changes

Effective Change Management relies on the Service Desk and Configuration, Problem and Release Management

The Service Desk must be kept informed of the details of each Change (2.7)

 

Service Desk with Release Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk should be integrated with Release Management.

Even self-service style "Service Desk" functionality must integrate with Release Management.

Service considerations must be included in release planning.

The Service Desk must be informed about any changes in the vendor support agreements

Release Management should inform the users when there are changes to the vendor support agreements, but the Service Desk may need to repeat that communication

Problem Management and Service Desk personnel must be trained to use and support the new Release

 

Service Desk with Service Level Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk uses Service Level Agreements and can provide enforcement on the Service Level Agreement and Operational Level Agreement requirements.

Service Desk uses the Service Catalog to define agreed expectations and identify the appropriate support process.

The Service Desk is in the direct firing line of any impact on the SLAs and as such needs rapid information flows (2.11)

Service Desk Incident Records can be used to identify the services when constructing a Service Catalog

Verify that the Service Level Agreements are achievable before agreeing

It is important that the Customer reports Incidents promptly so that the Incident can be resolved within the time allotted in the Service Level Agreement

Many organizations use their Service Desk linked to a comprehensive Configuration Management Database to monitor the Customer's perception of Availability

The Service Desk can monitor Incident response times and Incident resolution times

The Service Desk can monitor compliance with the Underpinning Contracts

The Service Level Agreements related to Incidents and Problems must be clearly visible in the Service Desk tool to facilitate communication, monitoring and escalation

The Customer's perception of response time should be recorded and Problem Management should investigate repetitions

Planning for a new Service should include the Service Desk, Change, Configuration and Release Management to verify resource and training requirements

 

Service Desk with Financial Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk can integrate with Financial Management.

Service Desk and integrated Service Management processes are likely to be able o provide data to be input to cost calculation (SD 5.5.8)

Training materials must be produced to explain the IT Accounting and/or Charging system. The Service Desk might be one of the users of this documentation (SD 5.6.1)

Sources of data for the costing / charging model are Service Desk calls, workload data from the Capacity Management Database, inventories from the Configuration Management Database and planned acquisitions (Capacity Plan) (SD 5.6.2)

 

Service Desk with Capacity Management

Service Desk can integrate with IT Service Capacity Management.

Service Desk can recommend improvements to Capacity Management.

Service Desk may refer performance related Incidents to Capacity Management to assess (SD 6.2)

Capacity Management works with Problem Management to give Incident Management diagnostic tools, automated alerting and updates to Known Errors (SD 6.7.1)

 

Service Desk with IT Service Continuity Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk can integrate with IT Service Continuity Management.

Service Continuity will interface with Service Desk for statistics on prior activities (SD 7.1.8)

Service Continuity is not focused minor technical faults unless that fault can cause a major impact on the business. Those risks are to be worked through Incident and Problem Management, addressed through Availability and Problem Management and resolved through Change and Configuration (and Release) Management (SD 7.2.2)

Criteria need to be established in advance as to when an Incident or Problem will be escalated to Service Continuity Management (SD 7.3.5)

The Service Desk has an important role to play if business continuity is invoked (2.4)

 

Service Desk with Availability Management

Service Desk can be the customer interface to the ITIL processes.

Service Desk and Incident Management handling of an Incident can significantly impact how this Incident affects Availability (processes, notifications, escalations, diagnostics, detection) (SD 8.5.4)

Diagnostic tools can help Problem Management resolve faster (SD 8.5.4)

The Incident should only be closed when the Users have verified that Service has been restored (SD 8.5.4)

Every component should have a designated maintenance strategy (planned downtime) incorporated into the service agreements (SD 8.5.6)

Service Desk, Change and Problem Management should provide data on downtimes (planned and unplanned), but this should be augmented by automated monitoring (SD 8.8.1)

Service Desk, Incident,  Problem, Change and Service Level Management provide inputs into a Service Outage Analysis (SD 8.9.8)

Available Management can use the Expanded Incident LifeCycle and work closely with Incident and Problem Management to insure that failures are not repeated (SD 8.9.9)

 


4.                Incident Management


4.1.          Incident Management Synopsis

Goals

Restore normal service operation quickly

Minimize adverse impact

Maintain service quality & availability

 

Benefits

Reduced business impact

Timely resolution

Proactive identification of enhancements

Improved monitoring on SLAs

Management information on service quality

Better staff utilization

Elimination of lost Incidents

More accurate CMDB information

Improved User and Customer satisfaction

 

Critical Success Factors

Up-to-date CMDB

Knowledge Base

Automated Incident Management system

Close link to SLM for SLA targets

 

Key Performance Indicators

Numbers of Incidents

Mean time to resolution

Percentage within agreed response time

Average cost per Incident

Percentage closed by Service Desk

Incidents per SD workstation

Percentage resolved without a visit

 

Possible Implementation Problems

Management or staff commitment

Clarity about business needs

Practices not reviewed or changed

Poor objectives, goals and responsibilities

No provision of agreed Customer service levels

Lack of knowledge

Inadequate training

Lack of integration

Tools to automate the process

Resistance to change

 

 

Tools

CMDB with automated matching

Automated monitoring to create Incidents

Escalation timers

Ability to route to other groups

Automation for Classification

Auto-Call Distribution telephone system

Diagnostic tools

Knowledge base including list of known errors

 

Activities

Incident detection and recording

Classification and initial support

Investigation and diagnosis

Resolution and recovery

Incident closure

Ownership, monitor, track and communicate

 

Inputs

Incident details

Configuration Items

Matching against Problems and Known Errors

Resolution details

Response on RFC

 

Outputs

RFC for resolution

Updated resolution & Work-arounds

Resolved and closed Incidents

Communication to Customers

Management information (reports)

 

Manager’s Duties

Drive efficiency and effectiveness of IM Process

Produce management information

Managing the work of IM staff

Monitoring the effectiveness of IM

Make recommendations for improvement

Develop and maintain the IM systems

 

Other Roles

Service Desk Manager

First, Second & Third-line


 


 

4.2.          Incident Management Goal Statement

The primary goal of the Incident Management process is to restore normal service operation as quickly as possible and minimize the adverse impact on business operations, thus ensuring that the best possible levels of service quality and availability are maintained. ‘Normal service operation’ is defined here as service operation within Service level agreement (SLA) limits.

4.3.          Incident Management Benefits

“For the business as a whole:

·         Reduced business impact of Incidents by timely resolution, thereby increasing effectiveness

·         The proactive identification of beneficial system enhancements and amendments

·         The availability of business-focused management information related to the SLA

For the IT organization in particular:

·         Improved monitoring, allowing performance against SLAs to be accurately measured

·         Improved management information on aspects of service quality

·         Better staff utilization, leading to greater efficiency

·         Elimination of lost or incorrect Incidents and service requests

·         More accurate CMDB information (giving an ongoing audit while registering Incidents)

·         Improved User and Customer satisfaction” (SS 5.4)

4.4.          Incident Management Critical Success Factors

·         "An up-to-date CMDB is a prerequisite for an efficiently working Incident Management process. If a CMDB is not available, information about Configuration Items (CIs) related to Incidents should be obtained manually, and determining impact and urgency will be much more difficult and time-consuming.

·         A ‘knowledge base’ in the form of an up-to-date Problem/error database should be developed to provide for resolutions and Work-arounds. This will greatly speed up the process of resolving Incidents. Third-party Known Error databases should also be available to assist in this process.

·         An effectively automated system for Incident Management is fundamental to the success of a Service Desk. Paper-based systems are not really practical or necessary, now that good and cheap support tools are available.

·         Forge a close link with the Service-Level Management (SLM) process to obtain necessary Incident response targets. Timely Incident resolution will satisfy Customers and Users." (SS 5.5.2)

4.5.          Incident Management Key Performance Indicators

·         "Total numbers of Incidents

·         Mean elapsed time to achieve Incident resolution or circumvention, broken down by impact code

·         Percentage of Incidents handled within agreed response time (Incident response-time targets may be specified in SLAs, for example, by impact code)

·         Average cost per Incident

·         Percentage of Incidents closed by the Service Desk without reference to other levels of support

·         Incidents processed per Service Desk workstation

·         Number and percentage of Incidents resolved remotely, without the need for a visit" (SS 5.9)

4.6.          Incident Management Possible Implementation Problems

·         “No visible management or staff commitment, resulting in non-availability of resources for implementation

·         Lack of clarity about business needs

·         Working practices not being reviewed or changed

·         Poorly defined service objectives, goals and responsibilities

·         No provision of agreed Customer service levels

·         Lack of knowledge for resolving Incidents

·         Inadequate training for staff

·         Lack of integration with other processes

·         Lack of, or expense of, tools to automate the process

·         Resistance to change” (SS 5.5.3)

4.7.          Incident Management Tools

·         Configuration Management Database with automation for matching the Incident to the appropriate Configuration Item

·         Automated monitoring with an interface to automatically create Incident Records

·         Escalation timers

·         Ability to route Incident Records to other Service Desk locations and to other groups

·         Automation for Classification

·         Auto-Call Distribution telephone system

·         Diagnostic tools

·         Knowledge base including list of known errors

4.8.          Incident Management Inputs and Outputs

Inputs:

·         Incident details sourced from (for example) Service Desk, networks or computer operations

·         configuration details from the Configuration Management Database (CMDB)

·         response from Incident matching against Problems and Known Errors

·         resolution details

·         response on RFC to effect resolution for Incident(s)

Outputs:

·         RFC for Incident resolution; updated Incident record (including resolution and/or Work-arounds)

·         resolved and closed Incidents

·         communication to Customers

·         management information (reports)

Incident Management activities:

·         Incident detection and recording

·         Classification and initial support

·         investigation and diagnosis

·         resolution and recovery

·         Incident closure

·         Incident ownership, monitoring, tracking and communication

4.8.1.      Incident Management Inputs and Outputs Diagram

 

 

 

4.8.2.      Incident Management Inputs and Outputs Description

Interface

Description

Service Desk

Service Desk acts as the interface into Incident Management for most of the users of the process. The Service Desk accepts customer input from telephone calls, emails and other communications and generates Incident Records. The Service Desk may also be authorized to accept input from various automation processes and transcribe those inputs into Incident Records.

Service Desk also communicates outwardly from Incident Management to inform users and management of status, provide work-arounds and confirm resolutions.

The extent of the interface between the Service Desk and Incident Management is further detailed in the description of the Service Desk Function.

Computer Operations

Computer Operations and Networking are simply placeholders to indicate 1) that the IS associates and contractors are also users of the Incident Management system and 2) that automated monitoring jobs should be configured to feed Incidents into the Incident Management system.  These source objects are included as a reminder that both the human and automation interfaces can either interface through the Service Desk or directly into the Incident Management system.

Networking

Procedures

All of the ITIL processes are interconnected and interdependent. Procedural documents used by all of the ITIL processes should include the process for reporting Incidents into the central Incident Management process. This will ensure consistency in the classification, management, notification and escalation processes.

Other sources of Incident

The ITIL definition for an Incident is: “any event which is not part of the standard operation of a service and which causes, or may cause, an interruption to, or a reduction in, the quality of that service.” Every “Incident” must be reported into the central Incident Management system.

Service Request Procedures

The ITIL definition of a Service Request is simply: “Every Incident not being a failure in the IT Infrastructure.” The intent here is to designate whether an Incident Record is processed through the Incident Management process or through some other standard process. The purpose for this separation is to ensure that metrics on the reliability of the infrastructure are not inflated by “business as usual” user requests.

CMDB

CMDB stands for the “Configuration Management DataBase”. The complete definition of the infrastructure, the relationships between the components and the interrelationships of those components with the other ITIL processes is to be accessible through one central repository. The definition and usage of this repository is contained in the Configuration Management section of this document.

Change Management Process

Incident Management generates both inputs into Change Management and it receives outputs from that process. Inputs from Incident Management into Change Management will be made through the Request for Change process. The outputs from Change Management to Incident Management consist of status updates and management reports including the Forward Schedule of Changes. The Forward Schedule of Changes is used by Incident Management to advise the users and customers about the schedule for implementing the required Resolutions.

Problem Error Database

The Problem Error Database is an essential tool maintained by the Problem Management process. Since Problem Management is not included in this phase of the ITIL implementation, it is important to document a deviation from the defined ITIL processes.

Incident Management Process

The Incident Management Process is defined in the next section of this document.

 

4.9.          Incident Management Process Flow

4.9.1.      Incident Management Process Flow Diagram

4.9.2.      Incident Management Process Flow Description

Aspect

Description

Ownership, monitoring, tracking and communication

Incident Management has an overriding requirement to take ownership of Incidents, monitor the Incident, track progress on restoring normal service and communicate. This aspect of the overall process is distinct in that it is interrupt driven. In contrast, the other aspects of the Incident Management are sequential.

Incident detection and recording

The sequential aspect of Incident Management begins when an Incident is detected. Detection can be a manual process or it can be implemented through automation.

Initial classification and support

As soon as the Incident has been detected, it is important to create an Incident Record to match this Incident with the appropriate Configuration Item(s). The Classification process then identifies the Category, Impact and Urgency for the Incident. Using that information the appropriate resource is identified

Service Request

The ITIL definition of a Service Request is simply: “Every Incident not being a failure in the IT Infrastructure.” The intent here is to designate whether an Incident Record is processed through the Incident Management process or through some other standard process. The purpose for this separation is to ensure that metrics on the reliability of the infrastructure are not inflated by “business as usual” user requests.

Service Request procedure

Pre-defined procedures are required for all Service Requests.

Investigation and diagnosis

Incidents that are “a failure in the IT infrastructure” must be investigated and diagnosed. Incident Management experience should be captured into standard diagnostic processes. Those processes will typically include a check against related open Incidents, a search through the Problem Error Database, a check against the Forward Schedule of Changes, use of appropriate diagnostic tools and standardized troubleshooting processes.

Resolution and recovery

“The primary goal of the Incident Management process is to restore normal service operation as quickly as possible.” This can be accomplished through a break-fix process, by implementing a work-around, or by following the defined process for implementing a Change. The precise boundaries of responsibility must be defined to ensure that Incident Management does not overreach into Problem Management or Release Management.

Incident closure

Incident Closure is always the responsibility of the Incident Management team, regardless of which ITIL Process actually implemented the restoration of service. Incident Closure consists of verifying with the user that reported the Incident that service has been restored to a normal level.

 

4.10.      Incident Management Line Escalation

4.10.1.  Incident Management Line Escalation Diagram

 

4.10.2.  Incident Management Line Escalation Description

Aspect

Description

Detect and record

The sequential aspect of Incident Management begins when an Incident is detected. Detection can be a manual process or it can be implemented through automation.

Request

The ITIL definition of a Service Request is simply: “Every Incident not being a failure in the IT Infrastructure.” The intent here is to designate whether an Incident Record is processed through the Incident Management process or through some other standard process. The purpose for this separation is to ensure that metrics on the reliability of the infrastructure are not inflated by “business as usual” user requests.

Service Request procedure

Pre-defined procedures are required for all Service Requests.

Initial Support

As soon as the Incident has been detected, it is important to create an Incident Record to match this Incident with the appropriate Configuration Item(s). The Classification process then identifies the Category, Impact and Urgency for the Incident. Using that information the appropriate resource is identified

Resolved

Resolution of an Incident simply means that they symptom is no longer occurring. It is important to distinguish between Problem Resolution, implying a permanent fix, and the “restoration of normal service”. Restoration can occur through a break-fix, a work-around, through a permanent fix or simply because the error is no longer occurring.

Resolution / Recovery

Once normal service is possible, it is necessary to “recover”. Recovery can be as simple as reprinting lost reports or as complex as restoring a corrupted database. The process for recovery should be included in the process “script” for this classification of Incident.

Investigate / Diagnose

Incidents that are “a failure in the IT infrastructure” must be investigated and diagnosed. Incident Management experience should be captured into standard diagnostic processes. Those processes will typically include a check against related open Incidents, a search through the Problem Error Database, a check against the Forward Schedule of Changes, use of appropriate diagnostic tools and standardized troubleshooting processes.

ETC

Processes must be defined for functional escalation beyond the skill set available to the Incident Management team.

Close

Incident Closure is always the responsibility of the Incident Management team, regardless of which ITIL Process actually implemented the restoration of service. Incident Closure consists of verifying with the user that reported the Incident that service has been restored to a normal level.

 

4.11.      Incident Management Relationships with other Service Management Processes

Please refer to the prior section for the relationships of Incident Management with Service Desk.

Incident Management with Problem Management

The Incident Management process includes searching for matches against open Problems, Known Errors and resolution details

Incidents are linked to Problem Records when there is a match

Incident Management can generate a Problem Record when the cause of the Incident is unknown

Incident Management is responsible for finding a resolution or a work-around

When Incident Management finds a work-around they should pass it to Problem Management for evaluation

When Problem Management finds a work-around or resolution they need to pass it over to Incident Management

It is recommended that all five Service Support processes and the Service Desk be implemented as a unit

Problem Management should be notified if there is a Major Incident

Incidents, Problems, Known Errors, Requests for Change should all be stored or linked into the Configuration Management Database so that that records can be shared (2.9)

Changes may create new Incidents (2.9)

There should be a close interface between Incident, Problem, Change Management and the Service Desk (2.9)

Incident Management forwards Incident details and work-arounds to Problem Management

Incident Management uses the documentation on known errors and work-arounds

Problem Management reacts to Incidents when escalated

Problem Management should proactively examine the history of Incidents and find trends

Improvements in the Knowledge database should allow Incident Management to resolve more Incidents and do so more quickly

Problem Management should be implemented in parallel with Incident Management, or after Incident Management

Problem Management needs to work closely with Incident Management to create and update the Categories

Problem Management can create or deploy diagnostic tools and then train the Incident Management team on how to use them

 

Incident Management with Configuration Management

Incident Management utilizes the Configuration Management Database and links Incidents to Configuration Items

Incident Management may uncover discrepancies in the Configuration Management Database

Inconsistencies in the Configuration Management Database should be reported to Configuration Management

It is recommended that all five Service Support processes and the Service Desk be implemented as a unit

Incident Management should use the Configuration Management database to assess the number of users impacted by an Incident

Incidents, Problems, Known Errors, Requests for Change should all be stored or linked into the Configuration Management Database so that that records can be shared (2.9)

Changes may create new Incidents (2.9)

There should be a close interface between Incident, Problem, Change Management and the Service Desk (2.9)

Provide an accurate representation of the infrastructure to support other ITIL processes

Configuration Management must be able to report on all Configuration Items impacted by an Incident or a Problem

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

The Configuration Management Database is used to link Incidents, Problems and other records to the Configuration Items and the Users (2.6)

 

Incident Management with Change Management

Requests for Change are not regarded as Incidents.

Incident Management will use the response from a Request for Change to implement a work-around or resolution

It is recommended that all five Service Support processes and the Service Desk be implemented as a unit

Replacing faulty hardware is not considered a Change

Incidents, Problems, Known Errors, Requests for Change should all be stored or linked into the Configuration Management Database so that that records can be shared (2.9)

Changes may create new Incidents (2.9)

There should be a close interface between Incident, Problem, Change Management and the Service Desk (2.9)

Incident Management is authorized to make corrections, but should seek authorization for Changes

 

Incident Management with Release Management

It is recommended that all five Service Support processes and the Service Desk be implemented as a unit

The Definitive Hardware Store can be used in the event of a Major Incident

Problem Management and Service Desk personnel must be trained to use and support the new Release

Release procedures may be an integral part of Incident and Problem Management (2.7)

 

Incident Management with Service Level Management

The detailed history that Incident Management maintains on each Incident is helpful when evaluating Service Level Agreement breaches

Functional Escalation should occur before the Service Level Agreement time has expired

Hierarchical Escalation occurs when the Service Level Agreement time is likely to expire

Service Level Management defines the time to resolve goals, often based upon Categories

Incident Management uses the time to resolve goals and the Category to determine Urgency

Incident Management gathers the data for the evaluation of Service Level Agreement compliance

Incident priorities and escalation procedures need to be agreed as part of the Service Level Management process and documented in the Service Level Agreements (2.9)

Service Desk Incident Records can be used to identify the services when constructing a Service Catalog

It is best to link the Incident Records, Problem Records, Requests for Change, etc. to defined Services

Recent Incidents and longstanding Problems often cloud the negotiation on Service Level Agreements

Verify that the Service Level Agreements are achievable before agreeing

It is important that the Customer reports Incidents promptly so that the Incident can be resolved within the time allotted in the Service Level Agreement

Many organizations use their Service Desk linked to a comprehensive Configuration Management Database to monitor the Customer's perception of Availability

The Service Desk can monitor Incident response times and Incident resolution times

The Service Level Agreements related to Incidents and Problems must be clearly visible in the Service Desk tool to facilitate communication, monitoring and escalation

The Customer's perception of response time should be recorded and Problem Management should investigate repetitions

Some of the most important targets set in the Service Level Agreements relate to service Availability and thus require Incident resolution within agreed periods (2.1)

 

Incident Management with Financial Management

Incident Management needs to update Incident Records with appropriate costing codes and hours worked

Financial Management assists the other ITIL Processes in assessing the cost-effectiveness of decisions (SD 5.1.6)

Service Desk and integrated Service Management processes are likely to be able o provide data to be input to cost calculation (SD 5.5.8)

 

Incident Management with Capacity Management

Incident Management often finds options for beneficial system enhancement

The history of Incidents and Problems is a key input into Capacity planning (SD 6.2)

Other Service Management processes can refer Incidents and Problems to Service Capacity Management when there is a breach or a risk of a breach against a Service Level Agreement. Service Capacity Management will then refer those Incidents or Problems to Resource Capacity Management (2.3, SD 6.2.2, 6.2.3, 6.7)

Reports from the Capacity Management Database are useful to many of the other Service Management processes, but especially so to Service Level Management when reviewing Service Levels and to Incident and Problem Management when investigating and diagnosing Incidents and Problems (SD 6.3.5)

It is necessary to review the Capacity Management Database to ensure the data collected is at the appropriate level and for the appropriate values so that the monitoring is not wasting resources and not missing data vital to the Incident and Problem Management processes (SD 6.6)

Incident Management must inform Capacity Management regarding all Incidents related to capacity or performance (SD 6.7.1)

Capacity Management works with Problem Management to give Incident Management diagnostic tools, automated alerting and updates to Known Errors (SD 6.7.1)

 

Incident Management with IT Service Continuity Management

Service Continuity is not focused minor technical faults unless that fault can cause a major impact on the business. Those risks are to be worked through Incident and Problem Management, addressed through Availability and Problem Management and resolved through Change and Configuration (and Release) Management (SD 7.2.2)

Criteria need to be established in advance as to when an Incident or Problem will be escalated to Service Continuity Management (SD 7.3.5)

 

Incident Management with Availability Management

A history of prior Incidents and Problems is used in Availability planning to assess component failures

Availability Management is dependent upon the maturity of Incident, Problem and Change Management

It might be necessary to improve upon the Incident, Problem, Change and Release Management processes to achieve higher levels of Availability (SD 8.5.3)

Service Desk and Incident Management handling of an Incident can significantly impact how this Incident affects Availability (processes, notifications, escalations, diagnostics, detection) (SD 8.5.4)

Diagnostic tools can help Problem Management resolve faster (SD 8.5.4)

The Incident should only be closed when the Users have verified that Service has been restored (SD 8.5.4)

Incident and Problem Management should provide input into the Availability reports to highlight Problem "black spots" and areas for increased preventative maintenance (SD 8.7.7)

Service Desk, Change and Problem Management should provide data on downtimes (planned and unplanned), but this should be augmented by automated monitoring (SD 8.8.1)

Service Desk, Incident,  Problem, Change and Service Level Management provide inputs into a Service Outage Analysis (SD 8.9.8)

Available Management can use the Expanded Incident LifeCycle and work closely with Incident and Problem Management to insure that failures are not repeated (SD 8.9.9)

 


5.                Problem Management


5.1.          Problem Management Synopsis

Goals

Minimize the adverse impact from IR & PR

Prevent recurrence

Seek root cause

Initiate actions to improve or correct

 

Benefits

Improved IT service quality

Incident volume reduction

Permanent solutions

Improved organizational learning

Better first-time fix rate for SD

 

Critical Success Factors

Effective automated registration & classification

Achievable objectives to use Problem-solving talents

Balance potential conflict between IM & PM

 

Key Performance Indicators

Number of Problems and errors grouped

Elapsed time to close

Elapsed time on outstanding

Expected resolution time on outstanding

Mean and maximum elapsed time to confirm KE

Any temporary resolution actions

 

Possible Implementation Problems

Absence of good IM process

Absence of good historical data

Failing to link IR with PR and KE

Management commitment

Do not undermine SD with back-door

Time spent on Knowledge Base

Inaccurate prioritization

 

Tools

CMDB

Scientific Method

Ishikawa

Brainstorming

Flowcharting

 

Activities

Problem control

Error control

Proactive prevention

Identify trends

Management information from PM data

Major Problem reviews

 

Inputs

Incident details from IM

Configuration Items

Work-arounds from IM

 

Outputs

Known Errors

Requests for Change

Updated PRs with resolutions & work-arounds

Properly closed PRs

Matched IR to PR and KE

Management information

 

Manager’s Duties

Develop and maintain Problem Control process

Efficiency and effectiveness of Problem Control

Produce management information

Manage Problem support staff

Allocating resources

Effectiveness of Error Control

Recommendations to improve Error Control

Develop and maintain Problem/Error Control systems

Efficiency and effectiveness of Proactive activities

 

Other Roles

Problem support team

 

 

 

 

 

 


 


 

5.2.          Problem Management Goal Statement

The goal of Problem Management is to minimize the adverse impact of Incidents and Problems on the business that are caused by errors within the IT Infrastructure, and to prevent recurrence of Incidents related to these errors. In order to achieve this goal, Problem Management seeks to get to the root cause of Incidents and then initiate actions to improve or correct the situation.

5.3.          Problem Management Benefits

·         Improved IT service quality

·         Incident volume reduction

·         Permanent solutions

·         Improved organizational learning

·         Better first-time fix rate at the Service Desk”  (SS 6.4)

5.4.          Problem Management Critical Success Factors

·         “An effective automated registration of Incidents, with an effective classification, is fundamental for the success of Problem Management.

·         Setting achievable objectives and making use of the Problem-solving talents of existing staff is a key activity. Consider ‘part-time’ Problem Management, whereby staff set aside periods when they will look at Problems away from the daily fire-fighting pressures.

·         In view of the potentially conflicting interests between Incident Management and Problem Management (paragraph 6.3.1), good cooperation between both processes is essential. Both also have enormous synergy, which can help. Support staff, often involved in both processes, should be aware of the importance of balancing activities between the two.” (SS 6.5.2)

5.5.          Problem Management Key Performance Indicators

·         “The number of Problems and errors split by:

o        status

o        service

o        impact

o        category

o        User group

·         The total elapsed time on closed Problems

·         The elapsed time to date on outstanding Problems

·         The mean and maximum elapsed time to close Problems or confirm a Known Error, from the time of raising the Problem record, by impact code and by support group (including vendors)

·         Any temporary resolution actions

·         The expected resolution time for outstanding Problems

·         The total elapsed time for closed Problems” (SS 6.10.1)

5.6.          Problem Management Possible Implementation Problems

·         “Absence of a good Incident control process, and thus the absence of detailed historical data on Incidents (necessary for the correct identification of Problems).

·         Failure to link Incident records with Problem/error records, means a failure to gain many of the potential benefits. This is a key feature in moving from reactive support to a more planned and proactive support approach.

·         Lack of management commitment, so that support staff (usually also involved with reactive Incident control activities) cannot allocate sufficient time to structural Problem-solving activities.

·         The undermining of the Service Desk role. Problem Management staff should accept support requests only from authorized sources. Difficulties will arise if the process deals with requests from many sources since multiple reports of Incidents with the same cause may not be interpreted in the same way.

·         Failure to set aside time to build and maintain the knowledge base will restrict the delivery of benefits.

·         An inability to determine accurately the business impact of Incidents and Problems. Consequently the business-critical Incidents and Problems are not given the correct priority.“ (SS 6.5.3)

5.7.          Problem Management Tools

·         Configuration Management Database for a record of the Configuration Items, Incidents, Problems and Request for Change

·         Kepner and Tregoe analysis (Scientific Method)

·         Ishikawa Diagrams

·         Brainstorming

·         Flowcharting

5.8.          Problem Management Inputs and Outputs

Inputs:

·         “Incident details from Incident Management

·         Configuration details from the Configuration Management Database (CMDB)

·         Any defined Work-arounds (from Incident Management)” (SS 6.2)

Outputs:

·         “Known Errors

·         A Request for Change (RFC)

·         An updated Problem record (including a solution and/or any available Work-arounds)

·         For a resolved Problem, a closed Problem record

·         Response from Incident matching to Problems and Known Errors

·         Management information” (SS 6.2)

Activities:

·         “Problem control

·         Error control

·         The proactive prevention of Problems

·         Identifying trends

·         Obtaining management information from Problem Management data

·         The completion of major Problem reviews” (SS 6.2)

5.8.1.      Problem Management Inputs and Outputs Diagram


5.8.2.      Problem Management Inputs and Outputs Description

Interface

Description

Incident Details

The standard pathway into Problem Management is from Incident Management. Incident Records are worked by Incident Management until

  • matching the Incident to existing Problems and Known Errors is not successful during the stage of Incident initial support and classification
  • analysis of Incident data reveals recurrent Incidents
  • analysis of Incident data reveals Incidents that are not yet matched to existing Problems or Known Errors
  • analysis of the IT infrastructure indicates a Problem that could potentially lead to Incidents
  • a major or significant Incident (serious and adverse impact on services to the Customer) occurs for which a structural solution has to be found

Incident Management will then escalate to Problem Management and Problem Management will use the history of the activity recorded in the Incident Record(s) to create a new Problem Record. Subsequent Incidents that match this Problem Record will be linked to this Problem Record by Incident Management. Those additional Incidents will be evaluated by Problem Management to glean additional information about the underlying Problem.

Configuration Details

The Configuration Management Database (CMDB) is the core system that supports the ITIL processes. Incident Records and Problem Records will be linked to the Configuration Items in the CMDB. The details about those Configuration Items and the relationships between Configuration Items are valuable to Problem Management in diagnosing the Problem, assessing the impact, and preparing the resulting Request for Change.

Work-Arounds

Incident Management often generates work-arounds to Problems in order to “restore normal operations as quickly as possible”. Those work-arounds must be linked to the Problem Record so that Incident Management can quickly access this information and not expend time reinventing something that has already been developed. Additionally, the work-arounds often provide valuable information that aids Problem Management in determining the underlying cause.

Problem Management

The Problem Management process has six primary activities, as shown in the preceding diagram. The flow through the Problem Management process is also documented in the following section of this document.

Problem Control

“The Problem control process is concerned with handling Problems in an efficient and effective way. The aim of Problem control is to identify the root cause, such as the CIs that are at fault, and to provide the Service Desk with information and advice on Work-arounds when available.”

Error Control

“Error control covers the processes involved in progressing Known errors until they are eliminated by the successful implementation of a Change under the control of the Change Management process. The objective of error control is to be aware of errors, to monitor them and to eliminate them when feasible and cost-justifiable.”

Proactive

“Proactive Problem Management covers the activities aimed at identifying and resolving Problems before Incidents occur. These activities are:

  • Trend analysis
  • Targeting support action
  • Providing information to the organization

By redirecting the efforts of an organization from reacting to large numbers of Incidents to preventing Incidents, an organization provides a better service to its Customers and makes more effect use of the available resources within the IT support organization.”

Identify Trends

Management Info

Major Problem Reviews

On completion of the resolution of every major Problem, Problem Management should complete a major Problem review. The appropriate people involved in the resolution should be called to the review to determine:

  • What was done right
  • What was done wrong
  • What could be done better next time
  • How to prevent the Problem from happening again

Known Errors

ITIL defines a Known Error as “an Incident or Problem for which the root cause is known and for which a temporary Work-around or a permanent alternative has been identified. If a business case exists, an RFC will be raised, but, in any event, it remains a known error unless it is permanently fixed by a Change.”

Request for Change

An important distinction between Problem Management and Incident Management is that the Problem Management produces resolutions that require a Change to the environment. The Request for Change is the document used by Problem Management to interface with Change Management.

Updated Problem Record

Problem Management is responsible for maintaining an accurate and current status in the Problem Record in order to communicate with the other ITIL processes. The Service Desk will use those Problem Record updates to communicate status to the user community.

Problem Resolution

A Problem Resolution is a Change that will resolve the Problem and the Incidents linked to that Problem. This is a “permanent fix”. For example, if an Incident Record states that a web server process is no longer serving web pages, Incident Management may “restore normal operations” by stopping and restarting the web server process. That is a work-around. Problem Management may become engaged because the Incident recurs. Problem Management will then come up with a Problem Resolution, such as applying a patch to the operating system.

Incident matching

Problem Management and Incident Management work together to link Incidents to Problems so that the Release Management process can implement the Problem Resolution on each impacted Configuration Item.

Management Information

Problem Management is responsible for providing key management information including an analysis on the number of Problems and a summary on the type of closure on each.

 

5.9.          Problem Management Problem Control Process Flow

5.9.1.      Problem Management Problem Control Process Flow Diagram

 

5.9.2.      Problem Management Problem Control Process Flow Description

Aspect

Description

Tracking and monitoring of Problems

Tracking and monitoring of Problems is an ongoing asynchronous activity. Problems need to be checked for changes in the reported errors, for diagnostic input, for third party resolutions and for communication from the users, customers and other ITIL processes. Problem Management is also responsible for updating the Problem Record on a consistent basis so that the other ITIL processes and the Service Desk can use the Problem Record for immediate status.

Problem identification and recording

The standard pathway into Problem Management is from Incident Management. Incident Records are worked by Incident Management until

  • matching the Incident to existing Problems and Known Errors is not successful during the stage of Incident initial support and classification
  • analysis of Incident data reveals recurrent Incidents
  • analysis of Incident data reveals Incidents that are not yet matched to existing Problems or Known Errors
  • analysis of the IT infrastructure indicates a Problem that could potentially lead to Incidents
  • a major or significant Incident (serious and adverse impact on services to the Customer) occurs for which a structural solution has to be found

Incident Management will then escalate to Problem Management and Problem Management will use the history of the activity recorded in the Incident Record(s) to create a new Problem Record. Subsequent Incidents that match this Problem Record will be linked to this Problem Record by Incident Management. Those additional Incidents will be evaluated by Problem Management to glean additional information about the underlying Problem.

Problem classification

When a Problem is identified, the amount of effort required to detect and recover the failing CI(s) has to be determined. Therefore it is important to be aware of the impact of the Problem on existing service levels. This process is known as ‘classification’. In practice, support effort is allocated to only a small proportion of Problems linked to a single Incident.

The steps involved in Problem classification are similar to the steps in classifying Incidents; they are to determine:

  • category
  • impact
  • urgency
  • priority

Problem investigating and diagnosis

“The process of Problem investigation is similar to that of Incident investigation – but the primary objective of each process is significantly different. Incident Management’s aim is rapid restoration of service, whereas Problem Management’s aim is diagnosis of the underlying cause. Investigation activities should include available Work-arounds for the Incidents related to the Problem, as registered in the Incident record database. Problem Management activities should include updating recommended Work-arounds in the Problem record, to support Incident control.”

RFC and possible resolution and closure

The Problem Control investigation may uncover the Change required to resolve the Problem. When that occurs, Problem Control will need to follow the steps described for how Error Control records the Error resolution, generates and monitors a Request for Change, and closes the Problem when the Change has successfully resolved the Problem to the satisfaction of the user.

Error Control

“Error control covers the processes involved in progressing Known errors until they are eliminated by the successful implementation of a Change under the control of the Change Management process. The objective of error control is to be aware of errors, to monitor them and to eliminate them when feasible and cost-justifiable.” Once the Problem Control activities have been completed, the Known Error and the Problem(s) linked to that Known Error are passed to Error Control for resolution.


5.10.      Problem Management Error Control Process Flow

5.10.1.  Problem Management Error Control Process Flow Diagram

5.10.2.  Problem Management Error Control Process Flow Description

Aspect

Description

Problem Control

“The Problem control process is concerned with handling Problems in an efficient and effective way. The aim of Problem control is to identify the root cause, such as the CIs that are at fault, and to provide the Service Desk with information and advice on Work-arounds when available.” Once the Problem Control activities have been completed, the Known Error and the Problem(s) linked to that Known Error are passed to Error Control for resolution.

Tracking and monitoring errors

Tracking and monitoring of Errors is an ongoing asynchronous activity. Errors need to be checked for changes in the reported symptoms, for diagnostic input, for third party resolutions and for communication from the users, customers and other ITIL processes. Problem Management is also responsible for updating the Error Record on a consistent basis so that the other ITIL processes and the Service Desk can use the Error Record for status.

Identify and record error

When the person performing the Error Control activities is not the person that performed the Problem Control activities, it is necessary to begin Error Control by reviewing the history of the Incident(s), work-around(s), Problem(s) and Known Error.

Assess Error

Error Assessment is the process of evaluating alternative resolutions to find the resolution that achieves the desired results cost effectively.

Record Error resolution

Once the appropriate resolution has been identified, it is important to record those details so that the Incident(s), work-around(s), Problem(s) and Known Error link to this resolution. Note that this step is highlighted here, but is already called for as part of the ongoing activity of “Tracking and monitoring errors”.

RFC

An RFC is a Request for Change. The Request for Change is the document used by Problem Management to interface with Change Management in order to Release the resolution into the production environment.

Change successfully implemented

Change Management assesses the success of a Change by verifying user satisfaction. Problem Management interfaces with Change Management to determine the implementation status and verify that the resolution is satisfactory.

Close Error and associated Problem(s)

Once the resolution has been successfully implemented, Problem Management will close the Known Error and all associated Problem Records with appropriate comments. Incident Management will use the Known Errors database to link new Incidents to this Known Error and identify the appropriate resolution.

 

5.11.      Problem Management Relationships with other Service Management Processes

Please refer to the prior section for the relationship of Problem Management with Service Desk and Incident Management

Problem Management with Configuration Management

Problem Management utilizes the Configuration Management Database to identify details about the Incidents and Problems

It is sometimes necessary to create dummy Configuration Items to allow tracking of Problems

All support documentation must be made available to the Problem Management team

Problem Management should use the contact information in the Configuration Management Database to determine who needs to know about a work-around or resolution

Provide an accurate representation of the infrastructure to support other ITIL processes

Configuration Management must be able to report on all Configuration Items impacted by an Incident or a Problem

Problem Management uses the Configuration Management Database during trend analysis

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

The Configuration Management Database is used to link Incidents, Problems and other records to the Configuration Items and the Users (2.6)

 

Problem Management with Change Management

Request for Change is submitted in order to implement a resolution

Problem Management should examine records of recent changes to see if those changes contributed to the Problem

Problem Management should advise Change Management as to the urgency of the Problem and can escalate if the Problem becomes more critical

Requests for Change can encompass processes as well as hardware and software

Problem Management only leads to a change when there is "a state different from a previously defined condition"

Effective Change Management relies on the Service Desk and Configuration, Problem and Release Management

 

Problem Management with Release Management

Problem Management records should trace from Incident to Problem to Known Error to Request for Change to Release and change implementation

A Release is typically designed to resolve problems and enhance services

An Emergency Fix generally resolves a small number of Problems

A Minor Release generally supersedes by inclusion prior Emergency Fixes as well as incorporating a few enhancements

A Major Release often includes enhancements that make prior Problem fixes redundant

A Package Release bundles multiple Releases, often from different applications, together into one Release

Procedures need to be defined for Problem escalation as related to the Release process

Once the Release is successful, Problems and enhancement requests resolved by the Release should be closed

Problems caused by the Release should be linked to the Release

Problem Management and Service Desk personnel must be trained to use and support the new Release

Release procedures may be an integral part of Incident and Problem Management (2.7)

 

Problem Management with Service Level Management

Problem Management needs to be aware of the impact that Problems are having on Service Levels

Problems reported to vendors need to be monitored to insure that the vendor responds in a reasonable time or within the time defined in the Underpinning Contract when defined there

Problem resolution needs to be measured against the Service Level Agreements

Problem identification and resolution is easier when Service Level Management has built a strong relationship between IT and the IT Customers

It is best to link the Incident Records, Problem Records, Requests for Change, etc. to defined Services

Recent Incidents and longstanding Problems often cloud the negotiation on Service Level Agreements

It is best to pilot the Service Level Agreements with a Customer group that has a small number of Problems

Many organizations use their Service Desk linked to a comprehensive Configuration Management Database to monitor the Customer's perception of Availability

The Service Level Agreements related to Incidents and Problems must be clearly visible in the Service Desk tool to facilitate communication, monitoring and escalation

The Customer's perception of response time should be recorded and Problem Management should investigate repetitions

Service Review meetings should be held monthly, or at least quarterly to review the results and initiate a Service Improvement Program if warranted

 

Problem Management with Financial Management

Resolutions need to be cost effective

Some Problems do not justify the expenditure required for resolution

Financial Management assists the other ITIL Processes in assessing the cost-effectiveness of decisions (SD 5.1.6)

Service Desk and integrated Service Management processes are likely to be able o provide data to be input to cost calculation (SD 5.5.8)

Financial Management should conduct periodic audits to ensure that the Change and Problem Management processes are strictly followed (SD 5.7.11)

 

Problem Management with Capacity Management

Capacity Management should be the "focal point" for all performance and capacity Problems (SD 6.1.1, 6.1.3)

The history of Incidents and Problems is a key input into Capacity planning (SD 6.2)

Proactive Changes and Service Improvements are key outputs from Capacity Planning (SD 6.2)

Capacity Management should preempt performance Problems (SD 6.2)

Other Service Management processes can refer Incidents and Problems to Service Capacity Management when there is a breach or a risk of a breach against a Service Level Agreement. Service Capacity Management will then refer those Incidents or Problems to Resource Capacity Management (2.3, SD 6.2.2, 6.2.3)

Service and Resource Capacity Management must be proactive, anticipate the impact that Changes will have on the infrastructure and take action to preclude Problems (SD 6.2.2, 6.2.3)

Reports from the Capacity Management Database are useful to many of the other Service Management processes, but especially so to Service Level Management when reviewing Service Levels and to Incident and Problem Management when investigating and diagnosing Incidents and Problems (SD 6.3.5)

Capacity Management should reduce the number of urgent Changes and reduce the risk of Problems (SD 6.4.2)

It is necessary to review the Capacity Management Database to ensure the data collected is at the appropriate level and for the appropriate values so that the monitoring is not wasting resources and not missing data vital to the Incident and Problem Management processes (SD 6.6)

Capacity Management works with Problem Management to give Incident Management diagnostic tools, automated alerting and updates to Known Errors (SD 6.7.1)

Problem Management expertise may be called on to assist in resolving Problems related to capacity or performance (SD 6.7.2)

The Capacity Management Database needs to be linked into the Configuration Management Database to assist Service Management in assessing proposed Service Level Agreements and Problem Management in diagnosing Problems (SD 6.7.5)

Slow performance and capacity Problems can result in Unavailability of the Service to the users (SD 6.7.9)

 

Problem Management with IT Service Continuity Management

Service Continuity is not focused minor technical faults unless that fault can cause a major impact on the business. Those risks are to be worked through Incident and Problem Management, addressed through Availability and Problem Management and resolved through Change and Configuration (and Release) Management (SD 7.2.2)

Criteria need to be established in advance as to when an Incident or Problem will be escalated to Service Continuity Management (SD 7.3.5)

 

Problem Management with Availability Management

Much of Availability Managements work is in the detection and avoidance of Problems

Fragility of components

Problem Management needs to work closely with Availability Management to identify trends and instigate remedial action (2.9)

A history of prior Incidents and Problems is used in Availability planning to assess component failures

Availability Management is dependent upon the maturity of Incident, Problem and Change Management

It might be necessary to improve upon the Incident, Problem, Change and Release Management processes to achieve higher levels of Availability (SD 8.5.3)

Diagnostic tools can help Problem Management resolve faster (SD 8.5.4)

Scheduling multiple Changes within one downtime can increase Availability, but it has the risks of increased complexity in Release Management and increased complexity for Problem Management when something goes wrong (SD 8.5.6)

Incident and Problem Management should provide input into the Availability reports to highlight Problem "black spots" and areas for increased preventative maintenance (SD 8.7.7)

Service Desk, Change and Problem Management should provide data on downtimes (planned and unplanned), but this should be augmented by automated monitoring (SD 8.8.1)

Service Desk, Incident,  Problem, Change and Service Level Management provide inputs into a Service Outage Analysis (SD 8.9.8)

Available Management can use the Expanded Incident LifeCycle and work closely with Incident and Problem Management to insure that failures are not repeated (SD 8.9.9)


6.                Configuration Management


6.1.          Configuration Management Synopsis

Goals

Account for all IT assets

Provide CIs to other Processes

Sound basis for IM, PM, CM & RM

Verify and correct CIs

 

Benefits

Accurate information on CIs

Control valuable CIs

Adhere to legal obligations

Help Financial planning

Make Changes visible

Contribute to Contingency planning

Improve Release Management

Security by controlling versions

Reduce unauthorized software

Perform Impact Analysis on Changes

Trend data for Problem Management

 

Critical Success Factors

Correct & accurate CIs to support SM

Defined procedures and tools

Identify, label, record CIs with history

Train organization on control processes

Reporting and auditing

 

Key Performance Indicators

Unauthorized configurations

IR & PR from wrongly made Changes

RFCs failed due to bad CIs or version control

Cycle time to approve & implement Changes

Licenses unused

Exceptions found in configuration audit

Unauthorized IT components

 

Possible Implementation Problems

CIs at wrong level

Inadequate analysis & design

Schedules overly ambitious

Commitment lacking

Too bureaucratic

Routinely circumvented

Processes inefficient or errors in process

Unrealistic expectations about tool

Tool lacks flexibility

Insufficient Change or Release processes

Unrealistic expectations about process

Controls not in place

 

Tools

Configuration Management Database

Software configuration management tools

Change Management support (impact analysis)

Automation for configuration auditing

Automation for release deployment

Definitive Software Library

Definitive Hardware Store

Document management system

Design, analysis and build tools

Report generation tools

 

Activities

Configuration Management planning

Configuration Identification

Control of CIs

Configuration status accounting

Configuration verification and audit

CMDB back-ups, archives and housekeeping

Add value to the SM organization

 

Inputs

Incidents, Problems, Known Errors & RFCs with updates

SW & HW for DSL/DHS

Configuration audits

 

Outputs

Forward Schedule of Changes

Projected Service Availability

RFC, assessment and updates

Releases from DSL/DHS & library

Updated CIs and relationships

Management information

Linkages to the CDB

 

Manager’s Duties

Implements CM policy agreed with Service Manager

Develop/improve CM systems

Propose/agree CM scope, Plan, standards & procedures

Awareness campaign

Arrange for recruit/train of staff

Propose/agree CI conventions & standards

Provides management information

RFC impact assessment & updates CI/RFC per Change

Configuration Audits and corrective actions

 

Other Roles

Configuration Librarian



 

6.2.          Configuration Management Goal Statement

The goals of Configuration Management are to:

·         Account for all the IT assets and configurations within the organization and its services

·         Provide accurate information on configurations and their documentation to support all the other Service Management processes

·         Provide a sound basis for Incident Management, Problem Management, Change Management and Release Management

·         Verify the configuration records against the infrastructure and correct any exceptions

6.3.          Configuration Management Benefits

·         Provide accurate information on CIs and their documentation

·         Controlling valuable CIs

·         Facilitating adherence to legal obligations

·         Helping with financial and expenditure planning

·         Making software Changes visible

·         Contributing to contingency planning

·         Supporting and improving Release Management

·         Improving security by controlling of the versions of CIs in use

·         Enabling organizations to reduce the use of unauthorized software

·         Allowing the organization to perform impact analysis and schedule Changes safely, efficiently and effectively

·         Provide Problem Management with data on trends (SS 7.4.1)

6.4.          Configuration Management Critical Success Factors

"Providing everyone working in Service Management and support with correct and accurate information on the present configurations with their physical and functional specifications

Defining and documenting the procedures and working practices to be followed

Identifying, labeling and recording the names and versions of the CIs that make up the IT services, infrastructure and their relationships

Controlling and storing definitive, authorized and trusted copies of specifications, documentation and software

Reporting the current status and history of all items on the IT infrastructure

Ensuring that all Changes to CIs are recorded as soon as practicable

Tracking and reconciling the actual state of the IT infrastructure against the authorized configuration records and data

Educating and training the organization in the control processes

Reporting metrics on CIs, Changes and Releases

Auditing and reporting exceptions to infrastructure standards and Configuration Management procedures" (SS 7.5.2)

6.5.          Configuration Management Key Performance Indicators

·         "Occasions when the ‘configuration’ is not as authorized

·         Incidents and Problems that can be traced back to wrongly made Changes

·         RFCs that were not completed successfully because of poor impact assessment, incorrect data in the CMDB, or poor version control

·         The cycle time to approve and implement Changes

·         Licenses that have been wasted or not put into use at a particular location

·         Exceptions reported during configuration audits

·         Unauthorized IT components detected in use" (SS 7.7.2)

6.6.          Configuration Management Possible Implementation Problems

·         CIs are defined at the wrong level with too much detail

·         Implementation is attempted without adequate analysis and design

·         Tactical schedules are overly ambitious

·         Commitment is lacking

·         The process is perceived to be too bureaucratic or rigorous

·         The process is routinely circumvented

·         Processes are inefficient and error-prone

·         Expectations of what the tool can do are unrealistic

·         The chosen tool may lack flexibility

·         Configuration Management implemented without Change and Release Management

·         Expectations about the process are unrealistic

·         Proper configuration control is not in place (SS 7.4.2)

6.7.          Configuration Management Tools

·         Configuration Management Database

·         Software configuration management tools

·         Change Management support (impact analysis)

·         Automation for configuration auditing

·         Automation for release deployment

·         Definitive Software Library

·         Definitive Hardware Store

·         Document management system

·         Design, analysis and build tools

·         Report generation tools

6.8.          Configuration Management Inputs and Outputs

All of the ITIL Service Management processes and functions are dependent upon the Configuration Management process. The primary interfaces, however, are with Service Desk, Incident Management, Change Management and Configuration Management. Those flows are included in this document.

Inputs:

·         Incidents, Problems, Known Errors and updates

·         RFC data and status updates

·         SW & HW for DSL/DHS

·         Configuration audits

Outputs:

·         Forward Schedule of Changes

·         Projected Service Availability

·         RFC, assessment and updates

·         Releases from DSL/DHS & library

·         Updated CIs and relationships

·         Management information

·         Linkages to the CDB

Activities:

·         Configuration Management planning

·         Configuration Identification

o        Configuration structures and the selection of CIs

o        CI types and life-cycles

o        CI relationships

o        Identification of software and document libraries

o        Identification of configuration baselines

o        Naming conventions

o        Labeling CIs

·         Control of CIs

o        Registration of new CIs and versions

o        Software developed in house

o        Off-the-shelf CIs

o        New CIs and versions from building and releasing

o        Updating CIs

o        License control

o        Updating and archiving configuration records of withdrawn/decommissioned CIs

o        Protecting the integrity of configurations

o        Updating the CMDB after checking the physical items

·         Configuration status accounting

·         Configuration verification and audit

·         CMDB back-ups, archives and housekeeping

·         Adding value to the Service Management organization

6.8.1.      Configuration Management Inputs and Outputs for the Incident Life-Cycle

The primary focus of the Service Desk will be Incident Management. The relationship of Incident Management with the Configuration Management Database is illustrated below.

 

6.8.2.      Configuration Management Inputs and Outputs for the Incident Life-Cycle

Interface

Description

Start

The Incident Life-Cycle begins with first notification of an Incident. The majority of all Incidents will be reported to the Service Desk, but some Incidents currently route directly to an alternative support process.

Incident

Whether the Incident is directly reported to the Service Desk or to an alternative support process, the first action to take is to create an Incident Record. Incident Records are to be stored in the Configuration Management Database to facilitate matching against the Configuration Item and to support trending and reporting against Configuration Items.

The Incident Management process is responsible for restoration of normal service. The scope boundary for Incident Management must be clearly defined so that Incident Management has a clear delineation from Problem Management. Once Incident Management has reached that boundary, it is the responsibility of Incident Management to escalate to Problem Management.

Problem

Escalation to Problem Management is documented by creating a Problem Record in the Configuration Management Database. Problem Records should be in the Configuration Management Database for reasons similar to those listed above for Incidents. Additionally, it is important that searches and reports can quickly identify the linkages between Incidents and Problems.

The flow through Problem Management is divided into Problem Control and Error Control. The primary duty of Problem Control is to take symptoms and isolate them to the underlying cause. Once isolated, the underlying cause can be defined as a Known Error.

Known Error

ITIL defines a Known Error as: “An Incident or Problem for which the root cause is known and for which a temporary Work-around or a permanent alternative has been identified. If a business case exists, an RFC will be raised, but, in any event, it remains a known error unless it is permanently fixed by a Change.” It is important that Known Errors be recorded in the Configuration Management Database to:

1)      Allow Problem Management to link the Known Error to the Problem Records and thus to the Incident Records

2)      Allow the Service Desk to advise the user on the status of the investigation into a permanent resolution to their Problem

3)      Allow Configuration Management to advise Change Management with the list of Configuration Items

Request for Change

Once it is recognized that a Change is required in order to resolve a Known Error, the Problem Management team must to create a Request for Change. The Request for Change belongs in the Configuration Management Database so that it can be linked to the Known Error, and thus linked to the Problem Records, Incident Records and underlying Configuration Items.

Change Authorized

The status of the Request for Change must be tracked. The flow through the Change Management process is described elsewhere in this document. Each status update, however, must be recorded in the Configuration Management Database so that it is clearly visible to all who need status. The Service Desk, as the primary interface to the users, will use this status, via the linkage to the user’s Configuration Items, to advise the users as to the progress on resolving their Problem.

Change tested, Implemented and Released

Release Management must be able to update the Request for Change and appropriate Configuration Items to document the progress of their work.

End

The Incident Life-Cycle ends when the underlying Problem has been resolved to the satisfaction of the user and all tributary Incidents closed.

6.8.3.      Configuration Management Inputs and Outputs for Change Management

The complexity of the interface between Change Management and Configuration Management is illustrated in the following diagram and text.

6.8.4.      Configuration Management Inputs and Outputs for Change Management

Interface

Description

Request for Change Register and allocation RFC number

A Request for Change is the entry point into the Change Management process. It is important that all Requests for Change be recorded in the Configuration Management Database so that they can be properly linked to the Configuration Items that will be affected.

Reports & Configuration audit to check environment

Many Changes have prerequisites. It is the responsibility of Configuration Management to ensure that the Configuration Items are accurate so that the impact assessment can proceed.

CMDB

CMDB stands for Configuration Management Database. This is the central repository for all configuration information. It is also recommended that all Incident Records, Problem Records, Known Errors and Requests for Change be linked through this repository.

Definitive Software Library

The library in which the definitive authorized versions of all software CIs are stored and protected. It is a physical library or storage repository where master copies of software versions are placed. This one logical storage area may in reality consist of one or more physical software libraries or file stores. They should be separate from development and test file store areas. The DSL may also include a physical store to hold master copies of bought-in software, e.g. a fireproof safe. Only authorized software should be accepted into the DSL, strictly controlled by Change and Release Management.

The DSL exists not directly because of the needs of the Configuration Management process, but as a common base for the Release Management and Configuration Management processes.

Assess

Analyze impact, broker agreement where assessors differ

Change Management is dependent upon Configuration Management to provide an accurate and comprehensive description of the impact from the proposed Change. Change Management will then use that assessment during evaluation and approval.

Reports on CIs, areas & parties impacted

The key benefit of a Configuration Management Database is the definition of the relationships between Configuration Items. Configuration Management makes use of those relationship definitions when assessing the impact that a proposed Change will have on the infrastructure and on the vital business functions.

Approved Change

The Change Advisory Board will meet and reject, approve or approve with conditions.

Update CM records

It is the responsibility of Configuration Management to record the status of the Request for Change in the Configuration Management Database. This is an on-going effort, but it is highlighted here as a reminder of the importance of this effort.

Implement Change

This will involve pre-release testing where software changes are required

The Change Management process will authorize a release schedule based upon the details accompanying the Request for Change. Release Management will then ensure that the appropriate development and testing processes are followed.

Release & distribute new versions of software and hardware with documentation

When the Release is ready, Release Management is authorized to begin distributing the Release. Implementation of the Release, however, must follow the pre-approved schedule.

Baseline, release software from DSL & update DSL & CM records

When the Release is ready, Release Management will provide Configuration Management with a release package. Configuration Management is responsible for uploading the Release into the Definitive Software Library and updating the appropriate Configuration Management Database records.

Post-Implementation Review

Has change met originator's expectations?

“Following successful implementation of Changes to resolve errors, the relevant Known error record(s) is closed, together with any associated Incident or Problem records. Consideration should be given to inserting into the process an interim status, on the Incident, Known Error and Problem records, of ‘Closed pending PIR’ to ensure that the fixes have actually worked. A Post-Implementation Review (PIR) can then confirm the effectiveness of the solution prior to final closure.”

Close Change

The Change can only be closed when the user has verified that the Problem has been resolved.

Check all CM records were updated

It is the responsibility of Configuration Management to insure that all of the linked Requests for Change, Known Errors, Problem Records, Incident Records and Configuration Items are appropriate updated to record the implementation of the Release and the resolution of the Problem.

End

The end of this Process.

 

6.9.          Configuration Management Process Flow

6.9.1.      Configuration Management Process Flow Diagram

6.9.2.      Configuration Management Process Flow Description

Aspect

Description

New

When Configuration Management first initiates, it is necessary to begin planning for the Configuration Management Process. This same task recurs whenever significant changes are scheduled.

Planning

ITIL defines planning as “defining the purpose, scope, objectives, policies and procedures and the organizational and technical context, for Configuration Management.”

Identification

ITIL defines Identification as “selecting and identifying the configuration structures for all the infrastructure’s CIs, including their ‘owner’, their interrelationships and configuration documentation. It includes allocating identifiers and version numbers for CIs, labeling each item, and entering it on the Configuration management database (CMDB).” The key distinction between an ITIL Configuration Management Database and a simple asset tracking system is the ability to define and track the relationships between the Configuration Items.

Control

Control is the core within Configuration Management. ITIL defines this as “ensuring that only authorized and identifiable CIs are accepted and recorded, from receipt to disposal. It ensures that no CI is added, modified, replaced or removed without appropriate controlling documentation, e.g. an approved Change request, and an updated specification.” That definition is too succinct in that Control is the daily activity for the Configuration Management group.

On Demand

While Planning, Identification and Control flow in a linear sequence, “Status Accounting” and “Verification and Audit” are only executed as required.

Status Accounting

ITIL defines Status accounting as “the reporting of all current and historical data concerned with each CI throughout its life cycle. This enables Changes to CIs and their records to be traceable, e.g. tracking the status of a CI as it changes from one state to another for instance ‘under development’, ‘being tested’, ‘live’, or ‘withdrawn’.” This is the activity of generating reports, updating status and supplying data to the other ITIL processes.

Verification and Audit

Periodically it is best practice to validate the Configuration Items by performing a physical inventory of the assets. ITIL defines this activity as “a series of reviews and audits that verify the physical existence of CIs and check that they are correctly recorded in the Configuration management system.”

 

6.10.      Configuration Management Relationships with other Service Management Processes

Please refer to the prior section for the relationship of Configuration Management with Service Desk, Incident and Problem Management

Configuration Management with Change Management

Provide an accurate representation of the infrastructure to support other ITIL processes (2.6)

Verify that no Configuration Items are modified without going through the Change Management process

Configuration Management works together with Change Management and Release Management to deploy new and updated deliverables

Policies should be jointly developed by Configuration Management, Change Management and Release Management

Configuration Management is responsible for assessing the impact from proposed changes (2.6)

Configuration Management needs to document the history of changes for each Configuration Item

Configuration Management is responsible for verifying that the environment is compatible with the proposed change

New builds should be verified against the specifications and documented as an "as-built" configuration

Audits should be conducted by Configuration Management to insure that all changes have gone through the proper process

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (2.6, 3.2)

It is best practice to implement Change and Configuration Management at the same time

Configuration Management is responsible for assessing the impact from proposed changes

Configuration Management needs to document the history of changes for each Configuration Item

The manager of the Configuration process can also manage the Change process

Effective Change Management relies on the Service Desk and Configuration, Problem and Release Management

Change Management depends upon the accuracy of the configuration data to ensure the full impact of making Changes is known (2.7)

 

Configuration Management with Release Management

Provide an accurate representation of the infrastructure to support other ITIL processes

Configuration Management works together with Change Management and Release Management to deploy new and updated deliverables

Policies should be jointly developed by Configuration Management, Change Management and Release Management

Release Management is responsible for populating the Definitive Software Library

Release Management must insure that all changes are reflected in the Configuration Management Database

Details about the build process and build environment should be recorded when a build is moved into the Definitive Software Library

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

Release Management is dependent upon the success of Configuration Management (2.6)

Release Management is to insure that the required software, hardware, documentation, etc are properly archived (DSL, DHS, etc) and that only authorized versions are installed

Release Management coordinates all Changes with Configuration Management and Change Management (2.8)

Release Management works with Configuration Management and Change Management to insure that the Configuration Management Database is accurately maintained (2.7)

Release Management should work to update the Configuration Management Database during the build cycle in anticipation of the release

When a Release is managed in phases, the Configuration Management Database should use a unique identifier to link the individual releases into one Release

The Definitive Software Library must be protected from unauthorized change

All additions to the Definitive Software Library must have corresponding entries in the Configuration Management Database

The Configuration Management Database should designate where the Release is to be applied

The Configuration Management Database should be updated with status to indicate receipt of the media, etc.

The detail description on how to build the Release should be a Configuration Item

The detailed description of the test environment should be a Configuration Item

The process for disposing of retired assets must be documented

Configuration Management should ensure that all pre-requisites have been met

Configuration Management should ensure that the Release has been successfully applied to all designated locations

Configuration Management and Release Management can work as one organization or can be combined with Change Management

Minimizing the number of variants will make Release Management easier

Ideally the users should be able to quickly determine the version of every component on their desktop from one screen

 

Configuration Management with Service Level Management

Provide an accurate representation of the infrastructure to support other ITIL processes (2.6)

Service Level Management can use the Configuration Management Database to house details about the services and link those services to Configuration Items

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

The Service Catalog can be stored as Configuration Items in the Configuration Management Database

It is best to link the Incident Records, Problem Records, Requests for Change, etc. to defined Services

Many organizations use their Service Desk linked to a comprehensive Configuration Management Database to monitor the Customer's perception of Availability

Changes to a Service Level Agreement must be reflected in the Service Catalog

Planning for a new Service should include the Service Desk, Change, Configuration and Release Management to verify resource and training requirements

 

Configuration Management with Financial Management

Configuration Management is responsible for verifying receipt of new Configuration Items

The Configuration Management Database describes the corporate assets

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

Financial Management needs to know the components utilized by each business unit especially when Charging is in place (2.6)

Financial Management works with Capacity, Configuration and Service Level Management to estimate the costs for the desired Capacity (2.2, SD 5.1.5)

Financial Management assists the other ITIL Processes in assessing the cost-effectiveness of decisions (SD 5.1.6)

Cost is an attribute of a Configuration Item (SD 5.1.5)

Service Desk and integrated Service Management processes are likely to be able o provide data to be input to cost calculation (SD 5.5.8)

The IT Accounting group must be trained on the fundamentals of Capacity, Change, Configuration and Service Level Management (SD 5.5.9)

Financial Management should conduct periodic audits to ensure that the interfaces into Configuration, Capacity and Service Level Management are working and provide the necessary workload information (SD 5.7.11)

Financial Management should work with Configuration Management to ensure that the Configuration Items are up-to-date and accurate (SD 5.7.11)

 

Configuration Management with Capacity Management

Provide an accurate representation of the infrastructure to support other ITIL processes

The Capacity Plan and other Service Management Plans should be identified in the Configuration Management Database

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

Capacity Management should work with Change, Configuration and Release Management to create and implement Requests for Change to ensure that appropriate Capacity is available (2.3)

The performance requirements for a Service should be recorded as Configuration Items (SD 6.2.1)

The capacity and configuration of Configuration Items should be accessible from both the Capacity Management Database and the Configuration Management Database (SD 6.2.1)

The Capacity Management and Configuration Management Databases can be merged into one database (SD 6.2.1)

Capacity Management monitoring is linked through the Configuration Management Database to Changes both to predict the probable impact and to asses the actual impact (SD 6.3.2)

The Capacity Management Database needs to be linked into the Configuration Management Database to assist Service Management in assessing proposed Service Level Agreements and Problem Management in diagnosing Problems (SD 6.7.5)

 

Configuration Management with IT Service Continuity Management

Provide an accurate representation of the infrastructure to support other ITIL processes

The definition of the Configuration Items and the copies in the Definitive Software Library can be used during disaster recovery

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

Service Continuity and Availability Management rely on the Configuration Management Database to identify components to perform risk analysis and component failure impact analysis (2.6)

Configuration Management should verify that the restored configuration matches the prior configuration

A successful Service Continuity program is dependent upon rigorous Configuration and Change Management processes (SD 7.1.4)

Service Continuity is dependent upon Configuration Management for the definition of the core infrastructure (SD 7.1.8)

Service Continuity is not focused minor technical faults unless that fault can cause a major impact on the business. Those risks are to be worked through Incident and Problem Management, addressed through Availability and Problem Management and resolved through Change and Configuration (and Release) Management (SD 7.2.2)

Recovery planning requires an in-depth understanding of the relationships between the components in the infrastructure (SD 7.3.3)

 

Configuration Management with Availability Management

All of the ITIL processes should be entered into the Configuration Management Database as Configuration Items and subject to Change Management (3.2)

Configuration and monitoring data on each component is used in Availability planning

The Configuration Management Database serves Availability Management through definitions of the configuration and by recording Incidents, Problems and Requests for Change (SD 8.8.2)


7.                Change Management


7.1.          Change Management Synopsis

Goals

Standardized methods

Efficient and prompt handling of Changes

Minimize impact from Change

Improve day-to-day operations

 

Benefits

IT aligned with business

Increased visibility and communication

Improved risk assessment

Reduced impact from Changes

Fewer Changes backed-out

Provide data to PM and Availability

Increased productivity of users

Fewer IT diversions

Ability to absorb more Changes

Enhanced business perspective on IT

 

Critical Success Factors

CM requires an appropriate tool

Implement CM with Configuration & Release

Plan, test and train for CM process

Management commitment

Audit trails in the tool

CAB requires authority

Introduce regular Change Slots

 

Key Performance Indicators

Number of Changes, grouped

Breakdown reasons for Change

Number successful

Number backed-out

Number secondary Incidents

Number and trends on RFCs

Size of review backlog

High RFCs due to fragile components

Prior year comparison

Number of RFCs rejected

 

Possible Implementation Problems

Too bureaucratic

Cultural difficulties with single CM system

Process circumvented

Dependent on Configuration Management

 

Tools

CMDB with CI, IR, PR, KE and RFCs

 

Activities

Filtering Changes

Managing Changes and the Change Process

Chairing the CAB and the CAB/EC

Reviewing and closing RFCs

Management reporting

 

Inputs

RFCs

CMDB

FSC

 

Outputs

FSC

RFCs

CAB minutes

Management reports

 

Manager’s Duties

Receive, log, prioritize, filter RFCs

Agenda, plan & conduct CAB & CAB/EC

Authorize Changes based on CAB

Issue FSC via Service Desk

Liaise & coordinate Change process

Update log & RFC

Provide management reports & trends

 

Other Roles

Change Advisory Board

Change Advisory Board/Emergency Committee



 

7.2.          Change Management Goal Statement

The goal of the Change management process is to ensure that standardized methods and procedures are used for efficient and prompt handling of all Changes, in order to minimize the impact of Change-related Incidents upon service quality, and consequently to improve the day-to-day operations of the organization.

7.3.          Change Management Benefits

·         "Better alignment of IT services to business requirements

·         Increased visibility and communication of Changes to both business and service-support staff

·         Improved risk assessment

·         A reduced adverse impact of Changes on the quality of services and on SLAs

·         Better assessment of the cost of proposed Changes before they are incurred

·         Fewer Changes that have to be backed-out, along with an increased ability to do this more easily when necessary

·         Improved Problem and Availability Management through the use of valuable management information relating to changes accumulated through the Change Management process

·         Increased productivity of Users – through less disruption and, higher-quality services

·         Increased productivity of key personnel through less need for diversion from planned duties to implement urgent Changes or back-out erroneous Changes

·         Greater ability to absorb a large volume of Changes

·         An enhanced business perception of IT through an improved quality of service and a professional approach" (SS 8.4.1)

7.4.          Change Management Critical Success Factors

·         CM requires an appropriate tool

·         Implement CM with Configuration & Release

·         Plan, test and train for CM process

·         Management commitment

·         Audit trails in the tool

·         CAB requires authority

·         Introduce regular Change Slots

7.5.          Change Management Key Performance Indicators

·         “The number of Changes implemented in the period, in total and by CI, configuration type, service, etc

·         A breakdown of the reasons for Change (User requests, enhancements, business requirements, service call/Incident/Problem fixes, procedures/training improvement, etc)

·         The number of Changes successful

·         The number of Changes backed-out, together with the reasons (e.g. incorrect assessment, bad build)

·         The number of Incidents traced to Changes (broken down into problem-severity levels) and the reasons (e.g. incorrect assessment, bad build)

·         The number of RFCs (and any trends in origination)

·         The number of implemented Changes reviewed, and the size of review backlogs (broken down over time)

·         High incidences of RFCs/PRs (Problem Records) relating to one CI (these are worthy of special attention), giving the reasons (e.g. volatile User requirement, fragile component, bad build)

·         Figures from previous periods (last period, last year) for comparison

·         The number of RFCs rejected

·         The proportion of implemented Changes that are not successful (in total and broken down by CI)

·         Change backlogs, broken down by CI and by stage in the Change Management process" (SS 8.7)

7.6.          Change Management Possible Implementation Problems

·         The process is perceived to be too bureaucratic or rigorous

·         Cultural difficulties in getting staff, Customers and Users to adopt a single Change Management system

·         The process is routinely circumvented

·         Change Management implemented without Configuration Management

7.7.          Change Management Tools

"For all but the smallest of organizations, a Configuration management-based tool, capable of storing all relevant configuration items (CIs), and the important relationships between them, should be used. Such a tool should have the following facilities:

·         Requests for Change (RFC) and Problem Records (PR) stored upon the same database, in an easily accessible format

·         The ability to identify the relationship between RFCs, PRs and CIs

·         The capability to link RFCs to projects

·         The means to identify easily the other CIs that will be impacted whenever a Change to any specific CI is proposed

·         Automatic production of requests for impact and resources assessment to the ‘owners’ of the impacted CIs

·         The ability for all authorized personnel to submit RFCs from their own terminal or location

·         The ability to ‘progress’ requests through the appropriate stages of authorization and implementation and to maintain clear records of this progress

·         The ability to allow Change Management staff, Change builders, testers, etc to add text to Change records

·         Clear definition of back-out procedures should a Change cause problems

·         Automatic warnings of any RFCs that exceed pre-specified time periods during any stage

·         Automatic prompting to carry out reviews of implemented Changes

·         Automatic generation of management and trend information relating to Changes

·         The ability to build Changes

·         Automatic production of Forward Schedule of Changes (FSC)

·         A process/workflow feature" (SS 8.8)

7.8.          Change Management Inputs and Outputs

Inputs:

·         Requests for Change

·         Configuration Management DataBase

·         Forward Schedule of Changes

Outputs:

·         Updated Forward Schedule of Changes

·         Updated Requests for Change

·         Change Advisory Board (CAB) minutes and actions

·         Change Management Reports

Activities:

·         Filtering Changes

·         Managing Changes and the Change Process

·         Chairing the CAB and the Change Advisory Board/Emergency Committee

·         Reviewing and closing Requests for Change

·         Management reporting

7.8.1.      Change Management Inputs and Outputs Diagram

 

7.8.2.      Change Management Inputs and Outputs Description

Interface

Description

Incident, Problem, Known Error and Request for Change linked to Configuration Items

The Configuration Management Database (CMDB) is the central repository for ITIL information. A Change is initiated with a Request for Change (RFC). That RFC should be linked to the relevant data in the CMDB including a history of Incidents and Problems. The Known Errors that this RFC is to resolve and indirect linkages to all of the Configuration Items that will be impacted by this Change.

Change Management Process and Policy

The details underlying this Change Management process must be documented in predefined processes and policies. The Change Manager uses those policies when assigning the Change to a Category, assigning a Priority and assessing the RFC. Other players in the Change Management process include the roles of Change Initiator, Change Builder, Independent Tester, Change Implementer, Change Advisory Board and Change Advisory Board / Emergency Committee. All of those roles and their responsibilities must be defined in the Change Management policies.

Build, implementation and back-out templates, policies and plans

Few Changes are truly new. Thus, those responsible for building the product, defining the implementation, testing and back out plans should have standard templates to work from. Additionally, when this work overlaps with Release Management and Configuration Management then policies for those processes must also be referenced.

Results from Testing

ITIL recommends assigning the testing of the Change, the implementation plan and the back out plan to an Independent Tester. That tester will report the results of those tests to the Change Builder and the Change Manager.

Results from Implementation

The results from implementing the Change will be reported to the Change Manager along with any secondary Incidents or Problems generated by the Change process.

User evaluation of results

An RFC is only successful if the user is satisfied that the Problem(s) addressed by this RFC have been successfully resolved.

Change Management

The flow through the Change Management process is illustrated in the diagrams immediately below this section.

Request for Change status updates, assessment and schedule

The Change Manager is responsible for coordinating the Change and providing updates to the Configuration Manager so that the RFC can be updated.

Forward Scheduled of Change, and Planned Schedule of Availability

This output is from the Configuration Management process and is included here only to fill in an obvious gap.

Communication with Service Desk, Users, Customers and other ITIL processes

This output is from the Configuration Management process and is included here only to fill in an obvious gap.

Change Review assessment, process adjustments and process metrics

Periodic Change Reviews will be scheduled. The Change Review can either be done by the Change Manager or by the Change Advisory Board (CAB). The purpose for the Change Review is to assess the success of the Change and evaluate the process.

 

7.9.          Change Management Process Flow

The Change Management Process Flow has been separated into three. The first diagram illustrated the Authorization Flow. The second diagram shows the Implementation Flow. The first and second diagrams are really one process, separated into two diagrams to simplify the presentation. The third diagram is an alternate path that should be followed when the Change requires urgent processing.

7.9.1.      Change Management Authorization Flow Diagram

 

7.9.2.      Change Management Authorization Flow Description

Aspect

Description

Start

The Change Management process formally begins with the creation of a Request for Change (RFC). This process can begin with a form transmitted through email. The person who creates the RFC is called the Change Initiator.

Change Manager filters requests

The Change Manager assesses the RFC for completeness.

Configuration Manager logs the RFC

The Configuration Manager is responsible for administration of the RFC. All RFCs will be stored in the Configuration Management Database (CMDB). After the Change Manager filters the RFC, the Configuration Manager will enter the RFC into the CMDB.

Accepted

The Change Manager can either accept the RFC for processing, or reject it and return it to the Change Initiator for correction.

Change Manager assigned initial priority

The Change Manager will review the RFC and assign an initial priority.

Configuration Manager updates log

The Configuration Manager will shadow the Change Manager and record the priority in the CMDB.

Urgent ?

If the RFC is Urgently needed, the Change Manager will divert the RFC from the normal flow over to the Urgency Flow.

See Urgency Flow

The Urgency Flow is documented later in this document.

Change Manager decides on category

The Change Manager will continue the evaluation of this RFC and assign it to the appropriate category based upon the scope of the Change.

Configuration Manager reports on impact