Comprehensive Testing Creates “Operational Familiarity” Between Companies and Their Disaster Recovery Providers

By Rodney Tyler|2022-03-29T19:05:19+00:00August 1st, 2008|0 Comments

During the recovery efforts following Hurricane Ike, organizations that con­ducted comprehensive tests of their recovery plans were operational in the allotted time with very little – if any – complica­tions. One of the benefits observed within the orga­nizations that tested was a sense of “operational familiarity” between the recovering organization and the Disaster Recovery (DR) provider.

A simple way to define operational familiarity is the personnel of two dif­ferent organizations understanding how to work best with each other, develop­ing a sense of reliance and trust in the others’ abilities. In many situations, the operational familiarity between the organizations was just as important as the technical know-how of the personnel and properly functioning equipment.

Testing provided organizations and DR coordinators a “point of reference,” and when issues would arise, it was not uncommon for one to say to the other, “Let’s configure it like the last time we tested.” Having conducted comprehen­sive testing, both organizations were aware of the others’ capabilities, per­sonalities and how best to work with each other.

In addition to the increased recovery time, the operational familiarity cre­ated a sense of purpose and experience among those that had already complet­ed the test. Each department knew what was expected of them, that it could be done, and knew what the other was doing. Additionally, the company was aware of the full extent of support the DR provider could provide. However, several organizations with untested or insufficiently tested DR plans did not do as well.

In an untested disaster recovery envi­ronment, hardware and software issues are probably easier to resolve than per­sonality conflicts. Finding out a server isn’t properly functioning in a disaster situation would be extremely disruptive to meeting a set recovery time. Just as, if not more disruptive, is discovering an individual responsible for an aspect of an organization’s recovery does not work well with others under pressure.

To illustrate the point: One organiza­tion completed a comprehensive test about a month before Hurricane Ike. The organization discovered some soft­ware issues they were able to resolve. After Hurricane Ike, the organization declared and was fully operational with­in hours of a mobile recovery center being on site. Because of testing, the recovery was a success.

On the other hand, another organiza­tion never tested, then declared after Hurricane Ike. The mobile recovery center arrived on site, and was ready to be populated; all of the DR provider’s hardware and software was in order within a few hours of deploying to the site. However, the company was having a software error on their end. The soft­ware glitch probably would have been revealed if prior testing had occurred.

Instead of being able to install and bring up several workstations simulta­neously, the IT director had to bring each one up individually, taking more than two hours, the time projected to bring the workstations up. Due to a lack of testing, the IT director was not comfortable listening to the suggestions of the DR personnel. The recovering organization then sent another special­ist from another city 200 miles away to the site to provide a new perspective on the problem.

The new specialist arrived on site and fixed the problem within the hour. Within four hours, the site was fully operational. The solution the new specialist implemented was the same solution the DR provider personnel had recommended.

Another observation: Organizations that tested had a faster recovery time, creating more time and resources to find solutions to logistical and per­sonnel issues. Hurricane Ike severely hampered supply lines for everything from gasoline to food. Power was out for several weeks in some areas, and many places had to scramble to find portable toilets. Finding fuel for vehicles, let alone generators, was difficult. Because of testing and trust in their DR pro­vider, the technical side of the business recovery was not an issue the company had to worry about.

To create operational familiarity, tests must be as realistic and stressful as a controlled setting can provide, such as setting challenging time lines to be met to be considered a successful test. The test should closely resemble an organization’s actual recovery plan. An organization’s leadership also has to support and establish a serious tone for the testing.

If a DR provider only allows a cus­tomer to “test” in the DR provider’s parking lot, or does not test all parts of the recovery plan, there are sev­eral variables left to be determined in a disaster. Organizations should be able to “declare” a test, as they would an actual disaster. Additionally, if a DR provider makes simply testing a challenge; it raises the question of how responsive the DR provider will be in an actual disaster.

Testing enables an organization to verify that in a disaster, their busi­ness operations will be taken care of, allowing the organization to focus on the actual impact of the disaster. Test­ing also fosters confidence, competence, communication and understanding – operational familiarity – between two organization’s personnel. Waiting until a disaster to implement an untested plan could end up being the costliest “test” of a DR plan.

Recommend0 recommendationsPublished in Incident & Crisis Response

Share This Story, Choose Your Platform!

About the Author: Rodney Tyler

Rodney Tyler, Services Development Manager for Rentsys Recovery Services, Inc., has been in the Disaster Recovery Industry for more than 10 years and has helped multiple Fortune 500 companies develop business recovery strategies. For more information, go to www.rentsysrecovery.com or call at 866.489.9585.

Leave A Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.