For more than 20 years NetWorker was tape right to the bone. But no longer.
Let me explain.
Tape in the 90’s pretty much was the entire secondary storage market. Reading data from disk and packaging that data for tape was NetWorker's very first media handling function and as such it was tape’s performance and access characteristics which defined how NetWorker would write data to storage for the years which followed.
When NetWorker looked beyond tape to what was then an up and coming long term retention format, Optical media, NetWorker treated it exactly as if it were a tape. This was driven by a desire for a consistent if somewhat device independent approach to media management. Around 1996 disk as a backup target emerged blinking into the daylight and NetWorker embraced this new storage target with a File type device…which behaved exactly as if it were a tape.
But quickly you start hitting the specific limitations of tape as a medium. Tape didn’t support concurrent read/write access, tape didn’t groom space if a backup failed or was aborted, tape didn’t, didn’t, didn’t, etc, ect.
Having moved into a new decade and with a view of still keeping management consistency in 2003 with the release of NetWorker 7 came the Advanced File Type Device (AFTD) which did support concurrent read/write operations, did groom space from failed operations and whatnot. But NetWorker did this by pretending each AFTD behaved kind of like tape and a half.
That's a dramatic over simplification of the technology involved but it is accurate that the entire device model paradigm was driven entirely by a view of the world that after it was all said and done, your backup data would end up on a tape cartridge.
That is no longer the paradigm.
In NetWorker 8.0 the storage layer now treats disk as disk while keeping the required operational consistency. For any existing NetWorker users one of the things you’ll first notice in 8.0 is the AFTD Read Only device (AFTD.RO) is gone. It's gone because it's no longer required and the media storage layer has been entirely decoupled from the previous tape-centric design to one where multiple instances of nsrmmd (NetWorker media multiplexor daemon) are now capable of working with devices concurrently.
In 8.0 when using disk for backup, the same disk volume can be now mounted to multiple different AFTDs. This allows NetWorker to load balance backup & restore operations across multiple storage nodes and multiple device paths taking into account how busy devices and storage nodes are at the the time the operation request occurs.
This reduces overall device resource contention across NetWorker deployments, can increase backup and recovery success rates as it can provide multipathing to storage which increases overall system availability and does so while offering a migration path for all of the AFTDs deployed in production today.
Two other major pieces of functionality gained from the new disk as disk approach come in the shape of synthetic fulls, where a NetWorker storage node will aggregate a full backup and a series of incremental backups to generate a restorable full backup and Client Direct.
Before we look at today with Client Direct we should first look back to how things were done.
The NetWorker three tier model was introduced in NetWorker 5.0 and designed to scale the overall throughput of the backup infrastructure and the number of clients and devices a NetWorker Server could manage. At it's heart, NetWorker Clients transmit backups to NetWorker Storage Nodes which in turn perform read & write operations to media storage devices, all while the NetWorker Server manages Client & Storage Node operations, the schedule and the catalogue. Clients, Storage Nodes & Server are the three tiers.
With the ever shortening backup & recovery windows, much greater numbers of clients and expanding numbers of devices required to back up those clients, it was possible the system resources of Storage Node hardware itself could become a gating factor in performance. And to this comes Client Direct.
As the name implies this allows a backup client to directly write to and recover from disk based storage devices without a Storage Node being in the data path at all. Data no longer passes through the storage node, it just goes directly to the storage target. Be that target Data Domain with BOOST, SAN or NAS.
Client Direct can greatly simplify deployments, which has the knock on effect of increasing availability due to fewer things to go wrong, while reducing overall cost as Storage Nodes move from pushing the data to storage to just monitoring device availability and forwarding any metadata updates to the server. Bottom line, with Client Direct you require fewer Storage Nodes.
Disk as disk. Synthetic Fulls. Client Direct.
These are the key elements of the NetWorker 8.0 move to a world without tape. In NetWorker 8.0 by treating disk as disk it has unlocked greater functionality inside the software.
Functionality which was simply not possible with NetWorker's previous tape focused paradigm.