The endowment is the money which, deposited with the data and invested at interest, suffices to pay for the storage of (in this case) a terabyte "forever", which in this model is 100 years.

- Provide a set of parameters and compute the model's estimate of the endowment.
- Provide a set of parameters and draw a graph of how the model's estimate of the endowment varies with the DiscountRate and KryderRate.

*DriveCost*: the initial cost per drive, assumed constant in real dollars.*DriveTeraByte*: the initial number of TB of useful data per drive (i.e. excluding overhead).*KryderRate*: the annual percentage by which DriveTeraByte increases.*DriveLife*: working drives are replaced after this many years.*DriveFailRate*: percentage of drives that fail each year.

*SlotCost*: the initial non-media cost of a rack (servers, networking, etc) divided by the number of drive slots.*SlotRate*: the annual percentage by which SlotCost decreases in real terms.*SlotLife*: racks are replaced after this many years

*SlotCostPerYear*: the initial running cost per year (labor, power, etc) divided by the number of drive slots.*LaborPowerRate*: the annual percentage by which SlotCostPerYear increases in real terms.*ReplicationFactor*: the number of copies. This need not be an integer, to account for erasure coding.

*DiscountRate*: the annual real interest obtained by investing the remaining endowment.

- Unlike earlier published research, this model ignores the cost of ingesting the data in the first place, and acessing it later. Experience suggests the following rule of thumb: ingest is half the total lifetime cost, storage is one-third the total lifetime cost, and access is one-sixth. Thus a reasonable estimate of the total preservation cost of a terabyte is three times the result of this model.
- The model assumes that the parameters are constant through time. Historically, interest rates, the Kryder rate, labor costs, etc. have varied, and thus should be modelled using Monte Carlo techniques and a probability distribution for each such parameter. It is possible for real interest rates to go negative, disk cost per terabyte to spike upwards, as it did after the Thai floods, and so on. These low-probability events can have a large effect on the endowment needed, but are excluded from this model.
- There are a number of different possible policies for handling the inevitable disk failures, and different ways to model each of them. This model assumes that it is possible to predict at the time a batch of disks is purchased what proportion of them will fail, and inflates the purchase cost by that factor. This models the policy of buying extra drives so that failures can be replaced by the same drive model.
- The model assumes that drives are replaced after DriveLife years even though they are working. Continuing to use the drives beyond this can have significant effects on the endowment, see this paper.