Mean time between failures (MTBF) is the predicted elapsed time between inherent failures of a system during operation. MTBF can be calculated as the arithmetic mean (average) time between failures of a system. The MTBF is typically part of a model that assumes the failed system is immediately repaired (MTTR), as a part of a renewal process. This is in contrast to the mean time to failure (MTTF), which measures average time to failures with the modeling assumption that the failed system is not repaired (infinite repair rate).
The definition of MTBF depends on the definition of what is considered a system failure. For complex, repairable systems, failures are considered to be those out of design conditions which place the system out of service and into a state for repair. Failures which occur that can be left or maintained in an unrepaired condition, and do not place the system out of service, are not considered failures under this definition. In addition, units that are taken down for routine scheduled maintenance or inventory control, are not considered within the definition of failure.
Overview
CORRECTION TO THIS PICTURE IS NEEDED. MTBF is time BETWEEN FAILURES and the measurement of the time of failure must be consistent (start at start of failure for both failures); therefore the MTBF = MTTF + MTTR. The value that is provided in the picture is MTTF and not MTBF.
For each observation, downtime is the instantaneous time it went down, which is after (i.e. greater than) the moment it went up, uptime. The difference (downtime minus uptime) is the amount of time it was operating between these two events.
MTBF value prediction is an important element in the development of products.
Reliability engineers / design engineers, often utilize Reliability Software to calculate products' MTBF according to various methods/standards (MIL-HDBK-217F, Telcordia SR332, Siemens Norm, FIDES,UTE 80-810 (RDF2000), etc.). However, these "prediction" methods are not intended to reflect fielded MTBF as is commonly believed. The intent of these tools is to focus design efforts on the weak links in the design.
Formal definition of MTBF
By referring to the figure above, the MTBF is the sum of the operational periods divided by the number of observed failures. If the "Down time" (with space) refers to the start of "downtime" (without space) and "up time" (with space) refers to the start of "uptime" (without space), the formula will be:
:
The MTBF is often denoted by the Greek letter θ, or
:
The MTBF can be defined in terms of the expected value of the density function ƒ(t)
:
where ƒ is the density function of time until failure – satisfying the standard requirement of density functions –
:
Variations of MTBF
There are many variations of MTBF, such as
mean time between system aborts (MTBSA) or
mean time between critical failures (MTBCF) or
mean time between unit replacement (MTBUR). Such nomenclature is used when it is desirable to differentiate among types of failures, such as critical and non-critical failures. For example, in an automobile, the failure of the FM radio does not prevent the primary operation of the vehicle.
Mean time to failure (MTTF) is sometimes used instead of MTBF in cases where a system is replaced after a failure, since MTBF denotes time between failures in a system which is repaired.
Notes
See also
Failure rate
Mean time to repair
Power-On Hours
References
Jones, James V., Integrated Logistics Support Handbook, McGraw–Hill Professional, 3rd edition (June 8, 2006), ISBN 0071471685
External links
CARE (Computer Aided Reliability Engineering) MTBF Prediction
Usenet FAQ about MTBF.
Reliability and Availability Basics
Summary including MTTF discussion
Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?
Category:Survival analysis
Category:Reliability analysis