# Failure rate

**Failure rate** is the frequency with which an engineered system or component fails, expressed, for example, in failures per hour. It is often denoted by the Greek letter λ (lambda) and is important in reliability engineering.

The failure rate of a system usually depends on time, with the rate varying over the life cycle of the system. For example, an automobile's failure rate in its fifth year of service may be many times greater than its failure rate during its first year of service. One does not expect to replace an exhaust pipe, overhaul the brakes, or have major transmission problems in a new vehicle.

In practice, the mean time between failures (MTBF, 1/λ) is often reported instead of the failure rate. This is valid and useful if the failure rate may be assumed constant – often used for complex units / systems, electronics – and is a general agreement in some reliability standards (Military and Aerospace). It does in this case *only* relate to the flat region of the bathtub curve, also called the "useful life period". Because of this, it is incorrect to extrapolate MTBF to give an estimate of the service life time of a component, which will typically be much less than suggested by the MTBF due to the much higher failure rates in the "end-of-life wearout" part of the "bathtub curve".

The reason for the preferred use for MTBF numbers is that the use of large positive numbers (such as 2000 hours) is more intuitive and easier to remember than very small numbers (such as 0.0005 per hour).

The MTBF is an important system parameter in systems where failure rate needs to be managed, in particular for safety systems. The MTBF appears frequently in the engineering design requirements, and governs frequency of required system maintenance and inspections. In special processes called renewal processes, where the time to recover from failure can be neglected and the likelihood of failure remains constant with respect to time, the failure rate is simply the multiplicative inverse of the MTBF (1/λ).

A similar ratio used in the transport industries, especially in railways and trucking is "mean distance between failures", a variation which attempts to correlate actual loaded distances to similar reliability needs and practices.

Failure rates are important factors in the insurance, finance, commerce and regulatory industries and fundamental to the design of safe systems in a wide variety of applications.

## Contents

## Failure rate in the discrete sense

The failure rate can be defined as the following:

- The total number of failures within an item population, divided by the total time expended by that population, during a particular measurement interval under stated conditions. (MacDiarmid,
*et al.*)

Although the failure rate, , is often thought of as the probability that a failure occurs in a specified interval given no failure before time , it is not actually a probability because it can exceed 1. Erroneous expression of the failure rate in % could result in incorrect perception of the measure, especially if it would be measured from repairable systems and multiple systems with non-constant failure rates or different operation times. It can be defined with the aid of the reliability function, also called the survival function, , the probability of no failure before time .

over a time interval from (or ) to and is defined as . Note that this is a conditional probability, hence the in the denominator.

The function is a CONDITIONAL probability of the failure DENSITY function. The condition is that the failure has not occurred at time .

Hazard rate and ROCOF (rate of occurrence of failures) are often incorrectly seen as the same and equal to the failure rate.

## Failure rate in the continuous sense

Calculating the failure rate for ever smaller intervals of time, results in the **Template:Visible anchor** (also called **hazard rate**), . This becomes the *instantaneous* failure rate as tends to zero:

A continuous failure rate depends on the existence of a **failure distribution**, , which is a cumulative distribution function that describes the probability of failure (at least) up to and including time *t*,

where is the failure time.
The failure distribution function is the integral of the failure *density* function, *f*(*t*),

The hazard function can be defined now as

Many probability distributions can be used to model the failure distribution (*see List of important probability distributions*). A common model is the **exponential failure distribution**,

which is based on the exponential density function. The hazard rate function for this is:

Thus, for an exponential failure distribution, the hazard rate is a constant with respect to time (that is, the distribution is "memory-less"). For other distributions, such as a Weibull distribution or a log-normal distribution, the hazard function may not be constant with respect to time. For some such as the deterministic distribution{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} it is monotonic increasing (analogous to "wearing out"), for others such as the Pareto distribution it is monotonic decreasing (analogous to "burning in"), while for many it is not monotonic.

## Decreasing failure rate

A decreasing failure rate (DFR) describes a phenomenon where the probability of an event in a fixed time interval in the future decreases over time. A decreasing failure rate can describe a period of "infant mortality" where earlier failures are eliminated or corrected^{[1]} and corresponds to the situation where λ(*t*) is a decreasing function.

Mixtures of DFR variables are DFR.^{[2]} Mixtures of exponentially distributed random variables are hyperexponentially distributed.

### Renewal processes

For a renewal process with DFR renewal function, inter-renewal times are concave.^{[2]}^{[3]} Brown conjectured the converse, that DFR is also necessary for the inter-renewal times to be concave,^{[4]} however it has been shown that this conjecture holds neither in the discrete case^{[3]} or continuous case.^{[5]}

### Applications

Increasing failure rate is an intuitive concept caused by components wearing out. Decreasing failure rate describes a system which improves with age.^{[6]}
Decreasing failure rates have been found in the lifetimes of spacecraft, Baker and Baker commenting that "those spacecraft that last, last on and on."^{[7]}^{[8]} The reliability of aircraft air conditioning systems were individually found to have an exponential distribution, and thus in the pooled population a DFR.^{[6]}

### Coefficient of variation

When the failure rate is decreasing the coefficient of variation is ⩾ 1, and when the failure rate is increasing the coefficient of variation is ⩽ 1.^{[9]} Note that this result only holds when the failure rate is defined for all t ⩾ 0^{[10]} and that the converse result (coefficient of variation determining nature of failure rate) does not hold.

## Failure rate data

Failure rate data can be obtained in several ways. The most common means are:

- Historical data about the device or system under consideration
- Many organizations maintain internal databases of failure information on the devices or systems that they produce, which can be used to calculate failure rates for those devices or systems. For new devices or systems, the historical data for similar devices or systems can serve as a useful estimate.
- Government and commercial failure rate data
- Handbooks of failure rate data for various components are available from government and commercial sources. MIL-HDBK-217F,
*Reliability Prediction of Electronic Equipment*, is a military standard that provides failure rate data for many military electronic components. Several failure rate data sources are available commercially that focus on commercial components, including some non-electronic components. - Testing
- The most accurate source of data is to test samples of the actual devices or systems in order to generate failure data. This is often prohibitively expensive or impractical, so that the previous data sources are often used instead.

### Units

Failure rates can be expressed using any measure of time, but **hours** is the most common unit in practice. Other units, such as miles, revolutions, etc., can also be used in place of "time" units.

Failure rates are often expressed in engineering notation as failures per million, or 10^{−6}, especially for individual components, since their failure rates are often very low.

The **Failures In Time** (**FIT**) rate of a device is the number of failures that can be expected in one billion (10^{9}) device-hours of operation. (E.g. 1000 devices for 1 million hours, or 1 million devices for 1000 hours each, or some other combination.) This term is used particularly by the semiconductor industry.

The relationship of FIT to MTBF may be expressed as: MTBF = 1,000,000,000 x 1/FIT.

### Additivity

Under certain engineering assumptions (e.g. besides the above assumptions for a constant failure rate, the assumption that the considered system has no relevant redundancies), the failure rate for a complex system is simply the sum of the individual failure rates of its components, as long as the units are consistent, e.g. failures per million hours. This permits testing of individual components or subsystems, whose failure rates are then added to obtain the total system failure rate.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}

### Example

Suppose it is desired to estimate the failure rate of a certain component. A test can be performed to estimate its failure rate. Ten identical components are each tested until they either fail or reach 1000 hours, at which time the test is terminated for that component. (The level of statistical confidence is not considered in this example.) The results are as follows:

Estimated failure rate is

or 799.8 failures for every million hours of operation.

## Estimation

The Nelson–Aalen estimator can be used to estimate the cumulative hazard rate function.

## See also

{{#invoke:Portal|portal}}

## References

- ↑ Template:Cite doi
- ↑
^{2.0}^{2.1}Template:Cite doi - ↑
^{3.0}^{3.1}Template:Cite doi - ↑ Template:Cite doi
- ↑ Template:Cite doi
- ↑
^{6.0}^{6.1}Template:Cite doi - ↑ Template:Cite doi
- ↑ Template:Cite doi
- ↑ Template:Cite doi
- ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- Federal Standard 1037C
- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:Citation/CS1|citation

|CitationClass=journal }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:Citation/CS1|citation

|CitationClass=journal }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

## External links

*Reliability Prediction of Electronic Equipment*, MIL-HDBK-217F(2), (DOD download site.)- Bathtub curve issues by ASQC.
- Fault Tolerant Computing in Industrial Automation by Hubert Kirrmann, ABB Research Center, Switzerland
- Usenet FAQ about MTBF
- Reliability and Availability Basics
- Product failure behavior and wear out

ar:معدل الإخفاق he:פונקציית סיכון ru:Интенсивность отказов uk:Інтенсивність відмов