- Research
- Open access
- Published:
Extended inverse Lindley distribution: properties and application
SpringerPlus volume 4, Article number: 690 (2015)
Abstract
In this paper, we introduce an extension of the inverse Lindley distribution, which offers more flexibility in modeling upside-down bathtub lifetime data. Some statistical properties of the proposed distribution are explicitly derived. These include density and hazard rate functions with their behavior, moments, moment generating function, skewness, kurtosis measures, and quantile function. Maximum likelihood estimation of the parameters and their estimated asymptotic distribution and confidence intervals are derived. Rényi entropy as a measure of the uncertainty in the model is derived. The application of the model to a real data set i.e., the flood levels for the Susquehanna river at Harrisburg, Pennsylvania, over 20 four-year periods from 1890 to 1969 is compared to the fit attained by some other well-known existing distributions.
Background
Survival and reliability analysis is a very important branch of statistics. It has many applications in many applied sciences, such as engineering, public health, actuarial science, biomedical studies, demography, and industrial reliability. The failure behavior of any system can be considered as a random variable due to the variations from one system to another resulting from the nature of the system. Therefore, it seems logical to find a statistical model for the failure of the system. In other applications, survival data are categorized by their hazard rate, e.g., the number of deaths per unit in a period of time. The modeling of survival data depends on the behavior of the hazard rate. The hazard rate may belong to the monotone (non-increasing and non-decreasing hazard rate) or non-monotone (bathtub and upside-down bathtub [UBT] or unimodal hazard rate). Several lifetime models have been suggested in statistics literature to model survival data. The Weibull distribution is one of the most popular and widely used models in life testing and reliability theory. Lindley (1958) suggested a one-parameter distribution as an alternative model for survival data. This model is known as Lindley distribution. However, we suggest that Weibull and Lindley distributions are restricted when data shows non-monotone hazard rate shapes, such as the unimodal hazard rate function (Almalki and Nadarajah 2014; Almalki and Yuan 2013).
There are several real applications where the data show the non-monotone shape for their hazard rate. For example, Langlands et al. (1997) studied the data of 3878 cases of breast carcinoma seen in Edinburgh from 1954 to 1964 and noticed that mortality was initially low in the first year, reaching a peak in the subsequent years, and then declining slowly. Another real problem was analyzed by Efron (1988) who, using head and neck cancer data, found the hazard rate initially increased, reached a maximum, and decreased before it finally stabilized due to therapy. The inverse versions of some existing probability distributions, such as inverse Weibull, inverse Gaussian, inverse gamma, and inverse Lindley, show non-monotone shapes for their hazard rates; hence, we were able to model a non-monotone shape data.
Erto and Rapone (1984) showed that the inverse Weibull distribution is a good fit for survival data, such as the time to breakdown of an insulating fluid subjected to the action of constant tension. The use of Inverse Weibull was comprehensively described by Murthy et al. (2004). Glen (2011) proposed the inverse gamma distribution as a lifetime model in the context of reliability and survival studies. Recently, a new upside-down bathtub-shaped hazard rate model for survival data analysis was proposed by Sharma et al. (2014) by using transmuted Rayleigh distribution. Sharma et al. (2015a) introduced the inverse Lindley distribution as a one parameter model for a stress-strength reliability model. Sharma et al. (2015b) generalized the inverse Lindley into a two parameter model called “the generalized inverse Lindley distribution.” Finally, a new reliability model of inverse gamma distribution referred to as “the generalized inverse gamma distribution” was proposed by Mead (2015), which includes the inverse exponential, inverse Rayleigh, inverse Weibull, inverse gamma, inverse Chi square, and other inverse distributions.
The Lindley distribution was proposed by Lindley (1958) in the context of the Bayes theorem as a counter example of fiducial statistics with the probability density function (pdf)
Shanker et al. (2013) proposed two parameter extensions of the Lindley distribution with the pdf
Ghitany et al. (2008) discussed the Lindley distribution and its applications extensively and showed that the Lindley distribution is a better fit than the exponential distribution based on the waiting time at the bank for service. The inverse Lindley distribution was proposed by Sharma et al. (2015a) using the transformation \(X = \frac{1}{Y}\) with the pdf
where \(Y\) is a random variable having pdf (1).
Another two parameter inverse Lindley distribution introduced by Sharma et al. (2015a), called “the generalized inverse Lindley distribution,” is a new statistical inverse model for upside-down bathtub survival data that uses the transformation \(X = Y^{{ - \frac{1}{\alpha }}}\) with the pdf
with \(Y\) being a random variable having pdf (1).
Using the transformation \(X = Z^{{ - \frac{1}{\alpha }}}\), we introduce a more flexible distribution with three parameters called “extended inverse Lindley distribution”, (EIL) and this gives us a better fit for upside-down bathtub data.
The aim of this paper is to introduce a new inverse Lindley distribution with its mathematical properties. These include the shapes of the density and hazard rate functions, the moments, moment generating function and some associated measures, the quantile function, and stochastic orderings. Maximum likelihood estimation of the model parameters and their asymptotic standard distribution and confidence interval are derived. Rényi entropy as a measure of the uncertainty in the model is derived. Application of the model to a real data set is finally presented and compared to the fit attained by some other well-known distributions.
The extended inverse Lindley distribution
An extended inverse Lindley distribution with parameters \(\theta ,\beta\), and \(\alpha\) is defined by its probability density function and cumulative distribution function according to the definition.
Definition
Let \(Z\) be a random variable having pdf (2), then the random variable \(X = Z^{{ - \frac{1}{\alpha }}}\) is said to follow an EIL distribution with probability density function
and cumulative distribution function (cdf)
Remark
The pdf (3) can be shown as a mixture of two distributions as follows:
where
We see that the EPL is a two-component mixture of inverse Weibull distribution (with shape \(\alpha\) and scale \(\theta\)), and a generalized inverse gamma distribution (with shape parameters \(2,\alpha\) and scale \(\theta\)), with the mixing proportion \(p = \theta /(\theta + \beta )\).
We use \(X \sim EIL(\theta ,\beta ,\alpha )\) to denote the random variable that has EIL distribution with parameters \(\theta ,\beta ,\alpha\) and the pdf and cdf in (3) and (4), respectively.
The derivative of \(f(x)\) is obtained from (3) as
where
with
Clearly, \(f^{\prime}(x)\) and \(\psi (y)\) have the same sign and \(\psi (y)\) is a unimodal quadratic function that attains its maximum value at the point \(y\) whenever \(\psi (y) = 0\); hence, the mode of \(f(x)\) is given by
In Fig. 1, we plot the pdf of the EIL distribution for some values of \(\theta ,\beta ,\alpha\) and the behavior of \(f(x)\).
Survival and hazard functions
The survival and hazard rate functions of the EIL distribution are respectively given by
and
The behavior of \(h(x)\) in (6) of the \(EIL(\theta ,\beta ,\alpha )\) for different values of the parameters \(\theta ,\beta\), and \(\alpha\) are showed graphically in Fig. 2.
Moments, moment generating function, and associated measures
Theorem 1
Let \(X\) be a random variable that follows the EIL distribution with pdf as in (3), then the rth row moment (about the origin) is given by
and the moment generating function (mgf) is given by
where \(\Gamma a = \int\limits_{0}^{\infty } {x^{a - 1} } e^{ - x} dx.\)
Proof
\(\mu_{r}^{{\prime }} = E(x^{r} ) = \int\limits_{ - \infty }^{\infty } {x^{r} } f(x)dx\)
For \(X \sim EIL(\theta ,\beta ,\alpha )\), we have
Letting \(y = x^{\alpha }\), we have
Using \(\int\limits_{0}^{\infty } {\frac{{e^{{ - \frac{a}{x}}} }}{{x^{b + 1} }}} dx = \frac{\Gamma b}{{a^{b} }},\) the definition of inverse gamma, the above expression is reduced to
The mgf of a continuous random variable \(X,\) when it exists, is given by
For \(X \sim EIL(\theta ,\beta ,\alpha )\), we have
Using \(e^{tx} = \sum\nolimits_{n = 0}^{\infty } {\frac{{t^{n} x^{n} }}{{n\text{!}}}} ,\) the series expansion, the above expression is reduced to
Letting \(y = x^{\alpha }\), we have
Using \(\int\limits_{0}^{\infty } {\frac{{e^{{ - \frac{a}{x}}} }}{{x^{b + 1} }}} dx = \frac{\Gamma b}{{a^{b} }},\) the definition of inverse gamma, the moment generating function for the EIL distribution is given by
The mean and the variance of the EIL distribution are, respectively,
The \(skewness{\text{ and }}kurtosis\) measures can be obtained from the expressions
upon substituting for the row moments in (7).
Quantile function
Theorem 2
Let \(X\) be a random variable with the pdf in (3). Then, the quantile function, say \(Q(p)\) is
where \(\theta ,\beta ,\alpha \text{ > }0,\text{ }p \in (0,1)\), and \(W_{ - 1} (.)\) is the negative Lambert \(W\) function.
Proof
We have \(Q(p) = F^{ - 1} (p),\text{ }p \in (0,1)\), which implies \(F(Q(p)) = p\). By substitution, we get
When we multiply both sides by \(- (\beta + \theta )e^{ - (\theta + \beta )}\), and raise them to \(\beta\), we have the Lambert equation
Hence, we have the negative Lambert \(W\) function of the real argument \(- p(\theta + \beta )e^{ - \theta - \beta }\). i.e.,
thus, by solving this equation for \(Q(P)\), the proof is complete.
Special cases of the EIL distribution
The EIL distribution contains some well-known distributions as sub-models, described below in brief.
Inverse Lindley distribution
The inverse Lindley distribution (IL) shown by Sharma et al. (2015b) is a special case of the EIL distribution; \(\alpha = \beta = 1.\) Using (3) and (4), the pdf and cdf is given by
The associated hazard rate function using (6) is given by
The generalized inverse Lindley distribution
The generalized inverse Lindley distribution (GIL) as shown by Sharma et al. (2015b) is a special case of the EIL distribution; \(\beta = 1.\) Using (3) and (4), the pdf and cdf are respectively given by
The associated hazard rate function using (6) is given by
The rth row moment for the GIL is then given by
and the mgf is given by
Inverse Weibull distribution
The inverse Weibull distribution (IW) is a special case of EIL distribution; \(\beta = 0\). Using (3) and (4), the pdf and cdf are respectively given by
The associated hazard rate function using (6) is given by
Stochastic orderings
Stochastic orderings of positive continuous random variables is an important tool used judge comparative behavior. A random variable \(X\) is said to be smaller than a random variable \(Y\) in the following contexts:
-
(a)
Stochastic order \((X \le_{st} Y){\text{ if }}F_{X} (x) \le F_{Y} (x){ \forall }x;\)
-
(b)
Hazard rate order \((X \le_{hr} Y){\text{ if }}h_{X} (x) \ge h_{Y} (x){ \forall }x;\)
-
(c)
Mean residual life order \((X \le_{mrl} Y){\text{ if }}m_{X} (x) \le m_{Y} (x){ \forall }x;\) and
-
(d)
Likelihood ratio order \((X \le_{lr} Y){\text{ if }}f_{X} (x)/f_{Y} (x)\text{ }{\text{decreases in }}x.\)
The following implications (Shaked and Shanthikumar 1994) are well known:
The following theorem shows that the EIL distribution is ordered with respect to “likelihood ratio” ordering.
Theorem 3
Let \(X \sim {\textit{PL}}(\theta_{1} ,\beta_{1,} \alpha_{1} )\;{\textit{and}}\;Y \sim {\textit{PL}}(\theta_{2} ,\beta_{2,} \alpha_{2} ).\) \({\textit{If}}\;\beta_{1} = \beta_{2}\; {\textit{and}}\; \theta_{2} \ge \theta_{1}\; {\textit{(or if}}\; \theta_{1} = \theta_{2} \; {\textit{and}} \; \beta_{2} \ge \beta_{1} ), \; {\textit{then}} \; X \ge_{lr} Y.\; {\textit{Hence,}}\) \(X \ge_{hr} Y,X \ge_{mrl} \;Y{\textit{and}}\; X \ge_{st} Y.\)
Proof
We have
Setting \(\alpha_{1} = \alpha_{2} = \alpha ,\) we have \(\frac{{f_{X} (x)}}{{f_{Y} (x)}} = \frac{{\theta_{1}^{2} }}{{\theta_{2}^{2} }}\frac{{\theta_{2} + \beta_{2} }}{{\theta_{1} + \beta_{1} }}\frac{{\beta_{1} + x^{\alpha } }}{{\beta_{2} + x^{\alpha } }}e^{{(\theta_{2} - \theta_{1} )x^{ - \alpha } }}\), which is decreasing in \(x\) for\(\beta_{1} = \beta_{2} {\text{ and }}\theta_{2} \ge \theta_{1} {\text{ (or if }}\theta_{1} = \theta_{2} {\text{ and }}\beta_{2} \ge \beta_{1} ).\) This implies \(X \le_{lr} Y\). Hence, \(X \le_{hr} Y,X \le_{mrl} Y{\text{ and }}X \le_{st} Y.\)
Estimation and inference
Let \(X_{1} , \ldots ,X_{n}\) be a random sample with observed values \(x_{1} , \ldots ,x_{n}\) from EIL distribution. Let \(\Theta = (\theta ,\beta ,\alpha )\) be the \(3{ \times }1\) parameter vector. The log likelihood function is given by
The score function \(U_{n} (\Theta ) = ({\partial }\ln /\partial \theta ,{\partial }\ln /\partial \beta ,{\partial }\ln /\partial \alpha )^{T}\) is given by
The maximum likelihood estimation (MLE) of \(\Theta\) say \(\{\Theta \}\) is obtained by solving the nonlinear system \(U_{n} (\rm{x};\Theta ) = 0\). This nonlinear system of equations does not have a closed form. For interval estimation and hypothesis tests on the model parameters, we require the observed information matrix
where the elements of \(I_{n} \left( \varTheta \right)\) are the second partial derivatives of \(U_{n} (\Theta )\). Under standard regular conditions for large sample approximation (Cox and Hinkley, 1974) that are fulfilled for the proposed model, the distribution of \(\{\Theta \}\) is approximately \(N_{3} (\Theta ,J_{n} (\Theta )^{ - 1} ),\) where \(J_{n} (\Theta ) = E[I_{n} (\Theta )].\) Whenever the parameters are in the interior of the parameter space but not on the boundary, the asymptotic distribution of \(\sqrt n (\{\Theta \} -\Theta )\) is \(N_{3} (0,J(\Theta )^{ - 1} ),\) where \(J(\Theta )^{ - 1} = \mathop {\lim }\limits_{n \to \infty } n^{ - 1} I_{n} (\Theta )\) is the unit information matrix and \(p\) is the number of parameters of the distribution. The asymptotic multivariate normal \(N_{3} (\Theta ,I_{n} (\{\Theta \})^{ - 1} )\) distribution of \(\{\Theta \}\) can be used to approximate the confidence interval for the parameters, hazard rate, and survival functions. An \(100 (1 - \gamma )\) asymptotic confidence interval for parameter \(\Theta _{i}\) is given by
where \(\widehat{{I^{ii} }}\) is the \((i,i)\) diagonal element of \(I_{n} (\{\Theta \})^{ - 1}\) for \(i = 1, \ldots ,3\) and \(Z_{{\frac{\gamma }{2}}}\) is the quantile \(1 - \gamma /2\) of the standard normal distribution.
Rényi entropy
Entropy is a measure of variation of the uncertainty in the distribution of any random variable. It provides important tools to indicate variety in distributions at particular moments in time and to analyze evolutionary processes over time. For a given probability distribution, Rényi (1961) gave an expression of the entropy function, so called Rényi entropy, defined by
where \(\gamma \text{ > }0{\text{ and }}\gamma \ne 0.\) For EIL distribution in (3), we have
Now using the fact that \((1 + z)^{\gamma } = \sum\limits_{j = 0}^{\infty } {\left( {\begin{array}{*{20}c} \gamma \\ j \\ \end{array} } \right)} z^{j} ,\) we have
We substitute \(y = x^{\alpha }\) and use the \(\int\limits_{0}^{\infty } {\frac{{e^{{ - \frac{a}{x}}} }}{{x^{b + 1} }}} dx = \frac{\Gamma b}{{a^{b} }}\) definition of inverse gamma so that
where \(\Gamma a = \int\limits_{0}^{\infty } {x^{a - 1} } e^{ - x} dx.\)
Application
In this section, we demonstrate the applicability of the EIL model for a real data. The data listed in Table 1 represents the flood levels for the Susquehanna River at Harrisburg, Pennsylvania, over 20 four-year periods from 1890 to 1969 and was obtained in a civil engineering context and give the maximum flood level (in millions of cubic feet per second). This data have been widely used by authors and were initially reported by Dumonceaux and Antle (1973). Upadhyay and Peshwani (2003) applied a Bayesian analysis for model comparison between lognormal and Weibull models and concluded that the lognormal fit the data better than the Weibull model. Singh et al. (2013) reported that inverse Weibull distribution fits this data better than other distributions, such as gamma, Weibull, flexible Weibull, and lognormal.
For this data, we fit the proposed \(EIL(\theta ,\beta ,\alpha )\), the sub models that were introduced in “Special cases of the EIL distribution” and the three parameters generalized inverse Weibull proposed by De Gusmao et al. (2011), as well as.
The expectation–maximization (EM) algorithm is used to estimate the model parameters. The MLEs of the parameters, the Kolmogorov‒Smirnov statistics (K–S) with its respective p value, and the maximized log likelihood (logL) for the above distributions as well as our proposed model are given in Table 2. They indicate that the EIL distribution (proposed model) fits the data better than the other distributions. The \(EIL(\theta ,\beta ,\alpha )\) takes the smallest K-S test statistic value and the largest value of its corresponding p-value. In addition, it takes the largest log likelihood. The fitted densities and the empirical distribution versus the fitted cumulative distributions of all models for this data are shown in Figs. 3 and 4, respectively.
Concluding remarks
In this paper, a new three-parameter inverse distribution, called extended inverse Lindley distribution, was introduced and studied in detail. This model has more flexibility than other types of inverse distributions (one, two and three parameters) due to the shape of its density as well as its hazard rate functions. It was shown that the density of the new distribution can be expressed as two components of the Weibull density function and a generalized gamma density function. We introduced the pdf, cdf, hazard rate function, the moments, moment generating function, and the quantile function in simple mathematical forms. Maximum likelihood estimation of the model parameters and their asymptotic standard distribution and confidence interval are derived. Rényi entropy as a measure of the uncertainty in the model is derived. Application of the model to a real data set is presented and compared to the fit attained by some other well-known inverse Lindley and inverse Weibull distributions, such as inverse Lindley, generalized inverse Lindley, inverse Weibull and generalized inverse Weibull.
References
Almalki S, Nadarajah S (2014) Modifications of the Weibull distribution: a review. Reliab Eng Syst Safety 124:32–55
Almalki S, Yuan J (2013) A new modified Weibull distribution. Reliab Eng Syst Safety 111:164–170
Cox D, Hinkley D (1974) Theoretical statistics. Chapman and Hall, London
De Gusmao F, Ortega E, Cordeiro G (2011) The generalized inverse Weibull distribution. Stat Papers 52:591–619
Dumonceaux R, Antle C (1973) Discrimination between the lognormal and Weibull distribution. Technometrics 15:923–926
Efron B (1988) Logistic regression, survival analysis, and the Kaplan-Meier curve. J Am Stat Assoc 83:414–425
Erto P, Rapone M (1984) Non-informative and practical Bayesian confidence bounds for reliable life in the Weibull model. Reliab Eng 7:181–191
Ghitany M, Atieh B, Nadadrajah S (2008) Lindley distribution and its applications. Math Comp Simul 78:493–506
Glen A (2011) On the inverse gamma as a survival distribution. J Qual Technol 43:158–166
Langlands A, Pocock S, Kerr G, Gore S (1997) Long-term survival of patients with breast cancer: a study of the curability of the disease. Br Med J 2:1247–1251
Lindley D (1958) Fiducial distributions and bays theorem. J Roy Stat Soc 20(1):102–107
Mead M (2015) Generalized inverse gamma distribution and its applications in reliability communications. Commun Stat Theory Methods 44:1426–1435
Murthy D, Xie M, Jiang R (2004) Weibull models. John Wiley & Sons, Hoboken
Renyi A (1961) On measure of entropy and information. In: Proceedings of the 4th Berkeley Symposium on Mathematical Statistics and Probability 1. University of California Press, Berkeley, pp 547–561
Shaked M, Shanthikumar J (1994) Stochastic orders and their applications. Academic Press, Boston
Shanker R, Sharma S, Shanker R (2013) A two-parameter Lindley distribution for modeling waiting and survival time series data. Appl Math 4:363–368
Sharma V, Singh S, Singh U (2014) A new upside-down bathtub shaped hazard rate model for survival data analysis. Appl Math Comput 239:242–253
Sharma V, Singh S, Singh U, Agiwal V (2015a) The inverse Lindley distribution: a stress-strength reliability model with applications to head and neck cancer data. J Indus Prod Eng 32(3):162–173
Sharma V, Singh S, Singh U, Merovci F (2015) The generalized inverse Lindley distribution: A new inverse statistical model for the study of upside-down bathtub survival data. Commun Stat Theory Methods, preprint
Singh S, Singh U, Sharma V (2013) Bayesian prediction of future observations from inverse Weibull distribution based on Type-II hybrid censored sample. Int J Adv Stat Probab 1:32–43
Upadhyay S, Peshwani M (2003) Choice between Weibull and log-normal models: a simulation-based Bayesian study. Commun Stat Theory Methods 32:381–405
Acknowledgements
The author is grateful to the Deanship of Scientific Research at King Saud University represented by the Research Center at the College of Business for financially supporting this research.
Competing interests
The author declares that there were no competing interests.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Alkarni, S.H. Extended inverse Lindley distribution: properties and application. SpringerPlus 4, 690 (2015). https://doi.org/10.1186/s40064-015-1489-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s40064-015-1489-2