T-normal family of distributions: a new approach to generalize the normal distribution

Alzaatreh, Ayman; Lee, Carl; Famoye, Felix

doi:10.1186/2195-5832-1-16

Research
Open access
Published: 08 July 2014

T-normal family of distributions: a new approach to generalize the normal distribution

Ayman Alzaatreh¹,
Carl Lee² &
Felix Famoye²

Journal of Statistical Distributions and Applications volume 1, Article number: 16 (2014) Cite this article

5127 Accesses
44 Citations
2 Altmetric
Metrics details

Abstract

The idea of generating skewed distributions from normal has been of great interest among researchers for decades. This paper proposes four families of generalized normal distributions using the T-X framework. These four families of distributions are named as T-normal families arising from the quantile functions of (i) standard exponential, (ii) standard log-logistic, (iii) standard logistic and (iv) standard extreme value distributions. Some general properties including moments, mean deviations and Shannon entropy of the T-normal family are studied. Four new generalized normal distributions are developed using the T-normal method. Some properties of these four generalized normal distributions are studied in detail. The shapes of the proposed T-normal distributions can be symmetric, skewed to the right, skewed to the left, or bimodal. Two data sets, one skewed unimodal and the other bimodal, are fitted by using the generalized T-normal distributions.

AMS 2010 Subject Classification

60E05; 62E15; 62P10

1 Introduction

The normal distribution is perhaps the most commonly used probability distribution in both statistical theory and applications. The normal distribution was first used by de Moivre (1733) in the literature as an approximation to the binomial distribution. However, the development of the normal distribution by Gauss (1809, 1816) became the standard used in the modern statistics. Hence, the normal distribution is also commonly known as the Gaussian distribution. Properties of the normal distribution have been well developed (e.g., see Johnson et al. 1994; Patel and Read 1996). The distribution also plays an important role in generating new distributions.

Methods for developing generalized normal distributions seemed very limited until Azzalini (1985). A random variable X_λ is said to follow the skewed normal distribution S N (λ) if the probability density function (PDF) of X_λis g (x|λ) = 2ϕ (x) Φ (λ x), where ϕ (x) and Φ (x) are N (0,1) PDF and cumulative distribution function (CDF) respectively. Various extensions of S N (λ) have been proposed and studied (e.g., Arellano-Valle et al. 2004; Arnold and Beaver 2002; Arnold et al. 2007; Choudhury and Abdul 2011; Balakrishnan 2002; Gupta and Gupta 2004; Sharafi and Behboodian 2008; Yadegari et al. 2008). For reviews on skewed normal and its generalization, one may refer to Kotz and Vicari (2005) and Lee et al. (2013). Pourahmadi (2007) showed that the skewed normal distribution SN (λ) approaches half-normal as λ→∞. This explains why skewed normal distribution is limited in fitting real data. In order to allow for fitting diverse magnitudes of skewness, various works have been done by introducing different methods to capture the magnitude of the skewness.

Fernández and Steel (1998) introduced a two-piece PDF as $g (x) = \{\begin{matrix} cf (α x), x \geq 0, \\ cf (x / α), x < 0 . \end{matrix}$ , c > 0 and α > 0, where f is a symmetric PDF defined on ℜ, which is unimodal and symmetric around 0. When f is normal, it is a generalized skewed normal. Kotz and Vicari (2005) suggested that α and 1/α be replaced by α₁ and α₂ respectively, in order to have more flexibility of controlling skewness. Another general framework that introduces a skew mechanism to symmetric distributions was given by Ferreira and Steel (2006). The corresponding skew family is g (x| f, q) = f (x) q [ F (x)],x ∈ ℜ. The PDF g(x| f, q) is a weighted version of f(x), with the weight function given by q [ F (x)]. If q follows the uniform distribution, then, g = f. When f is normal, this is a general framework for developing skewed normal distributions.

Eugene et al. (2002) introduced the beta-generated family of distributions with CDF

G (x) = \int_{0}^{F (x)} b (t) dt,

(1.1)

where b (t) is the PDF of the beta random variable and F (x) is the CDF of any random variable. The corresponding PDF to (1.1) is given by

g (x) = \frac{1}{B (α, β)} f (x) F^{α - 1} (x) {(1 - F (x))}^{β - 1}, α, β > 0 .

(1.2)

If F is Φ, the CDF of the normal distribution, equation (1.2) defines the beta-normal distribution. If α and β are integers, (1.2) is the α^th order statistic of the random sample of size (α + β - 1).

The beta-normal distribution can be unimodal or bimodal and it has been applied to fit a variety of real data including bimodal cases (Famoye et al. 2004). The main distinction between the method of skewed normal and the beta-generated normal is that the skewed normal method introduces a skewing mechanism into the normal distribution to generate skewed normal family. The skewness of the distribution is estimated by the skewing parameter. On the other hand, the beta-normal distribution is generated by adding more parameters using beta distribution as the generator. Thus, the skewness is not directly defined by a specific parameter; rather it is the combination of all shape parameters that play the role of measuring skewness. For detailed review about the methods for generating continuous distributions, including the normal distribution, one may refer to Lee et al. (2013).

Alzaatreh et al. (2013) extended the beta generated family and defined the T - X (W) family. The CDF of the T - X (W) distribution is $G (x) = \int_{a}^{W (F (x))} r (t) dt$ , where r (t) is the PDF of the random variable T with support (a, b) for - ∞ ≤ a < b ≤ ∞. The function W (F (x)) is monotonic and absolutely continuous. Aljarrah et al. (2014) took W (F(x)) to be the quantile function of a random variable Y and defined the T - X {Y} family as

G (x) = \int_{a}^{Q_{Y} (F (x))} r (t) dt = R (Q_{Y} (F (x)),

(1.3)

where Q_Y(p) is the quantile function of the random variable Y. In (1.1), X is used as a random variable having CDF F (x) and then as a random variable having CDF G (x) which may be confusing. This article first gives a unified notation to re-define the T - X {Y} as T - R {Y} and proposes several different generalizations of the normal distribution using the T - R{Y} framework.

Section 2 gives the unified definition of T - R {Y} family and defines several new generalized normal families. Section 3 gives some general properties of the proposed generalized normal families. Section 4 defines some new generalized normal distributions and studies some of their properties. Section 5 provides some applications to numerical data sets and the paper ends with a short summary and conclusions.

2 T-normal families of distributions

Let T, R and Y be random variables with CDF F_T(x) = P (T ≤ x),F_R(x) = P (R ≤ x) and F_Y(x) = P (Y ≤ x). The corresponding quantile functions are Q_T(p), Q_R(p) and Q_Y(p), where the quantile function is defined as Q_Z(p) = inf{z : F_Z(z) ≥ p}, 0 < p < 1. If densities exist, we denote them by f_T(x), f_R(x) and f_Y(x). Now assume the random variable T ∈ (a,b) and Y ∈ (c,d), for - ∞ ≤ a < b ≤ ∞ and - ∞ ≤ c < d ≤ ∞. Following the technique proposed by Aljarrah et al. (2014), the CDF of the random variable X is defined as

F_{X} (x) = \int_{a}^{Q_{Y} (F_{R} (x))} f_{T} (t) dt = P [T \leq Q_{Y} (F_{R} (x))] = F_{T} (Q_{Y} (F_{R} (x))) .

(2.1)

Note that (2.1) is an alternative expression to (1.3) without using X in two different situations. Hereafter, the family of distributions in (2.1) will be called the T- R{Y} family of distributions.

Remark 1.

If X follows the distribution in (2.1), it is easy to see that

(i)
$X \overset{d}{=} Q_{R} (F_{Y} (T))$
,
(ii)
Q _X(p) = Q _R(F _Y(Q _T(p))),
(iii)
If $T \overset{d}{=} Y$ then $X \overset{d}{=} R$ and
(iv)
If $Y \overset{d}{=} R$ then $X \overset{d}{=} T$ .

The corresponding PDF associated with (2.1) is

f_{X} (x) = f_{T} (Q_{Y} (F_{R} (x))) \times {Q^{'}}_{Y} (F_{R} (x)) \times f_{R} (x),

(2.2)

where ${Q^{'}}_{Y} (F_{R}) = \frac{d}{d F_{R}} Q_{Y} (F_{R})$ . Using the fact that Q_Y(F_Y(x)) = x, it follows that Q^′_Y(F_Y(x)) × f_Y(x) = 1 so that Q^′_Y(p) = 1/f_Y(Q_Y(p)). By taking p = F_R(x), (2.2) reduces to

f_{X} (x) = f_{R} (x) \times \frac{f_{T} (Q_{Y} (F_{R} (x)))}{f_{Y} (Q_{Y} (F_{R} (x)))} .

(2.3)

From (2.1) and (2.3), the hazard function of the random variable X can be written as

\begin{align} h_{X} (x) & = \frac{f_{X} (x)}{1 - F_{X} (x)} \\ = \frac{f_{R} (x)}{1 - F_{R} (x)} \times \frac{1 - F_{Y} (Q_{Y} (F_{R} (x)))}{f_{Y} (Q_{Y} (F_{R} (x)))} \times \frac{f_{T} (Q_{Y} (F_{R} (x)))}{1 - F_{T} (Q_{Y} (F_{R} (x)))} \\ = h_{R} (x) \times \frac{h_{T} (Q_{Y} (F_{R} (x)))}{h_{Y} (Q_{Y} (F_{R} (x)))} . \end{align}

(2.4)

One can see from (2.3) and (2.4) that

\frac{f_{X} (x)}{f_{R} (x)} = \frac{f_{T} (Q_{Y} (F_{R} (x)))}{f_{Y} (Q_{Y} (F_{R} (x)))} and \frac{h_{X} (x)}{h_{R} (x)} = \frac{h_{T} (Q_{Y} (F_{R} (x)))}{h_{Y} (Q_{Y} (F_{R} (x)))} .

Some general properties of the T - R {Y} family were recently studied in the literature, for more details see Aljarrah et al. (2014). Equivalent expressions to (2.2) - (2.4) are given in Aljarrah et al. (2014) by using the T - X {Y} notation. Table 1 gives some distributions of the T - R {Y} families based on quantile functions of some standard forms of distribution and some commonly used random variables T. The explicit expression of a T - R {Y} family can be obtained using (2.3) for different combinations of random variables T, R, and Y.

Table 1 Families of T - R { Y } distributions based on different choices for the random variables Y ^∗ and T

Full size table

Several extensions from Table 1 can be made. First, one can use the quantile function of non-standard distributions, such as non-standard exponential, log-logistic, logistic, extreme value, and Weibull. For example, the quantile function of log-logistic is Q_Y(p) = α (p/(1-p))^{1/ β}, α, β > 0. By using this Q_Y function, two additional parameters corresponding to the log-logistic distribution may be added to the T - R{log-logistic} family. Aljarrah et al. (2014) gave a more detailed list of T - R {Y} distributions based on quantile functions of non-standard distributions. Secondly, one can introduce exponentiated and scale parameters by replacing F_T(x) by $F_{T}^{δ} (αx), α, δ > 0$ as well as for F_R(x).

If R is a normal random variable with PDF f_R(x) = ϕ (x) and CDF F_R(x) = Φ (x), then (2.1) gives the T - normal{Y} family of distributions as

F_{X} (x) = \int_{a}^{Q_{Y} (Φ (x))} f_{T} (t) dt = F_{T} (Q_{Y} (Φ (x))) .

(2.5)

The corresponding PDF associated with (2.5) is

f_{X} (x) = f_{T} (Q_{Y} (Φ (x))) \times {Q^{'}}_{Y} (Φ (x)) \times ϕ (x) = ϕ (x) \times \frac{f_{T} (Q_{Y} (Φ (x)))}{f_{Y} (Q_{Y} (Φ (x)))} .

(2.6)

The hazard function of the T - normal {Y} family is given by $h_{X} (x) = h_{ϕ} (x) \times \frac{h_{T} (Q_{Y} (Φ (x)))}{h_{Y} (Q_{Y} (Φ (x)))}$ , where h_ϕ(x) = ϕ (x)/(1 - Φ (x)).

The T-normal {Y} family is a general framework for generating many different generalizations of the normal distribution. Various existing generalizations of normal distributions can be obtained based on this framework. The beta normal (Eugene et al. 2002), Kumaraswamy normal (Cordeiro and de Castro 2011), and generalized beta-generated normal (Alexander et al. 2012) belong to the T - normal{standard uniform} families. The gamma-normal distribution studied by Alzaatreh, et al. (2014) is a member of T - normal{standard exponential} family. For distribution “parsimony”, we will focus on the quantile functions of standard distributions in order to limit the number of parameters. Generalizations from using non-standard quantile functions or adding exponentiated and/or scale parameters can be derived in a straightforward manner. In the following, we define the families of generalized normal (GN) distributions, T-normal{Y}, using the standard quantile functions (b)-(e) defined in Table 1.

2.1 Family of GN distributions from the quantile function of exponential distribution (T- N{exponential})

By using the quantile function (b) in Table 1: Q_Y(Φ (x)) = - log(1 - Φ (x)), the corresponding CDF to (2.5) is

F_{X} (x) = F_{T} \{- log (1 - Φ (x))\} = F_{T} (H_{ϕ} (x)),

(2.7)

and the corresponding PDF is

f_{X} (x) = \frac{ϕ (x)}{1 - Φ (x)} f_{T} (- log (1 - Φ (x))) = h_{ϕ} (x) f_{T} (H_{ϕ} (x)),

(2.8)

where h_ϕ(x) and H_ϕ(x) = - log[ 1 - Φ (x)] are the hazard and cumulative hazard functions for the normal distribution, respectively. Thus, this family of GN distributions is denoted as T - N{exponential}, which arises from the “hazard function of the normal distribution”.

2.2 Family of GN distributions from the quantile function of log-logistic distribution (T- N{log-logistic})

By using the quantile function (c) in Table 1: Q_Y(Φ (x)) = Φ (x)/(1 - Φ (x)), the corresponding CDF to (2.5) is

F_{X} (x) = F_{T} \{Φ (x) / (1 - Φ (x))\},

(2.9)

and the corresponding PDF is

f_{X} (x) = \frac{ϕ (x)}{{(1 - Φ (x))}^{2}} f_{T} (\frac{Φ (x)}{1 - Φ (x)}) .

(2.10)

The family of GN distributions in (2.9) is denoted as T - N{log-logistic}, which arises from the “odds of the normal distribution”.

2.3 Family of GN distributions from the quantile function of logistic distribution (T- N{logistic})

By using the quantile function (d) in Table 1: Q_Y(Φ (x)) = log (Φ (x)/(1 - Φ (x))), the corresponding CDF to (2.5) is

F_{X} (x) = F_{T} \{log [Φ (x) / (1 - Φ (x))]\},

(2.11)

and the corresponding PDF is

f_{X} (x) = \frac{ϕ (x)}{Φ (x) (1 - Φ (x))} f_{T} (log \{Φ (x) / (1 - Φ (x))\}) = \frac{h_{ϕ} (x)}{Φ (x)} f_{T} (log \{\frac{Φ (x)}{1 - Φ (x)}\}) .

(2.12)

The family of GN distributions in (2.11) is denoted as T-N{logistic}, which arises from the “logit function of the normal distribution”.

2.4 Family of GN distributions from the quantile function of extreme value distribution (T- N{extreme value})

By using the quantile function (e) in Table 1: Q_Y(Φ (x)) = log(- log(1 - Φ (x)), the corresponding CDF to (2.5) is

F_{X} (x) = F_{T} \{log [- log (1 - Φ (x))]\},

(2.13)

and the corresponding PDF is

f_{X} (x) = \frac{ϕ (x)}{- (1 - Φ (x)) log (1 - Φ (x))} f_{T} \{log (- log (1 - Φ (x))\} = \frac{h_{ϕ} (x)}{H_{ϕ} (x)} f_{T} (log (H_{ϕ} (x)) .

(2.14)

The family of GN distributions in (2.13) is denoted as T - N{extreme value}, which arises from the “extreme value function of the normal distribution”.

3 Some properties of the T- normal family of distributions

In this section, some of the general properties of the T-normal family will be discussed.

Lemma 1

(Transformation). For any random variable T with PDF f_T(x), then the random variable

(i)
X = Φ ^{- 1}(1 - e ^{- T}) follows the distribution of T - N{exponential} family in (2.7).
(ii)
X = Φ ^{- 1} (T/(1 + T)) follows the distribution of T - N{log-logistic} family in (2.9).
(iii)
X = Φ ^{- 1}(e ^T/(1 + e ^T)) follows the distribution of T - N{logistic} family in (2.11).
(iv)
$X = Φ^{- 1} (1 - e^{- e^{T}})$
follows the distribution of T - N {extreme value} family in (2.13).

Proof.

The result follows immediately from Remark 1(i).

Lemma 1 gives the relationships between the random variable X and the random variable T. These relationships can be used to generate random samples from X by using T. For example, one can simulate the random variable X which follows the distribution of T - N{exponential} family in (2.7) by first simulating random variable T from the PDF f_T(x) and then computing X = Φ^{- 1} (1 - e^{- T}), which has the CDF F_X(x).

Lemma 2.

The quantile functions for the (i) T - N{exponential}, (ii) T - N{log-logistic}, (iii) T - N{logistic}, and (iv) T - N{extreme value} distributions, are respectively,

(i)
$Q_{X} (p) = Φ^{- 1} \{1 - e^{- Q_{T} (p)}\}$
,
(ii)
Q _X(p) = Φ ^{- 1} {Q _T(p)/(1 + Q _T(p))},
(iii)
$Q_{X} (p) = Φ^{- 1} \{e^{Q_{T} (p)} / (1 + e^{Q_{T} (p)})\}$
,
(iv)
$Q_{X} (p) = Φ^{- 1} \{1 - e^{- e^{Q_{T} (p)}}\}$
.

Proof.

The result follows directly from Remark 1(ii).

Theorem 1.

The mode(s) of the T - N {Y} family are the solutions of the equation

x = μ + σ^{2} ϕ (x) \{\frac{{Q^{'}}^{'}_{Y} (Φ (x))}{{Q^{'}}_{Y} (Φ (x))} + \frac{{f^{'}}_{T} (Q_{Y} (Φ (x)))}{f_{T} (Q_{Y} (Φ (x)))} {Q^{'}}_{Y} (Φ (x))\} .

(3.1)

Proof.

One can show the result in (3.1) by setting the derivative of the equation (2.6) to zero and then using the fact that ϕ^′ (x) = - σ^{- 2} (x - μ) ϕ (x).

Corollary 1.

The mode(s) of the (i) T - N {exponential}, (ii) T - N {log-logistic}, (iii) T - N {logistic}, and (iv) T - N {extreme value} distributions, respectively, are the solutions of the equations

\begin{align} (i) x & = μ + σ^{2} h_{ϕ} (x) \{1 + \frac{{f^{'}}_{T} (H_{ϕ} (x))}{f_{T} (H_{ϕ} (x))}\}, \\ (ii) x & = μ + σ^{2} h_{ϕ} (x) \{2 + \frac{{f^{'}}_{T} (Φ (x) / (1 - Φ (x)))}{(1 - Φ (x)) f_{T} (Φ (x) / (1 - Φ (x)))}\}, \\ (iii) x & = μ + σ^{2} \frac{h_{ϕ} (x)}{Φ (x)} \{\frac{{f^{'}}_{T} (log \{Φ (x) / (1 - Φ (x))\})}{f_{T} (log \{Φ (x) / (1 - Φ (x))\})} + 2 Φ (x) - 1\}, \\ (iv) x & = μ + σ^{2} \frac{h_{ϕ} (x)}{H_{ϕ} (x)} \{\frac{{f^{'}}_{T} {log (H_{ϕ} (x))}}{f_{T} {log (H_{ϕ} (x))}} + H_{ϕ} (x) - 1\} . \end{align}

(3.2)

Note that the results in Theorem 1 do not imply that the mode is unique. It is possible that there is more than one mode for some of these GN distributions. For example, the logistic - N {logistic} distribution given in section 4 is a bimodal distribution. If T follows the gamma distribution with parameters α and β, equation (3.2) can be simplified to

x = μ + σ^{2} h_{ϕ} (x) \{1 + \frac{{f^{'}}_{T} (H_{ϕ} (x))}{f_{T} (H_{ϕ} (x))}\} = μ + σ^{2} h_{ϕ} (x) [(α - 1) / H_{ϕ} (x) - β^{- 1} + 1] .

This agrees with the result obtained by Alzaatreh et al. (2014) for the gamma-normal distribution.

The entropy of a random variable X is a measure of variation of uncertainty (Rényi 1961). Shannon’s entropy for a random variable X with PDF g(x) is defined as E{- log(g(X))}.

Theorem 2.

The Shannon’s entropies for the T - N {Y} family is given by

σ_{X} = σ_{T} + E (log f_{Y} (T)) + log (σ \sqrt{2 π}) + \frac{1}{2 σ^{2}} E {(X - μ)}^{2} .

(3.3)

Proof.

Since $X \overset{d}{=} Q_{R} (F_{Y} (T))$ , this implies that $T \overset{d}{=} Q_{Y} (F_{R} (X))$ . Hence, from (2.3) we have $f_{X} (x) = \frac{f_{T} (t)}{f_{Y} (t)} \times f_{R} (x)$ . This implies

σ_{X} = σ_{T} + E (log f_{Y} (T)) - E (log f_{R} (X)) .

(3.4)

For the T - N {Y} family we have f_R(x) = ϕ (x), so

log (ϕ (x)) = - log (σ \sqrt{2 π}) - {[(x - μ) / σ]}^{2} / 2 .

(3.5)

The result in (3.3) follows from (3.4) and (3.5).

Corollary 2.

The Shannon’s entropies for the (i) T - N {exponential}, (ii) T - N {log-logistic}, (iii) T - N {logistic}, and (iv) T - N {extreme value} distributions, respectively, are given by

\begin{align} (i) σ_{X} & = log (σ \sqrt{2 π}) - μ_{T} + σ_{T} + E {(X - μ)}^{2} / (2 σ^{2}), \\ (ii) σ_{X} & = log (σ \sqrt{2 π}) - 2 E (log (1 + T)) + σ_{T} + E {(X - μ)}^{2} / (2 σ^{2}), \\ (iii) σ_{X} & = log (σ \sqrt{2 π}) - 2 E (log (1 + e^{T})) + μ_{T} + σ_{T} + E {(X - μ)}^{2} / (2 σ^{2}), \\ (iv) σ_{X} & = log (σ \sqrt{2 π}) - E (e^{T}) + μ_{T} + σ_{T} + E {(X - μ)}^{2} / (2 σ^{2}) . \end{align}

(3.6)

Proof.

The results in (i)-(iv) can be easily shown using (3.3) and the fact that f_Y(T) = e^{- T}, (1 + T)^{- 2}, e^T(1 + e^T)^{- 2} and $e^{T} e^{- e^{T}}$ for exponential, log-logistic, logistic and extreme value, respectively.

Theorem 3.

The r^th non-central moments of the (i) T - N{exponential}, (ii) T - N {log-logistic}, (iii) T - N{logistic}, and (iv) T - N {extreme value} distributions, respectively, can be expressed as

(i) E (X^{r}) = \sum_{j = 0}^{r} \sum_{k_{1}, \dots, k_{j} = 0}^{\infty} \sum_{i = 0}^{2 s_{j} + j} 2^{j / 2} σ^{j} μ^{r - j} A (\underset{̲}{k}) (\binom{r}{j}) (\binom{2 s_{j} + j}{i}) {(- 2)}^{i} M_{T} (- i),

(3.7)

(ii) E (X^{r}) = \sum_{j = 0}^{r} \sum_{k_{1}, k_{2}, \dots, k_{j} = 0}^{\infty} 2^{j / 2} σ^{j} μ^{r - j} A (\underset{̲}{k}) (\binom{r}{j}) E \{{(\frac{T - 1}{T + 1})}^{2 s_{j} + j}\},

(3.8)

(iii) E (X^{r}) = \sum_{j = 0}^{r} \sum_{k_{1}, k_{2}, \dots, k_{j} = 0}^{\infty} 2^{j / 2} σ^{j} μ^{r - j} A (\underset{̲}{k}) (\binom{r}{j}) E \{{(\frac{e^{T}}{1 + e^{T}})}^{2 s_{j} + j}\},

(3.9)

(iv) E (X^{r}) = \sum_{j = 0}^{r} \sum_{k_{1}, \dots, k_{j} = 0}^{\infty} \sum_{i = 0}^{2 s_{j} + j} 2^{j / 2} σ^{j} μ^{r - j} A (\underset{̲}{k}) (\binom{r}{j}) (\binom{2 s_{j} + j}{i}) {(- 2)}^{i} M_{e^{T}} (- i),

(3.10)

where $A (\underset{̲}{k}) = A (k_{1}, k_{2}, \dots, k_{j}) = {(\sqrt{π} / 2)}^{2 s_{j} + j} a_{k_{1}} a_{k_{2}} \dots a_{k_{j}}$ , s_j= k₁ + k₂ + ⋯ + k_j, M_T( - i) =E (e^{- iT}), $a_{k} = \frac{c_{k}}{2 k + 1}$ , $c_{k} = \sum_{j = 0}^{k - 1} \frac{c_{j} c_{k - 1 - j}}{(j + 1) (2 j + 1)}$ , and c₀ = 1.

Proof.

We first show (3.7). By using Lemma 1, the r^th moments for the T - N{exponential} distribution can be written as E (X^r) = E (Φ^{- 1}(1 - e^{- T}))^r. Since

Φ^{- 1} (1 - e^{- T}) = \sqrt{2} σ er f^{- 1} (1 - 2 e^{- T}) + μ,

the r^th moments can be written as

E (X^{r}) = E {(\sqrt{2} σ er f^{- 1} (1 - 2 e^{- T}) + μ)}^{r} = \sum_{j = 0}^{r} (\binom{r}{j}) 2^{j / 2} σ^{j} E \{{(er f^{- 1} (1 - e^{- T}))}^{j}\} μ^{r - j} .

(3.11)

On using the series representation for erf^{- 1} (1 - 2e^{- T}) (Wolfram.com, 2014), we get ${erf}^{- 1} (1 - 2 e^{- T}) = \sum_{k = 0}^{\infty} a_{k} {(\sqrt{π} / 2)}^{2 k + 1} {(1 - 2 e^{- T})}^{2 k + 1}$ , where $a_{k} = \frac{c_{k}}{2 k + 1}$ , $c_{k} = \sum_{j = 0}^{k - 1} \frac{c_{j} c_{k - 1 - j}}{(j + 1) (2 j + 1)}$ , and c₀ = 1. This implies

{({erf}^{- 1} (1 - 2 e^{- T}))}^{j} = \sum_{k_{1}, k_{2}, \dots, k_{j} = 0}^{\infty} A (k_{1}, k_{2}, \dots, k_{j}) {(1 - 2 e^{- T})}^{2 s_{j} + j},

(3.12)

where $A (k_{1}, k_{2}, \dots, k_{j}) = {(\sqrt{π} / 2)}^{2 s_{j} + j} a_{k_{1}} a_{k_{2}} \dots a_{k_{j}}$ and s_j= k₁ + k₂ + ⋯ + k_j. By using the binomial expansion on ${(1 - 2 e^{- T})}^{2 s_{j} + j}$ , (3.12) can be written as

{(er f^{- 1} (1 - 2 e^{- T}))}^{j} = \sum_{k_{1}, k_{2}, \dots, k_{j} = 0}^{\infty} \sum_{i = 0}^{2 s_{j} + j} A (k_{1}, k_{2}, \dots, k_{j}) (\binom{2 s_{j} + j}{i}) {(- 2)}^{i} e^{- iT} .

(3.13)

The result of (3.7) follows by using equation (3.13) in equation (3.11). The results of (3.8)-(3.10) can be obtained by applying the same technique for (3.7).

If T follows the gamma distribution with parameters α and β for the T - N {exponential}, we obtain the term M_T( - i)=(1 + β i)^{- α} in (3.7). Thus, (3.7) gives the r^th non-central moment of gamma-N{exponential} distribution as shown in Alzaatreh et al. (2014).

The deviation from the mean and the deviation from the median are used to measure the dispersion and the spread in a population from the center. The mean deviation from the mean is denoted by D (μ), and the mean deviation from the median M is denoted by D (M).

Theorem 4.

D (μ) and D (M) for each of (i) T - N{exponential}, (ii) T - N {log - logistic}, (iii) T - N{logistic}, and (iv) T - N{extreme value} distributions, respectively, are

\begin{matrix} (i) & D (μ) & = \sqrt{2} σ \sum_{k = 0}^{\infty} \sum_{i = 0}^{2 k + 1} A (k) (\binom{2 k + 1}{i}) {(- 2)}^{i + 1} S_{e^{- u}} (μ, 0, i), \end{matrix}

(3.14)

\begin{matrix} D (M) & = \sqrt{2} σ \sum_{k = 0}^{\infty} \sum_{i = 0}^{2 k + 1} A (k) (\binom{2 k + 1}{i}) {(- 2)}^{i + 1} S_{e^{- u}} (M, 0, i), \end{matrix}

(3.15)

where $S_{ξ} (c, a, α) = \int_{a}^{Q_{Y} (Φ (c))} ξ^{α} f_{T} (u) du$ and Q_Y(Φ (c)) = - log(1 - Φ (c)).

\begin{matrix} (ii) & D (μ) & = - \sqrt{8} σ \sum_{k = 0}^{\infty} A (k) S_{\frac{u - 1}{u + 1}} (μ, 0, 2 k + 1), \end{matrix}

(3.16)

\begin{matrix} D (M) & = - \sqrt{8} σ \sum_{k = 0}^{\infty} A (k) S_{\frac{u - 1}{u + 1}} (M, 0, 2 k + 1), \end{matrix}

(3.17)

where Q_Y(Φ (c)) = Φ (c)/(1 - Φ (c)).

\begin{matrix} (iii) & D (μ) & = - \sqrt{8} σ \sum_{k = 0}^{\infty} A (k) S_{\frac{e^{u}}{1 + e^{u}}} (μ, - \infty, 2 k + 1), \end{matrix}

(3.18)

\begin{matrix} D (M) & = - \sqrt{8} σ \sum_{k = 0}^{\infty} A (k) S_{\frac{e^{u}}{1 + e^{u}}} (M, - \infty, 2 k + 1), \end{matrix}

(3.19)

where Q_Y(Φ (c)) = log {Φ (c)/(1 - Φ (c))}.

\begin{matrix} (iv) & D (μ) & = \sqrt{2} σ \sum_{k = 0}^{\infty} \sum_{i = 0}^{2 k + 1} A (k) (\binom{2 k + 1}{i}) {(- 2)}^{i} S_{e^{- e^{u}}} (μ, - \infty, i), \end{matrix}

(3.20)

\begin{matrix} D (M) & = \sqrt{2} σ \sum_{k = 0}^{\infty} \sum_{i = 0}^{2 k + 1} A (k) (\binom{2 k + 1}{i}) {(- 2)}^{i} S_{e^{- e^{u}}} (M, - \infty, i), \end{matrix}

(3.21)

where Q_Y(Φ (c)) = log {- log(1 - Φ (c))}.

Proof.

By definitions of D (μ) and D (M),

\begin{align} D (μ) & = \int_{- \infty}^{μ} (μ - x) f_{X} (x) dx + \int_{μ}^{\infty} (x - μ) f_{X} (x) dx = 2 \int_{- \infty}^{μ} (μ - x) f_{X} (x) dx \\ = 2 μ F_{X} (μ) - 2 \int_{- \infty}^{μ} x f_{X} (x) dx. \end{align}

(3.22)

\begin{align} D (M) & = \int_{- \infty}^{M} (M - x) f_{X} (x) dx + \int_{M}^{\infty} (x - M) f_{X} (x) dx \\ = 2 \int_{- \infty}^{M} (M - x) f_{X} (x) dx + E (X) - M \\ = μ - 2 \int_{- \infty}^{M} x f_{X} (x) dx. \end{align}

(3.23)

We first proof the results (3.14) and (3.15) for the T - N{exponential} family. Defining the integral

I_{c} = \int_{- \infty}^{c} x f_{X} (x) dx = \int_{- \infty}^{c} \frac{xϕ (x)}{1 - Φ (x)} f_{T} \{- log (1 - Φ (x)\} dx,

(3.24)

and using the substitution u = - log (1 - Φ (x)), (3.24) can be written as

I_{c} = \int_{0}^{- log (1 - Φ (c))} Φ^{- 1} (1 - e^{- u}) f_{T} (u) du.

(3.25)

By using similar approach as in Theorem 3, the equation (3.25) can be written as

I_{c} = μ F_{X} (c) + \sqrt{2} σ \sum_{k = 0}^{\infty} \sum_{i = 0}^{2 k + 1} A (k) (\binom{2 k + 1}{i}) {(- 2)}^{i} S_{e^{- u}} (c, 0, i),

(3.26)

where A (k) is defined in the proof of Theorem 3, $S_{ξ} (c, a, α) = \int_{a}^{Q_{Y} (Φ (c))} ξ^{α} f_{T} (u) du$ and Q_Y(Φ (c)) = - log (1 - Φ (c)). The results in (3.14) and (3.15) follow by using (3.26) in (3.22) and (3.23). Applying the same techniques of showing (3.14) and (3.15), one can show the results of (3.16) and (3.17) for (ii), (3.18) and (3.19) for (iii), and (3.20) and (3.21) for (iv).

4 Some examples of GN families of distributions with different T distributions

In this section different T distributions are used to generate different GN distributions. In the following subsections, we present four new GN distributions namely, Weibull - N{exponential}, exponential - N{log-logistic}, logistic - N {logistic} and logistic - N{extreme value}. For illustrative purposes, we study some properties of the Weibull - N{logistic} distribution. To conserve space, properties of other GN distributions are not given. One can follow the same method to study the properties of other GN distributions.

4.1 The Weibull - N{exponential} distribution

If a random variable T follows the Weibull distribution with parameters c and γ, then $f_{T} (x) = c γ^{- 1} {(\frac{x}{γ})}^{c - 1} e^{- {(\frac{x}{γ})}^{c}}, c, γ > 0$ . From (2.8), the PDF of the Weibull - N {exponential} is defined as

f_{X} (x) = \frac{c}{γ} \frac{ϕ (x)}{1 - Φ (x)} {\{\frac{- log (1 - Φ (x))}{γ}\}}^{c - 1} exp (- {\{\frac{- log (1 - Φ (x))}{γ}\}}^{c}) .

(4.1)

Remark 2

(i)
When c = 1, the Weibull - N {exponential} reduces to the exponential-normal distribution with θ = 1/γ.
(ii)
When c = γ = 1, the Weibull - N {exponential} reduces to the normal distribution.
(iii)
When c = 1 and γ ^-1 = n ∈ N, the PDF in (4.1) reduces to the distribution of the minimum order statistics, x ₍₁₎, from a normal random sample of size n.

By using (2.7), the CDF of the Weibull - N{exponential} is given by

F_{X} (x) = 1 - exp (- {\{- (1 / γ) log (1 - Φ (x))\}}^{c}) .

In Figures 1 and 2, various graphs of f_X(x) when μ = 0, σ = 1 and for various values of c and γ are provided. These Figures indicate that the Weibull - N{exponential} PDF can be left skewed, right skewed, or symmetric. Also, the Weibull-N{exponential} is left skewed whenever γ > 1 and right skewed whenever γ < 1. For fixed γ, the peak increases as c increases.

Some properties of the Weibull - N {exponential} are obtained in the following by using the general properties discussed in section 3.

(1)
Quantile function: By using Lemma 2, the quantile function of the Weibull - N {exponential} distribution is given by Q _X(p) = Φ ^{- 1}{1 - exp ( - γ (- log (1 - p))^1/c)}.
(2)
Mode: By using Corollary 1, the mode of Weibull-N{exponential} distribution is the solution of the following equation
$x = μ + σ^{2} h_{ϕ} (x) \{\frac{c - 1}{H_{ϕ} (x)} - c γ^{- c} {(H_{ϕ} (x))}^{c - 1} + 1\} .$
(3)
Shannon entropy: By using Corollary 2 and the fact that μ _T = γ Γ (1 + 1/c) and σ _T=1 + ξ (1 - 1/c) + log (γ/c) (see Song 2001), one can easily obtain the Shannon entropy of Weibull - N{exponential} distribution as
$η_{X} = log (σ \sqrt{2 π}) - γ Γ (1 + 1 / c) + ξ (1 - 1 / c) + log (γ / c) + E {(X - μ)}^{2} / (2 σ^{2}) + 1 .$
(4)
Moments: By using Theorem 3, a series representation of the r ^th moments of the Weibull-N{exponential} distribution can be obtained by replacing M _T(- i) with
$\sum_{k = 0}^{\infty} \frac{{(- 1)}^{k} γ^{k}}{k!} Γ (1 + \frac{k}{c})$

in equation (3.7).

(5)
Mean deviations: By using Theorem 4, the mean deviation from the mean and the mean deviation from the median of Weibull - N{exponential} distribution can be obtained by replacing S (μ,0,i) and S (M,0,i) with $\frac{c}{i^{c} γ^{c}} \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{γ^{ck} i^{ck} k!} Γ [c (k + 1), - i log (1 - Φ (μ))]$ and $\frac{c}{i^{c} γ^{c}} \sum_{k = 0}^{\infty} \frac{{(- 1)}^{k}}{γ^{ck} i^{ck} k!} Γ [c (k + 1), - i log (1 - Φ (M))]$ in equations (3.14) and (3.15) respectively, where $Γ (α, x) = \int_{0}^{x} u^{α - 1} e^{- u} du$ is the incomplete gamma function.

4.2 The exponential - N{log-logistic} distribution

If a random variable T follows the exponential distribution with parameter λ, then f_T(x) = λ e^{- λx}, λ > 0. From (2.10), the PDF of the exponential-N{log-logistic} is defined as

f_{X} (x) = \frac{λ ϕ (x)}{{(1 - Φ (x))}^{2}} exp [\frac{- λ Φ (x)}{1 - Φ (x)}] .

(4.2)

From (2.9), the CDF of (4.2) is given by $F_{X} (x) = 1 - exp [\frac{λΦ (x)}{1 - Φ (x)}] .$

In Figure 3, various graphs of f_X(x) when μ = 0, σ = 1 and for various values of λ are provided. These graphs indicate that the exponential - N {log - logistic} distribution is always left skewed. Also, the skewness increases as λ decreases.

4.3 The logistic - N{logistic} distribution

If a random variable T follows the logistic distribution with parameter λ, then f_T(x) = λ e^{- λx}(1 + e^{- λx})^-2, λ > 0. From (2.12), the PDF of logistic-N{logistic} distribution is defined as

f_{X} (x) = \frac{λ ϕ (x) Φ^{λ - 1} (x) {(1 - Φ (x))}^{λ - 1}}{{[Φ^{λ} (x) + {(1 - Φ (x))}^{λ}]}^{2}} .

(4.3)

From (2.11), the CDF of (4.3) is given by $F_{X} (x) = \frac{Φ^{λ} (x)}{Φ^{λ} (x) + {(1 - Φ (x))}^{λ}} .$

When λ = 1, (4.3) reduces to the normal distribution. In Figure 4, various graphs of f_X(x) when μ = 0, σ = 1 and for various values of λ are provided. These graphs indicate that the PDF of logistic - N {logistic} can be bimodal and the bimodality occurs for small values of λ. Also, it is easy to see from the PDFs in (4.3) that the distribution is symmetric for all values of λ.

4.4 The logistic - N{extreme value} distribution

If a random variable T follows the logistic distribution with parameter λ, then f_T(x) = λ e^{- λx}(1 + e^{- λx})^{- 2}, λ > 0. From (2.14), the PDF of the logistic - N {extreme value} distribution is defined as

f_{X} (x) = \frac{λ h_{ϕ} (x) {H_{ϕ}}^{λ - 1} (x)}{{(1 + {H_{ϕ}}^{λ} (x))}^{2}} .

(4.4)

From (2.13), the CDF of (4.4) is given by $F_{X} (x) = \frac{{H_{ϕ}}^{λ} (x)}{1 + {H_{ϕ}}^{λ} (x)} .$

In Figure 5, various graphs of f_X(x) when μ = 0, σ = 1 and for various values of λ are provided. These graphs indicate that the distribution is always right skewed. Also, the skewness increases as λ decreases.

5 Applications

To illustrate the flexibility of the GN distributions, we fit some GN distributions to a unimodal data set and a bimodal data set. The unimodal data with n = 66 in Table 2 is obtained from Nichols and Padgett (2006) on the breaking stress of carbon fibers of 50 mm in length. Alzaatreh at el. (2013) fitted the data set to the gamma-normal distribution. They showed that the standard gamma-normal distribution with μ = 0 and σ = 1 provides a good fit to the data set. The standard form of exponential - N {exponential}, exponentiated exponential - N {exponential} and Weibull - N {exponential} distributions with μ = 0 and σ = 1 are applied to fit the data set and the results compared with the results from standard gamma-normal distribution. The maximum likelihood estimates, the log-likelihood value, the AIC (Akaike Information Criterion), the Kolmogorov-Smirnov (K-S) test statistic, and the p-value for the K-S statistic for the fitted distributions are reported in Table 3. The results in Table 3 show that all the generalized normal distributions give an adequate fit to the data. However, the K-S values indicate that the gamma - N {exponential} distribution provides the best fit among the distributions. Figure 6 displays the histogram and the fitted density functions for the data.

Table 2 Breaking stress of carbon fibers data

Full size table

Table 3 Parameter estimates for the carbon fibers data

Full size table

The second application is on a bimodal data set obtained from Emlet et al. (1987) on the asteroid and echinoid egg size. The data is available from the first author. The data consists of 88 asteroid species divided into three types; 35 planktotrophic larvae, 36 lecithotrophic larvae, and 17 brooding larvae. Since the logarithm of the egg diameters of the asteroids data has a bimodal shape, Famoye et al. (2004) applied the beta-normal distribution to the logarithm of the data set. We apply the logistic-N{logistic} distribution, which can be bimodal, to fit the same data. The results of the maximum likelihood estimates, the log-likelihood value, the AIC (Akaike Information Criterion), the Kolmogorov-Smirnov (K-S) test statistic, and the p-value for the K-S statistic for the fitted distributions are reported in Table 4. The results in Table 4 show that both the beta-normal and logistic-N{logistic} distributions give an adequate fit to the data. However, the K-S values indicate that the logistic-N{logistic} distribution provides a better fit. Figure 7 displays the histogram and the fitted density functions for the data.

Table 4 Parameter estimates for the asteroids data

Full size table

6 Summary and conclusions

The normal distribution is the most commonly used distribution in both statistical theory and applications. The generalization of the normal distribution is studied using the T - X framework proposed by Alzaatreh et al. (2013). Four types of generalized normal families from the quantile functions of the (i) exponential, (ii) log-logistic, (iii) logistic, and (iv) extreme value distributions are proposed. Some general properties are studied. Four generalized normal distributions are described and some of their properties investigated. It is noticed that the shapes of GN distributions can be symmetric, skewed to the right, skewed to the left or bimodal. This gives the families some flexibility in fitting real world data. Because the GN distributions include the normal distribution as a special case, using the GN distributions to fit data enables one to check if the additional parameters characterize the deviation from the normal distribution. Many types of generalizations of the normal distribution can be derived using the methodology described in this paper. Due to the fact that GN distributions are natural extensions from the normal distribution, statistical modeling by assuming the error term follows some form of GN distribution will be an interesting topic for future research.

References

Alexander C, Cordeiro GM, Ortega EMM, Sarabia JM: Generalized beta-generated distributions. Comput. Stat. Data Anal. 2012, 56(6):1880–1897.
Article MathSciNet Google Scholar
Aljarrah MA, Lee C, Famoye F: A method of generating T-X family of distributions using quantile functions. J. Stat. Distributions Appl. 2014, 1(2):17.
Google Scholar
Alzaatreh A, Famoye F, Lee C: The gamma-normal distribution: Properties and applications. Comput. Stat. Data Anal. 2014, 69(1):67–80.
Article MathSciNet Google Scholar
Alzaatreh A, Lee C, Famoye F: A new method for generating families of continuous distributions. Metron 2013, 71(1):63–79.
Article MathSciNet Google Scholar
Arellano-Valle RB, Gómez HW, Quintanan FA: A new class of skew-normal distributions. Commun. Stat.-Theory Methods 2004, 33: 1465–1480.
Article MathSciNet Google Scholar
Arnold BC, Beaver RJ: Skewed multivariate models related to hidden truncation and/or selective reporting (with discussion). Test 2002, 11: 7–35.
Article MathSciNet Google Scholar
Arnold BC, Castillo E, Jose JM: Distributions with generalized skewed conditionals and mixtures of such distributions. Commun. Stat.-Theory Methods 2007, 36: 1493–1503.
Article MathSciNet Google Scholar
Azzalini A: A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12: 171–178.
MathSciNet Google Scholar
Balakrishnan N: Discussion of skewed multivariate models related to hidden truncation and/or selective reporting by Arnold & Beaver. Test 11: 37–39.
Choudhury K, Abdul MM: Extended skew generalized normal distribution. Metron 2002, 69: 265–278.
Article MathSciNet Google Scholar
Cordeiro GM, de Castro M: A new family of generalized distributions. J. Stat. Comput. Simulat. 2011, 81(7):883–898.
Article MathSciNet Google Scholar
de Moivre A: Approximatio ad summam ferminorum binomii (a + b)n in seriem expansi. Self-published pamphlet 1733.
Google Scholar
Emlet RB, McEdward LR, Strathmann RR: Echinoderm larval ecology viewed from the egg. In Echinoderm Studies, volume 2. Edited by: Jangoux M, Lawrence JM. AA Balkema, Rotterdam; 1987.
Google Scholar
Eugene N, Lee C, Famoye F: The beta-normal distribution and its applications. Commun. Stat.-Theory Methods 2002, 31(4):497–512.
Article MathSciNet Google Scholar
Famoye F, Lee C, Eugene N: Beta-normal distribution: bimodality properties and applications. J. Mod. Appl. Stat. Meth. 2004, 3(1):85–103.
Google Scholar
Fernández C, Steel MFJ: On Bayesian modeling of fat tails and skewness. J. Am. Stat. Assoc. 1998, 93: 359–371.
MathSciNet Google Scholar
Ferreira JTAS, Steel MFJ: A constructive representation of univariate skewed distributions. J. Am. Stat. Assoc. 2006, 101: 823–829.
Article MathSciNet Google Scholar
Gauss CF: Theoria Motus Corporum Coelestium, pp. 205–224. Perthes u. Besser, Hamburg; 1809. Lib. 2, Sec. III Lib. 2, Sec. III
Google Scholar
Gauss, CF: Bestimmung der genauigkeit der beobachtungen. Zeitschrift Astronomi 1816, 1: 185–197.
Google Scholar
Gupta RC, Gupta RD: Generalized skew normal model. Test 2004, 13: 501–520.
Article MathSciNet Google Scholar
Johnson NL, Kotz S, Balakrishnan N: Continuous Univariate Distributions, volume 1. John Wiley & Sons, New York; 1994.
Google Scholar
Kotz S, Vicari D: Survey of developments in the theory of continuous skewed distributions. Metron 2005, 63: 225–261.
MathSciNet Google Scholar
Lee C, Famoye F, Alzaatreh A: Methods for generating families of univariate continuous distributions in the recent decades. WIREs Comput. Stat. 2013, 5: 219–238.
Article Google Scholar
Nichols MD, Padgett WJ: A bootstrap control for Weibull percentiles. Qual. Reliab. Eng. Int. 2006, 22: 141–151.
Article Google Scholar
Patel JK, Read CB, Handbook of the Normal Distribution Marcel Dekker, New York; 1996.
Google Scholar
Pourahmadi M: Construction of skew-normal random variables: are they linear combination of normals and half-normals? J. Stat. Theory Appl. 2007, 3: 314–328.
MathSciNet Google Scholar
Rényi A: On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability. University of California Press, Berkeley, CA; 1961.
Google Scholar
Sharafi M, Behboodian J: The Balakrishnan skew-normal density. Stat. Pap. 2008, 49: 769–778.
Article MathSciNet Google Scholar
Song KS: Rényi information, loglikelihood and an intrinsic distribution measure. J. Stat. Plann. Infer. 2001, 93: 51–69.
Article Google Scholar
Wolfram . Retrieved on May 29, 2014 http://functions.wolfram.com/GammaBetaErf/InverseErf/06/01/02/0004/ . Retrieved on May 29, 2014
Yadegari I, Gerami A, Khaledi MJ: A generalization of the Balakrishnan skew-normal distribution. Stat. Probability Lett. 2008, 78: 1165–1167.
Article MathSciNet Google Scholar

Download references

Acknowledgments

The authors are very grateful to the Associate Editor and the three anonymous reviewers for various constructive comments and suggestions that have greatly improved the presentation of the paper. The authors particularly thank one of the reviewers for suggesting the unified notation to define the T-R{Y} family of distributions.

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Austin Peay State University, Clarksville, TN, 37044, USA
Ayman Alzaatreh
Department of Mathematics, Central Michigan University, Mount Pleasant, MI, 48859, USA
Carl Lee & Felix Famoye

Authors

Ayman Alzaatreh
View author publications
You can also search for this author in PubMed Google Scholar
Carl Lee
View author publications
You can also search for this author in PubMed Google Scholar
Felix Famoye
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ayman Alzaatreh.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors, viz AA, CL and FF with the consultation of each other carried out this work and drafted the manuscript together. All authors read and approved the final manuscript.

Ayman Alzaatreh, Carl Lee and Felix Famoye contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Alzaatreh, A., Lee, C. & Famoye, F. T-normal family of distributions: a new approach to generalize the normal distribution. J Stat Distrib App 1, 16 (2014). https://doi.org/10.1186/2195-5832-1-16

Download citation

Received: 19 February 2014
Accepted: 23 June 2014
Published: 08 July 2014
DOI: https://doi.org/10.1186/2195-5832-1-16

T-normal family of distributions: a new approach to generalize the normal distribution

Abstract

AMS 2010 Subject Classification

1 Introduction

2 T-normal families of distributions

Remark 1.

2.1 Family of GN distributions from the quantile function of exponential distribution (T- N{exponential})

2.2 Family of GN distributions from the quantile function of log-logistic distribution (T- N{log-logistic})

2.3 Family of GN distributions from the quantile function of logistic distribution (T- N{logistic})

2.4 Family of GN distributions from the quantile function of extreme value distribution (T- N{extreme value})

3 Some properties of the T- normal family of distributions

Lemma 1

Proof.

Lemma 2.

Proof.

Theorem 1.

Proof.

Corollary 1.

Theorem 2.

Proof.

Corollary 2.

Proof.

Theorem 3.

Proof.

Theorem 4.

Proof.

4 Some examples of GN families of distributions with different T distributions

4.1 The Weibull - N{exponential} distribution

Remark 2

4.2 The exponential - N{log-logistic} distribution

4.3 The logistic - N{logistic} distribution

4.4 The logistic - N{extreme value} distribution

5 Applications

6 Summary and conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords