Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen. Математика. . Читать онлайн. Литмир. LITMIR.BIZ

Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen

Читать онлайн.

Информация о произведении:

Название Industrial Data Analytics for Diagnosis and Prognosis

Год выпуска 0

isbn 9781119666301

Автор произведения Yong Chen

Жанр Математика

Серия

Издательство John Wiley & Sons Limited

Industrial Data Analytics for Diagnosis and Prognosis - Yong Chen

Скачать книгу

parenthesis less or equal than fraction numerator left parenthesis straight n minus 1 right parenthesis straight p over denominator straight n minus straight p end fraction straight F subscript straight alpha comma straight p comma straight n minus straight p end subscript right curly bracket. end cell end table"/>

which can be interpreted as the standardized distance between the sample mean X̄ and μ0. The distance is standardized by S/n, which is equal to the sample covariance matrix of X̄. When the standardized distance between X̄ and μ0 is beyond the critical value given in the right-hand side of (3.22), the true mean is not likely equal to be μ0 and we reject H₀.

The concept of univariate confidence interval can be extended to multivariate confidence region. For p-dimensional normal distribution, the 100(1 − α)% confidence region for μ is defined as

$table row cell left curly bracket mu vertical line n left parenthesis bold x with bold bar on top minus bold italic mu right parenthesis to the power of T bold S to the power of negative 1 end exponent left parenthesis bold x with bold bar on top minus bold italic mu right parenthesis less or equal than fraction numerator left parenthesis n minus 1 right parenthesis p over denominator n minus p end fraction F subscript alpha comma p comma n minus p end subscript right curly bracket. end cell end table$

It is clear that the confidence region for μ is an ellipsoid centered at x̄. Similar to the univariate case, the null hypothesis H₀ :μ = μ₀ is not rejected at level α if and only if μ₀ is in the 100(1 − α)% confidence region for μ.

The T²-statistic can also be derived as the likelihood ratio test of the hypotheses in (3.20). The likelihood ratio test is a general principle of constructing statistical test procedures and having several optimal properties for reasonably large samples. The detailed study of the likelihood ratio test theory is beyond the scope of this book.

Substituting the MLE of μ and Σ in (3.16) and (3.17), respectively, into the likelihood function in (3.13), it is easy to see

$table row cell max with bold italic mu comma bold capital sigma below L left parenthesis bold italic mu comma bold capital sigma right parenthesis equals fraction numerator 1 over denominator left parenthesis 2 pi right parenthesis to the power of n p divided by 2 end exponent vertical line bold capital sigma with bold hat on top vertical line to the power of n divided by 2 end exponent end fraction e to the power of negative n p divided by 2 end exponent comma end cell end table$

where is the MLE of Σ given in (3.17). Under the null hypothesis H₀ : μ = μ₀, the MLE of Σ with μ = μ₀ fixed can be obtained as

table row cell bold capital sigma with bold hat on top subscript 0 equals 1 over n sum from i equals 1 to n of left parenthesis bold x subscript i minus bold italic mu subscript 0 right parenthesis left parenthesis x subscript i minus bold italic mu subscript 0 right parenthesis to the power of T. end cell end table

It can be seen that stack sum subscript 0 with hat on top is the same as except that X̄ is replaced by μ₀.

The likelihood ratio test statistic is the ratio of the maximum likelihood over the subset of the parameter space specified by H₀ and the maximum likelihood over the entire parameter space. Specifically, the likelihood ratio test statistic of H₀ : μ = μ₀ is

$table row cell L R equals fraction numerator m a x subscript bold capital sigma L left parenthesis bold italic mu subscript 0 comma bold capital sigma right parenthesis over denominator m a x subscript bold italic mu comma bold capital sigma end subscript L left parenthesis bold italic mu comma bold capital sigma right parenthesis end fraction equals left parenthesis fraction numerator vertical line bold capital sigma with bold hat on top vertical line over denominator vertical line bold capital sigma with bold hat on top subscript 0 vertical line end fraction right parenthesis to the power of n divided by 2 end exponent. end cell end table$ (3.23)

The test based on the T²-statistic in (3.21) and the likelihood ratio test is equivalent because it can be shown that

$table row cell L R equals open parentheses 1 plus fraction numerator T squared over denominator n minus 1 end fraction close parentheses to the power of negative n divided by 2 end exponent. end cell end table$ (3.24)

Example 3.2: Hot rolling is among the key steel-making processes that convert cast or semi-finished steel into finished products. A typical hot rolling process usually includes a melting division and a rolling division. The melting division is a continuous casting process that melts scrapped metals and solidifies the molten steel into semi-finished steel billet; the rolling division will further squeeze the steel billet by a sequence of stands in the hot rolling process. Each stand is composed of several rolls. The side_temp_defect data set contains the side temperature measurements on 139 defective steel billets at Stand 5 of a hot rolling process where the side temperatures are measured at 79 equally spaced locations spread along the stand. In this example, we focus on the three measurements at locations 2, 40, and 78, which correspond to locations close to the middle and the two ends of the stands. The nominal mean temperature values at the three locations are 1926, 1851, and 1872, which are obtained based on a large sample of billets without defects. We want to check if the defective billets have significantly different mean side temperature from the nominal values. We can, therefore, test the hypothesis

table row cell H subscript 0 colon bold mu equals open parentheses table row 1926 row 1851 row 1872 end table close parentheses end cell end table

The following R codes calculate the sample mean, sample covariance matrix, and the T²-statistic for the three side temperature measurements.

side.temp.defect <- read.csv("side_temp_defect.csv",

header = F) X <- side.temp.defect[, c(2, 40, 78)] mu0 <- c(1926, 1851, 1872) x.bar <- apply(X, 2, mean) # sample mean S <- cov(X) # sample var-cov matrix n <- nrow(X) p <- ncol(X) alpha = 0.05 T2 <- n*t(x.bar-mu0)%*%solve(S)%*%(x.bar -mu0) F0 <- (n-1)*p/(n-p)*qf(1-alpha, p, n-p) p.value <- 1 - pf((n-p)/((n-1)*p)*T2, p, n-p)

Using the above R codes, the sample mean and sample covariance matrix are obtained as

table row cell top enclose bold x equals open parentheses table row 1930 row 1848 row 1864 end table close parentheses comma space bold S equals open parentheses table row cell 2547.4 end
<p style=

Скачать книгу

Новинки

Популярные

Наши рекомендации

ТОП просматриваемых книг сайта:

Industrial Data Analytics for Diagnosis and Prognosis. Yong Chen

Информация о произведении: