代做MATH2010 Statistical Modelling I SEMESTER 1 EXAMINATION 2020/21代写Java程序

MATH2010 Statistical Modelling I

SEMESTER 1 EXAMINATION 2020/21

1.    [25 marks] Power laws are common to describe natural phenonomen in areas such as physics, biology and economics. A simple power law for a response variable Y   and explanatory variable x is given by Y = xβ . Assuming Y is actually observed     with some error, the linear model

logYi = β log xi + εi                                                         (1)

could be used to estimate β from data (yi , xi ), i = 1, . . . , n, where εi  ~ N(0, σ2 ) and εi , εj are assumed independent for all i ≠ j.

(a)  [7 marks] Show that the least squares estimator of β has the form.

and derive its expected value and variance.     (b)  [8 marks] An alternative estimator is given by

Derive the mean and variance of β, and hence show that β is unbiased but has

variance at least as large as the variance of β(^) .

(c)  [10 marks] The below data set gives the period (yi; seconds) of an oscilating object at the end of a spring for five objects of different masses (xi; kg).

Using these data and assuming model (1), find a 95% confidence interval for β and test the null hypothesis that β = 0.5. Show your working; if you use R, or other software, to answer this question, please clearly state the commands you use.

You may find the following quantities from R useful:

qt(0.975, 3) ## [1] 3.18 qt(0.975, 4) ## [1] 2.78 qt(0.975, 5) ## [1] 2.57

2.    [25 marks] Consider the usual multiple linear regression model Y = + ε ,

with Y an n × 1 response vector, X an n × (k + 1) design matrix, β a k + 1 vector of unknown parameters and ε ~ N (0, σ2In ). Assume least squares will be used to  estimate β .

(a)  [4 marks] Write down expressions for the vector of fitted valuesY(^) and the vector

of residuals R in terms of Y and the hat matrix H.

(b)  [6 marks] Find the expectation and variance-covariance matrix for each ofY(^) and R.

(c)  [8 marks] Show that the cross-covariance matrix cov(Y(^) , R) is a matrix with

every entry zero. Explain the importance of this result for model checking via residual plots.

(d)  [2 marks] Residual plots are commonly used to check linear model assumptions. Which model assumptions are checked by

(i) a plot of residuals against fitted values? (ii) a normal probability plot of residuals?

(e)  [5 marks] Sketch examples of how the residual plots above would give evidence that

(i) the model assumptions are adequate.

(ii) the error distribution has rightskew.

(iii) the variance of error term increases as a function of the mean of Yi.

3.    [25 marks] A study on the taste of cheddar cheese recorded a subjective taste score (taste) and the concentrations of acetic acid (Acetic),lactic acid (Lactic) and hydrogen sulfice (H2S) on 30 samples. A multiple linear model was fitted that regressed taste on the three explanatory variables (with H2S logged before being   included). Partial results from fitting the model are given below, and some quantities from R are given on the next page.

(a)  [2 marks] Write down the fitted regression model.

(b)  [2 marks] How many degrees of freedom are left to estimate the residual standard error?

(c)  [4 marks] Conduct a test to determine if logH2S is a significant explanatory variable at the 5% level.

(d)  [4 marks] Construct a 99% confidence interval for β2 , the coefficient of Lactic in the regression model.

(e)  [3 marks] Test the null hypothesis H0  : β2  = 5 against the alternative H1  : β2 5 at the 1% level of significance.

(f)  [10 marks] Consider the partial output from running the anova command in R using these data.

## Analysis of Variance Table ## ## Model 1: taste ~ 1 ## Model 2: taste ~ Acetic + Lactic + log(H2S) ## Res.Df RSS Df Sum of Sq F Pr(>F) ## 1 7663 ## 2 2697

Using this information, compare the null and full regression models using an

F-test. What is the value for your test statistic, and which model do you prefer?

You may find the following quantities from R useful:

qt(0.95, 26) ## [1] 1.71 qt(0.975, 26) ## [1] 2.06 qt(0.995, 26) ## [1] 2.78 qf(0.95, 3, 26) ## [1] 2.98 qf(0.975, 3, 26) ## [1] 3.67

4.    [25 marks] Data were collected from n = 21 consecutive days at a plant for the

oxidation of ammonia to nitric acid. The response is 10 times the percentage of the ingoing ammonia to the plant that escapes from the absorption column unabsorbed; that is, an (inverse) measure of the over-all efficiency of the plant. There are m = 3 explanatory variables as given in the table below.

A series of eight linear regression models, labelled A, B, . . . , H, are fitted. The table   below shows the residual sum of squares (RSS, to 3 decimal places) of each of these models where the x or - in the columns headed x1 , x2 and x3 indicates whether the    model includes (x) or excludes (-) the corresponding explanatory variable.

(a)  [1 mark] For each model (A, B, . . . H), write down the value of k, the number of explanatory variables.

(b)  [1 mark] Explain why the residual sum of squares (RSS) cannot be used for modelselection.

(c)  [7 marks] For each model (A, B, . . . H), calculate the adjusted R2 . Other than the null model, which model has the smallest value of this quantity?

(d)  [2 marks] How do AIC and BIC overcome the issues inherent with using the RSS to perform model selection? If the models selected using AIC and BIC differ,

which would include more explanatory variables?

(e)  [6 marks] For each model (A, B, . . . H), calculate the value of AIC and BIC using the definitions given in lectures. Determine the final chosen model under each of  AIC and BIC. Comment on any differences between the models selected using

AIC, BIC and adjusted R2 .

(f)  [4 marks] For each of forwards and backwards selection with BIC:

(i) For which models would you not need to calculate BIC?

(ii) Which is the final model chosen?

(g)  [4 marks] How many possible two-way interactions are there between the

explanatory variables in this study? How many models can be constructed

including two-way interactions, both with and without imposing effect heredity?

Learning objectives:

LO1 Use the theory of linear models and matrix algebra to investigate standard and non- standard problems.

LO2 Interpret the output from an analysis including the meaning of interactions and terms based on qualitative factors.

LO3 Understand how to make a critical appraisal of a fitted model.

LO4 Carry out t-tests and calculate confidence intervals by hand and by computer. LO5 Using a variety of procedures for variable selection.

LO6 Fit multiple regression models using the adopted software package.

LO7 Carry out simple linear regression by computer.

LO6 and LO7 are assessed via coursework.









热门主题

课程名

mktg2509 csci 2600 38170 lng302 csse3010 phas3226 77938 arch1162 engn4536/engn6536 acx5903 comp151101 phl245 cse12 comp9312 stat3016/6016 phas0038 comp2140 6qqmb312 xjco3011 rest0005 ematm0051 5qqmn219 lubs5062m eee8155 cege0100 eap033 artd1109 mat246 etc3430 ecmm462 mis102 inft6800 ddes9903 comp6521 comp9517 comp3331/9331 comp4337 comp6008 comp9414 bu.231.790.81 man00150m csb352h math1041 eengm4100 isys1002 08 6057cem mktg3504 mthm036 mtrx1701 mth3241 eeee3086 cmp-7038b cmp-7000a ints4010 econ2151 infs5710 fins5516 fin3309 fins5510 gsoe9340 math2007 math2036 soee5010 mark3088 infs3605 elec9714 comp2271 ma214 comp2211 infs3604 600426 sit254 acct3091 bbt405 msin0116 com107/com113 mark5826 sit120 comp9021 eco2101 eeen40700 cs253 ece3114 ecmm447 chns3000 math377 itd102 comp9444 comp(2041|9044) econ0060 econ7230 mgt001371 ecs-323 cs6250 mgdi60012 mdia2012 comm221001 comm5000 ma1008 engl642 econ241 com333 math367 mis201 nbs-7041x meek16104 econ2003 comm1190 mbas902 comp-1027 dpst1091 comp7315 eppd1033 m06 ee3025 msci231 bb113/bbs1063 fc709 comp3425 comp9417 econ42915 cb9101 math1102e chme0017 fc307 mkt60104 5522usst litr1-uc6201.200 ee1102 cosc2803 math39512 omp9727 int2067/int5051 bsb151 mgt253 fc021 babs2202 mis2002s phya21 18-213 cege0012 mdia1002 math38032 mech5125 07 cisc102 mgx3110 cs240 11175 fin3020s eco3420 ictten622 comp9727 cpt111 de114102d mgm320h5s bafi1019 math21112 efim20036 mn-3503 fins5568 110.807 bcpm000028 info6030 bma0092 bcpm0054 math20212 ce335 cs365 cenv6141 ftec5580 math2010 ec3450 comm1170 ecmt1010 csci-ua.0480-003 econ12-200 ib3960 ectb60h3f cs247—assignment tk3163 ics3u ib3j80 comp20008 comp9334 eppd1063 acct2343 cct109 isys1055/3412 math350-real math2014 eec180 stat141b econ2101 msinm014/msing014/msing014b fit2004 comp643 bu1002 cm2030
联系我们
EMail: 99515681@qq.com
QQ: 99515681
留学生作业帮-留学生的知心伴侣!
工作时间:08:00-21:00
python代写
微信客服:codinghelp
站长地图