Research School of Finance, Actuarial Studies and Statistics
ASSIGNMENT
Semester 2, 2024
STAT7055 Introductory Statistics for Business and Finance
INSTRUCTIONS TO STUDENTS
Due Date
• The assignment is due at 9:00am on Friday October 11.
• Late submission of the assignment is not permitted. An assignment submitted without an extension after the due date will receive a mark of 0.
Writing your Assignment
• The assignment is an individual piece of assessment and must be completed on your own.
• You are not permitted to use any form. of tutoring services (e.g., online, in-person, etc.) or any AI tools (e.g., ChatGPT, etc.).
• You will be required to write a report in an R Markdown document that contains R code (with R code comments), R output and written text. An example of an R Markdown document, which you can use as a template, has been provided on Wattle.
• All R code must have accompanying R code comments that sufficiently describe what the code is doing.
• When answering the assignment questions in your report, you will need to include all your R code and R output that you used to calculate any answers and you must also write your answers in proper sentences. For example, if you are required to calculate a sample mean, then you would include your R code for calculating the sample mean and the R output of the sample mean value and you would also write a proper sentence in the report such as “The sample mean is equal to ...” .
• Make sure to be clear and concise in your answers.
• A good way to approach writing your report is to imagine that you are a statistical consultant and that a client has asked you to do some statistical analyses. When presenting the results of your analyses to the client, you wouldn’t just give them pages of R code, R output, calculations, etc. Rather, you should give them a proper report which clearly outlines and explains the results of the analyses and which also includes the R code and R output used to produce the results.
• Therefore, presentation is very important. Marks will be deducted for poorly presented reports.
• Once you have finished writing your report in your R Markdown document, you will need to render the document by pressing the Knit button in RStudio to create a HTML file of your report.
• Further to the above point, it is good practice to regularly Knit your R Markdown document as you write your report. This is useful for checking that it’s rendering properly.
Submitting your Assignment
• Submission of the assignment will be through Wattle and further details regarding assignment submission will be provided on Wattle.
• For submission you will need to submit two files: the R Markdown file of your report (i.e., a “ .Rmd” file) and the rendered HTML file of your report produced by pressing the Knit button in RStudio (i.e., a “ .html” file).
• Please name your two files as “uNNNNNNN.Rmd” and “uNNNNNNN.html” , where uNNNNNNN is your student number.
• No other file types will be accepted or marked, e.g., “ .R” , “ .docx” , “ .RData” , “ .zip” , etc. In particular, do not submit any compressed files.
Other Important Details
• You may only use built-in functions available in the default installation of R and you are not permitted to use functions in any additional R packages (e.g., ggplot2).
• You must use the appropriate R functions (and not the statistical tables) to calculate any critical values or p-values used when performing any hypothesis tests.
• You must use R for all calculations.
• Round all final numeric answers to 4 decimal places. However, as you will be using R, keep all decimals during all intermediate steps to ensure the accuracy of your final numeric answer.
• Please use the help function if you want to learn more about a particular R function, e.g., enter help(mean) in the R console to learn more about the mean function.
• For questions that require writing mathematical symbols, you are welcome to use short- hand notation, provided you make the meaning clear (e.g., using “Mu” for μ , or “ !=” for ).
• Answers (including hypotheses, explanations, conclusions, etc.) need to be written in the text of the R Markdown document and not in the R code comments or the R output.
• Do not print out entire data sets in your R Markdown document or HTML file, as this will only take up unnecessary space.
Question 1 [11 marks]
A random sample of students undertaking a first year management course was se- lected and for each student, their mid-semester exam mark out of 100 (X1), their fi- nal exam mark out of 100 (X2) and the degree they studied (X3) were recorded. You have been asked to perform. some analyses on this data. The data are stored in the file AssignmentData .RData in the data frame. marks .df.
(a) [3 marks] Construct a single boxplot to describe the distribution of final exam marks for students who are studying a finance degree. Make sure to give your plot a proper descriptive title and an appropriate axis label (do not just use the default title or label). Based on this boxplot, provide a clear description of the distribution of final exam marks for students who are studying a finance degree. Be specific in your description, making sure to mention any interesting and/or important aspects of the distribution.
(b) [4 marks] Test whether the population mean mid-semester exam mark for students studying a commerce degree is greater than the population mean mid-semester exam mark for students studying an economics degree by more than 0.4 marks. State any assumptions that you may need to make (there is no need to formally test any assumptions). Clearly state your hypotheses, making sure to define any parameters, and use a significance level of α = 4%. Do not use any R functions that are designed to perform. hypothesis tests.
(c) [4 marks] For students studying either a business or commerce degree, test whether the population mean final exam mark is greater than the population mean mid- semester exam mark by more than 2.1 marks. State any assumptions that you may need to make (there is no need to formally test any assumptions). Clearly state your hypotheses, making sure to define any parameters, and use a significance level of α = 6%. Do not use any R functions that are designed to perform. hypothesis tests.
Presentation [4 marks]
Marks will be allocated for how well presented your report is, e.g., clear and distinct headings, concise answers with information clearly communicated, all R code sufficiently commented, etc.