代做PSY4101 Assignment 2 (individual work): Data analysis problem set代做Python语言

2025-2026, Term 1

PSY4101 Assignment 2 (individual work): Data analysis problem set

Submit ONE SOFTCOPY (AS A PDF FILE) ON TURNITIN

Instructions

- Write/Type your name and student ID clearly in your submission.

- Mark the problem number and subpart number clearly for each answer you provide.

- Your answers can be handwritten, word-processed (e.g., using Google Docs or MS Word), or in a mix of both. For handwritten answers, please make sure your handwriting is legible.

- For all questions related to null-hypothesis significance tests (NHST), set α = 0.05 unless otherwise specified.

- Please ensure that all numerical answers are rounded to three decimal places.

- For each analysis in jamovi, please provide the syntax generated from jamovi (jamovi user guide for enabling syntax mode). This syntax allows us to trace your analysis, so that, if you make an error, we can identify the type of error and give you partial credit if possible.

- Use of generative AI declaration (Write statements (see Section B of the notes on assessment items for details) below regardless of whether or not you have used AI in any parts of your work.
If you have used any generative artificial intelligence (AI) (e.g., software supported by large language models or LMMs such as ChatGPT) in any part of your work, please include a “use of generative AI statement” at the end of your work to declare when, where, how, and how much you have used them in your work.

- Using generative AI per se does not result in any penalty, but failure to appropriately and adequately acknowledge the use of it could be regarded as plagiarism. If you use generative AI in your work, the more details you include in your statement, the less likely you will face such a penalty.

Below are some example statements:

“I asked ChatGPT questions about the topic, and it gave me a list of readings to start with. Below are the questions I asked ChatGPT and the answers given by ChatGPT…”

“I used generative AI to improve my writing / check for grammatical errors / get suggestions of words/expressions only after I had written the first draft all by myself.”

“I did not use any generative AI in any part of this work.”

Please insert your use of generative AI declaration in the box below:
IMPORTANT: 50% of your final mark will be deducted if the statements are not included

Assignment Scoring Instructions:

- Please note that the total score for this assignment is 125%. However, we will cap the maximum score at 100%. This means that if your score exceeds 100%, it will be recorded as 100%. If your score is 100% or below, it will be recorded as is.

- This structure provides you with "extra room" to earn points, not "extra points." It is meant to encourage you to do your best with less pressure of losing points.

Problem 1 (50%)

Pat is an analyst at a teaching-quality unit of a university and is interested in finding out the factors affecting students’ performance in exams. Specifically, Pat would like to study how personality (measured based on the Big-Five scale), behavior. before exam (measured as the number of hours spent on revision before an exam), and emotion before exam (measured as anxiety level) may influence a student’s exam performance (measured as exam scores). The dataset contains questionnaire data from a sample of 103 students.

Dataset for Problem 1: Problem 1_data.omv

A. (30%) One of Pat’s original hypotheses was that anxiety is a significant predictor of exam scores even after the effects of personality have been controlled for.

To test this hypothesis, first conduct a linear regression with exam scores being the outcome and all 5 personality scores as the predictors.

i. (6%) For each of the following assumptions of linear regression, state whether there is any evidence that the assumption was violated in this regression analysis. Report relevant statistics and/or figures to support your answer. If the assumption was violated, justify whether we should proceed with the analysis.

a. (3%) the linear-independence assumption

b. (3%) the normality assumption

ii. (4%) Write down the regression equation with the coefficient estimates.

iii. (5%) Which personality factor yielded the smallest p value as a predictor in this model? Test whether this factor significantly predicted exam performance and write a sentence to interpret the regression coefficient of this predictor.

iv. (5%) Did the model have significant explanatory power on exam scores? Write a sentence to report how much variability in exam scores was explained by personality and test whether it was significant.

Now, insert anxiety into the model with the 5 personality factors already included in the model.

v. (6%) Did anxiety provide any additional explanatory power, over and above personality, in predicting exam performance? Test this and report the results in no more than 2 sentences.

vi. (4%) Given the regression results above, what would you expect to see if you extract the part (aka semipartial) correlation between anxiety and exam scores with personality factors being partialed out? Briefly explain your expectation and verify it using jamovi.

B. (20%) Before collecting the data, Pat had two other hypotheses, which are stated below. For each of the hypotheses, conduct an appropriate analysis and write a short paragraph in a “result-section” style. (with relevant statistics and/or graph(s)) to support your answer.

i. (10%) Pat hypothesized that revision duration would moderate the effect of anxiety on exam performance by weakening the effect of anxiety on exam performance.
Include a simple-slope analysis to interpret how revision duration moderated the relationship between anxiety and exam performance.

ii. (10%) To explain how neuroticism influences revision duration, Pat hypothesizes that anxiety is a mediator. Report whether the mediation is complete or partial, and whether it is consistent or inconsistent.

Problem 2 (40%)

Charlie is interested in the genetic and environmental effects on the body metrics on newborns and collected data from a sample of 42 cases of newborns, including body metrics of each newborn such as body length, birth weight, head circumference, possible sources of genetic influence such as father’s and mother’s heights, and other environmental factors such as father’s years of education. One key factor that Charlie wanted to focus on was the smoking habit of the parents, especially the mother’s. Charlie hypothesized that a mother’s smoking habit has a negative effect on her newborn’s body metrics.

Dataset for Problem 2: Problem 2_data.omv

A. (12%) Charlie wondered whether the newborn’s body metrics are influenced by a gene-and-environment interaction. Specifically, head circumference could be the result of an interaction between the daily number of cigarettes smoked by the mother and the mother’s height.

Conduct an appropriate analysis with the genetic variable as the original predictor in the model to test this hypothesis. Report your results in a short paragraph, including the interpretation of the coefficient of the interaction effect, supported by a simple-slope analysis.

B. (18%) Charlie would like to find out what factors affect the daily number of cigarettes smoked by the mother as it seems to be an important environmental factor. Charlie noticed that the numbers of cigarettes smoked by the mother and father seem to be correlated, and a potential mediator could be the father’s years of education.

i. (12%) Conduct an analysis to test this hypothesis, and write a short paragraph in “result-section” style. (with relevant statistics and/or graph(s)) to support your answer. Report whether the mediation is complete or partial, and whether it is consistent or inconsistent.

ii. (6%) How strong does the result from this mediation analysis support a causal relationship among the predictor, mediator, and the outcome? Write a few sentences to justify your answer.

C. (10%) One of Charlie’s specific hypotheses was that the newborn’s birth weight would be lower for a smoking mother than that for a non-smoking mother, after controlling for genetic factors.

i. (5%) Test the above hypothesis using an appropriate analysis with father’s and mother’s heights being the two genetic factors to be controlled for. Report your results in a short paragraph.

ii. (2%) Write a sentence to interpret the regression intercept.

iii. (3%) Write a sentence to interpret the regression coefficient of the binary variable of “smoker” (0=non-smoker, 1=smoker) in the full model.

Problem 3 (35%)

You are a research assistant in the Department of Psychology at a large university. The department is conducting a study to understand the factors that influence students' academic performance, specifically their GPA. The university administration is particularly interested in how students' study habits and socioeconomic backgrounds impact their academic success, beyond their prior academic achievement.

To carry out this study, you have access to a dataset from a recent survey of undergraduate students. The survey collected information on their GPA, HKDSE scores, English test scores (IELTS), study habits, and socioeconomic status.

The variables available in the dataset are:

GPA: The students' current college GPA (on a 4.0 scale).

HKDSE: Best 5 scores

IELTS: score of IELTS (Max score = 9).

StudyHours: Average number of study hours per week.

StudyEnv: Quality of the study environment, rated on a scale from 1 (poor) to 10 (excellent).

FamilyIncome: Family's annual income (in HKD1,000 units).

ParentsEdu: Parents' highest education level, coded as follows:

● 1: Primary School or below

● 2: Secondary School

● 3: High diploma or Associate degree

● 4: Bachelor's degree

● 5: Master's degree or above

Dataset for Problem 3: Problem3_data.omv

Note: This dataset is not real and is used for assignment purposes only.

A. The department has tasked you with analyzing this data to identify the key predictors of GPA. Specifically, they want you to control for prior academic achievement (HKDSE) and language ability (IELTS scores) and then test the additional impact of study habits (Quality of the study environment, study hours) and socioeconomic status (family income and parent’s education level).

i. (10%) Conduct a multiple regression analysis to determine whether GPA is predicted by previous test scores, i.e., HKDSE and IELTS scores.

a. (4%) How much of the variance in GPA can be explained by HKDSE and IELTS scores? Report the F-statistic and p-value for the overall model.

b.  (6%) Whether each of the predictors (HKDSE and IELTS scores) significantly predict GPA? Report the findings for each predictor and write a sentence to interpret the regression coefficients of the significant predictor(s).

ii. (20%) Extend your analysis by including additional predictors related to study habits and socioeconomic status into the existing regression model.

a. (4%) How much of the variance in GPA can be explained by all predictors? Report the F-statistic and p-value for the new model.

b. (4%) After accounting for HKDSE and IELTS scores, did study habits and socioeconomic status significantly improve the prediction of college GPA? Write a sentence to report how much variability in GPA was explained by habits and socioeconomic status and test whether it was significant.

c. (12%) In the final model with all predictors included, does each of the study habit and socioeconomic status variables significantly predict GPA with other test scores (HKDSE & IELTS) already in the model? Report the findings for each of these predictors and write a sentence to interpret the regression coefficients of the significant predictor(s).

iii. (5%) Some argue that family income and parents' education may be highly correlated, potentially affecting the results due to one of the assumptions of multiple linear regression models. Which of the assumptions of multiple linear regression relates to this concern? Conduct appropriate test(s) to assess and address this concern.




热门主题

课程名

mktg2509 csci 2600 38170 lng302 csse3010 phas3226 77938 arch1162 engn4536/engn6536 acx5903 comp151101 phl245 cse12 comp9312 stat3016/6016 phas0038 comp2140 6qqmb312 xjco3011 rest0005 ematm0051 5qqmn219 lubs5062m eee8155 cege0100 eap033 artd1109 mat246 etc3430 ecmm462 mis102 inft6800 ddes9903 comp6521 comp9517 comp3331/9331 comp4337 comp6008 comp9414 bu.231.790.81 man00150m csb352h math1041 eengm4100 isys1002 08 6057cem mktg3504 mthm036 mtrx1701 mth3241 eeee3086 cmp-7038b cmp-7000a ints4010 econ2151 infs5710 fins5516 fin3309 fins5510 gsoe9340 math2007 math2036 soee5010 mark3088 infs3605 elec9714 comp2271 ma214 comp2211 infs3604 600426 sit254 acct3091 bbt405 msin0116 com107/com113 mark5826 sit120 comp9021 eco2101 eeen40700 cs253 ece3114 ecmm447 chns3000 math377 itd102 comp9444 comp(2041|9044) econ0060 econ7230 mgt001371 ecs-323 cs6250 mgdi60012 mdia2012 comm221001 comm5000 ma1008 engl642 econ241 com333 math367 mis201 nbs-7041x meek16104 econ2003 comm1190 mbas902 comp-1027 dpst1091 comp7315 eppd1033 m06 ee3025 msci231 bb113/bbs1063 fc709 comp3425 comp9417 econ42915 cb9101 math1102e chme0017 fc307 mkt60104 5522usst litr1-uc6201.200 ee1102 cosc2803 math39512 omp9727 int2067/int5051 bsb151 mgt253 fc021 babs2202 mis2002s phya21 18-213 cege0012 mdia1002 math38032 mech5125 07 cisc102 mgx3110 cs240 11175 fin3020s eco3420 ictten622 comp9727 cpt111 de114102d mgm320h5s bafi1019 math21112 efim20036 mn-3503 fins5568 110.807 bcpm000028 info6030 bma0092 bcpm0054 math20212 ce335 cs365 cenv6141 ftec5580 math2010 ec3450 comm1170 ecmt1010 csci-ua.0480-003 econ12-200 ib3960 ectb60h3f cs247—assignment tk3163 ics3u ib3j80 comp20008 comp9334 eppd1063 acct2343 cct109 isys1055/3412 math350-real math2014 eec180 stat141b econ2101 msinm014/msing014/msing014b fit2004 comp643 bu1002 cm2030
联系我们
EMail: 99515681@qq.com
QQ: 99515681
留学生作业帮-留学生的知心伴侣!
工作时间:08:00-21:00
python代写
微信客服:codinghelp
站长地图