Midterm 2 Study Guide
This midterm is about key probability concepts, as well as two variable relationships and linear modeling.
This is meant to be a representative sampling of the key concepts you will need to know, and it is not meant to be exhaustive. You should make sure that you are comfortable with Quizzes 4-7 and Homework Assignments 7-11.
- What graphs are appropriate when we want to examine the relationship between two variables? What types of variables does each apply to?
- What terms do we use to describe the relationship between two scalar variables?
- What does the correlation coefficient \(r\) measure?
- When we have a regression line fitted to some data, explain the following terms: predicted value, actual value, residual, sum of squared residuals.
- What is the key property that makes the “least squares regression line” special?
- What is the meaning of “r-squared” in the context of the least squares regression line?
- How does the residual plot work? What do we expect from it if we have a suitable fit?
- Outliers that are far in the x direction only and those that are far in the y direction only affect the least squares regression line in very different ways. Explain.
- What are the key characteristics of a random phenomenon?
- What can we say about the probabilities of outcomes in a random phenomenon / probability model? Can they be any numbers? There are some restrictions to them (both on what numbers they can be, and on what their sum must be).
- What is an event and how do we define its probability?
- What is the complement of an event? What is a formula for its probability?
- What is the union of two events, what is the intersection of two events? What rule must their probabilities obey?
- What is the definition of conditional probability? Include a formula and also a more meaningful verbal definition.
- What does the multiplicative rule say?
- When do we say that two events are independent of each other? Provide some examples both of events that are independent of each other and events that are not independent of each other.
- In general when do we tend to multiply probabilities? When do we tend to add them?
- When can I say that \(P(A\cap B) = P(A)P(B)\)?
- When can I say that \(P(A\cup B) = P(A) + P(B)\)?
- What are the different parts in a tree diagram / decision tree? What probabilities do we associate with each part?