↳
Team A will win the game at the 7th match if and only if (1) it wins any 3 of the first 6 matches and (2) it wins the last match. Thus, [C(6,3)*p^3*(1-p)^3] * p = C(6,3)*p^4*(1-p)^3. Less
↳
C(6,3)*p^3*(1-p)^3*p
↳
P({A wins last match} AND {A wins 4 matches out of 7}) = // definition of conditional prob P({A wins last match} | {A wins 4 matches out of 7}) * P({A wins 4 matches out of 7}) The first element of this product is easy to compute: P({A wins last match} | {A wins 4 matches out of 7}) = 4/7 The second element of this product can be computed using a Binomial(prob=p, n=7) distribution, with k=4. P({A wins 4 matches out of 7}) = binom(7, 4) * p^4 * (1-p)^3 So overall, P({A wins last match} AND {A wins 4 matches out of 7}) = 4/7 * binom(7, 4) * p^4 * (1-p)^3 = binom(6, 3) * p^4 * (1-p)^3 That is, the final answer is: binom(6, 3) * p^4 * (1-p)^3 Less
↳
I will also follow to certain stage to see if I can learn something new out of this. If it keep continuing, I would seriously consider changing the job. Less
↳
I will present my case; if he doesn't agree with me, I will still follow him.
↳
repeated measure ANOVA?
↳
Paried T-test, ANOVA, Boostrap........
↳
The first one was by far the most challenging question, I did not get it correct. The second, third, and fourth were easy - but I ran out of time and did not completely answer the fourth. Less
↳
Coding in whiteboard or IDE or third party coding app?
↳
I would say that we should visualize the data first, getting a sense of it. How many attributes it contains? What are the ranges of the attributes? What is the data type for each attribute (ordinal, categorical, ratio, interval)? How might they be related? What are the predictors and how about the response(s)? These are typically the first thing I'd like to know when investigating a data set that I have no previous knowledge of. Less
↳
I said we should clean it up. Try to see which variables we can get rid by checking for collinearity. The interviewer's followup question was - what about even before this? I said "determine the goal as in figure out what we want to accomplish". She then asked "how we do this?" We just kept going in a circle with this question... Less
↳
bootstrap
↳
Technically MLE doesn't exist in this case if \bar{x}<5, but the restricted MLE max{5,\bar{x}} has asymptotic optimal properties and hence that's how the mean is generally estimated. Any standard mathematical statistics 1 course should cover this. Less
↳
Because of the decision boundary
↳
Because linear regression produces values beyond the range [0,1], and also because the underlying assumptions of equal and normally distributed error terms will be violated (errors will be distributed binary instead) Less
↳
I did not answer well.
↳
There are two ways to answer this type of statistical question. Both are valid and should be presented. A.) Enough cars to get everyone around. B.) Begin with a sample population and model the relative rate of car ownership, then extrapolate across the population accounting for a specific confidence interval. A supervised continuous model would be ideal, such as linear regression. A question like this isn't designed for an answer, it's designed to assess critical thinking. Less