Collecting Data (All In context(只能回答本问题,非广泛性))

SRS: Label -> Random number generator (Unique / Different)
Stratified -> Strata / Random Select from each stratas
Cluster -> Cluster / Select n from all clusters
Systematic -> Every 10th

Compare -> 夸一贬一

Bias ->

  • Volunteer
  • Response / non-response
  • Under coverage -> In context (Under / Over : Estimate)

Experiment

  • Factor (x)
  • Treatment: List not assigned
  • Response variable (Recorded)
  • Experiment unites (需要写数字): 380 Volunteers
Block

特征分类,相似
减少Confounding Variable

Blind

Researcher
Subject

1.Random

Balancing
The effect of other variable

2.Replicate

减少 By chance 结果

3.Control

Same Condition

4.Compare

Two or more treatment

Control Group

As a baseline to compare

Placebo

减少心理因素影响

Randomly assigned

Cause and effect

Randomly selected (X Volunteer) (Not Representative)

Inference for population

Exploring Data

Describe Distribution (SCSO) (In context)
抄问题

Probability and Sampling distribution

用 Binomial 先提及(解释用BINS)
Normal mu sigma
Binomial n p

E(x) = np
Sigma(x) = \sqrt(np(1-p))

出现probability先定义 (A是什么,B是什么)
算概率都要过程

Sampling Distribution
xbar mu sigma/sqrt(n) -> CLT(Central limit Theory) 无论总体分布是什么形状,只要样本容量足够大,样本均值的分布就会近似服从正态分布
phat p sqrt(p(1-p)/n)

过程要写出来

Inference

4 Steps

Interval / Test 写名字

Chi - Square

  • goodness of fit test (1 Cat Variable): H0: P_1 = ?, P_2 = ? / Same as claim
    (O - E)
  • Independence (2 Cat Variable) H0: Independent (No association)
  • Homogeneity (2 Cat Variable) H0: Proportion is same

Pvalue 定义: 得到一个 P (Statistic >= Value) =?

Type I / II
Consequence

Combine 2 or more Categories

Apply your Statistical skills to new contexts