CFA – Level 1 – Cheat Sheet – Quantitative Methods

CFA Level I — Quantitative Methods

2025 Program Curriculum · Volume 1 · Modules 1–11 · Complete Exam Reference

Returns
Statistics
Probability
Hypothesis Tests
Regression
Big Data

Module 1 — Interest Rates & Return Measures

Interest Rate ComponentsBuilding the Rate

r = Real risk-free rate + Inflation premium + Default risk premium + Liquidity premium + Maturity premium

Nominal risk-free (approx.)Real rate + Inflation premium

Exact nominal relation(1+nominal) = (1+real)(1+inflation)

Liquidity premiumCompensation for illiquid markets

Maturity premiumSensitivity to rate changes at longer maturities

T-bills proxy for the nominal risk-free rate in their country

Return FormulasKey Equations

Holding Period Return

R = (P₁ − P₀ + I₁) / P₀

Arithmetic Mean

R̄ = (1/T) Σ Rᵢₜ

Geometric Mean

R̄G = [∏(1+Rᵢₜ)]^(1/T) − 1

Harmonic Mean

X̄H = n / Σ(1/Xᵢ)

Always: Arithmetic ≥ Geometric ≥ Harmonic (unless all equal)

When to Use Which MeanMean Selection

ArithmeticSingle-period avg, no compounding

GeometricMulti-period compound growth

HarmonicRates, ratios, P/E, cost averaging

Trimmed/WinsorizedPresence of extreme outliers

Geometric is always ≤ arithmetic; gap grows with variance

Money-Weighted ReturnIRR Method

Σ CFₜ/(1+IRR)ᵗ = 0

Accounts for timing & size of cash flows
Investor inflows = outflows for fund (sign flip)
Two investors, same fund → different MWR if different cash flow timing
Excel: =IRR(values)

MWR ≠ appropriate for comparing portfolio managers — use TWR

Time-Weighted ReturnSub-period Link

Breaks portfolio into sub-periods at each cash flow date
Calculate HPR for each sub-period
Chain-link (compound) all sub-period returns
Take geometric mean of linked returns
Not sensitive to cash flow timing

TWR is preferred for evaluating portfolio managers

Annualizing ReturnsCompounding & Non-Annual

EAR = (1 + periodic rate)ᵐ − 1

Continuous compounding: EAR = eʳ − 1

Non-annual compoundingAdjust rate & periods (r/m, n×m)

Continuously compounded rln(1 + HPR)

Return TypesGross / Net / Real / Leveraged

Gross returnBefore mgmt & admin expenses

Net returnAfter all expenses — what investor earns

After-tax nominalTotal return minus taxes on div/int/gains

Real returnAdjusts for inflation (useful cross-period)

Leveraged returnAmplifies gains AND losses

Module 2 — Time Value of Money in Finance

Fixed Income TVMBond Pricing

PV = Σ PMTₜ/(1+r)ᵗ + FV/(1+r)ᴺ

Discount bondFV only at maturity

YTMIRR of all promised cash flows

Implied return (discount)r = (FV/PV)^(1/t) − 1

Perpetuity: PV = PMT / r

Equity TVMDividend Models

Constant div: PV = D / r

Constant growth (Gordon): PV = D(1+g)/(r−g)

Implied return: r = D₁/PV + g

Implied growth: g = r − D₁/PV

Two-stage modelHigh growth (gs) then stable (gl)

Higher g reduces denominator → higher PV; r must exceed g

Cash Flow AdditivityNo-Arbitrage

Identical future cash flows must have same PV today
Forward rates derived by equating PV of cash flows
FX forward: no-arbitrage rate from interest rate parity
Option pricing: replicating portfolio + no-arbitrage

Principle: Two instruments with identical cash flows must have identical prices

Module 3 — Statistical Measures of Asset Returns

Central TendencyMeasures of Location

MeanΣxᵢ/n — sensitive to outliers

MedianMiddle value — robust to outliers

ModeMost frequent value

Percentile / QuartileDivide sorted data into portions

IQRQ3 − Q1 (middle 50%)

Outlier fenceQ3 + 1.5×IQR (upper) / Q1 − 1.5×IQR (lower)

Box & whisker plot: box = IQR, line inside = median

Dispersion MeasuresRisk Quantification

MAD = Σ|Xᵢ − X̄| / n

s² = Σ(Xᵢ − X̄)² / (n−1)

s = √s² (same units as data)

Semideviation = √[Σ(Xᵢ−B)²/(n−1)] for Xᵢ ≤ B

CV = s / X̄ (risk per unit of return)

Use n−1 (degrees of freedom) for sample variance — ensures unbiased estimate

Distribution ShapeSkewness & Kurtosis

Positive skew (right)Mean > Median > Mode; right fat tail

Negative skew (left)Mean < Median < Mode; left fat tail

Excess kurtosis > 0Leptokurtic — fat tails (more extreme events)

Excess kurtosis < 0Platykurtic — thin tails

Normal distributionSkew = 0, excess kurtosis = 0

Most financial return series are negatively skewed with positive excess kurtosis (fat tails)

CorrelationLinear Relationship

Cov(X,Y) = Σ(Xᵢ−X̄)(Yᵢ−Ȳ) / (n−1)

r_XY = Cov(X,Y) / (sₓ · sᵧ)

Range−1 ≤ r ≤ +1

r = 0No linear relationship

Spurious correlationChance, third-variable, or calculation artifact

Correlation ≠ causation. Cannot measure nonlinear relationships.

Module 4 — Probability Trees & Conditional Expectations

Expected Value & VarianceProbability-Weighted

E(X) = Σ P(Xᵢ) · Xᵢ

σ²(X) = Σ P(Xᵢ) · [Xᵢ − E(X)]²

σ(X) = √σ²(X)

Conditional Expected Value

E(X|S) = Σ P(Xᵢ|S) · Xᵢ

Total Probability Rule

E(X) = E(X|S₁)P(S₁) + E(X|S₂)P(S₂) + …

Bayes’ FormulaUpdating Probabilities

P(Event|New Info) = [P(New Info|Event) / P(New Info)] × P(Event)

Rational method to update prior probabilities with new evidence
P(new info) = unconditional probability — calculated using total probability rule
Used in credit analysis, earnings revision, scenario updating

Probability tree: visualize scenarios → conditional expectations → weighted sum = E(X)

1Prior P

→

2New Evidence

→

3Posterior P

Module 5 — Portfolio Mathematics

Portfolio Return & Variance2-Asset Formulas

E(Rₚ) = w₁E(R₁) + w₂E(R₂)

σ²(Rₚ) = w₁²σ₁² + w₂²σ₂² + 2w₁w₂Cov(R₁,R₂)

Cov(R₁,R₂) = ρ₁₂ · σ₁ · σ₂

3-Asset Extension

σ²(Rₚ) = Σᵢ Σⱼ wᵢwⱼCov(Rᵢ,Rⱼ)

Add 2wᵢwⱼCov terms for each unique pair i≠j

Safety-First & Normal DistributionRoy’s Criterion

SFRatio = [E(Rₚ) − Rₗ] / σₚ

Choose portfolio with highest SFRatio
Rₗ = minimum acceptable (threshold) return
Shortfall risk = P(Rₚ < Rₗ) — minimize this
Higher E(R) and lower σ both reduce shortfall risk
Assumes normally distributed returns

SFRatio is the z-score distance from the threshold — higher is safer

Module 6 — Simulation Methods

Lognormal DistributionAsset Price Model

Asset prices are lognormally distributed (bounded at 0)
If ln(X) is normal → X is lognormal
Positively skewed, right-tailed

Continuously compounded r = ln(P₁/P₀)

Lognormal is appropriate for prices; normal for returns

Monte Carlo SimulationSimulation-Based

Generate thousands of random scenarios using specified distributions
Produces full distribution of outcomes (not just point estimate)
Used for option pricing, risk assessment, retirement planning
Requires specifying all input distributions and correlations

Garbage in → garbage out: model quality depends on distributional assumptions

BootstrappingResampling Method

Repeatedly draw samples with replacement from observed data
No distributional assumptions required
Empirical sampling distribution estimated from actual data
Useful when distributions are unknown or non-normal

Monte Carlo uses assumed distribution; bootstrapping uses actual data

Module 7 — Estimation & Inference

Sampling MethodsProbability vs Non-Probability

Simple randomEach member equally likely; can include luck

Stratified randomPopulation divided into strata; sample from each

ClusterClusters selected; all members within sampled

ConvenienceNon-prob; whatever is easiest (biased)

JudgementalNon-prob; expert selects sample

Non-probability samples may not represent population — selection bias risk

Central Limit TheoremSampling Distribution

For large n (≥30), sample means are approximately normally distributed, regardless of population distribution

Standard error of mean = s / √n

As n increasesStandard error decreases

Population σ knownUse z-distribution

Population σ unknownUse t-distribution (df = n−1)

CLT allows inference about means even when the parent distribution is non-normal

Module 8 — Hypothesis Testing

Hypothesis Test Framework6-Step Process

1State H₀ and Hₐ

2Select the appropriate test statistic (t, z, χ², F)

3Specify significance level α

4State the decision rule (critical values)

5Calculate the test statistic from sample data

6Make the decision (reject / fail to reject H₀)

Fail to reject H₀ ≠ H₀ is true. We never “accept” the null.

Error Types & PowerDecision Errors

Type I error (α)Reject true H₀ (false positive)

Type II error (β)Fail to reject false H₀ (false negative)

Power of test1 − β = P(reject false H₀)

p-valueSmallest α that rejects H₀ with calculated stat

Decreasing α (stricter) reduces Type I but increases Type II errors — trade-off

Reject H₀ if p-value < α, or if |test stat| > critical value

Test of Mean (t-test)Single & Difference

t = (X̄ − μ₀) / (s/√n), df = n−1

Two independent meansPooled variance t-test

Paired means (dependent)t on mean of differences d̄

Paired test more powerful — eliminates between-group variation

Test of VarianceChi-Square & F-test

χ² = (n−1)s² / σ₀², df = n−1

F = s₁² / s₂², df = n₁−1, n₂−1

χ² testSingle population variance

F-testEquality of two variances

Nonparametric TestsDistribution-Free

Used when data are non-normal or ordinal
Spearman rank correlation: nonparametric correlation test
Contingency table: chi-square test of independence
Less powerful than parametric tests when assumptions hold

Spearman rₛ = 1 − 6Σdᵢ²/[n(n²−1)]

Module 10 — Simple Linear Regression

Regression BasicsOLS Estimation

Ŷᵢ = b̂₀ + b̂₁Xᵢ + εᵢ

b̂₁ = Cov(X,Y) / Var(X)

b̂₀ = Ȳ − b̂₁X̄

b̂₁ interpretationChange in Ŷ per unit change in X

b̂₀ interpretationExpected Y when X = 0

R²Proportion of variation in Y explained by X

R² = r²_XY (simple regression)

CLRM AssumptionsRequired for Valid OLS

1Linearity: True relationship is linear in parameters

2Homoskedasticity: Constant variance of errors

3Independence: Errors are uncorrelated across observations

4Normality: Errors are normally distributed

Violations (heteroskedasticity, autocorrelation) distort standard errors and test statistics

ANOVA in RegressionPartitioning Variance

TSSTotal sum of squares (total variation)

RSSRegression SS (explained by model)

SSEError SS (unexplained)

TSS = RSS + SSE

R² = RSS / TSS

F = (RSS/1)/(SSE/(n−2))

Hypothesis Test — Coefficientst-test for Slope

t = b̂₁ / se(b̂₁), df = n−2

CI: b̂₁ ± t* × se(b̂₁)

H₀: β₁ = 0X has no linear effect on Y

F-test (overall) and t-test (slope) equivalent in simple regression

Functional FormsLog Transformations

Lin-LinY = b₀ + b₁X (standard)

Log-Linln(Y) = b₀ + b₁X (Y grows exponentially)

Lin-LogY = b₀ + b₁ln(X) (diminishing returns)

Log-Logln(Y) = b₀ + b₁ln(X) (elasticity = b₁)

Choose form based on economic rationale and residual analysis

Module 11 — Big Data Techniques & Fintech

Big Data CharacteristicsThe 3 Vs

VolumeEnormous size of datasets

VarietyStructured + unstructured (text, image, audio)

VelocitySpeed at which data is generated/processed

SourcesWeb, IoT, social media, financial transactions

Machine Learning TypesAI in Finance

SupervisedLabeled data → predict outcomes

UnsupervisedFind hidden patterns, clusters, no labels

Deep learningNeural networks with many layers

ReinforcementAgent learns via rewards/penalties

Overfitting: model learns noise, not signal — use cross-validation

Text Analytics & NLPUnstructured Data

Sentiment analysis of earnings calls, news
Information extraction from filings (10-K, 10-Q)
Bag-of-words: word frequency features
Named entity recognition (NER)
Applied to earnings surprises, risk signal detection

High-Frequency Exam Traps & Quick Reference

Common Exam TrapsWatch Out For These

Arithmetic vs GeometricMulti-period → always geometric

MWR vs TWRManager evaluation → always TWR

P-value decisionReject if p-value < α (not >)

Two-tailed critical valueSplit α/2 on each tail

Degrees of freedomn−1 for variance, n−2 for regression

“Fail to reject”Never “accept” H₀

Correlation vs CausationHigh r ≠ causal relationship

Positive skewMean > Median — right tail is longer

Quick Formula ReferenceMust-Know Numbers & Formulas

Normal: 68/95/99.7±1σ / ±2σ / ±3σ coverage

z critical (5%, two-tail)±1.96

z critical (1%, two-tail)±2.576

Arith × Harmonic= (Geometric)²

Roy’s SFR optimalMax [E(Rₚ) − Rₗ] / σₚ

CLT min sample sizen ≥ 30 (rule of thumb)

Spearman rₛ1 − 6Σdᵢ²/[n(n²−1)]

EAR from continuouseʳ − 1

The Master Rule: Choose your statistical tool based on what you know. Known population σ → z-test. Unknown σ (almost always) → t-test. Single variance → chi-square. Two variances → F-test. For returns, geometric mean captures actual investor experience across multiple periods; arithmetic mean is only appropriate for single-period expected values. Time-weighted return always benchmarks portfolio managers because it eliminates the distortion from external cash flows that are outside the manager’s control.