Methodology
Five risk dimensions.
One composite score.
The Housing Financial Vulnerability Score (HFVS) aggregates five independently measured risk dimensions from KHS microdata into a single household-level index. Weights are grounded in actuarial precedent and validated via PCA loading structure.
Composite formula
Key Findings
County Risk Atlas
47 Counties Ranked by Composite HFVS
Model Performance
XGBoost achieves AUC 0.989
Four model architectures benchmarked against binary (high vulnerability) and continuous (HFVS score) targets. Gradient boosting methods dominate across all metrics.
Both LightGBM and XGBoost achieve AUC > 0.988 — near-perfect discrimination between vulnerable and non-vulnerable households. Logistic regression (AUC 0.983) confirms that even a linear model captures strong signal from the engineered features.
The near-perfect AUC reflects the highly structured nature of HFVS (it was constructed from the same features used in prediction) and validates the internal consistency of the vulnerability index.
SHAP Feature Importance
What drives vulnerability?
Mutual information scores from the XGBoost model identify which household characteristics most strongly determine HFVS. Savings behaviour and dwelling quality dominate — structural, not geographic, factors lead.
MI contribution by dimension
Mutual information (MI) measures the statistical dependency between each feature and the HFVS target, regardless of relationship type. MI = 0 means complete independence; higher values indicate stronger predictive signal. These scores complement SHAP values computed from the trained XGBoost model, which additionally capture interaction effects and directional contributions.
Household Risk Calculator
Estimate your HFVS score
Enter household characteristics to compute an indicative Housing Financial Vulnerability Score using the same methodology as the dissertation model.
Financial
Tenure
Physical Hazard
Dwelling & Utilities
Fill in the household characteristics and click Compute HFVS Score to see where this household sits in the national vulnerability distribution.