AI4CI logo CASA logo

Local Authority Typology: Methodology and Cluster Profiles

School Attainment Tool — Technical Report

Published

March 19, 2026

1 Overview

This report describes the construction of a Local Authority (LA) typology for secondary schools in England, developed as part of the School Attainment Tool. The typology groups England’s 152 local authorities into a small number of empirically derived clusters based on a wide range of educational, demographic, workforce, and structural indicators. The aim is to help contextualise school-level performance data — recognising that schools operate in very different local environments — and to support comparisons between similar LAs rather than the national average alone.

The analysis covers four academic years (2021–22 to 2024–25) and is built from school-level panel data linked to DfE KS4 performance tables, school workforce census data, Ofsted inspection outcomes, and pupil absence statistics.


2 Data Sources

The typology draws on four administrative datasets published by the Department for Education (DfE):

Dataset Source Years covered
KS4 Performance Tables DfE (via Explore Education Statistics) 2021–22 to 2024–25
School Workforce Census DfE (via Explore Education Statistics) 2021–22 to 2024–25
Pupil Absence Statistics DfE (via Explore Education Statistics API) 2021–22 to 2024–25
Ofsted School Inspections Ofsted (Official Statistics, Aug 2024 snapshot) Most recent inspection
EduBase (school register) DfE Get Information About Schools Oct 2024 snapshot

All data are at school level and are aggregated to local authority level for the purpose of clustering.


3 Indicator Construction

Over 200 LA-level indicators are constructed from the school-level panel. These are grouped into thematic domains.

3.1 Attainment Indicators

The primary attainment measure is the Attainment 8 (ATT8) score, which measures average performance across eight GCSE subjects (English, mathematics, three further EBacc subjects, and three open slots). Indicators include:

Indicator Description
att8_mean Mean ATT8 score across all schools in the LA (averaged across years)
att8_median Median ATT8 score
att8_sd Standard deviation of ATT8 — measures within-LA spread of school performance
att8_cv Coefficient of variation (SD/mean) — relative spread
att8_iqr Inter-quartile range of ATT8
att8_p25, att8_p75 25th and 75th percentile ATT8
att8_fsm_mean Mean ATT8 for disadvantaged pupils (FSM6 classification)
att8_nfsm_mean Mean ATT8 for non-disadvantaged pupils
att8_gap_mean Disadvantage gap — mean difference between non-FSM and FSM ATT8 scores
att8_gap_sd Within-LA variation in the disadvantage gap
gap_driver_index Gap driver index — z(FSM ATT8) minus z(non-FSM ATT8). Negative values indicate the gap is driven by low disadvantaged attainment; positive values indicate it is driven by high non-disadvantaged attainment (see Section 7.7)
att8_girls_mean, att8_boys_mean Mean ATT8 by gender
att8_gender_gap Gender gap in ATT8 (girls minus boys)
att8_eng_mean, att8_mat_mean Mean ATT8 in the English and Maths elements respectively
att8_eng_mat_diff Relative performance on English vs Maths within the LA
p8_mean Mean Progress 8 score — measures value-added relative to prior attainment
p8_fsm_mean, p8_nfsm_mean Progress 8 by disadvantage group
p8_gap_mean Gap in Progress 8 between non-FSM and FSM pupils

Attainment 8 is scored on a scale of approximately 0–90. The national average is around 46–48 points. Progress 8 is a value-added measure centred on zero; positive scores indicate above-average progress given prior attainment.

3.3 Pupil Composition

Indicator Description
pct_fsm_mean Mean % of pupils eligible for Free School Meals in last 6 years (FSM6) — principal disadvantage proxy
pct_fsm_sd, pct_fsm_cv Within-LA spread of FSM rates
pct_eal_mean Mean % of pupils with English as an Additional Language (EAL)
pct_eal_sd, pct_eal_cv Within-LA spread of EAL rates
pct_prior_lo_mean Mean % of KS4 pupils with low prior attainment (KS2)
pct_prior_hi_mean Mean % of KS4 pupils with high prior attainment (KS2)
prior_lo_hi_ratio Ratio of low to high prior attainment — measure of prior attainment composition
pct_sen_ehcp_mean Mean % of pupils with an Education, Health and Care Plan (EHCP)
pct_sen_support_mean Mean % of pupils receiving SEN support (without EHCP)
pct_fsm_ever_mean Mean % of pupils ever eligible for FSM (broader deprivation measure)
totpups_mean Mean total pupils on roll — a proxy for school size
totpups_sd, totpups_cv Within-LA spread of school sizes

FSM6 (Free School Meals in the last six years) is the DfE’s standard measure of disadvantage for performance accountability purposes. It is a cumulative measure — a pupil is counted if they were ever eligible for FSM in any of the previous six years.

3.4 Absence

Indicator Description
abs_overall_mean Mean overall absence rate (% of sessions missed)
abs_overall_sd, abs_overall_cv Within-LA spread of absence rates
abs_persist_mean Mean persistent absence rate (% of pupils missing 10%+ of sessions)
abs_persist_sd, abs_persist_cv Within-LA spread of persistent absence
abs_persist_to_overall_ratio Ratio of persistent to overall absence — indicates severity
abs_trend_slope Annual trend in overall absence rate
abs_persist_trend Annual trend in persistent absence rate

Persistent absence (PA) is defined as missing 10% or more of possible sessions in a school year. It is a key indicator of educational engagement and is used by DfE for school accountability.

3.5 Ofsted Inspection Outcomes

Indicator Description
ofsted_good_plus_pct % of schools rated Good or Outstanding
ofsted_outstanding_pct % of schools rated Outstanding
ofsted_good_pct % of schools rated Good
ofsted_ri_pct % of schools rated Requires Improvement
ofsted_inadequate_pct % of schools rated Inadequate
ofsted_below_good_pct % of schools below Good (RI + Inadequate)
ofsted_numeric_mean Mean numeric Ofsted grade (1=Outstanding to 4=Inadequate)
ofsted_numeric_sd SD of Ofsted grades — within-LA variation in inspection outcomes

Ofsted ratings are based on the most recent inspection outcome for each school as at the August 2024 snapshot.

3.6 Socioeconomic Segregation

A Gorard segregation index is computed at school level for each LA and year. This measures the degree to which FSM pupils are unevenly distributed across schools within the LA, relative to a hypothetical even distribution.

Indicator Description
gorard_mean Mean Gorard segregation index (0 = perfectly even, 1 = complete segregation)
gorard_sd Within-LA variation in segregation
gorard_trend Trend in segregation over time

The Gorard index for LA \(l\) in year \(t\) is computed as: \[G_{lt} = \frac{1}{2} \sum_{s \in l} \left| \frac{n_{s,\text{FSM}}}{N_{l,\text{FSM}}} - \frac{n_{s,\text{total}}}{N_{l,\text{total}}} \right|\] where \(n_{s,\text{FSM}}\) is the number of FSM pupils in school \(s\), \(N_{l,\text{FSM}}\) is the total number of FSM pupils in the LA, and equivalent total-pupil quantities. Higher values indicate greater concentration of disadvantaged pupils in particular schools.

3.7 School Type and Structural Composition

Indicator Description
pct_selective % of schools with selective admissions policy (grammar schools)
pct_comprehensive % of schools with comprehensive (non-selective) admissions
pct_academy % of schools that are academies (any type)
pct_converter_acad % converter academies (previously good schools that converted)
pct_sponsored_acad % sponsored academies (typically underperforming schools taken over by sponsor)
pct_free_school % free schools
pct_va_school % Voluntary Aided schools
pct_vc_school % Voluntary Controlled schools
pct_community % community schools (LA-maintained)
pct_religious % of schools with a religious character
pct_ce % Church of England schools
pct_rc % Roman Catholic schools

3.8 Workforce Indicators

Indicator Description
retention_mean Mean teacher retention (FTE remaining in the same school year-on-year)
retention_sd, retention_cv Within-LA spread of retention
retention_trend Annual trend in teacher retention
sickness_mean Mean average teacher sickness absence days per year
sickness_sd, sickness_cv Within-LA spread of sickness absence
sickness_trend Annual trend in teacher sickness absence
ptr_qual_mean Mean pupil-to-qualified-teacher ratio
ptr_qual_sd Within-LA spread of PTR
ptr_trend Annual trend in PTR
ptr_adult_mean Mean pupil-to-adult ratio (including teaching assistants)
pay_main_mean Mean % of teachers on main pay range
pay_upper_mean Mean % of teachers on upper pay range
leadership_pay_mean Mean % of teachers on leadership pay range
fte_ta_to_teacher_ratio Ratio of teaching assistant FTE to teacher FTE
fte_leadership_prop Leadership teachers as proportion of all teachers
pct_taking_sickness_mean Mean % of teachers taking any sickness absence

Teacher retention is measured as the FTE count of teachers remaining in the same school in the next year’s census, expressed as a proportion of the current year’s teacher count. Higher values = more stable workforce. The pupil-to-qualified-teacher ratio (PTR) measures class sizes relative to qualified teacher supply — higher values indicate more pupils per qualified teacher, which can signal workforce pressure.

3.9 Derived / Composite Indicators

Indicator Description
att8_between_school_sd SD of ATT8 across schools — within-LA inequality in attainment
fsm_att8_cor Within-LA correlation between FSM rate and ATT8 — how tightly linked are disadvantage and attainment
eal_att8_cor Within-LA correlation between EAL rate and ATT8
abs_att8_cor Within-LA correlation between absence rate and ATT8
retention_att8_cor Within-LA correlation between teacher retention and ATT8
avg_years_per_school Average number of data years available per school (churn indicator)

4 Clustering Methodology

4.1 Feature Selection and Pre-processing

All numeric LA-level indicators described in Section 3 are candidates for clustering. Three pre-processing filters are applied sequentially:

  1. Missing data filter — Variables with more than 30% missing values across LAs are excluded. This removes indicators that are sparsely populated (e.g. workforce variables for small LAs).

  2. Near-zero variance filter — Variables with a standard deviation below \(10^{-6}\) are excluded. These contribute no discriminating information.

  3. High collinearity filter — From any pair of variables with \(|r| > 0.97\) (Pearson correlation, using pairwise complete observations), one variable is dropped. This prevents near-duplicate features from dominating the distance metric.

After these filters, the remaining variables are z-score standardised (centred at the mean, scaled to unit variance) using parameters estimated from the full set of 152 LAs. This ensures that each variable contributes equally to the Euclidean distances used by k-means, regardless of its original scale.

Missing values remaining after filtering are imputed with each variable’s column median prior to scaling.

Feature selection summary
Stage Variable count
All numeric indicators ~210+
After >30% missing filter ~180+
After near-zero variance filter ~175+
After |r| > 0.97 collinearity filter ~123 (final)
Final variables used in clustering 123

4.2 Outlier LA Exclusion

A small number of LAs have an unusually low number of secondary schools and would otherwise form singleton clusters or distort the k-means solution. LAs with fewer than 5 unique schools in the panel are excluded from the initial clustering:

LAs excluded from initial k-means (assigned post-hoc)
Local Authority Reason
City of London Fewer than 5 schools in panel data
Isles Of Scilly Fewer than 5 schools in panel data

These LAs are not discarded — after the main clustering is complete, each is assigned to the nearest centroid using Euclidean distance in the scaled feature space.

4.3 Optimal k Selection

K-means was run on the main LA set across k = 5 to 10 clusters, with nstart = 100 random initialisations and iter.max = 300 iterations per run to ensure stable solutions. Three complementary criteria were evaluated:

  • Silhouette width — measures how similar each LA is to its own cluster compared to the nearest other cluster. Higher = better-separated clusters.
  • Calinski-Harabasz (CH) index — ratio of between-cluster variance to within-cluster variance. Higher = more compact and well-separated clusters.
  • WSS elbow — the within-cluster sum of squares. The “elbow” (point of fastest rate of improvement) indicates diminishing returns from adding more clusters.

The final k is chosen as the median of the k values preferred by each of the three criteria, clamped to the range [5, 10].

Cluster selection metrics across candidate k values
k WSS Silhouette Calinski-Harabasz WSS improvement
5 12840.2 0.0738 13.16
6 12292.0 0.0736 12.22 4.3%
7 11892.1 0.0737 11.26 3.3%
8 11513.6 0.0786 10.57 3.2%
9 11207.5 0.0652 9.92 2.7%
10 10936.7 0.0661 9.36 2.4%
k selection criteria
Criterion Preferred k
Best silhouette 8
Best Calinski-Harabasz 5
WSS elbow
**Chosen k (median)** **7**

4.4 Final Clustering

The final k-means solution used:

  • k = 7 clusters
  • nstart = 200, iter.max = 500 (for the final run, to maximise stability)
  • set.seed(42) for reproducibility
  • 151 LAs in the main clustering set; 2 outlier LA(s) assigned post-hoc

The solution explains 31.9% of total variance in the scaled feature matrix (between-cluster SS / total SS).

Final cluster sizes
Cluster Label Number of LAs
1 High EAL / Low Absence 29
2 High Attainment / Segregated 9
3 Segregated / Non-FSM Driven Gap 18
4 High Turnover / Low EAL 37
5 Stable Workforce / Academy-Heavy 27
6 Weak Ofsted / Persistent Absence 11
7 FSM-Driven Gap / Wide Gap 22

5 Radar Chart Variables

The Shiny application displays radar (spider) charts for each cluster and for individual LAs. Ten variables are shown on the radar, chosen to represent the key policy-relevant dimensions:

Radar chart variables
Variable Label Direction Description
att8_mean ATT8 ↑ Higher = better Mean Attainment 8 score
pct_fsm_mean % FSM ↓ Lower = better (inverted) % pupils eligible for Free School Meals (6-year)
abs_overall_mean Absence ↓ Lower = better (inverted) Overall absence rate (%)
ofsted_good_plus_pct Good+ Ofsted ↑ Higher = better % schools rated Good or Outstanding
retention_mean Retention ↑ Higher = better Teacher retention (FTE remaining in same school)
pct_eal_mean % EAL ↑ Higher = better % pupils with English as an Additional Language
att8_fsm_mean FSM ATT8 ↑ Higher = better Mean ATT8 for disadvantaged (FSM) pupils — higher = better outcomes for FSM pupils
gorard_mean Segregation ↓ Lower = better (inverted) Gorard FSM segregation index
sickness_mean Sickness ↓ Lower = better (inverted) Mean teacher sickness days per year
ptr_qual_mean PTR ↓ Lower = better (inverted) Pupil-to-qualified-teacher ratio

Radar values are expressed as z-scores relative to the England average (z = 0 = England mean). For variables where a lower value is better (absence, % FSM, segregation, sickness, PTR), the z-score is negated so that the outward direction on every axis consistently means “better than average.” Values are clipped to the range [−2.5, +2.5] and shifted to [0, 5] for display. The dotted ring at the midpoint represents the England average.

NoteWhy FSM ATT8 replaced the Disadvantage Gap on the radar

In an earlier version of the radar chart, the “Disadvantage Gap” axis was displayed (inverted, so that a narrower gap pointed outward). However, the raw gap conflates two very different situations: an LA might have a wide gap because its FSM pupils do poorly (a policy concern), or because its non-FSM pupils do exceptionally well (less concerning). By showing FSM ATT8 directly, the radar now reveals the actual attainment level of disadvantaged pupils — higher = outward = better. The raw gap, its decomposition (gap_driver_index), and both group-level means remain available in the data table and bar chart.

ImportantReading inverted variables on the radar

Five of the ten radar axes are inverted (marked ↓ in the table above): % FSM, Absence, Segregation, Sickness, and PTR. For these variables, a value that is worse than the England average will appear inside the dotted ring (closer to the centre), even though the underlying z-score is positive (above average).

The same logic applies to all inverted axes: higher absence, higher FSM rates, greater segregation, more sickness days, and larger pupil-to-teacher ratios all plot inward because they represent worse outcomes.


6 Cluster Profiles

6.1 Summary Comparison Table

The table below shows the mean value of each key indicator for each cluster, alongside the England-wide mean.

Cluster profile means on key indicators (England mean shown for reference)
Indicator C1 C2 C3 C4 C5 C6 C7 England
ATT8 mean 47.09 53.85 45.74 43.56 44.18 39.89 43.02 44.86
FSM ATT8 44.83 46.80 39.52 35.57 36.76 33.86 36.10 38.65
Non-FSM ATT8 54.29 56.65 49.97 48.21 48.73 46.16 51.11 50.44
Disadv. gap (pts) 9.46 9.85 10.45 12.64 11.97 12.30 15.01 11.79
Gap driver 0.24 0.01 0.31 -0.04 0.07 0.17 -0.73 0.00
Progress 8 0.29 0.21 -0.06 -0.15 -0.12 -0.45 0.05 -0.02
% FSM 35.50 18.90 27.60 25.60 30.90 38.70 19.70 28.40
% EAL 39.40 20.60 15.40 8.20 20.50 10.10 9.00 18.10
Absence (%) 7.60 7.70 9.10 9.30 9.30 10.30 9.10 8.90
Pers. absence (%) 22.50 21.60 26.70 27.20 27.00 30.60 25.70 25.90
Good+ Ofsted (%) 92.00 87.00 81.00 81.00 78.00 66.00 90.00 83.00
Gorard seg. 0.13 0.25 0.20 0.15 0.15 0.12 0.16 0.16
Retention (FTE) 57.25 57.92 52.75 46.16 60.16 56.14 56.79 54.43
Sickness (days) 7.47 7.46 8.50 8.61 8.56 9.20 7.73 8.23
PTR 16.69 18.06 17.39 17.36 17.59 17.04 17.36 17.30
% Selective 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
% Academy 0.50 0.80 0.70 0.70 0.80 0.70 0.50 0.70
ATT8 trend (pts/yr) -0.54 -0.49 -0.74 -0.90 -0.82 -0.92 -0.72 -0.75

6.2 All-Cluster Radar Comparison

The radar chart below overlays all 7 cluster profiles on a single plot. Each axis represents a z-score relative to the England average (dotted ring). Axes are oriented so that outward = better — for variables where lower is better (e.g. absence, % FSM, segregation, sickness, PTR), the z-score is inverted so that worse-than-average values appear inside the ring. The “FSM ATT8” axis shows disadvantaged pupil attainment directly (higher = outward = better).

Radar profiles for all clusters (outward = better on every axis)

6.3 Cluster Map

The map below shows the geographic distribution of each cluster across English local authorities. Each LA is coloured by its cluster assignment. Click on an LA for a summary popup.

6.4 Individual Cluster Pen Portraits

Each cluster was assigned a descriptive label and pen portrait automatically from the z-score profile. Labels are formed by combining the top two distinguishing features (those with |z| > 0.5 on the priority-ordered indicator list), so that each name captures the most policy-relevant characteristics of that group. Each portrait is accompanied by a radar chart showing that cluster’s profile against the England average.

NoteRadar chart polarity reminder

On every radar axis, outward = better. For inverted variables (% FSM, Absence, Segregation, Sickness, PTR), values that are worse than the England average appear inside the dotted ring — closer to the centre. The “FSM ATT8” axis directly shows disadvantaged pupil attainment (not inverted: higher = outward = better). See Section 5 for full details.

Cluster 1: High EAL / Low Absence

29 local authorities | Cluster 1 (29 local authorities)

ATT8: 47.1 | Gap: 9.5 (FSM: N/A / non-FSM: N/A) | FSM: 35.5% | Absence: 7.6% | Good+ Ofsted: 92% Attainment is above average (ATT8 = 47.1), with a narrower than average disadvantage gap (9.5 points). The narrower gap reflects non-disadvantaged pupils attaining closer to the national average (FSM ATT8: N/A, non-FSM ATT8: N/A). Schools serve higher levels of disadvantage than average (35.5% FSM), with substantially EAL pupils (39.4%). Overall absence is below the national average (7.6%), with persistent absence at 22.5%. Ofsted ratings are strong: 92% of schools are rated Good or Outstanding. There is lower than average FSM-based school segregation within LAs in this group (Gorard index: 0.135). Academy schools make up 50% of schools — lower than the national average. Workforce characteristics include lower teacher sickness absence (7.5 days).

Local authorities in this cluster:

Barking and Dagenham, Barnet, Brent, Camden, Ealing, Enfield, Greenwich, Hackney, Hammersmith and Fulham, Haringey, Harrow, Havering, Hillingdon, Hounslow, Islington, Kensington and Chelsea, Lambeth, Leicester, Lewisham, Manchester, Merton, Newham, Redbridge, Richmond upon Thames, Southwark, Tower Hamlets, Waltham Forest, Wandsworth, Westminster


Cluster 2: High Attainment / Segregated

9 local authorities | Cluster 2 (9 local authorities)

ATT8: 53.9 | Gap: 9.8 (FSM: N/A / non-FSM: N/A) | FSM: 18.9% | Absence: 7.7% | Good+ Ofsted: 87% Attainment is above average (ATT8 = 53.9), with a narrower than average disadvantage gap (9.8 points). Schools serve lower levels of disadvantage than average (18.9% FSM). Overall absence is below the national average (7.7%), with persistent absence at 21.6%. Ofsted ratings are strong: 87% of schools are rated Good or Outstanding. There is higher than average FSM-based school segregation within LAs in this group (Gorard index: 0.248). Academy schools make up 78% of schools — higher than the national average. Workforce characteristics include strong teacher retention (57.9 FTE) and lower teacher sickness absence (7.5 days).

Local authorities in this cluster:

Bexley, Bromley, Buckinghamshire, Kingston upon Thames, Slough, Southend-on-Sea, Sutton, Torbay, Trafford


Cluster 3: Segregated / Non-FSM Driven Gap

18 local authorities | Cluster 3 (18 local authorities)

ATT8: 45.7 | Gap: 10.4 (FSM: N/A / non-FSM: N/A) | FSM: 27.5% | Absence: 9.1% | Good+ Ofsted: 81% Attainment is broadly average (ATT8 = 45.7). The narrower gap reflects non-disadvantaged pupils attaining closer to the national average (FSM ATT8: N/A, non-FSM ATT8: N/A). Ofsted ratings are broadly typical: 81% of schools are rated Good or Outstanding. There is higher than average FSM-based school segregation within LAs in this group (Gorard index: 0.198).

Local authorities in this cluster:

Birmingham, Blackburn with Darwen, Bournemouth, Christchurch and Poole, Calderdale, Essex, Gloucestershire, Kent, Kirklees, Lancashire, Lincolnshire, Liverpool, Medway, Plymouth, Reading, Telford and Wrekin, Walsall, Warwickshire, Wirral


Cluster 4: High Turnover / Low EAL

37 local authorities | Cluster 4 (37 local authorities)

ATT8: 43.6 | Gap: 12.6 (FSM: N/A / non-FSM: N/A) | FSM: 25.6% | Absence: 9.3% | Good+ Ofsted: 81% Attainment is broadly average (ATT8 = 43.6). There are relatively few EAL pupils (8.2%). Ofsted ratings are broadly typical: 81% of schools are rated Good or Outstanding. Workforce characteristics include weaker teacher retention (46.2 FTE) and higher teacher sickness absence (8.6 days).

Local authorities in this cluster:

Bedford, Bury, Central Bedfordshire, Cornwall, County Durham, Croydon, Cumberland, Darlington, Derbyshire, Doncaster, Dudley, East Riding of Yorkshire, Herefordshire, County of, Isles Of Scilly, Leicestershire, Norfolk, North East Lincolnshire, North Lincolnshire, North Tyneside, North Yorkshire, Northumberland, Redcar and Cleveland, Salford, Sefton, Shropshire, Somerset, South Gloucestershire, Staffordshire, Stockton-on-Tees, Suffolk, Sunderland, Swindon, Tameside, Wakefield, Westmorland and Furness, Wigan, Worcestershire


Cluster 5: Stable Workforce / Academy-Heavy

27 local authorities | Cluster 5 (27 local authorities)

ATT8: 44.2 | Gap: 12 (FSM: N/A / non-FSM: N/A) | FSM: 30.9% | Absence: 9.3% | Good+ Ofsted: 78% Attainment is broadly average (ATT8 = 44.2). Overall absence is above the national average (9.3%), with persistent absence at 27%. Ofsted ratings are weaker than average: 78% of schools are rated Good or Outstanding. Academy schools make up 76% of schools — higher than the national average. Workforce characteristics include strong teacher retention (60.2 FTE).

Local authorities in this cluster:

Barnsley, Bolton, Bradford, Coventry, Derby, East Sussex, Gateshead, Kingston upon Hull, City of, Leeds, Luton, Milton Keynes, Newcastle upon Tyne, North Northamptonshire, North Somerset, Nottingham, Nottinghamshire, Oldham, Peterborough, Rotherham, Sandwell, Sheffield, Solihull, Southampton, Thurrock, Warrington, West Northamptonshire, Wolverhampton


Cluster 6: Weak Ofsted / Persistent Absence

11 local authorities | Cluster 6 (11 local authorities)

ATT8: 39.9 | Gap: 12.3 (FSM: N/A / non-FSM: N/A) | FSM: 38.7% | Absence: 10.3% | Good+ Ofsted: 66% Attainment is below average (ATT8 = 39.9), with a broadly average disadvantage gap (12.3 points). Schools serve higher levels of disadvantage than average (38.7% FSM), with relatively few EAL pupils (10.1%). Overall absence is above the national average (10.3%), with persistent absence at 30.5%. Ofsted ratings are weaker than average: 66% of schools are rated Good or Outstanding. There is lower than average FSM-based school segregation within LAs in this group (Gorard index: 0.122). Workforce characteristics include higher teacher sickness absence (9.2 days).

Local authorities in this cluster:

Blackpool, Halton, Hartlepool, Isle of Wight, Knowsley, Middlesbrough, Portsmouth, Rochdale, South Tyneside, St. Helens, Stoke-on-Trent


Cluster 7: FSM-Driven Gap / Wide Gap

22 local authorities | Cluster 7 (22 local authorities)

ATT8: 43 | Gap: 15 (FSM: N/A / non-FSM: N/A) | FSM: 19.7% | Absence: 9.1% | Good+ Ofsted: 90% Attainment is below average (ATT8 = 43), with a wider than average disadvantage gap (15 points). The gap is primarily driven by lower-than-average disadvantaged pupil attainment (FSM ATT8: N/A, non-FSM ATT8: N/A). Schools serve lower levels of disadvantage than average (19.7% FSM), with relatively few EAL pupils (9%). Ofsted ratings are strong: 90% of schools are rated Good or Outstanding. Academy schools make up 52% of schools — lower than the national average. Workforce characteristics include lower teacher sickness absence (7.7 days).

Local authorities in this cluster:

Bath and North East Somerset, Bracknell Forest, Brighton and Hove, Bristol, City of, Cambridgeshire, Cheshire East, Cheshire West and Chester, City of London, Devon, Dorset, Hampshire, Hertfordshire, Oxfordshire, Rutland, Stockport, Surrey, West Berkshire, West Sussex, Wiltshire, Windsor and Maidenhead, Wokingham, York



7 Methodological Notes and Caveats

7.1 Sensitivity to k

The choice of k = 7 reflects a balance between parsimony (a small, interpretable number of groups) and granularity (sufficient nuance to distinguish meaningfully different LA contexts). The cluster metrics indicate that silhouette scores plateau after k = 8, suggesting moderate cluster separation. Users should treat the typology as a descriptive tool rather than a definitive classification.

7.2 Outlier LAs

Authorities with very few schools (below the 5-school threshold) — in practice City of London and Isles Of Scilly — are excluded from the k-means optimisation to avoid them forming artefactual singleton clusters. They are assigned post-hoc to the nearest centroid. Their cluster membership should be interpreted cautiously.

7.3 Cross-year averaging

Indicators are averaged across all available years (2021–22 to 2024–25) before clustering. This means the typology reflects structural characteristics of LAs over the medium term rather than any single year. Year-on-year volatility is captured separately via trend and variability indicators (CV, SD, trend slopes).

7.4 Imputed data (2024–25)

For 2024–25, some pupil composition predictors (e.g. prior attainment, SEN rates) were not available at the time of data collection and were carried forward from 2023–24. Schools with imputed predictors are flagged in the panel data via the has_imputed_predictors column. This affects a small proportion of the KS4 cohort.

7.5 Ecological fallacy

All indicators are LA-level aggregates. Statements about clusters describe the average characteristics of LAs in that group, not individual schools or pupils. Considerable variation exists within each LA and within each cluster.

7.6 Workforce data availability

Some workforce indicators have higher rates of missingness for smaller LAs (particularly those with few schools). The >30% missing data filter removes the worst-affected variables, but residual missingness is handled by median imputation, which may compress variation in workforce-heavy clusters.

7.7 Gap Decomposition

The raw disadvantage gap (att8_gap_mean = non-FSM ATT8 minus FSM ATT8) captures the size of the gap but not its driver. Two LAs can have the same 15-point gap for very different reasons:

  • LA A: FSM pupils score 30 (well below average), non-FSM score 45 (average) — the gap is driven by low disadvantaged attainment, which is a significant policy concern.
  • LA B: FSM pupils score 42 (average), non-FSM score 57 (well above average) — the gap is driven by high non-disadvantaged attainment, a fundamentally different situation.

To capture this distinction, we compute a gap driver index:

\[\text{gap\_driver\_index}_l = z\!\left(\overline{\text{ATT8}}_{l,\,\text{FSM}}\right) - z\!\left(\overline{\text{ATT8}}_{l,\,\text{non-FSM}}\right)\]

where \(z(\cdot)\) denotes standardisation to a z-score against the England LA-level mean and standard deviation. The index is interpreted as follows:

Value Interpretation
Strongly negative (e.g. < −0.5) Gap driven by low disadvantaged attainment — FSM pupils score well below the national average for FSM pupils, while non-FSM pupils are closer to their average
Near zero Both groups contribute symmetrically to the gap
Strongly positive (e.g. > +0.5) Gap driven by high non-disadvantaged attainment — non-FSM pupils score well above average, while FSM pupils perform closer to the national average

The gap_driver_index enters the clustering pipeline as a numeric indicator (subject to the standard collinearity and missing-data filters). It also appears in the bar chart, map, LA profile panel, data table, and cluster pen portraits, alongside the raw gap and both group-level ATT8 means.

The radar chart now shows FSM ATT8 directly (replacing the inverted raw gap axis), giving a clear picture of how disadvantaged pupils are actually performing in each cluster rather than just the gap size.


8 Reproducibility

The typology is fully reproducible. To regenerate it from scratch:

# From the project root:
source("R/07_la_typology.R")

This requires data/panel_data.rds (produced by R/04_compute_derived.R) to be present. The script uses set.seed(42) throughout and saves all outputs to data/. Key parameters:

Parameter Value
Random seed 42
k search range 5 – 10
Final nstart 200
Final iter.max 500
Outlier threshold < 5 schools
Missing data threshold > 30% missing
Collinearity threshold |r| > 0.97
Missing imputation Column median

Report generated automatically from la_typology_report.qmd. Data: DfE 2021–22 to 2024–25.