View Release Notes

Property Intelligence is designed to assist in the process of providing insurance quotes for customers in England, Wales and Scotland by supplying data on residential properties. Property coverage:

  • Houses, bungalows and flats covered although the majority of testing has been on freehold properties i.e. typically houses and bungalows;

Geographic coverage:

  • England and Wales full coverage;
  • Scotland full coverage but reduced accuracy because of different policies for land and property registration and the publication of data;
  • Northern Ireland limited coverage, just includes estate agent data;
  • Isle of Man and Channel Islands not covered;

Property Intelligence is currently built on a quarterly basis, this is subject to review. The underlying datasets have a range of update frequencies from monthly upwards. Updates are included in the build as they become available.

Database Fields

Utility fields

The database contains a set of utility fields and a set of feature fields. The utility fields are as follows:

  • UPRN - the Unique Property Reference Number originating from the Ordnance Survey
  • UDPRN - the Unique Delivery Point Reference Number originating from the Royal Mail
  • UMRRN - the Unique Multiple Residence Reference Number originating from the Royal Mail
  • Address1 - a standardised first line of address containing house name or number and street
  • Postcode - a full postcode
  • Easting - Ordnance Survey National Grid Easting
  • Northing - Ordnance Survey National Grid Northing
  • Latitude - latitude in ETRS89 converted from the easting using OSTN02
  • Longitude - longitude in ETRS89 converted from the northing using OSTN02
  • Output area code - Census 2011 Output Area (OA) code from the ONS Postcode Directory
  • Lower super output area code - Census 2011 Lower Super Output Area (LSOA) code from the ONS Postcode Directory
  • Output area code - Census 2021 Output Area (OA) code from the ONS Postcode Directory
  • Lower super output area code - Census 2021 Lower Super Output Area (LSOA) code from the ONS Postcode Directory
  • Country - one of England, Northern Ireland, Scotland, or Wales from the ONS Postcode Directory

Data items

The feature fields in the database are arranged in sets of three:

  • X - this is the data item of interest, for example a number of bedrooms for which X = “bedrooms”;
  • source_X - this is the source of the information, for example the number 2 indicates that this data item is sourced from the Land Registry;
  • p_X - this is a confidence score for the data ranging between 0 and 1. Confidence scores are calculated, where possible, as a “fraction correct” measure against a groundtruth dataset of 36,000 properties supplied by Simple and Open;

The unique key to the database is the UDPRN / UMRRN pair supplied by Royal Mail, the UPRN is also supplied. The list of data items is as follows:

  • Property type - whether the property is semi-detached, detached, terraced or a flat
  • Number of floors - the estimated number of floors in the property based on the height of the building.
  • Number of bedrooms - the number of bedrooms in a property
  • Number of bathrooms - the number of bathrooms in a property
  • Number of rooms in total - the number of rooms excluding bathrooms and kitchens
  • Building construction period - the construction date of a building in one of the following periods: (before 1719 (old), 1720-1839 (Georgian), 1840-1919 (Victorian/Edwardian), 1920-1945 (Inter-war), 1946-1979 (Post-war) and 1980 to date (Modern))
  • Year built - the year built, only available for those buildings in the Land Registry Price Paid data, built after 1995
  • Listed building - The grade of listing of a building, if it is listed, using data supplied by English Heritage, Cadw or Historic Scotland
  • Cadastral polygon area - the area of the cadastral parcel in which the building sits expressed in square metres using data from Land Registry
  • Height - the building height in metres
  • Building footprint (square metres) - the approximate footprint of the building expressed in square metres
  • Building volume (cubic metres) - the approximate volume of the building expressed in cubic metres
  • Average roof slope - the average slope of the property roof, can be used to identify properties with flat roofs
  • Flat roof fraction - the estimated fraction of a building which has a flat roof
  • Distance to tree - distance from the nearest tree over 10 metres tall to the property geocode
  • Geocode multiplicity - the number of property geocodes falling within the footprint of the building at 1.8 metres above ground level
  • Floor area (square metres) - the liveable floor area in square metres
  • Last transaction price - the price paid at the last transaction recorded by the Land Registry (England and Wales only, back to 1995)
  • Last transaction date - the date of the last transaction recorded by the Land Registry (England and Wales only, back to 1995)
  • Last transaction duration type - the duration type of the last transaction recorded by the Land Registry (England and Wales only, back to 1995)
  • Estimated current value - estimated current value based on data from Land Registry (England and Wales only, back to 1995)
  • Number of transactions - the number of transactions recorded by the Land Registry (England and Wales only, back to 1995)
  • Estimated council tax band - estimated council tax from price at reference years using Land Registry data (England and Wales only, back to 1995)
  • Within 200 metres of watercourse - flag indicating whether there is a watercourse within 200 metres
  • Distance to watercourse (within 200 metres) - distance (in metres) to a watercourse, if it is within 200 metres
  • Distance to road - the distance to the centre line of the nearest road from the property geocode, not necessarily accessible
  • Road class - road class, as provided by Ordnance Survey
  • Business usage - a flag indicating potential business usage
  • Planning classification - planning classification as per Town and Country Planning (Use Classes) Order 1987 for non-domestic properties
  • Congestion zone - a flag indicating if a property is in the London Congestion Zone
  • Burglary rate - the number of burglaries per property per year averaged over a LSOA (England and Wales only)
  • Storey on which flat sits - storey on which a flat sits. This typically contains N/A where it is not available or applicable or a number which may have been derived from a model based on the text found in the original data source
  • Is top floor flat? - Is a flat on the top floor of the building
  • Number of extensions - the number of extensions to a property, typically 1 but up to 4
  • Wall type - the type of wall used in construction, possible values cavity wall, solid brick, sandstone, granite, timber frame, system built and SAP05
  • Main central heating fuel - Main central heating fuel, possible values include gas, electricity, oil, coal, LPG, wood, B30K (a biofuel mix) and also ‘not known’ and ‘none’
  • Type of tenure - type of tenure: owner-occupier, rented or social housing
  • Energy rating - Energy rating as indicated in the EPC Energy Certificate
  • EPC Inspection Date - Inspection date indicated in the EPC Energy Certificate
  • Multi-residential property - a flag to identify properties that are multi-residential

Technical details

Technical details for each of these fields are shown in the table below:

TitleField nameData type
UPRNUPRNInteger
UDPRNUDPRNInteger
UMRRNUMRRNInteger
Address1address1Text
PostcodepostcodeText
EastingeastingFloat
NorthingnorthingFloat
LatitudelatitudeFloat
LongitudelongitudeFloat
Output area codeOA11CDText
Lower super output area codeLSOA11CDText
Output area codeOA21CDText
Lower super output area codeLSOA21CDText
CountrycountryText
Property typeproperty_typeLookup
Number of floorsfloorsText
Number of bedroomsbedroomsText
Number of bathroomsbathroomsText
Number of rooms in totaltotal_roomsText
Building construction periodageLookup
Year builtyear_builtText
Listed buildinglistedLookup
Cadastral polygon areacadastralText
HeightheightText
Building footprint (square metres)footprintText
Building volume (cubic metres)volumeText
Average roof slopeavg_roof_slopeText
Flat roof fractionflat_roof_fractionText
Distance to treedistance_to_treeText
Geocode multiplicitygeocode_multiplicityText
Floor area (square metres)floor_areaText
Last transaction pricelast_transaction_priceText
Last transaction datelast_transaction_dateText
Last transaction duration typelast_transaction_duration_typeLookup
Estimated current valueest_current_valueText
Number of transactionsn_transactionsText
Estimated council tax bandest_council_taxLookup
Within 200 metres of watercoursewatercourse_200MLookup
Distance to watercourse (within 200 metres)distance_to_waterText
Distance to roaddistance_to_roadText
Road classroad_classLookup
Business usagebusiness_usageLookup
Planning classificationplanning_classificationLookup
Congestion zonecongestion_zoneLookup
Burglary rateburglary_rateText
Storey on which flat sitsflat_floorText
Is top floor flat?top_floor_flatLookup
Number of extensionsextensionsText
Wall typewall_typeLookup
Main central heating fuelmain_fuelLookup
Type of tenuretenureLookup
Energy ratingenergy_ratingLookup
EPC Inspection Dateepc_inspection_dateText
Multi-residential propertyis_multiresText

Table 1: Technical details for each utility and data field. Lookup fields contain positive integers (starting from zero). source_X fields are lookup fields, p_X fields are number fields.

Lookup tables

Tables 2-13 are the lookup tables relating the numbers found in the database fields to descriptions for the property type, property age, Council Tax band, and data source. The Yes/No lookup is used for the ‘watercourse 200M’, ‘congestion zone’ and ‘top floor flat’ fields.

Yes/no lookup

DescriptionValue
No0
Yes1

Table 2: Yes/no lookup

Property type lookup

DescriptionValue
Detached0
Semi-detached1
Terraced2
Flat3
Unknown4

Table 3: Property type lookup

Property age lookup

DescriptionValue
Before 1719 (old)0
1720-1839 (Georgian)1
1840-1919 (Victorian/Edwardian)2
1920-1945 (Inter-war)3
1946-1979 (Post-war)4
1980 to date (Modern)5
Not known6

Table 4: Property age lookup

Council Tax lookup

DescriptionValue
A0
B1
C2
D3
E4
F5
G6
H7
I8
N/A100

Table 5: Council tax band lookup

Data source lookup

DescriptionValue
Default0
Land Registry2
Historic England3
Estate agent4
LIDAR7
NROSH multipart8
NROSH snapshot9
VOA12
Heuristic14
ML (age)15
Naive Bayes (age)17
Banded VOA18
ML (bedrooms)19
VOA (Council Tax)20
OS Open Rivers21
NB (bedrooms)24
OS Open Map25
Transport for London28
police.uk29
Flats modeller30
Cadw33
Historic Environment Scotland34
OS Open Roads35
Royal Mail36
DCLG37
DCLG non-domestic38
Prefix flat floor modeller42
Flats per floor modeller43
Nearest neighbour modeller44
DCLG Scotland45
DCLG Scotland non-domestic46
Financial Services48

Table 6: Data source lookup

Business usage lookup

DescriptionValue
Domestic0
Business1

Table 7: Business usage lookup

Main fuel lookup

DescriptionValue
Gas0
Electricity1
Oil2
Not known3
Coal4
LPG5
Wood6
None7
B30K8
Other9
Biomass/Biogas10
District heating11
Waste heat12

Table 8: Main fuel lookup

Wall type lookup

DescriptionValue
Cavity wall0
Solid brick1
Sandstone2
Timber frame3
Granite4
System built5
SAP056
Not known7

Table 9: Wall type lookup

Planning classification lookup

DescriptionValue
Not known0
A1/A2 Retail and Financial/Professional services1
A3/A4/A5 Restaurant and Cafes/Drinking Establishments and Hot Food takeaways2
B1 Offices and Workshop businesses3
B2 to B7 General Industrial and Special Industrial Groups4
B8 Storage or Distribution5
C1 Hotels6
C2 Residential Institutions - Hospitals and Care Homes7
C2 Residential Institutions - Residential schools8
C2 Residential Institutions - Universities and colleges9
C2A Secure Residential Institutions10
C3 - Dwelling houses11
D1 Non-residential Institutions - Community/Day Centre12
D1 Non-residential Institutions - Crown and County Courts13
D1 Non-residential Institutions - Education14
D1 Non-residential Institutions - Libraries Museums and Galleries15
D1 Non-residential Institutions - Primary Health Care Building16
D2 General Assembly and Leisure plus Night Clubs and Theatres17
Others - Passenger terminals18
Others - Emergency services19
Others - Miscellaneous 24hr activities21
Others - Car Parks 24 hrs22
Others - Stand alone utility block23
Others - Telephone exchanges24
Sui generis25

Table 10: Planning classification lookup

Road class lookup

DescriptionValue
Unclassified0
Not classified1
Classified unnumbered2
B Road3
A Road4
Motorway5
Unknown6

Table 11: Road class lookup

Listed building grade lookup

DescriptionValue
Not listed0
I or A1
II* or B2
II or C3

Table 12: Listed building grade lookup

Tenure lookup

DescriptionValue
Owner-occupier0
Rented1
Social2

Table 13: Tenure lookup

Energy rating lookup

DescriptionValue
A0
B1
C2
D3
E4
F5
G6

Table 14: Energy rating lookup

Last transaction duration type lookup

DescriptionValue
Not known0
Freehold1
Leasehold2

Table 15: Last transaction duration type lookup

Multi-residential lookup

DescriptionValue
No0
Yes1

Table 16: Last transaction duration type lookup

Accuracy

Accuracy for the tested fields calculated using 2025-02_groundtruth on 2025-03-13 00:52:51 against 32233 properties is shown in the table below.

FieldAccuracy (%)
Number of bedrooms71.3
Number of bathrooms77.4
Building construction period68.6
Property type82.1
Number of floors89.9

Table 17: Summary accuracy for fields, measured against ‘groundtruth’ properties in England and Wales, excluding flats

Coverage

The following tables show dataset coverage and accuracy for number of floors, bedrooms, age and property type using the along with confidence for these attributes based on measurements against the 33,000 property groundtruth dataset covering England and Wales.

SourceCoverageAccuracyConfidence
DCLG0.1190.6910.700
Default0.0320.4070.500
Estate agent0.4630.7900.850
Flats modeller0.0030.2130.600
NB (bedrooms)0.3780.6550.640
NROSH multipart0.0050.7290.800
NROSH snapshot0.0001.0000.800
Overall1.0000.7130.741

Table 18: Accuracy and coverage for bedrooms

SourceCoverageAccuracyConfidence
Default0.5600.8000.730
Estate agent0.4400.7070.760
Overall1.0000.7740.743

Table 19: Accuracy and coverage for bathrooms

SourceCoverageAccuracyConfidence
Cadw0.0000.6670.630
DCLG0.4420.7720.650
DCLG non-domestic0.0000.3330.250
Default0.0100.5000.470
Heuristic0.0120.4200.600
Historic England0.0040.5320.540
Land Registry0.0200.9360.950
Naive Bayes (age)0.3030.7180.721
Overall1.0000.6860.637
VOA0.2090.4610.469

Table 20: Accuracy and coverage for age

SourceCoverageAccuracyConfidence
Banded VOA0.0540.6260.595
DCLG0.0700.9140.900
Default0.0010.3750.540
Estate agent0.6900.8490.880
LIDAR0.1830.7450.800
Land Registry0.0000.0000.920
NROSH multipart0.0010.7040.800
NROSH snapshot0.0000.0000.800
Overall1.0000.8210.851

Table 21: Accuracy and coverage for property_type

SourceCoverageAccuracyConfidence
Banded VOA0.1180.8250.811
DCLG0.1040.9600.940
Default0.0030.7380.840
LIDAR0.7750.9030.900
Overall1.0000.8990.893

Table 22: Accuracy and coverage for floors

Attribute distribution charts

The following charts show the distribution of values for selected fields, for domestic properties, not arising from the default model.

Figure 1: Distribution of property type

Figure 2: Distribution of number of bedrooms

Figure 3: Distribution of number of bathrooms

Figure 4: Distribution of building construction period

Figure 5: Distribution of number of floors

Direct data content

The following tables shows the coverage with direct data for the five fields tested against groundtruth.

AttributePercentage direct
Property type88.5
Floors75.1
Bedrooms59.8
Bathrooms36.6
Age60.0

Table 23: Percentage of data supplied from direct sources rather than modelled

Data recency

Data recency for the Property Intelligence dataset is determined by a number of factors, listed below:

  • The build process for Property Intelligence takes approximately 2 months from start to delivery to customer with quarterly scheduled releases;
  • Individual datasets have a range of update frequencies, some are static and will never be updated, others are yearly, quarterly or monthly;
  • Two datasets, EPC (formerly DCLG) and Estate agent data, have property-level fields which indicate when an inspection was carried out so potentially day-level data on recency could be provided;
  • The LIDAR data is a composite dataset, 80% of which has been collected in the last 10 years;

The table below shows the dates of the datasets used in this version of Property Intelligence along with an indication of the expected update frequency.

DatasetFrequencyDate
Congestion ZoneOnceNone
DCLGQuarterly2025-03-11
DCLG ScotlandQuarterly2024-12-04
ONS Postcode to LSOA/LA lookupQuarterly2025-03-03
Land Registry House Price IndexMonthly2024-12-01
Land Registry Cadastral PolygonsQuarterly2025-03-02
Land Registry Price PaidMonthly2025-03-03
English HeritageYearly2024
Historic Environment ScotlandYearly2024
CadwYearly2024
NROSHOnce2016-12-12
ONSPDQuarterly2025-03-03
ONS rural-urban classificationOnce2016-12-12
OS Open UPRNQuarterly2025-02-01
OS Open RiversQuarterly2024-10-01
OS Open RoadsQuarterly2024-10-01
Police.ukMonthly2024-12-01
Royal MailMonthly2025-03-03
VOAYearly2024
Estate AgentMonthly2025-03-04

Table 24: Data recency and frequency by dataset

The Environment Agency started to systematically cover England for LIDAR measurement in about 2005 and they have added, very approximately 5% coverage in each year since then.

Figure 6: Cumulative percentage of LIDAR coverage

Attributions

This dataset contains Open Data typically provided under the UK government’s OGL3 license, a requirement of this license is that an attribution is provided for the data. These are as follows:

Release notes

March 2025

No new fields have been added in this release but sources have been updated.

February 2025

No new fields have been added in this release but sources have been updated.

January 2025

No new fields have been added in this release but sources have been updated.

December 2024

No new fields have been added in this release but sources have been updated.

November 2024

No new fields have been added in this release but sources have been updated.

October 2024

No new fields have been added in this release but sources have been updated.

September 2024

No new fields have been added in this release but sources have been updated.

July 2024

A new field, IS_MULTIRES, has been added. It indicates if a property is multi-residential based on Royal Mail MRS.

June 2024

A new field, LAST_TRANSACTION_DURATION_TYPE, has been added. References to the duration type of the last transaction recorded by the Land Registry Price Paid dataset (England and Wales only, back to 1995). It has two possible values, Freehold or Leasehold.

March 2024

No new fields have been added in this release but sources have been updated.

January 2024

No new fields have been added in this release but sources have been updated.

October 2023

No new fields have been added in this release but sources have been updated.

July 2023

No new fields have been added in this release but sources have been updated.

April 2023

An EPC_INSPECTION_DATE is added. References to ‘DCLG’, the original department responsible for the EPC Energy Certificate data are replaced with ‘EPC’ in documentation.

The Census 2021 codings oa21cd and lsoa21cd are added, to sit alongside the Census 2011 codings. Currently the source Open Data used to derive fields in Property Intelligence still use the Census 2011 codings.

January 2023

No new fields have been added in this release but sources have been updated.

October 2022

No new fields have been added in this release but sources have been updated.

July 2022

No new fields have been added in this release but sources have been updated.

April 2022

No new fields have been added in this release but sources have been updated.

Our supplier of business information which is used to populate the business_usage field has changed.

January 2022

No new fields have been added in this release but sources have been updated.

Floor areas for flats are now included in modelling so that values for neighbouring flats are used if direct data is not available.

As a result of changing our address cleanser to the standard GBG Loqate Verify engine we now include some data from Northern Ireland.

October 2021

No new fields have been added in this release but sources have been updated.

We have added the DCLG Scotland data which provides a significant improvement in accuracy for property type, and property age in Scotland as well as improvements in accuracy to numbers of bedrooms and floors. DCLG Scotland also provides fields including extension count, wall type, main fuel, floor area, total rooms, tenure and energy rating which were not previously populated for Scotland.

There are improvements in the flat floor modeller such that it does not return unreasonable large values (over 90 storeys) or non-numeric values (other than N/A), and floor areas for flats are now included in modelling so that values for neighbouring flats are used if direct data is not available.

As a result of changing our address cleanser to the standard GBG Loqate Verify engine we now include some data from Northern Ireland.

July 2021

No new fields have been added in this release but sources have been updated.

We have introduced modelling for ‘flat_floor’ - the storey a flat sits on which improves the coverage for this field, and introduces new entries to the data sources table.

April 2021

Tenure and energy rating fields have been added. Tenure is a replacement for the previously removed tenancy field. It indicates whether a property is owner-occupied, private rental or social housing. The name has been changed to retain consistency with the underlying dataset

The property age field has improved direct data content and accuracy as a result of the addition of a new dataset.

January 2021

No new fields have been added in this release but sources have been updated.

October 2020

We have resumed supply of two fields which had been suspended:

  • the congestion zone field from the raw TFL data rather than using a third party as a supplier;
  • the cadastral area (building plot area) which had been suspended due to licensing issues. This is based on the Land Registry INSPIRE Polygons data;

The Land Registry House Price Index is available once again, and thus Estimated Current Values will be up to date unless a sale went through during the period in which the HPI was suspended.

July 2020

As noted in the April 2020 Release notes we have removed the following fields from this release:

  • geocode accuracy
  • red route
  • tenancy
  • multiplicity
  • outdoor area
  • building count
  • adult occupants

The Land Registry UK House Price Index has been suspended as of the April 2020 release, due to be published in June because of the impact of COVID-19 which means limited transactions are occurring on which to base the Index. The relevant Land Registry Bulletin describing this change is here.This means the estimated current value field will contain the estimated current value at last release of the House Price Index - 1st March 2020.

April 2020

This build incorporates the OS AddressBase Premium property-level easting / northing and latitude/longitude coordinates, these replace those provided by our previous supplier. This data is supplied under evaluation terms which you have already signed up to.

These will be replaced with coordinates from derived from Ordnance Survey Open Source data once this has been released in July 2020.

As a result of recent supplier changes we are also withdrawing a number of fields including the geocode accuracy and red route fields. The Congestion Zone field will remain but not be populated in the next build.

We will also be withdrawing the Tenancy field as a result of other supplier licensing changes.

Finally there are a number of fields which have not been populated for some time including multiplicity, outdoor area, building count and number of adult occupants. All of these fields are present in this build containing default values in most cases but will be removed from the July 2020 build.

January 2020

No new fields have been added in this release but sources have been updated.

October 2019

No new fields have been added in this release but sources have been updated.

July 2019

No new fields have been added in this release but sources have been updated.

April 2019

The listed building field now reports the grade of listing (I, II* or II in England or Wales, A, B or C in Scotland). Previously buildings were just reported listed/not listed.

October 2018

A tenancy field was added in this release which identifies a property as being rented, social housing or owner-occupier.

June 2018

No new fields have been added in this release but sources have been updated, in addition the documentation provides details of data recency.

Feburary 2018

This release contains further parameters derived from LIDAR data, these include building footprint, building volume, the average roof slope, a flat roof fraction, the distance to the nearest tree over 10 metres high to the property geocode and a geocode multiplicity which counts the number of geocodes within a building. The building footprint is not listed below since it is not a new field but has been re-calculated using LIDAR data.

October 2017

This release incorporates a major new dataset which has brought improved accuracy to numbers of bedrooms, numbers of floors, property type and property age fields as well as introducing a number of new fields, listed below. The cadastral, outdoor area and footprint fields will be populated only with default values from this release onwards for licensing reasons. We hope to re-introduce the building footprint in the February 2018 release.

June 2017

This release sees a switch to using the Royal Mail PAF, Not Yet Built and Multiple Residence files as the base address list which results in approximately 10% more addresses than earlier releases. In addition accuracy in identifying flats was improved substantially, and a number of fields pertaining specifically to flats included

February 2017

This release introduced the following new fields, with a focus on logistics.

October 2016

This is the first public release of the Property Intelligence dataset