What is the Global Average SST?

By Andy May

This post can be read in German here.

The global average SST (Sea Surface Temperature) is a very important component of the global average surface temperature for the simple reason that the global ocean covers almost 71% of Earth’s surface. So, we downloaded the gridded SST data from 1850 through 2024 from the Hadley Centre (HadSST v4.1), NOAA (ERSST v5), and NOAA’s SST data repository (ICOADS v3) and then plotted the data in figure 1.

A graph of a graph showing the number of the same numbers

AI-generated content may be incorrect.
Figure 1. Area-weighted global average temperature from ICOADS, HadSST, and ERSST for 1850-2024. Data are downloaded from the respective agencies. Click on the image to see it in full resolution, this is also true for the other figures in this post.

There are good reasons for the large spread of SST values in figure 1 and we will go through some of them in this post, but the original question remains, “What is the global average SST?” The global average surface temperature has supposedly increased around one degree since 1850, but the differences in the records plotted are larger than that.

ICOADS v. 3

ICOADS (International Comprehensive Ocean-Atmospheric Data Set) is the ultimate source of nearly all the data plotted in figure 1. Nearly all the original ship, buoy, and other raw data used by the Hadley Centre and the NOAA ERSST (Extended Reconstructed SST) group comes from ICOADS v. 3. In addition to collecting the raw data they also provide a simple mean gridded product of their own as shown in figure 1 in green. All the global average temperature data plotted in this post is area weighted. The grids utilized by the agencies are latitude and longitude grids and while the latitudes are spaced the same everywhere, the longitudes are not, they are around 111 km apart at the equator and converge to zero at the poles so the area of each grid cell changes with the cosine of the latitude, which I corrected for. The ICOADS simple mean plotted in figure 1 is as close to the raw data as you can get.

HadSST v. 4.1

The Hadley Centre provides a lot of information about the steps they take to reach their final global average SST and at several steps they provide an intermediate product (see here). This is very commendable and educational. We downloaded their final “core” temperature anomaly, the unadjusted anomaly, and the “actuals” dataset in degrees C. These are plotted in figure 2.

A graph of a graph showing the average core of the core

AI-generated content may be incorrect.
Figure 2. Two intermediate HadSST 4.1 products and the final “core” SST product compared. The actual bias-corrected green curve is in degrees and uses the right scale. Data from the Hadley Centre website here.

The process that the Hadley Centre uses to get from the ICOADS raw data plotted in figure 1 to the “core” HadSST product shown in figure 2 is described in several papers (Rayner, et al., 2006), (Kennedy J. J., Rayner, Smith, Parker, & Saunby, 2011), (Kennedy J. J., Rayner, Smith, Parker, & Saunby, 2011b), and (Kennedy J. , Rayner, Atkinson, & Killick, 2019).

Their process is made difficult because their measurement “stations” are constantly moving, except for a few tethered buoys. So, their first step is to construct a reference “climatology” grid. In the case of the Hadley Centre this is a global ocean one degree by one degree grid. Each usable ocean grid cell must have monthly average values for the reference period of 1961-1990. Most of these values are measured from ships. Not all grid cells had actual measurements in the reference period for some years or months and interpolation and some extrapolation in both time and space was required to complete the 1961-1990 climatology (Rayner, et al., 2006).

A special averaging function that trims extreme values, called a winsorized mean, is used to compute the values for each reference and monthly measurement cell (Rayner, et al., 2006). Later the one-degree grids are combined into the larger five-degree grids that are used to produce the averages shown in this post and the final “core” SST anomaly product plotted in figure 2 in orange.

The Hadley Centre starts with the ICOADS v. 3 raw data shown in figure 1, but they exclude some of the data either to use later as a quality control check, or because they consider it inferior data (Kennedy J., Rayner, Atkinson, & Killick, 2019). The usable data is first placed in its grid cell and converted into an anomaly by subtracting the climatology reference mean for the month from the monthly average value (Kennedy J., Rayner, Atkinson, & Killick, 2019). The measurement is obviously not from the same source as the climatology reference value.

After the initial anomalies are computed, the potential biases are computed for the individual values based on the data source. It might be a bucket sample taken over the side of a ship, in this case the type of bucket and bucket insulation, if any, is taken into account. It might be from a ship engine water intake, in that case the location of the thermometer relative to the engine is taken into account, and so on. Using the data they have about each measurement, a possible suite of “realizations” of the biases are generated and the median value of these hypothetical biases is selected to compute a “bias-corrected” anomaly. The resulting bias-corrected actual temperature values are averaged in figure 2 and shown as an “Actual” temperature in degrees C in light green in figure 2. The raw anomalies, without a bias correction, are shown in figure 2 as a dark blue line. It is interesting that the bias corrected actual temperature has a different shape than either the final core anomaly or the uncorrected anomaly. I’m not sure what to make of that.

Since most of the measurements used to compute SST are moving (ships and drifting buoys) and they all have different biases and measurement methods that change over time, building a coherent and consistent SST global average temperature record is a challenge.

ERSST v. 5

The most heavily processed estimate of global ocean temperatures is the ERSST v. 5 reconstruction. We don’t have any intermediate data like we have for the HadSST reconstruction, but we have Boyin Huang, et al.’s description of the process (Huang, et al., 2017). Like the HadSST process, ERSST v. 5 starts with the ICOADS v. 3 dataset. The ERSST team then goes through a process that validates observations, discarding those that do not meet their quality control checks, bias corrects the data, cross checks with neighbors, and excludes outliers. They directly use ARGO data rather than using ARGO as a validation dataset like the HadSST team do. In ERSST, ARGO observations are weighted by 6.8 times the weight of ship observations.

The Areal coverage

The ERSST process utilizes the HadISST data to locate ice cover. When a grid cell is 90% covered with ice or more the SST in the grid cell is set to -1.8°C. Partial coverage is linearly interpolated between the reconstructed grid cell value and -1.8°C (Huang, et al., 2017). Since -1.8°C is the temperature where seawater freezes this makes some sense, but currents do exist under ice caps and clearly the sea surface temperature under the ice is not a uniform -1.8°C. Their assumption is a speculative oversimplification when we are trying to estimate surface warming rates on the order of 0.1°C/decade.

Their reconstruction process includes both interpolation and extrapolation. By making these assumptions, the ERSST reconstructed grid is more complete than the HadSST grid, as shown in figure 3.

A blue and white map

AI-generated content may be incorrect.
Figure 3. The populated grid cells are colored for the HadSST 4.1 and ERSST 5 datasets. The HadSST cells are 5×5 degrees and the ERSST cells are 2×2. In both maps blue means that 10 or more months have values, light blue 6-9 months have values, and green means 3-5 months have values. A white cell has two values or less. The maps are both for 2024.

ERSST provides a very full grid. HadSST is more conservative but still uses interpolation and some extrapolation to produce as full a grid as they can. Compare these two maps to the ICOADS map of the actual data in 2024 shown in figure 4.

A map of the world

AI-generated content may be incorrect.
Figure 4. A map of the 2×2 ICOADS populated cells for 2024. The color scheme is the same as in figure 3.

The ICOADS simple mean temperature uses all values, whereas both HadSST and ERSST reject anomalous values, so the ICOADS coverage as mapped in figure 4, is as good as it gets, at least with respect to the actual measurements.

Comparing the maps in figures 3 and 4 shows us one of the reasons why the ICOADS global average temperature is the highest in figure 1 and the ERSST temperature is the lowest. The ERSST average includes a lot of assumed low values under polar ice that are unused null grid values in the ICOADS and HadSST averages. The differences between HadSST and ICOADS are due, at least in part, to the difference in the cell sizes. The ICOADS cells are 2×2 degrees and the HadSST cells are 5×5. These are areas of 49,000 sq km and 308,000 sq km respectively at the equator. The larger HadSST cell size allows for smaller areas without any measurements to be incorporated into larger cells that have values. In other words, the larger cells spread what data exists over larger areas.

Observations over time

So far, we’ve only looked at the distribution of measurements and final values for 2024. How does the amount of data vary over time? We have detailed data for both ICOADS and HadSST on observations and have plotted it in figure 5.

A group of graphs showing different types of data

AI-generated content may be incorrect.
Figure 5. Reported observations by cell for HadSST 4.1 and ICOADS v3. Plots of the average from 1850-2024 and for 2024 are shown. The HadSST cells are 5×5 and the ICOADS cells are 2×2, which accounts for some of the difference.

Figure 5 shows that HadSST reports a lot more observations per cell than ICOADS, even more than would be expected due to the larger HadSST cell size. Further, at latitude 30S, HadSST reports as many observations as at 30N for 2024, these observations are not seen in the ICOADS 2024 dataset. The ICOADS data is not interpolated or manipulated, thus the observations plotted in the lower half of figure 5 are real. HadSST does not interpolate, extrapolate, or infill cells to the extent that ERSST does, but they obviously do some. The methods used to interpolate, infill, and extrapolate values in HadSST are described in part in Rayner et al. (Rayner, et al., 2006). A lot of this is accomplished when combining the initial one-degree cells into the final five-degree cells.

Discussion

It is apparent that we don’t know what the global average SST is at the accuracy required to detect a warming rate of 0.1°C/decade. The raw data (ICOADS) does not compare well to the processed results, as shown in figures 1 and 2.

The ICOADS simple mean is the closest to the actual measurements and is preferred for that reason. Comparing it (in green) and the HadSST 4.1 Actual bias-corrected values (light blue) in figure 6 to the more highly processed standard anomalies shows that all estimates, anomaly or otherwise, are suspect before 1990 and even afterward.

A graph showing the growth of a stock market

AI-generated content may be incorrect.
Figure 6. The final HadSST 4.1 “core” anomaly compared to the ERSST, ICOADS, and HadSST Actuals converted to anomalies.

In figure 6 the ERSST, ICOADS and HadSST 4.1 actual temperatures are converted to anomalies by subtracting their respective 1961-1990 means from each grid cell value. This is different from the normal procedure of computing each anomaly by subtracting its individual reference mean from each value before processing and populating the grid. The normal procedure was used to build the plotted final median core HadSST 4.1 anomaly in figure 6. However, even though the other anomalies were computed differently, the ICOADS anomaly is a fairly close fit to the ERSST and final HadSST anomalies from the early 1990s to 2024.

The HadSST bias-corrected Actual values, converted to an anomaly, do not match most of the other anomalies very well, even after 1990, which is a bit confusing. The data quality over World War II (WWII) is very poor as discussed by Huang, et al. Kennedy, et al. (2019) also discuss the data quality over this period, especially the sharp drop at the end of war, and say it is due to a large change in the areas sampled by the global fleet. Some of the sudden post-war drop may also be due to a change in the ship sampling methods. This anomaly is hidden in most datasets because the anomalies in most datasets are computed before processing begins masking the measured jump and fall in actual SST. Sometimes when viewing Earth through a microscope you can miss the mountains. The original papers do not include plots of actual temperature, but the WWII data problem can be seen in Kennedy, et al.’s (2019) bias plots in their figures 6, 7, 8, 9, 10, and 13.

The agreement between the estimated anomalies is also poor prior to 1912, which is largely due to poor sampling, the ICOADS observations prior to 1912 generally never exceed 5,000 and peak between 30N and 45N at only 9,000 observations. While good reasons exist for all the problems identified in this post and for all the corrections applied by the HadSST and ERSST teams, this does not mean they are getting the right answer.

Thus, the global average SST, the most important component of global average surface temperature, is largely unknown before the early 1990s and even after 1990 there is some doubt. The doubts about the SST average and the doubts about the land record combined with the very small amount of warming since the beginning of the 20th century (about one degree) cast considerable doubt, at least in my mind, about estimates of modern global warming. I don’t doubt that the world is warmer, on average, since 1900, but I don’t think we know how much warming has occurred with any accuracy. I also don’t think we necessarily have the trend or trends right, is part of the WWII “hump” real? I don’t think we know.

Works Cited

Freeman, E., Woodruff, S., Worley, S., Lubker, S., Kent, E., Angel, W., . . . Smith, S. (2017). ICOADS Release 3.0: a major update to the historical marine climate record. Int. J. Climatol., 37, 2211-2232. doi:10.1002/joc.4775

Huang, B., Thorne, P. W., Banzon, V. F., Boyer, T., Chepurin, G., Lawrimore, J. H., . . . Zhang, H.-M. (2017). Extended Reconstructed Sea Surface Temperature, Version 5 (ERSSTv5): Upgrades, Validations, and Intercomparisons. Journal of Climate, 30(20). doi:10.1175/JCLI-D-16-0836.1

Kennedy, J. J., Rayner, N. A., Smith, R. O., Parker, D. E., & Saunby, M. (2011). Reassessing biases and other uncertainties in sea surface temperature observations measured in situ since 1850; 1. Measurement and sampling uncertainties. Journal of Geophysical Research, 116. Retrieved from https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2010JD015218

Kennedy, J. J., Rayner, N. A., Smith, R. O., Parker, D. E., & Saunby, M. (2011b). Reassessing biases and other uncertainties in sea surface temperature observations measured in situ since 1850: 2. Biases and homogenization. J. Geophys. Res., 116. doi:10.1029/2010JD015220

Kennedy, J., Rayner, N. A., Atkinson, C. P., & Killick, R. E. (2019). An ensemble data set of sea-surface temperature change from 1850: the Met Office Hadley Centre HadSST.4.0.0.0 data set. JGR Atmospheres, 124(14). Retrieved from https://agupubs.onlinelibrary.wiley.com/doi/abs/10.1029/2018JD029867

Rayner, N. A., Brohan, P., Parker, D. E., Folland, C. K., Kennedy, J. J., Vanicek, M., . . . Tett, S. F. (2006). Improved Analyses of Changes and Uncertainties in Sea Surface Temperature Measured In Situ since the Mid-Nineteenth Century: The HadSST2 Dataset. J. Climate, 19, 446-469. doi:10.1175/JCLI3637.1

Published by Andy May

Petrophysicist, details available here: https://andymaypetrophysicist.com/about/

18 thoughts on “What is the Global Average SST?

  1. Using ship engine cooling water intake temperature as a measure of sea surface temperature seems problematic to me. The deepest draft commercial ship I am aware of had a draft of about 80 ft. More typical for merchant ships is around 40 or 50 feet. Engine cooling water is typically drawn from near the bottom of the ship. For typical merchant ships, that is dozens of feet below the surface. IR radiation from water at such depths cannot reach the surface as water is essentially black to IR. Only water molecules at or within a very few molecular diameters of the surface can be released to the atmosphere. So how can the temperature of water dozens of feet below the surface impact our atmosphere? During storms, there is likely a lot of mixing which might make the dozens-of-feet deep temperatures close to the absolute surface. During calms there might be very little mixing. I suppose such issues have been evaluated by those involved in the matter. Any comments?

    1. The data is from many different sources, there is some discussion of the statistical spread of the various data sources and their biases in Kennedy, et al., 2019. See the bibliography.

  2. Andy, are you familiar with the works of Bob Weber? If not may I introduce you? I can send you by email his short commentary replete with his interesting graphs. I think you will converge and may want to collaborate.

    Both data sets (Hadley and IOADS) appear (to this old guy familiar with digital signal analysis) to be one set of data overlaid on at least another set of data. You might run these raw data sets (not anomalies) through Fourier transform to see if other systemic variables pop out/separate and then try to figure out what the other variable is. As Denis suggests above, a systematic variance in sampling is a likely suspect. (Direction of the current vis-a-vis ship of buoy?, cloudy days vs clear sky? ENSO proximity?)

    Another suggestion: you might tag and redistribute the cells based on SST per unit surface area. How many square miles of ocean surface at 17 degrees, at 18 degrees, at 19 degrees, etc. Weight the cells less based on rate of extrapolation/interpolation. Weight more if based on rate samples per square miles at a given SST. Do you get a Gaussian distribution around a mean SST?

  3. This comment is by Dave Burton, it was lost in the system for some reason, so I’m posting it. [Ed.]

    Thank you, Andy, for an extremely informative post.

    I have a question: In Figure 5, top row, you show “Average Observations per cell by Latitude, 1850-2024” and “Observations per cell by Latitude 2024.” E.g., for HadSST4.1, at 30°N “Average Observations per cell by Latitude, 1850-2024” is about 250,000 and “Observations per cell by Latitude 2024” is about 1.2 million. So, are those figures the (average) number of measurements taken within PER YEAR, per grid cell at that latitude? (And are those numbers scaled by 1/cosine of latitude?)

    You write that, “all estimates, anomaly or otherwise, are suspect before 1990 and even afterward” and “thus, the global average SST, the most important component of global average surface temperature, is largely unknown before the early 1990s and even after 1990 there is some doubt.” I think you’ve made that point convincingly.

    The first Argo float was deployed in 2000. They didn’t reach 3000 units (i.e., one float per 120,000 km²) until 2007. So I would guess that ocean temperature data since perhaps about 2005 are not garbage. But even now we only have about one float per 90,000 km² of ocean.

    What’s more, even Argo data is subject to quite drastic correction and revision. Do you remember when the Argo floats were measuring a cooling ocean… until Dr. Josh Willis fixed the data? Willis is a passionate climate alarmist, who calls climate change “The Apocalypse.” So when the Argo floats found ocean cooling instead of warming, he looked hard to find an error. Do you think he would have looked as hard if the measurements had initially shown the warming that he expected, instead of cooling that he did not expect?

    Additionally, since 1979 we have satellite datasets for lower troposphere air temperatures, e.g., UAH6.

    Before that, there really is no reliable temperature data for most of the world, and even after that, there’s a lot of uncertainty, as you can see when you compare different global temperature indexes:
    https://sealevel.info/GISS_vs_UAH_and_HadCRUT_1958-2022_woodfortrees3_annot1_1495x783.png

    1. Hi Dave,
      As for your first question:
      Yes, figure 5 is average observations per cell per year, and no I did not weight any of the observation graphs by area, only the temperature graphs. I considered weighting them by area, but could think of no reason to do so, a cell is a cell.

      1. Hi Andy,

        Thanks for posting my comment, and answering my question.

        Here’s a July 2025 paper about the large variations in sea surface temperature indexes:

        https://www.nature.com/articles/s41558-025-02362-6 Menemenlis, S., Vecchi, G.A., Yang, W. et al. Consequential differences in satellite-era sea surface temperature trends across datasets. Nat. Clim. Chang. 15, 897–903 (2025). https://doi.org/10.1038/s41558-025-02362-6
        (fulltext)

        Dave

    2. As for Josh Willis’ blog post on “correcting” the ARGO temperatures here:
      https://earthobservatory.nasa.gov/features/OceanCooling

      If anything, the satellite measurements of incoming and outgoing radiation are less accurate than the ARGO data, and are less related to SST than cloud cover anyway.

      The world ocean is huge and contains 99.9% of the thermal energy stored by the surface of the Earth. We only sample a very small part of it and we don’t sample it very well. My best estimate is the ocean warmed (over all, to about 4,500 meters) about 0.06 deg c/decade from 2002-2020, but that is pretty speculative. The data are not that accurate.

  4. Best climate article in the past few months
    I read over 12 climate and energy articles a day
    ARGO has potential if owners can be trusted

    Future article?
    — List of every sst methodology used in history

    I’ve been looking for a comparison of every sst methodology used in (tested in) the same location to see what trend methodology changes alone could have caused. Been looking since 1997!

Leave a Reply to Andy MayCancel reply

Discover more from Andy May Petrophysicist

Subscribe now to keep reading and get access to the full archive.

Continue reading