2.5: The Role of Observation in Oceanography
- Page ID
- 30032
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)The brief tour of theoretical ideas suggests that observations are essential for understanding the ocean. The theory describing a convecting, wind-forced, turbulent fluid in a rotating coordinate system has never been sufficiently well known that important features of the oceanic circulation could be predicted before they were observed. In almost all cases, oceanographers resort to observations to understand oceanic processes.
At first glance, we might think that the numerous expeditions mounted since 1873 would give a good description of the ocean. The results are indeed impressive. Hundreds of expeditions have extended into all oceans. Yet, much of the ocean is poorly explored.
By the year 2000, most areas of the ocean will have been sampled from top to bottom only once. Some areas, such as the Atlantic, will have been sparsely sampled three times: during the International Geophysical Year in 1959, during the Geochemical Sections cruises in the early 1970s, and during the World Ocean Circulation Experiment from 1991 to 1996. All areas will be vastly under-sampled. This is the sampling problem (See box below). Our samples of the ocean are insufficient to describe the ocean well enough to predict its variability and its response to changing forcing. Lack of sufficient samples is the largest source of error in our understanding of the ocean.
The lack of observations has led to a very important and widespread conceptual error:
“The absence of evidence was taken as evidence of absence.” The great difficulty of observing the ocean meant that when a phenomenon was not observed, it was assumed it was not present. The more one is able to observe the ocean, the more the complexity and subtlety that appears.
— Wunsch (2002a)
As a result, our understanding of the ocean is often too simple to be correct.
Sampling error is the largest source of error in the geosciences. It is caused by a set of samples not representing the population of the variable being measured. A population is the set of all possible measurements, and a sample is the sampled subset of the population. We assume each measurement is perfectly accurate.
To determine if your measurement has a sampling error, you must first completely specify the problem you wish to study. This defines the population. Then, you must determine if the samples represent the population. Both steps are necessary.
Suppose your problem is to measure the annual-mean sea-surface temperature of the ocean to determine if global warming is occurring. For this problem, the population is the set of all possible measurements of surface temperature, in all regions in all months. If the sample mean is to equal the true mean, the samples must be uniformly distributed throughout the year and over all the area of the ocean, and sufficiently dense to include all important variability in time and space. This is impossible. Ships avoid stormy regions such as high latitudes in winter, so ship samples tend not to represent the population of surface temperatures. Satellites may not sample uniformly throughout the daily cycle, and they may not observe temperature at high latitudes in winter because of persistent clouds, although they tend to sample uniformly in space and throughout the year in most regions. If daily variability is small, the satellite samples will be more representative of the population than the ship samples.
From the above, it should be clear that oceanic samples rarely represent the population we wish to study. We always have sampling errors.
In defining sampling error, we must clearly distinguish between instrument errors and sampling errors. Instrument errors are due to the inaccuracy of the instrument. Sampling errors are due to a failure to make a measurement. Consider the example above: the determination of mean sea-surface temperature. If the measurements are made by thermometers on ships, each measurement has a small error because thermometers are not perfect. This is an instrument error. If the ships avoids high latitudes in winter, the absence of measurements at high latitude in winter is a sampling error.
Meteorologists designing the Tropical Rainfall Mapping Mission have been investigating the sampling error in measurements of rain. Their results are general and may be applied to other variables. For a general description of the problem see North & Nakamoto (1989).
Selecting Oceanic Data Sets
Much of the existing oceanic data have been organized into large data sets. For example, satellite data are processed and distributed by groups working with NASA. Data from ships have been collected and organized by other groups. Oceanographers now rely more and more on such collections of data produced by others.
The use of data produced by others introduces problems: i) How accurate are the data in the set? ii) What are the limitations of the data set? And, iii) How does the set compare with other similar sets? Anyone who uses public or private data sets is wise to obtain answers to such questions.
If you plan to use data from others, here are some guidelines:
- Use well documented data sets. Does the documentation completely describe the sources of the original measurements, all steps used to process the data, and all criteria used to exclude data? Does the data set include version numbers to identify changes to the set?
- Use validated data. Has accuracy of data been well documented? Was accuracy determined by comparing with different measurements of the same variable? Was validation global or regional?
- Use sets that have been used by others and referenced in scientific papers. Some data sets are widely used for good reason. Those who produced the sets used them in their own published work and others trust the data.
- Conversely, don’t use a data set just because it is handy. Can you document the source of the set? For example, many versions of the digital, 5-minute maps of the sea floor are widely available. Some date back to the first sets produced by the U.S. Defense Mapping Agency, others are from the etopo-5 set. Don’t rely on a colleague’s statement about the source. Find the documentation. If it is missing, find another data set.
Designing Oceanic Experiments
Observations are exceedingly important for oceanography, yet observations are expensive because ship time and satellites are expensive. As a result, oceanographic experiments must be carefully planned. While the design of experiments may not fit well within an historical chapter, perhaps the topic merits a few brief comments because it is seldom mentioned in oceanographic textbooks, although it is prominently described in texts for other scientific fields. The design of experiments is particularly important because poorly planned experiments lead to ambiguous results, they may measure the wrong variables, or they may produce completely useless data.
The first and most important aspect of the design of any experiment is to determine why you wish to make a measurement before deciding how you will make the measurement or what you will measure.
- What is the purpose of the observations? Do you wish to test hypotheses or describe processes?
- What accuracy is required of the observation?
- What resolution in time and space is required? What is the duration of measurements?
Consider, for example, how the purpose of the measurement changes how you might measure salinity or temperature as a function of depth:
- If the purpose is to describe water masses in an ocean basin, then measurements with 20–50 m vertical spacing and 50–300 km horizontal spacing, repeated once per 20–50 years in deep water are required.
- If the purpose is to describe vertical mixing in the open equatorial Pacific, then 0.5–1.0 mm vertical spacing and 50–1000 km spacing between locations repeated once per hour for many days may be required.
Accuracy, Precision, and Linearity
While we are on the topic of experiments, now is a good time to introduce three concepts needed throughout the book when we discuss experiments: precision, accuracy, and linearity of a measurement.
Accuracy is the difference between the measured value and the true value.
Precision is the difference among repeated measurements.
The distinction between accuracy and precision is usually illustrated by the simple example of firing a rifle at a target. Accuracy is the average distance from the center of the target to the hits on the target. Precision is the average distance between the hits. Thus, ten rifle shots could be clustered within a circle 10 cm in diameter with the center of the cluster located 20 cm from the center of the target. The accuracy is then 20 cm, and the precision is roughly 5 cm.
Linearity requires that the output of an instrument be a linear function of the input. Nonlinear devices rectify variability to a constant value. So a nonlinear response leads to wrong mean values. Non-linearity can be as important as accuracy. For example, let \[\begin{align*} Output &= Input + 0.1 (Input)^{2} \\ Input &= a \sin \omega t \end{align*} \]
then \[\begin{align*} Output &= a \sin \omega t + 0.1 (a \sin \omega t)^{2} \\ Output &= Input + \frac{0.1}{2} a^{2} - \frac{0.1}{2} a^{2} \cos 2 \omega t \end{align*} \]
Note that the mean value of the input is zero, yet the output of this nonlinear instrument has a mean value of \(0.05a^{2}\) plus an equally large term at twice the input frequency. In general, if input has frequencies \(\omega_{1}\) and \(\omega_{2}\), then output of a non-linear instrument has frequencies \(\omega_{1} \pm \omega_{2}\). Linearity of an instrument is especially important when the instrument must measure the mean value of a turbulent variable. For example, we require linear current meters when measuring currents near the sea surface where wind and waves produce a large variability in the current.
Sensitivity to other variables of interest.
Errors may be correlated with other variables of the problem. For example, measurements of conductivity are sensitive to temperature. So, errors in the measurement of temperature in salinometers leads to errors in the measured values of conductivity or salinity.