Analysis of differences between the ATLSS High Resolution Topography (HRT) model output
and the USGS High Accuracy Elevation Data (HAED).
by
Scott M. Duke-Sylvester
The Institute for Environmental Modeling
University of Tennessee at Knoxville
sylv@tiem.utk.edu
Introduction
Measuring ground surface elevations in South Florida is a difficult task.
As a result the elevation data available for the region exists only at a
coarse spatial resolution.
This has presented a significant problem when trying the capture the dynamics of biological
components.
Local hydrologic conditions are an important factor in the behavior and
life histories of many of South Florida's flora and fauna.
The local elevation of the ground surface is in turn an important part of local hydrologic
conditions.
Without detailed elevation data there can be no local response by models to
hydrology.
The ATLSS High Resolution Topography (HRT) model has been produced to provide detailed elevation
data as input to various models, both within the ATLSS project and in other projects,
at spatial resolutions which are relevant to biological processes.
The accuracy and sources of errors in the ATLSS HRT are long-standing questions.
The lack of any reliable ground surface elevation data against which the model could be tested
have made it difficult to find meaningful
answers.
To date, the problems with the elevation data sets available to us has been the
lack of error measurements for either the horizontal or vertical data.
In addition, many early data sets either covered very limited regions, or covered large regions
at very coarse spatial resolutions.
The High Accuracy Elevation Data (HAED) project is an effort headed by the
US Geological Survey (USGS) to obtain accurate elevation values for the Everglades
at a high spatial resolution.
The technique used by the HAED to measure both the horizontal and vertical data utilizes
differential GPS.
The differential GPS technology yields measurment accuracies which previously have been difficult to
achieve.
Measurements are taken on a grid where the nominal longitudinal
or latitudinal distance between measurements is 400 meters.
This represents a vast improvement in the resolution at which elevations are known for the
region.
In addition, the error in the horizontal data is less than a centimeter, so the location of
each measurment is known to a high degree of accuracy.
The vertical data are measured utilizing differential GPS providing
15 centimeters of vertical accuracy.
Further information about this data set can be found at:
sofia.usgs.gov/projects/elev_data/.
The data sets provided by the HAED project are broken into a number of regions.
Currently there are 19 regions.
The location and names of these regions are shown in Figure 1.
We have used the data from the USGS project to evaluate the quality of the HRT model outputs
and to understand the sources of errors.
|
|
Figure 1.
Map of south east Florida showing the locations and names of the various regions
for which the HAED project has collected elevation data.
Each colored box represents a single region, each region is shown with a different color.
An index is present in the upper left hand corner of each box which corresponds to one of the
indices listed in the upperleft hand corner of the figure.
The region names given in this figure
reflect the names of the data files obtained from the USGS web site.
The red an blue does represent the locations at which the HAED project has surveyed elevation.
Red dots represent positive elevations relative to NAVD 88.
Blue does represent negative elevation relative the NAVD 88.
Zero NAVD 88 can be taken to be approximatly equal to mean sea level.
|
Method of comparison
The data provided by the HAED project provides three pieces of
information for each data point.
The Universal Transect Mercator (UTM) coordinates for each data point are given, providing the
longitude and latitude of each data point.
The elevation for each point is also given.
Elevations are recorded in meters relative to NAVD 88.
To compare the HRT model output and the HAED data we need HRT elevations for each location given
by the HAED data.
The HRT model does not make predictions about ground surface elevation in urban areas.
For this reason the regions from the HAED project which include urban areas have been
excluded from the analysis.
Of the 19 regions for which HAED project data is available 11 are used for this analysis.
The locations and names of these regions are given in figure 2.
The HRT model is partially based on the ground surface elevation given by the South Florida
Water Management District (SFWMD).
So the elevations given in the SFWMD elevation map were also recorded for each location given
by the HAED data.
The files from which the HRT and SFWMD data were extracted are located in:
/quartz/homes/a/sfla/Nov.4.1997/pseudotopo/NEW/pseudotopo.28m.abs.lan and
/diamond/homes/datafiles/ground/version.2.9/data/ground.surface.elevation.unit-mm.lan
respectively.
Each of these files give ground surface elevation in mm relative NGVD 29.
This was confimed prior to using the data in two ways.
The source code which creates the HRT data as check.
The two data files where compared using PVWave to assure the data files had the
smae range of values.
The units of the HRT and SFWMD elevations for each location were converted to meters relative to
NAVD 88.
For each region, the distribution of elevations given by the HAED, the HRT model and the SFWMD
elevation data were comparied in a number of ways.
Scatter plots for each region where created which show the elevations from one data set
plotted agains one of the two remaining data sets.
This resulted in three scatter plots for each region, one comparing the HAED and SFWMD data,
one comparing the SFWMD and HRT data and one comparing the HAED and HRT data.
For each region the average elevation and 95% range of elevations given by the HAED and HRT
data associated with each habitat type where plotted.
For each region the histograms showing the distribution of elevations for each data set
are graphed together.
For these plots the average and standard deviation were computed for each data set.
The normal distribution for each data set was plotted on the same graph.
To determine if two or more of the averages differed significantly a one way ANOVA was
employed.
Even though the standard deviations are not identical the results of the ANOVA should be
essentially correct (Triola 1994).
The results of this comparison are given in the next section.
Results
The results of the comparison are shown in Figures 3a-3k.
Each figure represents the distribution of elevations for the HAED project, the HRT model and
the SFWMD elevation data for a given region.
The region represented by each figure is listed at the top of each figure.
In each figure the abscissa represents elevation in meters.
The ordinate gives the frequency with which each elevation occurs within a region.
Elevations were aggregated to the nearest tenth of a meter.
The data for the HAED is shown in red, the HRT data is shown in blue and the SFWMD data is
shown in green.
For each data set the actual distribution of elevations is shown by the thick curve.
The thin curve shows the normal distribution based on the mean and variance of the
measured data.
Each figure also shows the mean and standard deviation for each data set.
Each figure shows the value of a one way ANOVA.
The number of degrees of freedom for the numerator and denominator are also show in each figure.
In all cases ANOVA indicates a significant difference between at least two of the
data sets with a 1% chance of a type I error.
The statistics for each region have been gathered in Table 1.
In all regions the average HRT elevations are below the HAED elevation.
In at least three of the regions, Blackwater, Joe Bay and West Lake this is the result
of low SFWMD elevations and a lack of hydroperiod information of certain of the habitat
types present in these regions.
The SFWMD elevations are the starting point for the creation of HRT elevation.
In the three regions listed above the average SFWMD elevation is below the average HAED elevation.
Lower SFWMD elevations are joined with the fact that we do not have hydroperiod values
for some of the habitat types present in these regions.
In particular we do not have hydroperiods for two of the Red Mangrove (Rhizophora mangle) types.
These habitat types consitute a large portion of the area within these regions.
Lacking hydroperiod values for these habitat types the HRT model can not adjust their elevations.
In these cases the HRT model gives locations occupied by Red Mangrove the same elevation the
SFWMD provides.
In addition, many of the habitat types in these regions are assigned hydroperiods
or 365 days.
Since the SFWMD hydrology model already predicts a 365 day hydroperiod for these regions,
the model assumes they are already at an appropriate elevation.
In this way the majority of locations within Blackwater, Joe Bay and West Lake keep the
elevation provided by the SFWMD elevation data.
A few sites have hydroperiod shorter than 365 days, and the model raises these site.
This results in average HRT elevations higher than the SFWMD, but since most site are not
modified the effect is small.
In most of the regions however the HRT model is decreasing the ground surface elevation by
to much.
This is the result of assigning hydroperiod values to habitat types which are long compared
to the hydroperiod predicted by the SFWMD hydrology model.
This problem can be addressed in two ways, which maybe used in concert.
The simplest solution is to assign shorter hydroperiods to the habitat types.
This may mean shortening the maximum hydroperiod for each range, or moving both the maximum
and minimum hydroperiod values.
Another solution is to change how the model chooses a hydroperiod value from the range of
values available for each habitat type.
Currently the HRT model chooses a hydroperiod for a habitat type from the range of hydroperiod
values in proportion the hydroperiod predicted by the SFWMD hydrology model (See Box 1).
Box 1 : How the HRT model chooses hydroperiod values for a habitat type.
|
For each location, the SFWMD hydrology model predicts a hydroperiod between 0 and 365 days.
This value is normalized.
This normalized value is used to choose a hydroperiod for a habitat type from the range of
hydroperiods.
A normalized value of zero corresponds to choosing the shortest hydroperiod available for
a habitat type.
A normalized value of one corresponds to choosing the longest hydroperiod available for
a habitat type.
Interpolation between the max and min hydroperiod is linear.
|
In stead of choosing hydroperiod values on a simple proportional basis the model could use
other information.
The hydroperiod choosen for one habitat type depend on the hydroperiod preferences
for the other types.
For example if a long hydroperiod type is close to a short hydroperiod type, the model
might shift the choose of hydroperiod for the longer type toward the shorter end of the
hydroperiod range for that type.
Similarly the hydroperiod choosen for the short hydroperiod type might come from the longer
end of its range due to its proximity to the long hydroperiod type.
Alernatively the model could be designed to choose a hydroperiod with a tendency toward
a particular value with in a given range or based on a weighting of the values within the
range.
A bias in the choice of hydroperiod would be parameterized to reflect the hydroperiod preferences
of a habitat type.
Both of these approaches add realizem into the HRT model and would allow the model
to predict average elevations which are better fit to HAED elevations.
The difference in the standard deviations between data sets is less consistant from region to
region.
However some patterns are present.
The regions can be broken into two groups.
Those where are dominated by positive HAED elevations and those dominated by negative HAED
elevations.
Recall that HAED elevations are measured relative to NAVD 88, so negative elevation represent
elevations below this base line.
The regions dominated by positive HAED elevations are Coopertown, Custard, Haileah, Long Island,
Pennsuco and Taylor.
The regions dominated by negative HAED elevations are Blackwater Joe Bay, Madeira, Mahogany, and
West Lake.
This partitioning can be seen from figure 1 by observing which regions contain more red (positive
elevation) or blue (negative elevation) dots.
If we partitioned the regions based on positive or negative SFWMD elevation dominance
the groups would be different, since most of the elevations provided by the SFWMD are positive.
However the range of SFWMD elevations in the HAED positive group are an order of magnatude
larger than the SFWMD elevations for the HAED negative group.
Specifically, in regions dominated by negative HAED elevations the SFWMD elevations for those
regions are close to zero meters.
In regions dominated by positive HAED elevations the SFWMD elevations are at least a meter
above NAVD 88.
This break is important because it effects the output of the HRT model.
With the regions partitioned according to positive or negative HAED elevation we see that for
positive HAED elevation regions the standard deviation of elevation predicted by the
HRT model is larger than both the HAED and SFWMD standard deviations.
Infact the standard deviation for the HAED data is between the standard deviation for the
SFWMD and HRT data.
In these regions the HRT model has created to much diversity in the elevations.
While this indicates that the current HRT data set isn't a very good match to the
HAED data, it implies that there is enough flexability in the model which can be used
to better match the HAED data.
In particular the variation in elevations produced by the HRT model can be effected by
changing the range of hydroperiods assigned to each region.
By making the range of hydroperiods assigned to habitat types more similar the range of
elevations produced by the HRT model can be narrowed.
The results in a normalization of hydroperiod values. (?)
This sort of change to the hydroperiod ranges is possible since it intails narrowing the
diversity of hydroperiod ranges within a range of values which is already present with in
the data.
Narrowing the range of hydroperiod maybe effected by the process of shifting the range
of hydroperiods described earlier in as much as moving a hydroperiod range to far in one direction
or another may force a narrowing.
For regions in the negative HAED elevation group the picture is less clear.
In many of the regions the standard deviation of HRT elevations is larger than
the standard deviation of SFWMD elevations.
However, the standard deviation of the HAED elevations is no longer consistantly braketed
by the HRT and SFWMD data.
In these regions the HRT model under estimates the diversity of elevation.
More diveristy can be added to the elevation by diversifing the range of hydroperiod
values assigned to each habitat type.
There are limits to this diversification since hydroperiod values are constraned to the
range 0 to 365.
Overlap in hydroperiod ranges can also minimize diversity by allowing differnt habitat types
to have similar or identical hydroperiods.
The process of diversification can also conflict with the processes or normalization described
above.
It is not clear if these two processes can be achived simultainiously.
In addition this process may not be compatable with shifting the entire range of hydroperiods
to improve the fit of average hydroperiods.
In two of the regions, Madeira and West Lake the standard deviation of the HRT elevations is
less than the standard devation of the SFWMD elevations.
This result is a consequence of limitation in the input data sets.
In these regions the hydroperiod predicted by the SFWMD hydrology model is less than 365 days.
For most of the habitat types in these regions 365 days is the given hydroperiod.
Under these conditions, the HRT model does not modify the elevation of the habitat types since
their hydroperiod requirements are met at the elevation given by the SFWMD elevation data.
|
|
Figure 3a: Elevation distribution for region 1, Blackwater (bl990415.cor).
In this figure, the spike in the HRT data covers a identical spike in the SFWMD data.
|
|
|
Figure 3b: Elevation distribution for region 2, Coopertown (cooperct.cor).
|
|
|
Figure 3c: Elevation distribution for region 3, Custard (custard.cor).
|
|
|
Figure 3d: Elevation distribution for region 8, Hialeah (hialeah.cor).
|
|
|
Figure 3e: Elevation distribution for region 10, Joe Bay (jb990414.cor).
|
|
|
Figure 3f: Elevation distribution for region 11, Long Island (long_is.cor).
|
|
|
Figure 3g: Elevation distribution for region 12, Madeira (mb990412.cor).
|
|
|
Figure 3h: Elevation distribution for region 13, Mahogany (mh990412.cor).
|
|
|
Figure 3i: Elevation distribution for region 14, Pennsuco (penncoop.cor).
|
|
|
Figure 3j: Elevation distribution for region 18, Taylor Slough (ts990326.cor).
|
|
|
Figure 3k: Elevation distribution for region 19, West Lake (wl990412.cor).
|
Table 1. A listing of the statistics for each data set taken over each region.
The first column gives the name of the region.
The name of the source data file is given in parentheses.
The second column gives the figure number with which the statistics in the row are associated.
The next three columns give the average and mean for each of the USGS HAED data, the
ATLSS HRT data and the SFWMD data.
The last column gives the value of the ANOVA F-statistic.
All ANOVA values are significant, implying at least two of the data sets differ.
| |
Elevation |
| |
USGS HAED |
SFWMD |
HRT |
| Region Name |
Figure |
Average (meters) | Std. Dev. (meters) |
Average (meters) | Std. Dev. (meters) |
Average (meters) | Std. Dev. (meters) |
| Blackwater (bs990415.cor)
|
3a |
-0.266351 | 0.182032 |
-0.455230 | 0.0179738 |
-0.439824 | 0.0803711 |
| Coopertown (cooperct.cor)
|
3b |
1.43660 | 0.150940 |
1.50764 | 0.0749559 |
1.23106 | 0.229077 |
| Custard (custard.cor)
|
3c |
2.02151 | 0.174405 |
1.98807 | 0.129999 |
1.71736 | 0.171437 |
| Haileah (hialeah.cor)
|
3d |
1.19104 | 0.144416 |
1.25566 | 0.118832 |
0.802258 | 0.175113 |
| Joe Bay (jb990414.cor)
|
3e |
-0.280117 | 0.203776 |
-0.359333 | 0.0576803 |
-0.312632 | 0.105798 |
| Long Island (long_is.cor)
|
3f |
1.79149 | 0.180347 |
1.69341 | 0.104615 |
1.45278 | 0.262085 |
| Madeira (mb990412.cor)
|
3g |
-0.297962 | 0.257835 |
-0.274946 | 0.166931 |
-0.315746 | 0.125111 |
| Mahogany (mh990412.cor)
|
3h |
-0.232075 | 0.308445 |
-0.214740 | 0.0861255 |
-0.554640 | 0.181581 |
| Pennsuco (penncoop.cor)
|
3i |
1.49988 | 0.186719 |
1.55554 | 0.163086 |
1.33601 | 0.299984 |
| Taylor (ts990326.cor)
|
3j |
0.114706 | 0.290985 |
0.198750 | 0.319642 |
-0.176738 | 0.321766 |
| West Lake (wl990412.cor)
|
3k |
-0.165463 | 0.326811 |
-0.349140 | 0.0913071 |
-0.365879 | 0.0780477 |
Discussion
Copyright © 2000 Scott M. Duke-Sylvester.
Last Updated : 2000/04/26 19:50:22
Version : 1.14