TimeSeries Data Mining of Minimum Design Height for River Bridge Deck Using Seasonal Trend Decomposition
✅ Paper Type: Free Essay  ✅ Subject: Computer Science 
✅ Wordcount: 7238 words  ✅ Published: 8th Feb 2020 
Abstract. Using the traditional calculation of minimum design height (MDH) could not provide the effective decision support for engineers under complex situations. This research proposes a timeseries data mining system for the decomposition and analysis of seasonal trends of hydrologic fluctuations, using stagedischarge curves, LogPearson frequency curves, and plotting positions. The research purpose is to provide the decision support based on precise time series for bridge design and feasibility analysis. Prior to conducting the gageheight design for bridges, this study investigates the construction site of the bridge at Kinston Mines of Illinois River in the United States and processes the correlation tests between gage heights and hydrologic discharge. In addition, the study simulates and verifies the system outcomes using MannKendall Mutation, stability tests, and nonlinear regressions. The results show the correlation coefficient of 0.9719 for the discharge trends from 1994 to 2017. In addition, the study selects the bestfitted theoretical frequency curve based on the most appropriate convergence to the empirical frequency curve. Overall, this research achieves satisfactory results for the MDH design and has substantial influence for engineers and designers to meet design requirements and save project costs. The data mining method sheds light upon data analysis for engineering design.
 Introduction
The minimum design height (MDH) of bridge deck is always an inevitable factor in the feasibility study of a new river bridge [1, 2]. The traditional methods to calculate the MDH are usually based on the statistical means of multiple observations on hydraulic data [37]. However, the current practice of bridge maintenance requires the considerations of the hydrological fluctuations of rivers [5, 6]. Particularly, hydrologic discharge is the volume of water flowing past a given point in the stream in a given period of time which is an important indicator for river changes [5, 6, 8]. It is named after streamflow by fluvial hydrologists and plays a critical role in the study of hydrogeological cycles. The most common form of discharge is water flow derived from net precipitation and melting glaciers [8]. Bridge design and maintenance should consider gage height or stage, which is the water height in the stream above a reference point [5, 6, 8, 9]. Empirically, there is a quantitative relation between gage height and discharge in a specific crosssection of river, which can be manifested by StageDischarge Curve [10]. However, in the real world, transport systems connect geographical isolation by highway, railway, and airlines. Hence, it’s undoubted that bridges, which connect two points over two sides of rivers or valleys at specific heights, are significant components of the transportation systems. In the planning and design phases of river bridges, engineers and planners should always consider flooding argumentation in order to make sure that this newlyconstructed bridge meet the specific flooding control requirement. This ensures that it will not be submerged by a onceinacentury flood [10, 11]. Based on a design flood, bridge engineers should define the flood discharge for the particular project according to historical timeseries data. Then they the graph the stagedischarge curve correspondingly to find the conforming gage height. In addition, when deciding on the minimum vertical height from the lowest point of the bridge center and surface water, engineers and planners should take into account the navigable clearance height if this waterway is eligible for river transportation [12]. For example, on Illinois River in the United States, the minimum design height of bridge deck consists of designed flood level and navigable clearance height [3].
Researchers noticed the knowledge gap between the traditional methods of bridge design and the current needs of cost control on maintenances and repair. Engineers, designers and planners need to have improved understanding on how to estimate and predict the possible river flow changes. For example, Amirreza and Arash stated briefly that the relationship between stage and discharge was one of the methods used to measure the continuous river flow [5]. Researchers proposed ideas for the estimation of stagedischarge curve in open channels [5, 6]. For instance, the model of Ahmadi et al. introduced the rating curve taking advantage of the discharge information at a referenced water level [6]. Another method was suggested by Fread DL using numerical solutions of the dynamic, unsteady, nonuniform flow equations [10, 11]. Recent studies proposed inverse methods based on Manning’s equation to estimate the discharge from remotely sensed observations of water surface elevation, width and slope [12, 13]. Researchers discussed that useful discharge estimates could be retrieved for certain types of river reaches using satellite images. The estimates provided information on the river widths and the loglinear relationships of atastation hydraulic geometries (AHG) [14]. The logPearson type Ⅲ distribution is the upgraded transformation of Pearson type Ⅲ distribution and currently proposed to do floodfrequency analysis by U.S. federal agencies as described by B17.IACWD1982 [15]. The method of moment’s estimator in log space might be suitable to estimate the parameters of the distribution [7]. Nevertheless, researchers devoted much time and effort to update the methods to the accuracy of stagedischarge curve and estimation of distribution parameters. Furthermore, few scholars gathered those two methods in practice though both are functional tools for flooding control or flood argumentation [12, 13]. In this study, the authors consider using the combination of these two technical means to determine the minimum design river bridge height and explicitly explain the feasibilities.

Literature Review
 StageDischarge Curve
In hydrology, a rating curve is a graph of discharge versus stage for a given spot on the stream at a gouge station [9]. Generally, the rating curve is plotted as stage on Yaxis versus discharge on Xaxis [8]. At a specific gauge station, the stream discharge is measured and the stage can also be acquired by water level gauge. Theoretically, the curve scope diminishes gradually with a level off to 0 [8, 9].
2.2. LogPearson Ⅲ Frequency Curve
LogPearson Ⅲ distribution (LP3) is the updated transformation of Pearson Ⅲ distribution (P3) [13, 14, 16]. In term of P3, there are shape and location parameters α, β and ε with the probability density function in Equation (1):
$f\left(\mathrm{x}\right)=\frac{{\beta}^{\alpha}}{\u0433\left(\mathrm{\alpha}\right)}{\left(\frac{x\u2013\epsilon}{\beta}\right)}^{\alpha \u20131}{e}^{\u2013\beta (x\u2013\epsilon )}\mathit{}$
(1)
where $\mathrm{\alpha}$
> 0, (xε)/β > 0, $\u0433\left(\mathrm{\alpha}\right)$
= complete gamma function.
The density function can be confirmed if these three parameters defined. The parameters of the P3 distribution are functions of the first three population moments ( $\stackrel{\u0305}{x}$
, ${\sigma}_{x}$
and ${\gamma}_{x}$
): $\mathrm{\alpha}=\frac{4}{{\gamma}_{x}^{2}}\mathrm{};\beta =\frac{{\sigma}_{x}{\gamma}_{x}}{2};\mathit{\epsilon}=\stackrel{\u0305}{x}(1\u20132\frac{{\sigma}_{x}}{{\gamma}_{x}})$
, in which, $\mathrm{}\stackrel{\u0305}{x}$
, ${\sigma}_{x}$
and ${\gamma}_{x}$
are sample mean, the variance coefficient and skewness coefficient(Cs) respectively. Similarly, the LP3 distribution has probability density function:
$f\left(\mathrm{q}\right)=\frac{1}{\left\beta \right\mathrm{q}\u0433\left(\mathrm{\alpha}\right)}{\left(\frac{\mathit{ln\; q}\u2013\epsilon}{\beta}\right)}^{\alpha \u20131}\bullet {e}^{\u2013\left(\frac{\mathit{ln\; q}\u2013\epsilon}{\beta}\right)}\mathit{}$
(2)
Where, $\mathrm{ln\; q}=\mathrm{x}$
has a P3 distribution with parametersα, β and ε. The density function is defined for α > 0, and 0 < q < exp [ $\epsilon $
] for $\mathrm{}{\gamma}_{x}0$
, and exp [ $\epsilon $
] $\le $
q for $\mathrm{}{\mathit{\gamma}}_{x}0$
. When $\mathrm{}{\gamma}_{x}=0$
, the PL3 distribution converges to a LN distribution.
However, the differences between LP3 and P3 are not limited to the density functions. As far as variance coefficient (Cv) or ${\sigma}_{x}$
for LP3, it should be calculated as follow:
$\mathit{}{\sigma}_{x}=\sqrt{\frac{\sum _{i=1}^{n}{({K}_{i}\u20131)}^{2}}{n\u20131}}\mathit{}$
(3)
In which, ${K}_{i}$
is modulus coefficient; n is sample size. In term of skewness coefficient ${\sigma}_{x}$
(Cs), it can be estimated by Cv. For instance, $\mathrm{Cs}=\mathrm{}\left(2\mathrm{\uff5e}3\right)\bullet \mathrm{Cv}$
. In hydrological computation, when a specific frequency P given, the random response variable ${x}_{p}$
is acquired through an analysis of frequency density curve. In practice, the complicated process of calculation has been simplified in order to make it more efficient. When conducting frequency computation, based on the calculated Cv value, a hypothetical Cs should be given in order to find the $\mathrm{\varnothing}$
with a specific frequency value of P. Then, values of $\mathrm{\varnothing}$
and Cv can be substituted into formula 5 to get the according $\mathrm{}\mathrm{ln}{x}_{p}$
.
$\varnothing =\frac{\mathit{ln\; x}\u2013\mathit{ln}\stackrel{\u0305}{x}}{{C}_{v}\bullet \mathit{ln\; x}}\mathit{}$
(4)
$\mathit{}\mathrm{ln}{x}_{p}=\mathit{ln}\stackrel{\u0305}{x}\bullet \left(1+{C}_{v}\bullet \varnothing \right)\mathit{}$
(5)
where, $\mathrm{\varnothing}$
is standardized value, named coefficient of deviation mean.
 Method Design
In this research, the following steps are used for the calculation and analysis of time series of the seasonal trends of hydrological data. The steps can help to calculate the minimum design height of river bridge deck minus navigable clearance.
 Rank the observed values of discharge data from high to low first, calculate the empirical frequency of each discharge item and graph empirical curve thereafter.
 Define hydrological frequency distribution curve (LogPearson Ⅲ Frequency Curve).

Work out the values of
$\mathrm{ln}\stackrel{\u0305}{x}$
, Cv and Cs by using Central Moment Method and suppose them as the subjects for first attempt. Due to large error comes out when calculating Cs by Central Moment, it is generally estimated by 2 – 3 times of Cv value.

According to the identified
$\mathrm{ln}\stackrel{\u0305}{x}$
, Cv and Cs, ∅ can be found in the appendix chart then ln x ̅ will be figured out in the end. Plot $\mathrm{ln}\stackrel{\u0305}{x}$
as yaxis versus P as xaxis, LogPearson frequency curve is graphed.
 Change the Cs then find the decent LogPearson frequency curve which fits the empirical curve well. Then, take advantage of those parameters of this wellfitted curve as population statistics for practical applying.
 Calculate the design discharge value with a designated frequency.

Data Collection and Analysis
 Sources of data
All the data used in this study is derived from USGS. A gauge station on the stream at Kinston Mines recorded monthly data of stage and discharge of Illinois River from October 1993 to July 2018. Due to malfunction of stage measurement instrument or teletransmission failure, especially before 2002, there were some deletions of monthly data of gauge height. But, the stage data is sufficient for this study because the lost data only stands for a small proportion.
4.2. Correlation Test
Prior to graph stagedischarge curve, it’s necessary to test the correlation between gage height and discharge to ensure that there is a certain relation between these two factors. The monthly average data of gage height and discharge is used to do correlation test; the result of correlation test indicates that the correlation coefficient is 0.9719, which is shown in table 1.
Table 1. Result of Correlation Analysis between Gage Height and Discharge.
Gage Height 
Discharge 

Gage Height 
1 
– 
Discharge 
0.9719 
1 
4.3. MannKendall Mutation and Stability Test
MannKendall (MK) test (Mann 1945; Kendall 1975), a nonparametric way, in which is widely utilized to evaluate the statistically significant trends in hydrological and climatological time series. So as to make sure whether there are extreme values of discharge data and if the time series discharge data are acceptable within a specific interval, MK test is conducted. With a significant level $\mathrm{\alpha}=0.05$
and ${\mathrm{}U}_{0.05}=\pm 1.96$
, figure 1 shows that the discharge of Illinois River at Kinston Mines has a distinct uptrend according to both UFk and UBk curves. In addition, there are three UFk excess values which are assigned with the year 2011, 2016 and 2017 respectively. The difference between UFk and upper limit ${\mathrm{}U}_{0.05}=1.96$
is most significant. Hence, it’s affirmative that there is at least one monthly average extreme discharge data in 2017 and 2017 should be selected as typical year for subsequent definition of design water level.
4.4. StageDischarge Curve and Empirical Frequency Curve
Due to some lost data points of gage height, of which some assigned discharge data is not paired, so these discharge data points were not undertaken in drawing the stagedischarge curve. In the end, the data points of gage height and discharge used for drawing following curve (as shown in figure 2) were onetoone correspondence (the data will be available upon request). Based on the observed values of discharge, the data is ranked from high to low and serial numbers i (i=1, 2, …24 ) are given to mark those newlyranked data. Figure 3 shows the empirical frequency calculated by following formula:
Pi= [i/(n+1)]*100% (6)
4.5. Bestfitted Plotting Position and Design gage height
According to the methods to Plotting Position, three different attempts of plotting position are carried out, of which the results are shown in figures 4, 5 and 6 respectively. Figures 7 and 8 combine the outcomes of those three attempts in order to analyze which curve with a set of Cv and Cs is bestfitted and shrinks towards the empirical frequency curve. Obviously, the outcome of first attempt is most ideal, so the Cv and Cs are chose to be estimated parameters of $\mathrm{}\mathrm{ln}{x}_{p}=\mathrm{}\mathit{ln}\mathrm{}\stackrel{\u0305}{x}\bullet \left(1+{C}_{v}\bullet \mathrm{\varnothing}\right)$
. Furthermore, this curve will be used to do flood forecasting at Kinston Mines of Illinois River. In table 2, the $\mathit{ln}\stackrel{\u0305}{\mathit{Q}}$
is the $\mathit{ln}\mathrm{}\stackrel{\u0305}{x}$
in that formula, which is a constant, 10.49, calculated from the average of annual maximum discharge values within 24 years.
Figure 1 MK Test of Annual Average Discharge from 1994 to 2017.
Figure 2 StageDischarge Curve Based on Observed Data Points.
Figure 3 Empirical Frequency Curve
Figure 2First Attempt of Plotting Positions
Figure 3 Second Attempt of Plotting Positions
Table 2 Three Attempts for Bestfitted Plotting Positions
Frequency P (%) 
First Attempt 
Second Attempt 
Third Attempt 

$\textcolor[rgb]{}{\mathit{ln}}\stackrel{\u0305}{\textcolor[rgb]{}{\mathit{Q}}}$ =10.49 Cv=0.10 Cs=2Cv=0.20 
$\textcolor[rgb]{}{\mathit{ln}}\stackrel{\u0305}{\textcolor[rgb]{}{\mathit{Q}}}$ =10.49 Cv=0.10 Cs=3Cv=0.30 
ln $\stackrel{\u0305}{\textcolor[rgb]{}{\mathit{Q}}}$ =10.49 Cv=0.20 Cs=2Cv=0.40 

∅ 
Lnx_{p} 
∅ 
Lnx_{p} 
∅ 
Lnx_{p} 

(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
0.01 
4.16 
14.89 
4.38 
15.12 
4.61 
15.36 
0.1 
3.38 
14.07 
3.52 
14.22 
3.67 
14.37 
0.2 
3.12 
13.80 
3.24 
13.92 
3.36 
14.05 
0.5 
2.76 
13.42 
2.86 
13.52 
2.95 
13.62 
1 
2.47 
13.11 
2.54 
13.19 
2.62 
13.27 
2 
2.16 
12.79 
2.21 
12.84 
2.26 
12.89 
3 
1.96 
12.58 
2.00 
12.62 
2.04 
12.66 
5 
1.70 
12.30 
1.73 
12.33 
1.75 
12.36 
10 
1.30 
11.88 
1.31 
11.89 
1.32 
11.90 
20 
0.83 
11.39 
0.82 
11.38 
0.82 
11.38 
25 
0.65 
11.20 
0.64 
11.19 
0.64 
11.19 
30 
0.50 
11.04 
0.48 
11.02 
0.47 
11.01 
40 
0.22 
10.75 
0.20 
10.73 
0.19 
10.71 
50 
0.03 
10.48 
0.05 
10.46 
0.07 
10.44 
60 
0.28 
10.22 
0.30 
10.20 
0.31 
10.19 
70 
0.55 
9.94 
0.56 
9.93 
0.57 
9.92 
75 
0.69 
9.79 
0.70 
9.78 
0.71 
9.77 
80 
0.85 
9.62 
0.85 
9.62 
0.85 
9.62 
85 
1.03 
9.43 
1.03 
9.43 
1.03 
9.43 
90 
1.26 
9.19 
1.24 
9.21 
1.23 
9.22 
95 
1.59 
8.84 
1.55 
8.89 
1.52 
8.92 
97 
1.79 
8.63 
1.75 
8.67 
1.70 
8.73 
99 
2.18 
8.22 
2.10 
8.31 
2.03 
8.38 
99.9 
2.81 
7.56 
2.67 
7.71 
2.54 
7.84 
Figure 4Third Attempt of Plotting Positions
Based on the data points in Appendix A, the maximum monthly average discharge was 53630 ft3/s in 2009. If it is amplified by 1.5 times, there is a new data, 80445 ft3/s. When a return period, 10 years, is taken into consideration, the design frequency P is 0.1 or 10% then the assigned value of $\mathrm{ln}{x}_{p}$
11.2 can be estimated by plotting position curve of first attempt. Afterwards, ${x}_{p}$
is worked out by exp[11.2], of which the result is 73130.44 ft3/s. The estimated discharge is exactly the design discharge with a 10year return period. Finally, the design gage height, 23.0 ft, is acquired through a lookup of stagedischarge curve.
Figure 5 Combined Plotting Positions
Figure 6 Regression Charts
 Conclusions
This study discusses the minimum design height of a river bridge. The following are some findings.
 In terms of the correlation test between gage height and discharge, the correlation coefficient is significantly closed to 1. In other words, it means the stagedischarge curve is drawn based on reliable data points.
 The MK test was well applied to verify the annual discharge variation trend. Precisely, the discharge of Illinois River at Kinston Mines has an upward tendency from 1994 to 2017, which means the according gage height will also goes up to some extent. Hence, when doing estimation of design gage height at Kinston, Mines, the design discharge should be amplified slightly greater.
 Compared Pearson Frequency Curve with LogPearson Frequency Curve, the principles of each other is largely identical but with minor differences. LogPearson Frequency Curve is satisfyingly used to do plotting positions.
 Research shows that design discharge with a designated return period can be deduced by the theoretical frequency curve, and then the design gage height is worked out through stagedischarge curve. The design discharge with a 10year return period corresponds to a 23ft gage height at Kinston Mines of Illinois River. Furthermore, when considering the minimum height of a hypothetical river bridge, it can be defined by navigable clearance and design gage height.
This hypothetical study is for the minimum design height of river bridge deck and for the first time, shows the possibility to combine the function of stagedischarge curve, LogPearson Ⅲ Frequency Curve or Pearson Ⅲ frequency curve. In general, this study is an exploration to connect hydrological system and transport, which inspires more researchers to step forwards an interdisciplinary domain. There are still several limitations in this study. For instance, the study subject is merely a hypothetical object, so there is no data material collected about navigation system, such as the ship height. In addition, in the process of plotting positions, it’s inevitable to generate some personal errors due to the curves were delineated manually. In future research, the authors plan to collect navigable clearance data to complete the study.
 Appendix
 References
[1] Cheng YM, Leu SS. Integrating data mining with KJ method to classify bridge construction defects. Expert Systems with Applications. 2011;38(6):714350.
[2]. Wong KY. Design of a structural health monitoring system for longspan bridges. Structure and Infrastructure Engineering. 2007;3(2):16985.
[3]. IDOT. Standard Specifications for Road and Bridge Construction. Illinois Department of Transportation. http://www.idot.illinois.gov/Assets/uploads/files/DoingBusiness/ManualsGuides&Handbooks/Highways/Construction/StandardSpecifications/Standard%20Specifications%20for%20Road%20and%20Bridge%20Construction%202016.pdf2016. p. 1225.
[4]. AASHTO L. AASHTO LRFD bridge design specifications. Transportation (Amst) American Association of State Highway and Transportation Officials, Inc: Washington, DC. 2007.
[5]. Kavousizadeh A, Ahmadi A. Highperformance approach for estimating stagedischarge curves in the open channels. Journal of hydrology. 2018;565:197213.
[6]. Maghrebi MF, Ahmadi A. Stagedischarge prediction in natural rivers using an innovative approach. Journal of hydrology. 2017;545:17281.
[7]. Haan CT. Statistical methods in hydrology: The Iowa State University Press; 1977.
[8]. Kennedy E. Discharge ratings at gaging stations: US Geological Survey Techniques of WaterResources Investigations, book 3, chap. A10. 1984;59.
[9]. Herschy RW. Hydrometry: principles and practices. 1999.
[10]. Fread D. A dynamic model of stagedischarge relations affected by changing discharge. 1973.
[11]. Fread D. COMPUTATION OF STAGE‐DISCHARGE RELATIONSHIPS AFFECTED BY UNSTEADY FLOW 1. JAWRA Journal of the American Water Resources Association. 1975;11(2):21328.
[12]. Durand M, Neal J, Rodríguez E, Andreadis KM, Smith LC, Yoon Y. Estimating reachaveraged discharge for the River Severn from measurements of river water surface elevation and slope. Journal of Hydrology. 2014;511:92104.
[13]. Garambois PA, Monnier J. Inference of effective river properties from remotely sensed observations of water surface. Advances in Water Resources. 2015;79:10320.
[14]. Gleason CJ, Smith LC. Toward global mapping of river discharge using satellite images and atmanystations hydraulic geometry. Proceedings of the National Academy of Sciences. 2014;111(13):478891.
[15]. Pierson TC. Hyperconcentrated flow—transitional process between water flow and debris flow. Debrisflow hazards and related phenomena: Springer; 2005. p. 159202.
[16]. Turban E, Sharda R, Delen D, Efraim T. Decision support and business intelligence systems: Pearson; 2014.
Cite This Work
To export a reference to this article please select a referencing stye below:
Related Services
View allDMCA / Removal Request
If you are the original writer of this essay and no longer wish to have your work published on UKEssays.com then please: