Dự báo lún theo thời gian cho xây dựng đường ở Bắc Ninh và Hải Dương bằng mạng nơ ron nhân tạo

Tóm tắt Dự báo lún theo thời gian cho xây dựng đường ở Bắc Ninh và Hải Dương bằng mạng nơ ron nhân tạo: ...(yj). The training of the multi-layer feed-forward neural network starts at the input layer, after that, a learning rule is used to obtain the network output (Figure 1). The weights and the bias are adjusted in order to get the smallest possible error between the desired output and the o... + e -x )). The 5000 training cycles (epochs) is used to terminate the training process, which is the same as the training cycles previously published by Shahin [7]. This number basically satisfies the requirement that there is no increase in ĐỊA KỸ THUẬT - TRẮC ĐỊA Tạp chí KHCN Xây ... TRẮC ĐỊA Tạp chí KHCN Xây dựng - số 3/2021 67 Table 5. Performance of the models for the second case Name of Method Equation (or Model architecture) R squared (traning data) MSE Asaoka 1*9987.09164.2  titi SS 0.9980 2.9 Asaoka + Polynomial 1 3825 *9598.0 *10*2.2*10*3 *02...

pdf9 trang | Chia sẻ: Tài Phú | Ngày: 19/02/2024 | Lượt xem: 37 | Lượt tải: 0download
Nội dung tài liệu Dự báo lún theo thời gian cho xây dựng đường ở Bắc Ninh và Hải Dương bằng mạng nơ ron nhân tạo, để tải tài liệu về máy bạn click vào nút DOWNLOAD ở trên
. So sánh kết quả dự đoán 
của các mô hình với kết quả thực tế có thể thấy: 
nên sử dụng mạng nơ-ron nhân tạo (ANN) để dự 
báo lún theo thời gian trong quá trình thi công 
đường. 
Từ khóa: Dự đoán, mạng nơ ron nhân tạo 
(ANN), lún mặt đất, xây dựng đường. 
1. Introduction 
Artificial neural network is a branch of the 
‘artificial intelligence’, which also includes case-
based reasoning, expert systems, and genetic 
algorithms [19]. The efficient manipulation of large 
amounts of data and the ability to generalize results 
are the main advantages of neural networks [23]. 
Thus, the applications of the artificial neural network 
have been found in innumerable fields [2, 4, 10-15, 
18, 21, 22]. 
In the last few decades, artificial neural networks 
(ANNs) have been successfully applied to virtually 
every problem in the predictions of the settlement 
problems. Alkroosh and Nikraz [10] simulated the 
load-settlement behavior of pile foundations 
embedded in sand and mixed soils (subjected to 
axial loads) by artificial neural networks (ANNs). The 
results indicated that the ANN model performed very 
well after comparing predictions from the ANN with 
predictions of number of currently adopted load-
transfer methods. Ismail and Jeng [13] developed a 
high-order neural network (HON) for simulating the 
pile load–settlement curve using properties of the 
pile and SPT data along the depth of pile 
embedment as inputs. The HON model presented 
better predictions than the predictions of elastic and 
hyperbolic models. Shahin, Jaksa [8] presented a 
new hand-calculation design formula for settlement 
prediction of shallow foundations on granular soils 
based on a more accurate settlement prediction 
from an artificial neural network model. The model 
had five inputs (the footing width, net applied footing 
load, average blow count, obtained using a standard 
penetration test (SPT) over the depth of influence of 
the foundation as a measure of soil compressibility, 
footing geometry, and footing embedment ratio), 
and one output (foundation settlement). Shahin, 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
62 Tạp chí KHCN Xây dựng - số 3/2021 
Jaksa [20] developed a set of charts incorporating 
the uncertainty associated with the ANN method, 
which enable the designer to make informed 
decisions about the level of risk correlated with 
predicted settlements. 
It can be seen that there is not much research 
on predicting the settlement over time. In this paper, 
an ANN model is developed and compared with two 
traditional methods (Asaoka method and the method 
of Asaoka combined with Polynomial) in predicting 
the settlement of two cases of road construction. 
The parameters of two traditional methods are 
referenced from Tran, Nguyen [9]. The criterions to 
evaluate the accuracy of the models are the R 
squared (R
2
) and the mean square error (MSE). The 
relative conclusions would be drawn by comparing 
the criterions of these models, and the predictions 
data of these models with the monitoring data. 
2. Overview of artificial neural networks 
Artificial neural networks (ANNs) are numerical 
modeling techniques inspired by the functioning of 
the human brain and nervous system [7]. 
The purpose of ANNs is similar to conventional 
statistical models, which is to determine the 
relationship between the model inputs and 
corresponding outputs. However, ANNs only use the 
data and do not require predefined mathematical 
equations of the relationship between the inputs 
data and outputs data. This allows ANN to get past 
the limitations of the conventional models. 
A Multi-layer feed-forward with the back-
propagation algorithm training is used in this study 
[6]. The multi-layer feed-forward neural network is 
composed of several processing elements (called 
nodes or neurons). The processing elements are 
fully or partially connected via connection weights, 
and they are often classified into layers: an input 
layer; an output layer; and hidden layers (layers in 
between). 
Many authors have already described the 
structure and operation of ANNs. Figure 1 shows 
the structure and operation of an ANN depicted by 
Shahin [7]. At each processing element, the input 
from the processing element of the previous layer 
(xi) is multiplied by an adjustable connection weight 
(wji), and weighted inputs are summed and a bias 
(θj) is added or subtracted. The combined input (Ij) is 
then passed through a non-linear transfer function 
(f(.)) (e.g. sigmoidal function or tanh function) to 
produce the output of the processing element (yj). 
The training of the multi-layer feed-forward 
neural network starts at the input layer, after that, a 
learning rule is used to obtain the network output 
(Figure 1). The weights and the bias are adjusted in 
order to get the smallest possible error between the 
desired output and the output which is obtained from 
the preview step. As soon as the training phase is 
accomplished, the trained model would be validated 
by an independent testing set. Several of the steps 
used to develop an ANN are discussed by Maier 
and Dandy [5]. 
Figure 1. Structure and operation of an ANN [7] 
3. Development of the ANN model 
The ANN model is created with the aid of the 
software package PYTHON Version 3.6. The 
constructed sites of two roads were located in Bac 
Ninh and Hai Duong province with different soil 
types and geotechnical conditions. The data used to 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
Tạp chí KHCN Xây dựng - số 3/2021 63 
calibrate and validate the ANN model are shown in Table 1 and Table 2. 
. 
Table 1. Road surface settlement over time in the first case [9] 
No 
Time 
(day) 
Field data 
(mm) 
No 
Time 
(day) 
Field data 
(mm) 
No 
Time 
(day) 
Field data 
(mm) 
1 1 -2 14 63 -60 27 115 -290 
2 6 -6 15 68 -65 28 118 -295 
3 10 -9 16 71 -70 29 125 -310 
4 14 -13 17 75 -94 30 131 -325 
5 20 -19 18 78 -112 31 138 -340 
6 24 -29 19 82 -125 32 145 -360 
7 27 -32 20 86 -141 33 152 -381 
8 31 -35 21 92 -155 34 159 -395 
9 35 -37 22 96 -172 35 166 -411 
10 39 -42 23 100 -189 36 173 -425 
11 46 -49 24 104 -214 37 179 -439 
12 52 -54 25 108 -242 
13 57 -56 26 110 -262 
Table 2. Road surface settlement over time in the second case [9] 
No 
Time 
(day) 
Field data 
(mm) 
No 
Time 
(day) 
Field data 
(mm) 
No 
Time 
(day) 
Field data 
(mm) 
1 4 -1 17 280 -52 33 515 -92 
2 23 -3 18 298 -53 34 522 -95 
3 43 -4 19 319 -54 35 531 -99 
4 61 -6 20 333 -55 36 547 -103 
5 90 -7 21 350 -57 37 566 -108 
6 111 -8 22 363 -58 38 578 -112 
7 140 -10 23 375 -60 39 587 -115 
8 158 -13 24 384 -61 40 599 -117 
9 165 -19 25 398 -65 41 613 -118 
10 172 -25 26 403 -68 42 636 -118 
11 181 -28 27 417 -74 43 662 -121 
12 197 -35 28 438 -78 44 683 -123 
13 209 -38 29 459 -81 45 704 -125 
14 226 -45 30 474 -82 46 726 -126 
15 237 -47 31 494 -86 47 782 -127 
16 254 -50 32 502 -89 
3.1 Data division and preprocessing 
Monitoring data includes time and settlement, 
thus, the time was chosen as an input variable and 
settlement was chosen as an output variable. 
The data have been divided into two subsets, 
training set for model calibration and validation set 
for model verification. All the methods in this paper 
are developed for the purpose of predicting the 
settlement at the next step. Thus, the data from the 
beginning to 159 days and the data from the 
beginning to 683 days are allocated for the training 
data in the first and the second case, respectively. 
The last three monitoring data are used for model 
verification. 
In order to minimize the dimension and to make 
sure all variables get equal attention during training, 
the preprocessing is conducted by scaling the input 
and output variables between 0.0 and 1.0. The 
scaled value of each variable x, xn, is calculated as 
follows: 
xn = x / xmax [1] 
Where: xmax is maximum values of each variable x. 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
64 Tạp chí KHCN Xây dựng - số 3/2021 
3.2 Model architecture, weight optimization and stopping criterion 
Figure 2. Variation of loss against epoch 
Figure 3. Effect of number of hidden layer nodes on performance of ANN model 
The model geometry (i.e. the number of hidden 
layers, the number of hidden nodes in each layer) 
and weight optimization (i.e. learning rate and 
momentum term) play a major role in the 
development of the ANN models. 
 Hornik, Stinchcombe [3] noted that a network 
with one hidden layer can approximate any 
continuous function provided that sufficient 
connection weights are used. Thus, one hidden 
layer is used in this ANN model. 
ReLU and tanh are selected as transfer 
functions in the hidden and output layers (ReLU and 
tanh are activation functions, which outputs a small 
value for small inputs, and a larger value if its inputs 
exceed a threshold; ReLu: f(x) = max(0,x); Tanh: 
f(x) = (e
x 
- e
-x
)/(e
x 
+ e
-x
)). The 5000 training cycles 
(epochs) is used to terminate the training process, 
which is the same as the training cycles previously 
published by Shahin [7]. This number basically 
satisfies the requirement that there is no increase in 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
Tạp chí KHCN Xây dựng - số 3/2021 65 
the error. The training loss at the end of the training 
process does not fluctuate and does not increase 
(see Figure 2). 
 Caudill [1] noted that 2I+1 hidden layer nodes 
are the upper limit needed to map any continuous 
function for a network with I number of inputs. 
However, based on the effect of the number of 
hidden nodes on the performance of ANN model 
(Figure 3), the ANN model with 5000 hidden nodes 
has the lowest prediction error (the highest value of 
the R squared and the lowest value of the mean 
square error). The number of hidden nodes using for 
ANN model in this paper is much more than the 
number of hidden nodes recommended and used by 
Caudill [1] before. 
The learning rate is a tuning parameter in an 
optimization algorithm that determines the step size 
at each iteration while moving toward a minimum of 
a loss function [17]. The effect of learning rate on 
the performance of ANN models is shown in Figure 
4. It can be seen that the ANN model with the 
learning rate of 0.01 has the lowest prediction error. 
The prediction errors tend to increase at the larger 
learning rate. The gradient descent optimization 
algorithm is Adam optimizer. It already incorporates 
something like momentum, thus, the momentum 
term is not examined. 
Figure 4. Effect of learning rate on performance of ANN model 
4. Comparison of ANN model with difference 
methods 
The performance of the ANN models and two 
traditional methods for the first case is summarized 
in Table 3. It may be seen that the ANN 1 model 
performs well; as it has the highest value of R 
squared (0.9996), and lowest value of mean 
squared errors (7.6) for the training data set. 
However, the next three predicted data (validation 
set) of the ANN 1 model does not even draw a line 
of best fit between predicted data and field data 
(Figure 4 and Figure 5). It also validates that, like all 
empirical models, ANNs perform best in 
interpolation rather than extrapolation [16]. 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
66 Tạp chí KHCN Xây dựng - số 3/2021 
Table 3. Performance of the models for first case 
Name of Method Equation (or Model architecture) 
R squared 
(traning data) 
MSE 
Asaoka 1*0423.10372.6  titi SS 
0.9976 37.4 
Asaoka + 
Polynomial 
1
32
*9096.0
*474.4*0117.0
*4776.0838.9



ti
ii
iti
S
tt
tS
0.9985 24.0 
ANN 1 5000 Hidden nodes; lr 0.01; epochs 5000 0.9996 7.6 
ANN 2 5000 Hidden nodes; lr 0.008; epochs 5000 0.9992 51.7 
Table 4. Field data and Prediction data for first case 
Time 
(day) 
Measurement (mm) 
Prediction (Diviation) (mm) 
Asaoka Asaoka + Polynomial ANN 1 
166 -411 -418 (-7) -407 (4) -406 (5) 
173 -425 -434 (-9) -419 (6) -414 (11) 
179 -439 -449 (-10) -429 (10) -418 (21) 
Figure 5. Settlement for first case 
The method and the model performs well in 
predicting the next three data (validation set) is the 
Asaoka combined with Polynomial (AP) and ANN 2 
(with 5000 Hidden nodes, learning rate 0.008 and 
epochs 5000). Though their criterions to evaluate 
the accuracy of the models in the training data set 
are not the best (the values of R squared are not the 
highest and the values of mean squared errors are 
not the lowest). This may due to the training data is 
not enough in this case. 
Table 5 shows the performance of the ANN 1 
model and two traditional methods for the second 
case. The ANN 1 model also performs well for the 
training data, as it has the highest value of R 
squared (0.9993), and lowest value of mean 
squared errors (1.2). However, in this case, the next 
three predicted data of ANN 1 model have a very 
small deviation from the field data (Table 6). It may 
be because the road has moved to the stage of the 
commissioning work. All the data given by the ANN 
1 model creates the best fit line between predicted 
data and field data (Figure 6). Thus, when the 
training data is enough, the ANN method provides a 
more accurate prediction than traditional methods. 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
Tạp chí KHCN Xây dựng - số 3/2021 67 
Table 5. Performance of the models for the second case 
Name of Method Equation (or Model architecture) 
R squared 
(traning data) 
MSE 
Asaoka 1*9987.09164.2  titi SS 0.9980 2.9 
Asaoka + 
Polynomial 
1
3825
*9598.0
*10*2.2*10*3
*0207.0159.1





ti
ii
iti
S
tt
tS
 0.9981 2.8 
ANN 1 5000 Hidden nodes; lr 0.01; epochs 5000 0.9993 1.2 
Table 6. Field data and Prediction data for the second case 
Time 
(day) 
Measurement (mm) 
Prediction (Diviation) (mm) 
Asaoka Asaoka + Polynomial ANN 1 
704 -125 -126 (-1) -127 (-2) -125 (0) 
726 -126 -128 (-2) -129 (-3) -126 (0) 
782 -127 -129 (-2) -130 (-3) -127 (0) 
Figure 6. Settlement for the second case 
5. Conclusion 
After comparing the accuracy of ANN method 
with the accuracy of traditional methods (Asaoka 
method and the method of Asaoka combined with 
Polynomial) for predicting the settlement of two case 
histories, the following conclusions can be drawn. 
- The ANN model with 5000 hidden nodes has 
the lowest prediction error in predicting settlement 
over time (time is input variable and settlement is 
output variable). The number of hidden nodes is 
much more than the hidden nodes discussed by 
Caudill [1]; 
- The criterions to evaluate the accuracy of the 
ANN model are much better than other models, 
however, the accuracy of the next three predicted 
data of the ANN models (validation set) is not good 
if it does not have enough training data (i.e. 34 data 
in the first case history); 
- If the training data is enough (i.e. 44 data in the 
second case history) the accuracy the ANN model 
for the next three predicted data (validation set) is 
extremely high, and ANN model should be used for 
predicting settlement over time without knowing 
parameters related to the surcharge and soil. 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
68 Tạp chí KHCN Xây dựng - số 3/2021 
REFERENCES 
1. Caudill Maureen (1988). "Neural networks primer, 
Part III". AI Expert, 3 (6), pp 53-59. 
2. Goh ATC and Kulhawy FH (2003). "Neural network 
approach to model the limit state surface for reliability 
analysis". Can Geotech Jounal, 40 (6), pp 1235-1244. 
3. Hornik Kurt, Stinchcombe Maxwell, and White Halbert 
(1989). "Multilayer feedforward networks are universal 
approximators". Neural networks, 2 (5), pp 359-366. 
4. Kim Y and Kim B (2008). "Prediction of relative crest 
settlement of concrete-faced rockfill dams analyzed 
using an artificial neural network model". Comput 
Geotech, 35 (3), pp 313-322. 
5. Maier HR and Dandy GC (2000), Application of 
artificial neural networks to forecasting of surface 
water quality variables: issues, applications and 
challenges, in Artificial neural networks in hydrology. 
Springer. p. 287-309. 
6. Rumelhart David E, Hinton Geoffrey E, and Williams 
Ronald J (1985), Learning internal representations by 
error propagation. California Univ San Diego La Jolla 
Inst for Cognitive Science. 
7. Shahin Mohamed A (2010). "Intelligent computing for 
modeling axial capacity of pile foundations". Canadian 
Geotechnical Journal, 47 (2), pp 230-243. 
8. Shahin Mohamed A, Jaksa Mark B, and Maier Holger 
R (2002). "Artificial neural network based settlement 
prediction formula for shallow foundations on granular 
soils". Australian Geomechanics: Journal and News 
of the Australian Geomechanics Society, 37 (4), pp 
45. 
9. Tran Quang Hoc, Nguyen Thanh Le, and Tong Thi 
Hanh (2017). "Study and establish subsidence 
forecast models in accordance with analysis and 
forecast soft ground subsidence from monitoring 
results". Journal of Mining and Earth Sciences, 58 / 
2017 pp 93-100. 
10. Alkroosh I. and Nikraz H. (2011). "Simulating pile 
load-settlement behavior from CPT data using 
intelligent computing". Central European Journal of 
Engineering, 1 (3), pp 295-305. 
11. Baziar Mohammad Hassan, Azizkandi Alireza Saeedi, 
and Kashkooli Armin (2015). "Prediction of pile 
settlement based on cone penetration test results: an 
ANN approach". KSCE Journal of Civil Engineering, 
19 (1), pp 98-106. 
12. Hajihassani M, et al. (2019). "3D prediction of 
tunneling-induced ground movements based on a 
hybrid ANN and empirical methods". Engineering with 
Computers, pp 1-19. 
13. Ismail Abdussamad and Jeng Dong-Sheng (2011). 
"Modelling load–settlement behaviour of piles using 
high-order neural network (HON-PILE model)". 
Engineering Applications of Artificial Intelligence, 24 
(5), pp 813-821. 
14. Kung GT, et al. (2007). "A neural network approach to 
estimating deflection of diaphram walls caused by 
excavation in clays". Comput Geotech, 34 (5), pp 
385-396. 
15. Kuo YL, et al. (2009). "ANN-based model for 
predicting the bearing capacity of strip footing on 
multi-layered cohesive soil". Computers and 
Geotechnics, 36 (3), pp 503-516. 
16. Masters Timothy (1993), Practical neural network 
recipes in C++. Morgan Kaufmann. 
17. Murphy Kevin P (2012), Machine learning: a 
probabilistic perspective. MIT press. 
18. Nejad F Pooya, et al. (2009). "Prediction of pile 
settlement using artificial neural networks based on 
standard penetration test data". Computers and 
geotechnics, 36 (7), pp 1125-1133. 
19. Rahul, et al. (2015). "Evaluation of dump slope 
stability of a coal mine using artificial neural network". 
Geomechanics and Geophysics for Geo-Energy and 
Geo-Resources, 1 (3), pp 69-77. 
20. Shahin MA, Jaksa MB, and Maier HR (2005). "Neural 
network based stochastic design charts for settlement 
prediction". Canadian Geotechnical Journal, 42 (1), 
pp 110-120. 
21. Shahin Mohamed A, Jaksa Mark B, and Maier Holger 
R (2009). "Recent advances and future challenges for 
artificial neural systems in geotechnical engineering 
applications". Advances in Artificial Neural Systems, 
pp 5. 
22. Suwansawat Suchatvee and Einstein Herbert H 
(2006). "Artificial neural networks for predicting the 
maximum surface settlement caused by EPB shield 
tunneling". Tunnelling and underground space 
technology, 21 (2), pp 133-150. 
23. Verma A. K. and Sirvaiya Abhinav (2016). 
"Comparative analysis of intelligent models for 
prediction of Langmuir constants for CO2 adsorption 
of Gondwana coals in India". Geomechanics and 
Geophysics for Geo-Energy and Geo-Resources, 2 
(2), pp 97-109. 
Ngày nhận bài: 18/8/2021. 
Ngày nhận bài sửa: 07/9/2021. 
Ngày chấp nhận đăng: 08/9/2021. 
ĐỊA KỸ THUẬT - TRẮC ĐỊA 
Tạp chí KHCN Xây dựng - số 3/2021 69 

File đính kèm:

  • pdfdu_bao_lun_theo_thoi_gian_cho_xay_dung_duong_o_bac_ninh_va_h.pdf