|
|
Prediction of Fake Toll-free Logistic Vehicles Based on Historical Traffic Data |
LIU Yu-gang1,2,4, ZHENG Shuai1,2, XU Xu-dong1,2, WANG Tian-bi1,2, YE Jin-song3 |
1. School of Transportation and Logistics, Southwest Jiaotong University, Chengdu Sichuan 610031, China;
2. National Engineering Laboratory of Integrated Transportation Big Data Application Technology, Southwest Jiaotong University, Chengdu Sichuan 610031, China;
3. China Academy of Transportation Science, Beijing 100088, China;
4. Institute of Transportation Development Strategy and Planning of Sichuan Province, Chengdu Sichuan 610001, China |
|
|
Abstract At present, the "green channel" policy is fully implemented. However, the detection is relatively lagging, and truck drivers are susceptible to fake the toll-free logistics vehicles (TFLVs), causing huge losses to the operation. In order to improve the accuracy and efficiency of TFLVs inspection, this paper establishes a toll evasion prediction model for fake TFLVs based on the historical TFLVs traffic dataset derived from the highway network toll collection system. First, according to the importance and reliability, we used the data mining technology to differentiate and extract the data attributes. And the spatiotemporal characteristics and other characteristics of fake TFLVs through-traffic entering and exiting toll booths are studied and analyzed based on the preprocessed data. Then use the Borderline-SMOTE oversampling method to balance the pass data set, use the ChiMerge algorithm to discretize continuous attributes. In order to ensure the effective matching and the correlation between the large contribution attributes and the results, correlation item test and collinearity test are used for discrete related attributes. Finally, adopting the processed discrete TFLVs data which passed correlation item test and collinearity test, and using the decision tree to establish the prediction model. The classification results of the escape behavior prediction model and other models are compared by using the historical TFLVs data set. The results show that the accuracy of the escape behavior prediction model proposed in this paper is 83.4%, which is higher than that of the Logistic regression model (61.8%) and the Random forest model (81%).This research model can effectively warn of fake TFLVs, and on the basis of simplifying the inspection process of TFLVs, reducing the probability of the occurrence of fake TFLVs evasion, which has practical application significance.
|
Received: 07 February 2021
|
Fund:Supported by the National Natural Science Foundation of China (Nos.51774241, 71704145); Sichuan Youth Science and Technology Innovation Research Team Project under Grant (No.2020JDTD0027); Humanity and Social Science Foundation of Ministry of Education of China (No.18YJCZH138) |
Corresponding Authors:
LIU Yu-gang
E-mail: liuyugang@swjtu.edu.cn
|
|
|
|
[1] CHEN Li. What Should We Do to Check the "Green Pass" after Withdrawing the Station?[J]. China Highway, 2019(14):30-31. (in Chinese)
[2] NIU Jian-qiang, MENG Hong-fei, YANG Rui-rui. Study on Density Detect Scheme of Expressway Greenchannels[J]. Transportation Standardization, 2011(S2):82-84. (in Chinese)
[3] XIONG Wen-lei. Research of Image Recognition of Radioactive Sources for Carrying Cargo by Toll-free Logistics Vehicle Based on Convolutional Neural Networks[D]. Xi'an:Chang'an University, 2019. (in Chinese)
[4] SUN Xiao-ning. Research and Application of Multi-view Portable Green Pass Detection System for Beijing-Zhangjiakou Expressway[J]. TranspoWorld, 2019(4):47-49. (in Chinese)
[5] CHEN Hao-tai. Study on Vehicle Inspection of Expressway Green Channel Based on Logistic Regression[D]. Xi'an:Chang'an University, 2017. (in Chinese)
[6] REN Wen-long. Study of Toll Audit in Networked Expressway Based on Neural Network[D]. Xi'an:Chang'an University, 2010. (in Chinese)
[7] SHEN Chang-chun. Research on Inspection Model of Freeway Green Channel Vehicle Based on BP Neural Network Ensemble with Bagging[D]. Xi'an:Chang'an University, 2018. (in Chinese)
[8] LEI Yi, CHEN Wen-rui, DENG Tao. Discussion on Scheme of Green Channel Vehicle Cargo Detection Points in Jiangxi Province[J]. China ITS Journal, 2015(12):132-134,137. (in Chinese)
[9] ZHANG Meng. Study on Layout Planning of Expressway Transport of Fresh Farm Products by Green Channel in Henan Province[D]. Zhengzhou:Henan Agricultural University,2016. (in Chinese)
[10] BATISTA G E, PRATI R C, MONARD M C. A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data[J]. ACM SIGKDD Explorations Newsletter, 2004, 6(1):20-29.
[11] TAO Xin-min, HAO Si-yuan, ZHANG Dong-xue, et al. Overview of Classification Algorithms for Unbalanced Data[J]. Journal of Chongqing University of Posts and Telecommunications:Natural Science Edition,2013,25(1):105-114,125. (in Chinese)
[12] CHAWLA N V, BOWYER K W, HALL L O, et al. SMOTE:Synthetic Minority Over-sampling Technique[J]. Journal of Artificial Intelligence Research, 2002, 16(1):321-357.
[13] KERBER R. ChiMerge:Discretization of Numeric Attributes[C]//Proceedings of the 10th National Conference on Artificial Intelligence. San Jose:[s.n.],1992.
[14] HANLEY J A, MCNEIL B J. The Meaning and Use of the Area under a Receiver Operating Characteristic (ROC) Curve[J]. Radiology, 1982, 143(1):29-36.
[15] CHEN Wei-zhong, NI Zong-zan, PAN Xiao-ping, et al. Receiver Operating Characteristic Curves to Determine the Optimal Operating Point and Doubtable Value Interval[J]. Modern Preventive Medicine, 2005, 32(7):729-731. (in Chinese)
[16] GAO Hui-xuan. Some Method on Treating the Collinearity of Independent Variables in Multiple Linear Regression[J]. Journal of Applied Statistics and Management, 2016, 19(5):49-55. (in Chinese)
[17] GUI Qing-ming, YAO Shao-wen, GU Yong-wei, et al. A New Method to Diagnose Multicollinearity Based on Condition Index and Variance Decomposition Proportion (CIVDP[J]. Acta Geodaetica et Cartographica Sinica, 2006, 35(3):210-214. (in Chinese)
[18] WU Xi-zhi. Applied Regression and Classification:Based on R[M]. Beijing:China Renmin University Press, 2016. (in Chinese)
[19] WANG Ji-chuan, GUO Zhi-gang. Logistic Regression Models:Methods and Application[M]. Beijing:Higher Education Press, 2001. (in Chinese) |
|
|
|