A fundamental overview of sota-ensemble learning methods for deep learning: a systematic literature review

Marco Klaiber

doi:10.31763/sitech.v2i2.549


A fundamental overview of sota-ensemble learning methods for deep learning: a systematic literature review

^{(1) *} Marco Klaiber

(Aalen University of Applied Sciences, Germany)
^*corresponding author

Abstract

The rapid growth in popularity of Deep Learning (DL) continues to bring more use cases and opportunities, with methods rapidly evolving and new fields developing from the convergence of different algorithms. For this systematic literature review, we considered the most relevant peer-reviewed journals and conference papers on the state of the art of various Ensemble Learning (EL) methods for application in DL, which are also expected to give rise to new ones in combination. The EL methods relevant to this work are described in detail and the respective popular combination strategies as well as the individual tuning and averaging procedures are presented. A comprehensive overview of the various limitations of EL is then provided, culminating in the final formulation of research gaps for future scholarly work on the results, which is the goal of this thesis. This work fills the research gap for upcoming work in EL for by proving in detail and making accessible the fundamental properties of the chosen methods, which will further deepen the understanding of the complex topic in the future and, following the maxim of ensemble learning, should enable better results through an ensemble of knowledge in the future.

Keywords

Ensemble Learning; Bagging; Boosting; Deep Learning; Machine Learning; Predictive Performance; CNN

DOI

https://doi.org/10.31763/sitech.v2i2.549

Article metrics

10.31763/sitech.v2i2.549 Abstract views : 3081 | PDF views : 793

Cite

How to cite item

Full Text

Download

References

[1] Y. Chen, Y. Wang, Y. Gu, X. He, P. Ghamisi, and X. Jia, â€œDeep Learning Ensemble for Hyperspectral Image Classification,â€ IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., vol. 12, no. 6, pp. 1882â€“1897, Jun. 2019, doi: 10.1109/JSTARS.2019.2915259.

[2] H. Greenspan, B. Van Ginneken, and R. M. Summers, â€œGuest Editorial Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique,â€ IEEE Trans. Med. Imaging, vol. 35, no. 5, pp. 1153â€“1159, May 2016, doi: 10.1109/TMI.2016.2553401.

[3] W. Liu, M. Zhang, Z. Luo, and Y. Cai, â€œAn Ensemble Deep Learning Method for Vehicle Type Classification on Visual Traffic Surveillance Sensors,â€ IEEE Access, vol. 5, pp. 24417â€“24425, Oct. 2017, doi: 10.1109/ACCESS.2017.2766203.

[4] I. Kononenko, â€œMachine learning for medical diagnosis: history, state of the art and perspective,â€ Artif. Intell. Med., vol. 23, no. 1, pp. 89â€“109, Aug. 2001, doi: 10.1016/S0933-3657(01)00077-X.

[5] J. Latif, C. Xiao, A. Imran, and S. Tu, â€œMedical imaging using machine learning and deep learning algorithms: A review,â€ 2019 2nd Int. Conf. Comput. Math. Eng. Technol. iCoMET 2019, Mar. 2019, doi: 10.1109/ICOMET.2019.8673502.

[6] J. Bell, â€œWhat is machine learning?,â€ Mach. Learn. City Appl. Archit. Urban Des., pp. 209â€“216, May 2022, doi: 10.1007/978-3-319-18305-3_1.

[7] C. Bin Ha and H. K. Song, â€œSignal Detection Scheme Based on Adaptive Ensemble Deep Learning Model,â€ IEEE Access, vol. 6, pp. 21342â€“21349, Apr. 2018, doi: 10.1109/ACCESS.2018.2825463.

[8] Y. Ren, L. Zhang, and P. N. Suganthan, â€œEnsemble Classification and Regression-Recent Developments, Applications and Future Directions [Review Article],â€ IEEE Comput. Intell. Mag., vol. 11, no. 1, pp. 41â€“53, Feb. 2016, doi: 10.1109/MCI.2015.2471235.

[9] S. J. Lee, T. Chen, L. Yu, and C. H. Lai, â€œImage Classification Based on the Boost Convolutional Neural Network,â€ IEEE Access, vol. 6, pp. 12755â€“12768, Jan. 2018, doi: 10.1109/ACCESS.2018.2796722.

[10] Z.-H. Zhou, â€œEnsemble Learning,â€ Encycl. Biometrics, pp. 270â€“273, 2009, doi: 10.1007/978-0-387-73003-5_293.

[11] L. Wen, L. Gao, and X. Li, â€œA New Snapshot Ensemble Convolutional Neural Network for Fault Diagnosis,â€ IEEE Access, vol. 7, pp. 32037â€“32047, 2019, doi: 10.1109/ACCESS.2019.2903295.

[12] Y. Lecun, Y. Bengio, and G. Hinton, â€œDeep learning,â€ Nat. 2015 5217553, vol. 521, no. 7553, pp. 436â€“444, May 2015, doi: 10.1038/nature14539.

[13] X. Dong, Z. Yu, W. Cao, Y. Shi, and Q. Ma, â€œA survey on ensemble learning,â€ Front. Comput. Sci., vol. 14, no. 2, pp. 241â€“258, Apr. 2020, doi: 10.1007/S11704-019-8208-Z.

[14] Y. Wang, Y. Yang, Y. X. Liu, and A. A. Bharath, â€œA Recursive Ensemble Learning Approach with Noisy Labels or Unlabeled Data,â€ IEEE Access, vol. 7, pp. 36459â€“36470, 2019, doi: 10.1109/ACCESS.2019.2904403.

[15] F. Huang, J. Lu, J. Tao, L. L. Li, X. Tan, and P. Liu, â€œResearch on Optimization Methods of ELM Classification Algorithm for Hyperspectral Remote Sensing Images,â€ IEEE Access, vol. 7, pp. 108070â€“108099, 2019, doi: 10.1109/ACCESS.2019.2932909.

[16] O. Sagi and L. Rokach, â€œEnsemble learning: A survey,â€ Wiley Interdiscip. Rev. Data Min. Knowl. Discov., vol. 8, no. 4, p. e1249, Jul. 2018, doi: 10.1002/WIDM.1249.

[17] â€œMachine Learning Research: Four Current Directions | Request PDF.â€ Available at : researchgate.net.

[18] S. Wan and H. Yang, â€œComparison among methods of ensemble learning,â€ Proc. - 2013 Int. Symp. Biometrics Secur. Technol. ISBAST 2013, pp. 286â€“290, 2013, doi: 10.1109/ISBAST.2013.50.

[19] F. Huang, G. Xie, and R. Xiao, â€œResearch on ensemble learning,â€ 2009 Int. Conf. Artif. Intell. Comput. Intell. AICI 2009, vol. 3, pp. 249â€“252, 2009, doi: 10.1109/AICI.2009.235.

[20] L. Breiman, â€œBagging predictors,â€ Mach. Learn., vol. 24, no. 2, pp. 123â€“140, 1996, doi: 10.1007/BF00058655.

[21] Y. H. Na, H. Jo, and J. B. Song, â€œLearning to grasp objects based on ensemble learning combining simulation data and real data,â€ Int. Conf. Control. Autom. Syst., vol. 2017-October, pp. 1030â€“1034, Dec. 2017, doi: 10.23919/ICCAS.2017.8204368.

[22] M. A. Dede, E. Aptoula, and Y. Genc, â€œDeep Network Ensembles for Aerial Scene Classification,â€ IEEE Geosci. Remote Sens. Lett., vol. 16, no. 5, pp. 732â€“735, May 2019, doi: 10.1109/LGRS.2018.2880136.

[23] G. I. Webb and Z. Zheng, â€œMultistrategy ensemble learning: Reducing error by combining ensemble learning techniques,â€ IEEE Trans. Knowl. Data Eng., vol. 16, no. 8, pp. 980â€“991, Aug. 2004, doi: 10.1109/TKDE.2004.29.

[24] H. Guan and X. Xue, â€œRobust online visual tracking via a temporal ensemble framework,â€ Proc. - IEEE Int. Conf. Multimed. Expo, vol. 2016-August, Aug. 2016, doi: 10.1109/ICME.2016.7552969.

[25] N. Yu, L. Qian, Y. Huang, and Y. Wu, â€œEnsemble Learning for Facial Age Estimation Within Non-Ideal Facial Imagery,â€ IEEE Access, vol. 7, pp. 97938â€“97948, 2019, doi: 10.1109/ACCESS.2019.2928843.

[26] J. Zilly, J. M. Buhmann, and D. Mahapatra, â€œGlaucoma detection using entropy sampling and ensemble learning for automatic optic cup and disc segmentation,â€ Comput. Med. Imaging Graph., vol. 55, pp. 28â€“41, Jan. 2017, doi: 10.1016/J.COMPMEDIMAG.2016.07.012.

[27] S. A. Gyamerah, P. Ngare, and D. Ikpe, â€œOn Stock Market Movement Prediction Via Stacking Ensemble Learning Method,â€ CIFEr 2019 - IEEE Conf. Comput. Intell. Financ. Eng. Econ., May 2019, doi: 10.1109/CIFER.2019.8759062.

[28] S. DÅ¾eroski and B. Å½enko, â€œIs combining classifiers with stacking better than selecting the best one?,â€ Mach. Learn., vol. 54, no. 3, pp. 255â€“273, Mar. 2004, doi: 10.1023/B:MACH.0000015881.36452.6E.

[29] L. Breiman, â€œRandom forests,â€ Mach. Learn., vol. 45, no. 1, pp. 5â€“32, Oct. 2001, doi: 10.1023/A:1010933404324`.

[30] R. Buettner, S. Sauer, C. Maier, and A. Eckhardt, â€œTowards ex ante prediction of user performance: A novel NeuroIS methodology based on real-time measurement of mental effort,â€ Proc. Annu. Hawaii Int. Conf. Syst. Sci., vol. 2015-March, pp. 533â€“542, Mar. 2015, doi: 10.1109/HICSS.2015.70.

[31] H. Liang, L. Song, and X. Li, â€œThe rotate stress of steam turbine prediction method based on stacking ensemble learning,â€ Proc. IEEE Int. Symp. High Assur. Syst. Eng., vol. 2019-January, pp. 146â€“149, Mar. 2019, doi: 10.1109/HASE.2019.00030.

[32] A. P. Piotrowski and J. J. Napiorkowski, â€œA comparison of methods to avoid overfitting in neural networks training in the case of catchment runoff modelling,â€ J. Hydrol., vol. 476, pp. 97â€“111, Jan. 2013, doi: 10.1016/J.JHYDROL.2012.10.019.

[33] T. G. Dietterich, â€œEnsemble methods in machine learning,â€ Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 1857 LNCS, pp. 1â€“15, 2000, doi: 10.1007/3-540-45014-9_1.

[34] Y. Freund and R. E. Schapire, â€œA Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting,â€ J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119â€“139, Aug. 1997, doi: 10.1006/JCSS.1997.1504.

[35] B. Zhang, Y. Yang, C. Chen, L. Yang, J. Han, and L. Shao, â€œAction Recognition Using 3D Histograms of Texture and A Multi-Class Boosting Classifier,â€ IEEE Trans. Image Process., vol. 26, no. 10, pp. 4648â€“4660, Oct. 2017, doi: 10.1109/TIP.2017.2718189.

[36] Z. H. Zhou, J. Wu, and W. Tang, â€œEnsembling neural networks: Many could be better than all,â€ Artif. Intell., vol. 137, no. 1â€“2, pp. 239â€“263, May 2002, doi: 10.1016/S0004-3702(02)00190-X.

[37] Y. Zhao, J. Li, and L. Yu, â€œA deep learning ensemble approach for crude oil price forecasting,â€ Energy Econ., vol. 66, pp. 9â€“16, Aug. 2017, doi: 10.1016/J.ENECO.2017.05.023.

[38] G. Huang, Y. Li, G. Pleiss, Z. Liu, J. E. Hopcroft, and K. Q. Weinberger, â€œSnapshot Ensembles: Train 1, get M for free,â€ 5th Int. Conf. Learn. Represent. ICLR 2017 - Conf. Track Proc., Apr. 2017, Accessed: Apr. 18, 2023. [Online]. Available at: https://arxiv.org/abs/1704.00109v1

[39] Y. Freund, â€œBoosting a Weak Learning Algorithm by Majority,â€ Inf. Comput., vol. 121, no. 2, pp. 256â€“285, Sep. 1995, doi: 10.1006/INCO.1995.1136.

[40] E. Bauer and R. Kohavi, â€œEmpirical comparison of voting classification algorithms: bagging, boosting, and variants,â€ Mach. Learn., vol. 36, no. 1, pp. 105â€“139, 1999, doi: 10.1023/A:1007515423169.

[41] J. R. Quinlan, â€œBagging, Boosting, and C4.5,â€ 1996. Available at : semanticscholar.org.

[42] B. Efron and R. J. Tibshirani, â€œAn Introduction to the Bootstrap,â€ An Introd. to Bootstrap, May 1994, doi: 10.1201/9780429246593.

[43] A. Kabir, C. Ruiz, and S. A. Alvarez, â€œMixed Bagging: A Novel Ensemble Learning Framework for Supervised Classification Based on Instance Hardness,â€ Proc. - IEEE Int. Conf. Data Mining, ICDM, vol. 2018-November, pp. 1073â€“1078, Dec. 2018, doi: 10.1109/ICDM.2018.00137.

[44] B. Krawczyk and M. WoÅºniak, â€œWagging for Combining Weighted One-class Support Vector Machines,â€ Procedia Comput. Sci., vol. 51, no. 1, pp. 1565â€“1573, Jan. 2015, doi: 10.1016/J.PROCS.2015.05.351.

[45] G. I. Webb, â€œMultiBoosting: a technique for combining boosting and wagging,â€ Mach. Learn., vol. 40, no. 2, pp. 159â€“196, Aug. 2000, doi: 10.1023/A:1007659514849.

[46] D. H. Wolpert, â€œStacked generalization,â€ Neural Networks, vol. 5, no. 2, pp. 241â€“259, Jan. 1992, doi: 10.1016/S0893-6080(05)80023-1.

[47] T. K. Ho, â€œThe random subspace method for constructing decision forests,â€ IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, no. 8, pp. 832â€“844, 1998, doi: 10.1109/34.709601.

[48] Y. Lin and Y. Jeon, â€œRandom Forests and Adaptive Nearest Neighbors,â€, vol. 101, no. 474, pp. 578â€“590, Jun. 2012, doi: 10.1198/016214505000001230.

[49] I. Loshchilov and F. Hutter, â€œSGDR: Stochastic Gradient Descent with Warm Restarts,â€ 5th Int. Conf. Learn. Represent. ICLR 2017 - Conf. Track Proc., Aug. 2016, Accessed: Apr. 18, 2023. [Online]. Available: https://arxiv.org/abs/1608.03983v5

[50] P. K. Chan and S. J. Stolfo, â€œA Comparative Evaluation of Voting and Meta-learning on Partitioned Data,â€ Mach. Learn. Proc. 1995, pp. 90â€“98, Jan. 1995, doi: 10.1016/B978-1-55860-377-6.50020-7.

[51] J. Zheng, X. Cao, B. Zhang, X. Zhen, and X. Su, â€œDeep Ensemble Machine for Video Classification,â€ IEEE Trans. Neural Networks Learn. Syst., vol. 30, no. 2, pp. 553â€“565, Feb. 2019, doi: 10.1109/TNNLS.2018.2844464.

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

___________________________________________________________
Science in Information Technology Letters
ISSN 2722-4139
Published by Association for Scientific Computing Electrical and Engineering (ASCEE)
W : http://pubs2.ascee.org/index.php/sitech
E : sitech@ascee.org, andri@ascee.org, andri.pranolo.id@ieee.org

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

Statcounter View My Stats

Username
Password
Remember me