[1] | Lessmann, S. (2004). Solving Imbalanced Classification Problems with Support Vector Machines. In IC-AI (Vol. 4, pp. 214-220). |
[2] | Tang, Y., Zhang, Y. Q., Chawla, N. V., & Krasser, S. (2008). SVMs modeling for highly imbalanced classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 39(1), 281-288. |
[3] | López, V., Fernández, A., Moreno-Torres, J. G., & Herrera, F. (2012). Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics. Expert Systems with Applications, 39(7), 6585-6608. |
[4] | Yan, Y., Liu, R., Ding, Z., Du, X., Chen, J., & Zhang, Y. (2019). A parameter-free cleaning method for SMOTE in imbalanced classification. IEEE Access, 7, 23537-23548. |
[5] | Lin, E., Chen, Q., & Qi, X. (2020). Deep reinforcement learning for imbalanced classification. Applied Intelligence, 1-15. |
[6] | Ayiko, R., Antai, D., & Kulane, A. (2009). Trends and determinants of under-five mortality in Uganda. East African journal of public health, 6(2), 136-140. |
[7] | Nasejje, J. B., Mwambi, H. G., & Achia, T. N. (2015). Understanding the determinants of under-five child mortality in Uganda including the estimation of unobserved household and community effects using both frequentist and Bayesian survival analysis approaches. BMC public health, 15(1), 1003. |
[8] | Sreeramareddy, C.T., Kumar, H.N., & Sathian, B. (2013). Time Trends and Inequalities of Under-Five Mortality in Nepal: A Secondary Data Analysis of Four Demographic and Health Surveys between 1996 and 2011. PLoS ONE, 8(11): e79818. doi:10.1371/journal.pone.0079818. |
[9] | Gawande, R., Indulkar, S., Keswani, H., Khatri, M., & Saindane, P. (2019). Analysis and Prediction of Child Mortality in India. International Research Journal of Engineering and Technology, 6(3), 5071-5074. |
[10] | Zhang, X., Tang, F., Ji, J., Han, W., & Lu, P. (2019). Risk Prediction of Dyslipidemia for Chinese Han Adults Using Random Forest Survival Model. Clinical Epidemiology, 11, 1047. |
[11] | Cassy, A., Saifodine, A., Candrinho, B., do Rosário Martins, M., da Cunha, S., Pereira, F. M., & Gudo, E. S. (2019). Care-seeking behaviour and treatment practices for malaria in children under 5 years in Mozambique: a secondary analysis of 2011 DHS and 2015 IMASIDA datasets. Malaria journal, 18(1), 115. |
[12] | Liu, V. (2019). Predicting ovarian cancer survival times: Feature selection and performance of parametric, semi-parametric, and random survival forest methods. Master Thesis, Simon Fraser University. |
[13] | Kenya National Bureau of Statistics, Ministry of Health[Kenya], National AIDS Control Council [Kenya], Kenya Medical Research Institute, National Council for Population and Development [Kenya], ICF International. Kenya demographic and health survey 2014. Nairobi, Kenya, 2015. |
[14] | Corsi, D. J., Neuman, M., Finlay, J. E., & Subramanian, S. (2012). Demographic and health surveys: A profile. International Journal of Epidemiology, 41, 1602–1613. |
[15] | Stekhoven, D. J., & Bühlmann, P. (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118. |
[16] | Ali, H., Salleh, M. N. M., Saedudin, R., Hussain. K., & Mushtaq, M. F. (2019). Imbalance class problems in data mining: a review. Indonesian Journal of Electrical Engineering and Computer Science. 14(2), 1560-1571. |
[17] | Galar, M., Ferńandez, A., Barrenechea, E., Bustince, H., & Herrera, F. (2012). A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART C: APPLICATIONS AND REVIEWS. |
[18] | Fernández, H. A., García, L. S., Galar, M., Prati, R. C., Krawczyk, B., & Herrera, F. (2018). Learning from Imbalanced Data Sets. Springer, Gewerbestrasse 11, 6330 Cham, Switzerland. |
[19] | Zhao, Y., Cen, Y. Data Mining Applications with R; Academic Press: Cambridge, MA, USA, 2013; ISBN 9780124115118. |
[20] | Datta, S., Das, S. Near-Bayesian support vector machines for imbalanced data classification with equal or unequal misclassification costs. Neural Netw. 70, 39–52 (2015). |
[21] | Ertekin, S., Huang, J., Bottou, L., Giles, C.L.: Learning on the border: active learning in imbalanced data classification. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM 2007, Lisbon, 6–10 Nov 2007, pp. 127–136 (2007). |
[22] | Cateni, S., Colla, V., Vannucci, M. A method for resampling imbalanced datasets in binary classification tasks for real-world problems. Neurocomputing 2014, 135, 32–41. |
[23] | He, H., Garcia, E.A. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 2009. |
[24] | Japkowicz, N.; Stephen, S. The class imbalance problem: A systematic study. Intell. Data Anal. 2002. |
[25] | Olson, D.L. Data Set Balancing. In: Shi Y., Xu W., Chen Z. (eds) Data Mining and Knowledge Management. CASDMKM 2004. Lecture Notes in Computer Science, 3327, 71-80, (2005). Springer, Berlin, Heidelberg. https://doi.org/10.1007. |
[26] | Ofek, N., Rokach, L., Stern, R., Shabtai, A. Fast-CBUS: A fast clustering-based undersampling method for addressing the class imbalance problem. Neurocomputing 2017, 243, 88–102. |
[27] | Fiorentini, N.; Losa, M. Handling Imbalanced Data in Road Crash Severity Prediction by Machine Learning Algorithms. Infrastructures 2020, 5, 61. |
[28] | Chawla, N.V., Cieslak, D.A., Hall, L.O., Joshi, A.: Automatically countering imbalance and its empirical relationship to cost. Data Min. Knowl. Disc. 17(2), 225–252 (2008) |
[29] | Estabrooks, A., Jo, T., Japkowicz, N. A multiple resampling method for learning from imbalanced data sets. Comput. Intell. 20(1), 18–36 (2004). |
[30] | Batista, G.E.A.P.A., Prati, R.C., Monard, M.C.: A study of the behaviour of several methods for balancing machine learning training data. SIGKDD Explor. 6(1), 20–29 (2004). |
[31] | Yen, S.J., Lee, Y.S. Cluster-based under-sampling approaches for imbalanced data distributions. Expert Syst. Appl. 2009, 36, 5718–5727. |
[32] | Chawla, N.V., Bowyer, K.W., Hall, L.O., & Kegelmeyer, W.P. (2002). Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research, 16, 321-357. |
[33] | Torgo, L. (2010). Data Mining using R: learning with case studies. CRC Press (ISBN: 9781439810187). http://www.dcc.fc.up.pt/~ltorgo/DataMiningWithR. |
[34] | Lunardon, N., Menardi, G., & Torelli, N. (2013). R package ROSE: Random Over-Sampling Examples (version 0.0-3). Università di Trieste and Università di Padova, Italia. http://cran.r-project.org/web/packages/ROSE/index.html. [p79]. |
[35] | Ishwaran, H., Kogalurt, U. B., Blackstone, E. H., & Lauer, M.S. (2008). Random Survival Forests. The Annals of Applied Statistics, 2(3), 841-860. |
[36] | Breiman, L. (2003b). Setting up, using, and understanding random forests V4.0. https://www.stat.berkeley.edu/~breiman/Using_random_forests_v4.0.pdf. |
[37] | Weathers, W. & Cutler, R. (2017). Comparison of Survival Curves Between Cox Proportional Hazards, Random Forests, and Conditional Inference Forests in Survival Analysis. All Graduate Plan B and other reports, 927. https://digitalcommons.usu.edu/gradreports/927. |
[38] | Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society. Series B (Methodological), 34(2), 187 {220. URL: http://www.jstor.org/stable/2985181. |
[39] | Harrell, F. E., Califf, R. M., Pryor, D. B., Lee, K.L. & Rosati, R.A. (1982). Evaluating the yield of medical tests. Journal of American Medical Association, 247(18), 2543—2546. |
[40] | Afrin, K., Illangovan G., Srivatsa S. S., and Bukkapatnam S. T. (2018) Balanced random survival forests for extremely unbalanced, right censored data," arXiv preprint arXiv: 1803.09177. |