[1] | M. Khalilia, S. Chakraborty and M. Popescu “Predicting Disease Risks from Highly Imbalanced Data Using Random Forest”, BMC Medical Informatics and Decision Modelling, vol. 11, no. 51, pp.1-13, 2011. |
[2] | S. Ayme and J. Schmidtke “Networking For Rare Diseases: An Necessity for Europe”, Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz, pp. 1477-1483, 2007. |
[3] | P.K. Chan and S. J. Stolfo “Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection”, in Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp.164–168, AAAI Press, 1998. |
[4] | B. Busser, W. Daelemans and A. Bosch “Machine Learning Of Word Pronunciation: The Case Against Abstraction”, Sixth European Conference on Speech Communication and Technology-EUROSPEECH, Budapest, Hungary, 1999. |
[5] | M. Kubat, R. C. Holte and S. Matwin (1998) “Machine Learning for the Detection of Oil Spills in Satellite Radar Images”, Machine Learning, vol. 30, no. 2, pp. 195-215. |
[6] | J. Quigley, T. Bedford and L. Walls “Estimating Rate of Occurrence of Rare Events with Empirical Bayes: A Railway Application”, Reliability Engineering and System Safety, vol. 92, no. 5, pp. 619-627, 2007. |
[7] | T.B. Trafalis, H. Ince and M.B. Richman “Tornado Detection with Support Vector Machines”, International Conference on Computational Science, pp. 289-298, 2003. |
[8] | Y. Zurynski, K. Frith, H. Leonard and E. Elliot “Rare Childhood Diseases: How Should We Respond”, Arch Dis Child, vol.93, pp. 1071-1074, 2008. |
[9] | A. Briggs, R. Nixon, S. Dixon and S. Thompson (2005) “Parametric Modelling of Cost Data: Some Simulation Evidence”, Health Economics, vol. 14, no. 4, pp. 421-428. |
[10] | W. Manning “Dealing with Skewed Data on Costs and Expenditures” Chapter 41, pp. 439-454. Jones A.M. (2006) The Elgar Companion to Health Economics, Second Edition, 2006. |
[11] | A. Manca and S. Palmer (2005) “Handling Missing Data in Patient-Level Cost-Effectiveness Analysis Alongside Randomized Clinical Trials”, Appl Health Econ Health Policy, vol. 4, no. 2, pp. 66-75. |
[12] | L. Rokach “Ensemble Based Classifiers” Artificial Intelligence Review, vol. 33, no.1, pp. 1-39, 2010. |
[13] | R. Chattamvelli “Data Mining Methods”, Alpha Science International, Oxford, UK, 2009. |
[14] | N. S. Altman “An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression”, The American Statistician. vol. 46, no. 3, pp. 175–185, 1992. |
[15] | S. Russell and P. Norvig [1995]. Artificial Intelligence: A Modern Approach (2nd Ed.). Prentice Hall. ISBN 978-0137903955, 2003. |
[16] | P. J. Lisboa and A. F. G. Taktak “The Use of Artificial Neural Networks in Decision Support in Cancer: A systematic review”, Neural Networks, vol. 19, no. 4, pp. 408-415, 2006. |
[17] | L. Breiman “Bagging Predictors”, Machine Learning, vol. 24, no. 2, pp. 123-140, 1996. |
[18] | Y. Freund and R. E. Schapire “Experiments with a New Boosting Algorithm”, http://www.public.asu.edu/~jye02/CLASSES/Fall-2005/PAPERS/boosting-icml.pdf, 1996, Accessed on: 15.3.2017. |
[19] | D. Opitz and R. Maclin “Popular Ensemble Methods: An Empirical Study”, Journal of Artificial Intelligence Research, vol. 11, pp. 169-198, 1999. |
[20] | E. Bauer and R. Kohavi “An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting and Variants”, Machine Learning, vol. 36, no.1, pp. 105-139, 1999. |
[21] | D. A. Davis, N. V. Chawla N., Blumm, N. Christakis, and A. L. Barabasi “Predicting Individual Disease Risk Based on Medical History”, Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 769-778, 2008. |
[22] | S. T., Moturu, W. G. Johnson and L. Huan “Predicting Future High-Cost Patients: A Real World Risk Modeling Application”, Bioinformatics and Biomedicine, BIBM, IEEE International Conference, 2007. |
[23] | D. Mantzaris, G. C. Anastassopoulos and D. K. Lymberopoulos (2008) “Medical Disease Prediction Using Artificial Neural Networks”, http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=4696782, 2008, Accessed on: 15.3.2017. |
[24] | P. L. Hebert, L. S. Geiss and E. F. Tierney, M. M. Engelgau, B. P. Yawn and A. M. McBean “Identifying Persons with Diabetes Using Medicare Claims Data”, Am J Med Qual, vol. 14, no. 6, pp. 270-277, 1999. |
[25] | W. Yu, T. Liu, R. Valdez, M. Gwinn and M. J. Khoury “Application of Support Vector Machine Modeling for Prediction of Common Diseases: The Case of Diabetes and Pre-Diabetes”, BMC Medical Informatics and Decision Making, 10(1), pp.1-7, 2010. |
[26] | W. Zhang, F. Zeng, X. Wu, X. Zhang and R. Jiang “A Comparative Study of Ensemble Learning Approaches in the Classification of Breast Cancer Metastasis”, Bioinformatics, System Biology and Intelligent Computing International Conference, pp. 242-245, 2009. |
[27] | Hall J. Guyton and Hall Textbook of Medical Physiology (12th ed.). Philadelphia, Pa.: Saunders/Elsevier. ISBN 978-1-4160-4574-8, 2011. |
[28] | UCI Machine Learning Repository, Center for Machine Learning and Intelligent Systems, Thyroid Disease Data Set, https://archive.ics.uci.edu/ml/datasets/Thyroid+Disease, Accessed on: 9.3.2017. |
[29] | T. Hastie, R. Tibshirani and J. Friedman, The Elements of Statistical Learning Data Mining, Inference and Prediction, Springer, Second Edition, 2009. |
[30] | M. Maalouf and T. B. Trafalis “Robust Weighted Kernel Logistic Regression in Imbalanced and Rare Events Data”, Computational Statistics & Data Analysis, vol. 55, no. 1, pp. 168-183, 2011. |
[31] | G. E. P. Box “Science and Statistics”, J Am Statist Assoc, vol. 71, pp. 791-799, 1976. |
[32] | M. R. Nester “An Applied Statistician’s Creed”, Appl Statist, vol. 45, no. 4, pp. 4001-410, 1996. |
[33] | E. L. Cohen, C. A. Caburnay, D. A. Luke, S. Rodgers, G. T. Cameron and M. W. Kreuter (2004) “Cancer Coverage in General-Audience and Black Newspapers”, Health Communication, vol. 23, no. 5, pp. 427-435, 2004. |
[34] | J. R. Quinlan “Bagging, Boosting and C4.5”, Proceedings of the Thirteenth National Conference on Artificial Intelligence, pp. 725–730, 1996. |
[35] | P. Horton and K. Nakai “Better Prediction of Protein Cellular Localization Sites with the k Nearest Neighbors Classifier”, ISMB-97 Proceedings, pp. 147-151, 1997. |
[36] | C. M. Ma, W. S. Yang and B. W. Cheng “How the Parameters Of K-Nearest Neighbor Algorithm Impact On The Best Classification Accuracy: In Case of Parkinson Dataset”, Journal of Applied Sciences, vol. 14, no. 2, pp. 171-176, 2014. |
[37] | V. Chernozhukov, D. Chetverikov, M. Demirer, E. Duflo, C. Hansen and W. Newey “Double Machine Learning for Treatment and Causal Parameters”, Cornell University Library, https://arxiv.org/abs/1608.00060, 2016, Accessed on: 15.3.2017. |