An Intelligent Cardiac Ailment Prediction Using Efficient ROCK Algorithm and K- Means & C4.5 Algorithm
##plugins.themes.bootstrap3.article.main##
The data mining techniques have the ability to discover hidden patterns or correlation among the objects in the medical data. There are many areas that adapt data mining techniques, namely marketing, stock, health care sector and so on. In the health care industry produces gigantic quantities of data that clutches complex information relating to the sick person and their medical conditions. The data mining has an infinite potential to make use of healthcare data more effectually and efficiently to predict various kinds of disease. The present-time healthcare industry heart ailment is a term that assigns to an enormous number of health care circumstances related to heart. These medical circumstances relate to the unexpected health circumstance that straight control the cardiac. In this paper we are using a ROCK algorithm because it uses Jaccard coefficient on the contrary using the distance measures to find the similarity between the data or documents to classify the clusters and the contrivance for classifying the clusters based on the similarity measure shall be used over a given set of data. Afterward, C4.5 algorithm is used as the training algorithm to show the rank of a cardiac ailment with the decision tree. The C4.5 can be referred as the statistic classifier as well as this algorithm uses avail radio for feature selection and to build the decision tree. The C4.5 algorithm is widely used because of its expeditious classification and high exactitude. Lastly, the cardiac ailment database is clustered using the K-means clustering, which will alienate the data convenient to cardiac sickness from the database.
Downloads
References
-
Jiawei Han, Micheline Kamber, Jian Pei., “Data mining concepts and techniques “, 3rd ed, ISBN 978-0-12-381479-1, Morgan Kaufmann Publishers is an imprint of Elsevier. 225Wyman Street,Waltham, MA 02451, USA, 2012.
Google Scholar
1
-
Yusuf Perwej, “An Experiential Study of the Big Data,” for published in the International Transaction of Electrical and Computer Engineers System (ITECES), USA, ISSN (Print): 2373-1273 ISSN (Online): 2373-1281, Vol. 4, No. 1, page 14-25, March 2017 DOI:10.12691/iteces-4-1-3.
Google Scholar
2
-
D. Luo, C. Ding, H. Huang, "Parallelization with Multiplicative Algorithms for Big Data Mining", Proc. IEEE 12th Int'l Conf Data Mining, pp. 489-498, 2012.
Google Scholar
3
-
Marco Viceconti, Peter Hunter, Rod Hose, "Big Data big knowledge: big data for personalised health care", IEEE Journal of Biomedical and Health Informatics, no. 99, February 2015.
Google Scholar
4
-
Tan, P., Steinbach, M. and Kumar, V. Introduction to Data Mining, Addison-Wesley, Boston, 2006.
Google Scholar
5
-
Yusuf Perwej, Mohammed Y. Alzahrani, F. A. Mazarbhuiya, Md. Husamuddin, “The State of the Art Cardiac Illness Prediction Using Novel Data Mining Technique” International Journal of Engineering Sciences & Research Technology (IJESRT), ISSN: 2277-9655, Vol. 7, Issue 2, Page no. 725-739, February -2018. DOI: 10.5281/zenodo.1184068
Google Scholar
6
-
Boris Milovic, Milan Milovic, "Prediction and Decision Making in Health Care using Data Mining", International Journal of Public Health Science (IJPHS), vol. 1, no. 2, pp. 69-78, December 2012.
Google Scholar
7
-
Fryar CD, Chen T, Li X. Prevalence of Uncontrolled Risk Factors for Cardiovascular Disease: United States, NCHS Data Brief, No. 103. Hyattsville, MD: National Center for Health Statistics, Centers for Disease Control and Prevention, US Dept of Health and Human Services; 2012.
Google Scholar
8
-
Jyoti Soni et.al. Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction; International Journal of Computer Applications (0975 – 8887) Volume 17– No.8, March 2011.
Google Scholar
9
-
Shadab Adam Pattekari and Asma Parveen, prediction system for heart disease using naïve bayes, International Journal of Advanced Computer and Mathematical Sciences, ISSN 2230-9624. Vol 3, Issue 3, pp 290-294, 2012.
Google Scholar
10
-
Ms. Ishtake S.H, Prof. Sanap S.A., “Intelligent Heart Disease Prediction System Using Data Mining Techniques”, International J. of Healthcare & Biomedical Research, Volume: 1, Issue: 3, Pages 94-101, April 2013
Google Scholar
11
-
M.A.Nishara Banu and B.Gomathy,” Disease Forecasting System Using Data Mining Methods”, ISBN: 978-1-4799-3966-4, pp: 130-133, 2014
Google Scholar
12
-
Hlaudi Daniel Masethe, Mosima Anna Masethe-prediction of Heart Disease using Classification Algorithms; Proceedings of the World Congress on Engineering and Computer Science 2014.
Google Scholar
13
-
J.Vijayashree and N.Ch.SrimanNarayanaIyengar,” Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques: A Review”, International Journal of Bio-Science and Bio-Technology , Vol.8, No.4, pp. 139-148, 2016
Google Scholar
14
-
Zeinab Arabasadi, Roohallah Alizadehsani, Mohamad Roshanzamir, Hossein Moosaei, Ali Asghar Yarifard, Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm, Computer Methods and Programs in Biomedicine, Volume 141, Pages 19-26, April 2017
Google Scholar
15
-
M. Al-Razgan, C. Domeniconi, and D. Barbara, “Random Subspace Ensembles for Clustering Categorical Data,” Supervised and Unsupervised Ensemble Methods and Their Applications, pp. 31-48, Springer, 2008.
Google Scholar
16
-
S. Guha, R. Rastogi, and K. Shim,” ROCK: A Robust Clustering Algorithm for Categorical Attributes”, 15th International Conference on Data Engineering, pp. 512-521, 2000
Google Scholar
17
-
Qiongbing Zhang, Lixin Ding, Shanshan Zhang, “A Genetic Evolutionary ROCK Algorithm” International Conference on Computer Application and System Modeling (ICCASM), 2010
Google Scholar
18
-
B Mobasher, R Cooley, S Jaideep et al., Comments on decision tree, New York:IEEE Press, 1999.
Google Scholar
19
-
Quinlan, J. R. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.
Google Scholar
20
-
J. Macqueen, "Some methods for classification and analysis of multivariate observations", 5th Berkeley Symp. Math. Statist. Prob, pp. 281-297, 1967.
Google Scholar
21
-
M. Erisoglu, N. Calis, and S. Sakallioglu, "A New Algorithm for Initial Cluster Centers in K-means Agorithm, " Pattern Recognition Letters, vol. 32, no. 14, pp. 1701-1705, 2011.
Google Scholar
22
-
K.A. Abdul Nazeer, M.P. Sebastian, "Improving the Accuracy and Efficiency of the k-means Clustering Algorithm", Proceeding of the World Congress on Engineering, vol. 1, 2009
Google Scholar
23
-
Y Jiang, C H. Y. Zhang, "Clustering Algorithm for Data-Mining[J]", Journal of Electronic and Information, vol. 27, no. 4, pp. 655-622, 2005.
Google Scholar
24
-
J Dong, M. Qi. K-means Optimization Algorithm for Solving Clustering Problem. Knowledge Discovery and Data Mining, pp52-55, 2009.
Google Scholar
25
-
David W. Aha & Dennis Kibler. "Instance-based prediction of heart-disease presence with the Cleveland database."
Google Scholar
26
-
B. Kovalerchuk, E. Vityaev, and J. Ruiz, "Consistent and complete data and "expert" mining in medicine'. In: Cios, K. (ed.) Medical Data Mining and Knowledge Discovery, Springer, Heidelberg, pp. 238-280, 2001.
Google Scholar
27