##plugins.themes.bootstrap3.article.main##

The data mining techniques have the ability to discover hidden patterns or correlation among the objects in the medical data. There are many areas that adapt data mining techniques, namely marketing, stock, health care sector and so on. In the health care industry produces gigantic quantities of data that clutches complex information relating to the sick person and their medical conditions. The data mining has an infinite potential to make use of healthcare data more effectually and efficiently to predict various kinds of disease. The present-time healthcare industry heart ailment is a term that assigns to an enormous number of health care circumstances related to heart. These medical circumstances relate to the unexpected health circumstance that straight control the cardiac.  In this paper we are using a ROCK algorithm because it uses Jaccard coefficient on the contrary using the distance measures to find the similarity between the data or documents to classify the clusters and the contrivance for classifying the clusters based on the similarity measure shall be used over a given set of data. Afterward, C4.5 algorithm is used as the training algorithm to show the rank of a cardiac ailment with the decision tree. The C4.5 can be referred as the statistic classifier as well as this algorithm uses avail radio for feature selection and to build the decision tree. The C4.5 algorithm is widely used because of its expeditious classification and high exactitude. Lastly, the cardiac ailment database is clustered using the K-means clustering, which will alienate the data convenient to cardiac sickness from the database.

Downloads

Download data is not yet available.

References

  1. Jiawei Han, Micheline Kamber, Jian Pei., “Data mining concepts and techniques “, 3rd ed, ISBN 978-0-12-381479-1, Morgan Kaufmann Publishers is an imprint of Elsevier. 225Wyman Street,Waltham, MA 02451, USA, 2012.
     Google Scholar
  2. Yusuf Perwej, “An Experiential Study of the Big Data,” for published in the International Transaction of Electrical and Computer Engineers System (ITECES), USA, ISSN (Print): 2373-1273 ISSN (Online): 2373-1281, Vol. 4, No. 1, page 14-25, March 2017 DOI:10.12691/iteces-4-1-3.
     Google Scholar
  3. D. Luo, C. Ding, H. Huang, "Parallelization with Multiplicative Algorithms for Big Data Mining", Proc. IEEE 12th Int'l Conf Data Mining, pp. 489-498, 2012.
     Google Scholar
  4. Marco Viceconti, Peter Hunter, Rod Hose, "Big Data big knowledge: big data for personalised health care", IEEE Journal of Biomedical and Health Informatics, no. 99, February 2015.
     Google Scholar
  5. Tan, P., Steinbach, M. and Kumar, V. Introduction to Data Mining, Addison-Wesley, Boston, 2006.
     Google Scholar
  6. Yusuf Perwej, Mohammed Y. Alzahrani, F. A. Mazarbhuiya, Md. Husamuddin, “The State of the Art Cardiac Illness Prediction Using Novel Data Mining Technique” International Journal of Engineering Sciences & Research Technology (IJESRT), ISSN: 2277-9655, Vol. 7, Issue 2, Page no. 725-739, February -2018. DOI: 10.5281/zenodo.1184068
     Google Scholar
  7. Boris Milovic, Milan Milovic, "Prediction and Decision Making in Health Care using Data Mining", International Journal of Public Health Science (IJPHS), vol. 1, no. 2, pp. 69-78, December 2012.
     Google Scholar
  8. Fryar CD, Chen T, Li X. Prevalence of Uncontrolled Risk Factors for Cardiovascular Disease: United States, NCHS Data Brief, No. 103. Hyattsville, MD: National Center for Health Statistics, Centers for Disease Control and Prevention, US Dept of Health and Human Services; 2012.
     Google Scholar
  9. Jyoti Soni et.al. Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction; International Journal of Computer Applications (0975 – 8887) Volume 17– No.8, March 2011.
     Google Scholar
  10. Shadab Adam Pattekari and Asma Parveen, prediction system for heart disease using naïve bayes, International Journal of Advanced Computer and Mathematical Sciences, ISSN 2230-9624. Vol 3, Issue 3, pp 290-294, 2012.
     Google Scholar
  11. Ms. Ishtake S.H, Prof. Sanap S.A., “Intelligent Heart Disease Prediction System Using Data Mining Techniques”, International J. of Healthcare & Biomedical Research, Volume: 1, Issue: 3, Pages 94-101, April 2013
     Google Scholar
  12. M.A.Nishara Banu and B.Gomathy,” Disease Forecasting System Using Data Mining Methods”, ISBN: 978-1-4799-3966-4, pp: 130-133, 2014
     Google Scholar
  13. Hlaudi Daniel Masethe, Mosima Anna Masethe-prediction of Heart Disease using Classification Algorithms; Proceedings of the World Congress on Engineering and Computer Science 2014.
     Google Scholar
  14. J.Vijayashree and N.Ch.SrimanNarayanaIyengar,” Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques: A Review”, International Journal of Bio-Science and Bio-Technology , Vol.8, No.4, pp. 139-148, 2016
     Google Scholar
  15. Zeinab Arabasadi, Roohallah Alizadehsani, Mohamad Roshanzamir, Hossein Moosaei, Ali Asghar Yarifard, Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm, Computer Methods and Programs in Biomedicine, Volume 141, Pages 19-26, April 2017
     Google Scholar
  16. M. Al-Razgan, C. Domeniconi, and D. Barbara, “Random Subspace Ensembles for Clustering Categorical Data,” Supervised and Unsupervised Ensemble Methods and Their Applications, pp. 31-48, Springer, 2008.
     Google Scholar
  17. S. Guha, R. Rastogi, and K. Shim,” ROCK: A Robust Clustering Algorithm for Categorical Attributes”, 15th International Conference on Data Engineering, pp. 512-521, 2000
     Google Scholar
  18. Qiongbing Zhang, Lixin Ding, Shanshan Zhang, “A Genetic Evolutionary ROCK Algorithm” International Conference on Computer Application and System Modeling (ICCASM), 2010
     Google Scholar
  19. B Mobasher, R Cooley, S Jaideep et al., Comments on decision tree, New York:IEEE Press, 1999.
     Google Scholar
  20. Quinlan, J. R. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.
     Google Scholar
  21. J. Macqueen, "Some methods for classification and analysis of multivariate observations", 5th Berkeley Symp. Math. Statist. Prob, pp. 281-297, 1967.
     Google Scholar
  22. M. Erisoglu, N. Calis, and S. Sakallioglu, "A New Algorithm for Initial Cluster Centers in K-means Agorithm, " Pattern Recognition Letters, vol. 32, no. 14, pp. 1701-1705, 2011.
     Google Scholar
  23. K.A. Abdul Nazeer, M.P. Sebastian, "Improving the Accuracy and Efficiency of the k-means Clustering Algorithm", Proceeding of the World Congress on Engineering, vol. 1, 2009
     Google Scholar
  24. Y Jiang, C H. Y. Zhang, "Clustering Algorithm for Data-Mining[J]", Journal of Electronic and Information, vol. 27, no. 4, pp. 655-622, 2005.
     Google Scholar
  25. J Dong, M. Qi. K-means Optimization Algorithm for Solving Clustering Problem. Knowledge Discovery and Data Mining, pp52-55, 2009.
     Google Scholar
  26. David W. Aha & Dennis Kibler. "Instance-based prediction of heart-disease presence with the Cleveland database."
     Google Scholar
  27. B. Kovalerchuk, E. Vityaev, and J. Ruiz, "Consistent and complete data and "expert" mining in medicine'. In: Cios, K. (ed.) Medical Data Mining and Knowledge Discovery, Springer, Heidelberg, pp. 238-280, 2001.
     Google Scholar