Big Data, Decision Tree Induction, and Image Analysis for the Discovery of Decision Rules for Colon Examination

Abstract— The aim of our research was to develop a method that allows us automatically to discover the decision rules for diagnosing medical images in normal tissue images and images showing a polyp. We used a data set of images that came from an endoscope video system used for colon examination.  The data set contains 283 normal tissue images and 61 polyp images. The 283 normal images consist of dark regions and reflection. One must decide if the image shows a polyp or not.  This is a two-class problem. The unequal number of the data in the two classes makes our problem to an unbalanced data set problem. The polyps in the images were identified and selected by a “well-trained” medical expert. Based on these medical images, we study the behavior of two different statistical texture descriptors, the co-occurrence matrix-texture descriptor and our novel Random set texture descriptor. We review the theory of both texture descriptors and then we apply them to our medical data set. We used a decision-tree induction method to learn the classification rules based on our tool “Decision Master”. In both cases, for the full unequally distributed data set and for the balanced data set, we achieved the best error rate based the Random-set texture descriptor. The performance of the co-occurrence matrix-texture descriptor was worse. For statistical based texture descriptors large enough texture are necessary that cannot always guaranteed for medical objects. Since the co-occurrence matrix is based on higher order statistic that might be the reason for the worse performance. The results show that decision tree induction and image analysis based on our novel texture descriptor is an excellent method to mine medical images for the decision rules even when the data set is unbalanced, but not only that makes our Random-set based texture descriptor favorable. It also gives a flexible way to describe the appearance of the medical objects in symbolic terms, the computation time is less, and it can be set up as software module that can be flexible used in different systems.

Keywords— Image Analysis, Endoscope Images, Colon Examination, Polyp Images, Decision Tree Induction, Random Set Texture Descriptor, Co-occurrence Texture Descriptor, Unbalanced Data Set Problem.

Click here to Download Full Paper

Engineering Journal: Big Data, Decision Tree Induction, and Image Analysis for the Discovery of Decision Rules for Colon Examination

AD Publications is a rapidly growing academic publisher in the fields of Engineering, Medical-Health, Environmental Science and Agriculture Research. AD Publications is a registered organization broad-based open access and publishes most exciting researches with respect to the subjects of our journals. The Journals is being indexed and abstracted by all major global current awareness and alerting services.
The organization aims at undertaking, co- coordinating and promoting research and development. It provides professional and academic guidance in the field of basic education, Higher Education as well in the Technical Education. Our Aims is to Promote and support, High Quality basic, Scientific Research and development in fields of Engineering, Medical-Health, Environmental Science and Agriculture Research and to Generate Public awareness, provide advice to scholar’s researchers and communicate research outcomes.

Some Important Links About Research Journal
International Journal
Agriculture Journal
Medical Journal
Environmental Journal
Engineering Journal

Translate »