data mining

Classification Trees and Predicting Breast Cancer Diagnosis

Classification trees are used with a categorical response variable. The goal of a classification tree is to derive a model that predicts to which category a particular subject or individual belongs, based one or more explanatory factors. For example, we could use a classification tree to predict the diagnosis (Benign or Malignant) of a particular patient based upon information obtained by doctors through scanned images. These classification trees are displayed as a decision tree that has a start node which then branches into other nodes.

Redesigning Computer Architecture to Optimize Performance on Data Mining Applications

Data mining is the process of extracting useful information or patterns from large raw sets of data. In recent years the amount of data being collected has increased tremendously, which has resulted in the development of new and more complex data mining algorithms to go through the vast data. However, the rate of growth of the new computer systems does not equal the growth of the datasets and the complexity of these data mining algorithms.