Write a report comparing and contrasting clustering vs classification based approaches. You must include citations and references formatted per APA 7 when discussing these two. Besides the class materials you can find more information on simple learn. Implement 3 academic references

Global Information Technology

Global Information Technology

Data mining refers to a computational process of exploring and analyzing large amounts of data in order to discover useful information [14, 15, 6, 3, 7, 4, 5, 1]. To give a perspective, there are four main types of data mining tasks: association rule learning, clustering, classification, and regression. We have identified that these types of data mining tasks are useful in each of the research strands discussed in this research proposal. There are two types of data: labelled and unlabelled. Labelled data has a specially designated attribute and the aim is to use the given data to predict the value of that attribute for new data. Unlabelled data does not have such a designated attribute when discussing clustering vs classification. The first two data mining tasks, association rule learning and clustering, work with unlabelled data and are known as unsupervised learning. The last two data mining tasks, classification and regression, work with labelled data and are called supervised learning.

Association rule learning concerns about finding interesting relationships and correlations that exist amongst the values of variables [3, 1]. A typical application of association rule learning is to find customer purchase behavior from market basket transaction data. The primary aim usually is to find associations and correlations of the various items purchased by consumers. Data clustering is a form of unsupervised learning and refers to a process of dividing a set of items into homogeneous groups or clusters such that items in the same cluster are similar to each other and items from different clusters are distinct [12, 11]. During the past six decades, many clustering algorithms have been developed by researchers from different fields of studies. Most clustering algorithms are formulated as an optimization problem. You can get more information about clustering algorithms with astute scholars.