b Data selection: Select the data which will be useful for mining
c Data Cleaning: Remove the error in the data selected
d Data Extraction: Extracting relevant information from the data
e Data Interpretation: Interpreting the results obtained.
c Decision trees
d Neural network
e Genetic Algorithm
Interview and survey
The data selected can serve the purpose for which it is selected
A representative sample is a small amount which represents the characteristics of the larger entity accurately.
The researcher can avoid bias by
Doing a preliminary research and asking open ended questions
Clearly outlining the population for which the study is to be conducted
The researcher should have complete understanding of all the statistical techniques before starting the research.
The consequence of improperly collected data are
The researcher will not be able to answer research questions inaccurately.
The researcher will not be able to repeat and the validate the study
The researcher will lose trust and will not be consulted for further studies.
Data selection is dependent on purpose for which data will be used, potential reuse, timeframe for which the data will be used, budget for data selection.
Data Mining. (n.d.). Retrieved from
Representative Sample. (n.d.). Retrieved from
Tips for Overcoming Researcher Bias. (2013) . Retrieved from
Five steps to decide what data to keep. (n.d.). Retrieved from