You are an intern at Nutritionist centre ‘Smile clinic’, who has been tasked to analyse customers data to
see whether they eat rice in their daily mail for healthy diet point of view. The clininc has a copy of
Microsoft Excel and has just downloaded a free copy of the open source SPSS data mining software. The
company has used Microsoft Excel before but not SPSS.
You need to produce a report giving an evaluation how many customers of the Smile Clinic do eat rice.
How many customers are Male and Female? Also, what is Mean, Median of the ages in the data. Further
what is the Mean & Median of participants do eat rice. Show findings on Pie, Bar or Histogram as well.
The dataset is provided in xls and csv format: smile_clinic.xls, smile_clinic.csv.
2.1 Using the smile_clinic.csv provided in conjunction with SPSS give a specific example of clustering.
Show your workings with screenshots and explain you results.
2.2 Explain the most common data mining methods that can be used in business with real world
examples.
2.3 You will need to discuss the advantages/disadvantages of SPSS over Excel. This should be a mix of
theoretical argument as well as practical argument using the csv and text files for the dataset.
Word count guideline & marking criteria:
1. Processing the data, analysing the data and visualising the data (60 mark) 1500 – 1650 words
2. Data mining & analysis (40 marks) 1000 – 1100 words

SPSS – 30 days free subscription trialURL – SPPS – 30 days free subscription trial

on the main page click on ‘Try SPSS Statistics for free’

Enter your personal details, afterwards download the software according to the devise you have, Windows or Macbook.
How to enter & process the data of Assessment 2 into SPSS :