Create a word count style MapReduce program (CLOUD COMPUTING).
In this assignment, you are creating a wordcount style MapReduce program that runs on Cloudera Hortonworks Data Platform (HDP) or AWS Elastic MapReduce (EMR).
- Download Elon Musk’s twists from the attachment.
- Develop your MapReduce program with default FileInputFormat or your version of CSVInputFormat (10 Bonus points for creating the CSVInputFormat).
- You can choose to use either Hadoop MapReduce or Spark framework.
- Your program should:
- Counting the users that Elon ”@” in this twists
- Counting the topic that Elon referred to in his twists (#).
- Test the program on your local computer with IntelliJ IDE.
- Run the program on HDP or EMR.