Click here to Skip to main content
15,886,199 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
What do you understand by Word Count implementation via Hadoop framework? Explain in detail


What I have tried:

I am not able to implement the Word Count implementation via the Hadoop framework?
Posted
Updated 28-Jul-18 2:04am
Comments
Richard MacCutchan 28-Jul-18 7:22am    
See the Hadoop documentation, it should be explained there.

1 solution

"Word Count Implementation will be as follows:
For ex: Input File 1 contains data: "This is December Month."
              Input File 2 contains data:  "December is the last month of the year."

Step 1: Mapper will generate the following below output:
Input File 1 output
<this, 1>
<is, 1>
<December, 1>
<Month, 1>
Input File 2 output
<December, 1>
<is, 1>
<the, 1>
<last, 1>
<month, 1>
<of, 1>
<the, 1>
<year, 1>

Step2: Combiner/sorting is performed on the both the input files individually:
Input File 1 output:
<this, 1>
<is, 1>
<December, 1>
<Month, 1>
Input File 2 output
<December, 1>
<is, 1>
<the, 2>
<last, 1>
<month, 1>
<of, 1>
<year, 1>

Step3: Reducer will combine the output:
<this, 1>
<is, 2>
<December, 2>
<Month, 2>
<the, 2>
<of, 1>
<year, 1>

Final Output:
This 2 times
Is 2 times
December 2 times
Month 2 times
The 2 times
Of 1 times
Year 1 times

Assignment II - 3rd December
"
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900