Steps to create jar file and execute word count problem in mapper reducer

Uploaded by

vaishnavireddy1809vs

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Steps to create jar file and execute word count problem in mapper reducer

Uploaded by

vaishnavireddy1809vs

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Steps to create jar file and execute word count problem in mapper reducer

1. First Open Eclipse -> then select File -> New -> Java Project ->Name it WordCount -> then
Finish.

2. Create Three Java Classes into the project. Name them WCDriver(having the main
function), WCMapper, WCReducer.

3. You have to include two Reference Libraries for that:

Right Click on Project -> then select Build Path-> Click on Configure Build Path. You can see
the Add External JARs option on the Right Hand Side.
3.1 Go to C:\hadoop-3.3.6\share\hadoop\common
Select all jar file listed in this folder
3.2 C:\hadoop-3.3.6\share\hadoop\mapreduce
Select all jar file listed in this folder
3.3 Click on apply

4. Create a class file named as WCMapper in the WordCount Project

Mapper Code: You have to copy paste this program into the WCMapper Java Class file.

// Importing libraries
import java.io.IOException;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.Mapper;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reporter;

public class WCMapper extends MapReduceBase implements Mapper<LongWritable,

Text, Text, IntWritable> {

// Map function
public void map(LongWritable key, Text value, OutputCollector<Text,
IntWritable> output, Reporter rep) throws IOException
{

String line = value.toString();

// Splitting the line on spaces

for (String word : line.split(" "))
{
if (word.length() > 0)
{
output.collect(new Text(word), new IntWritable(1));
}
}
}
}

5. Reducer Code: You have to copy paste this program into the WCReducer Java Class file.
// Importing libraries
import java.io.IOException;
import java.util.Iterator;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reducer;
import org.apache.hadoop.mapred.Reporter;

public class WCReducer extends MapReduceBase implements Reducer<Text,

IntWritable, Text, IntWritable> {

// Reduce function
public void reduce(Text key, Iterator<IntWritable> value,
OutputCollector<Text, IntWritable> output,
Reporter rep) throws IOException
{

int count = 0;

// Counting the frequency of each words

while (value.hasNext())
{
IntWritable i = value.next();
count += i.get();
}

output.collect(key, new IntWritable(count));

}
}

6. Driver Code: You have to copy paste this program into the WCDriver Java Class file.
// Importing libraries
import java.io.IOException;
import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.FileInputFormat;
import org.apache.hadoop.mapred.FileOutputFormat;
import org.apache.hadoop.mapred.JobClient;
import org.apache.hadoop.mapred.JobConf;
import org.apache.hadoop.util.Tool;
import org.apache.hadoop.util.ToolRunner;

public class WCDriver extends Configured implements Tool {

public int run(String args[]) throws IOException

{
if (args.length < 2)
{
System.out.println("Please give valid inputs");
return -1;
}

JobConf conf = new JobConf(WCDriver.class);

FileInputFormat.setInputPaths(conf, new Path(args[1]));
FileOutputFormat.setOutputPath(conf, new Path(args[2]));
conf.setMapperClass(WCMapper.class);
conf.setReducerClass(WCReducer.class);
conf.setMapOutputKeyClass(Text.class);
conf.setMapOutputValueClass(IntWritable.class);
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
JobClient.runJob(conf);
return 0;
}

// Main Method
public static void main(String args[]) throws Exception
{
int exitCode = ToolRunner.run(new WCDriver(), args);
System.out.println(exitCode);
}
}
7. Now you have to make a jar file:
Right Click on Project-> Click on Export-> Select export destination as Jar File-> Name the jar
File(WordCount.jar) -> Click on next -> at last Click on Finish. Now copy this file into the
C:/hadoop-3.3.6/share/hadoop/mapreduce/

8. create one txt file named as test.txt with some repeated words

9. copy that data file into input directory

C:\hadoop-3.3.6\sbin>hadoop fs -put C:/Users/IIITK/Documents/files/test.txt /input3

10. list the contents of hdfs

C:\hadoop-3.3.6\sbin>hadoop fs -ls /input3/

11. display the contents of test.txt file

hadoop dfs -cat /input3/test.txt

12. run the wordcount.jar file saved in the shared directory of Hadoop
C:\hadoop-3.3.6\ sbin> adoop jar C:/hadoop-
3.3.6/share/hadoop/mapreduce/wordcount.jar WCDriver /input3 /output3

13. display the output stored in /output3 directory

14. C:\hadoop-3.3.6\sbin>hadoop fs -cat /output3

16. 16. we can see the output in browser also

Localhost:9870
Go to utilities

Objects First With Java Chapter 4
No ratings yet
Objects First With Java Chapter 4
16 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
BDA3
No ratings yet
BDA3
7 pages
Exp 4 Word Count
No ratings yet
Exp 4 Word Count
4 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Part B Assignment - No - 1
No ratings yet
Part B Assignment - No - 1
6 pages
wrordcount
No ratings yet
wrordcount
2 pages
Word Count
No ratings yet
Word Count
10 pages
Ravikant_Hadoop_file
No ratings yet
Ravikant_Hadoop_file
22 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Lab3_BigData-MapReduce
No ratings yet
Lab3_BigData-MapReduce
8 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Palak
No ratings yet
Palak
10 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
3 MapReduce program ex code
No ratings yet
3 MapReduce program ex code
14 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
MR Progs For Self Excercise
No ratings yet
MR Progs For Self Excercise
14 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
ADA Lab Manual
No ratings yet
ADA Lab Manual
34 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
wc
No ratings yet
wc
13 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
ExNo04
No ratings yet
ExNo04
4 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
Source Code for Wordcount
No ratings yet
Source Code for Wordcount
3 pages
Wordcount
No ratings yet
Wordcount
3 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
12 CodigoNetbeans
No ratings yet
12 CodigoNetbeans
5 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
37 pages
BDA
No ratings yet
BDA
6 pages
6 - Simple Wordcount
No ratings yet
6 - Simple Wordcount
2 pages
Hadoop Wordcount Program
No ratings yet
Hadoop Wordcount Program
20 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
BDA Lab 8 Manual
No ratings yet
BDA Lab 8 Manual
7 pages
Advanced Mapreduce
No ratings yet
Advanced Mapreduce
37 pages
Hadoop Developingapps PDF
No ratings yet
Hadoop Developingapps PDF
17 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
Hadoop Mapred
100% (1)
Hadoop Mapred
11 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
Unit 4 BDA
No ratings yet
Unit 4 BDA
31 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
20-Jan-Paper-I-EN
No ratings yet
20-Jan-Paper-I-EN
55 pages
2021BCS0103
No ratings yet
2021BCS0103
7 pages
MODULE 5
No ratings yet
MODULE 5
27 pages
19-Jan-Paper-II-Statistics-HN-1
No ratings yet
19-Jan-Paper-II-Statistics-HN-1
34 pages
2021BCS0103_CSE411_LAB-9
No ratings yet
2021BCS0103_CSE411_LAB-9
10 pages
MOD 2
No ratings yet
MOD 2
43 pages
s42003-023-05480-z
No ratings yet
s42003-023-05480-z
12 pages
MODULE 1 INTRO
No ratings yet
MODULE 1 INTRO
32 pages
2021BCS0103_ICS322_Assignment2
No ratings yet
2021BCS0103_ICS322_Assignment2
10 pages
2021BCS0103_Lab2_Microproc
No ratings yet
2021BCS0103_Lab2_Microproc
3 pages
2021BCS0103_ICS322
No ratings yet
2021BCS0103_ICS322
3 pages
Spark MLIB
No ratings yet
Spark MLIB
50 pages
2021BCS0103_ML
No ratings yet
2021BCS0103_ML
1 page
OfferLetter.pdf
No ratings yet
OfferLetter.pdf
1 page
9_Pig Latin (1)
No ratings yet
9_Pig Latin (1)
42 pages
2021BCS0103_CSE411_Lab5
No ratings yet
2021BCS0103_CSE411_Lab5
11 pages
2021BCS0103_CSE411_lab8
No ratings yet
2021BCS0103_CSE411_lab8
12 pages
2021BCS0103_CSE321_LAB6
No ratings yet
2021BCS0103_CSE321_LAB6
12 pages
2021BCS0103_MicroP_lab
No ratings yet
2021BCS0103_MicroP_lab
3 pages
2021BCS0103
No ratings yet
2021BCS0103
15 pages
2021BCS0103_CSE411_LAB6
No ratings yet
2021BCS0103_CSE411_LAB6
11 pages
2021BCS0103_CSE321_LAB
No ratings yet
2021BCS0103_CSE321_LAB
16 pages
PIG_installation step
No ratings yet
PIG_installation step
2 pages
2021BCS0103_CSE321_LAB7
No ratings yet
2021BCS0103_CSE321_LAB7
3 pages
Hive-Part-2
No ratings yet
Hive-Part-2
53 pages
Steps of Hadoop installation
No ratings yet
Steps of Hadoop installation
3 pages
Spark graphX
No ratings yet
Spark graphX
43 pages
Spark SQL_updated
No ratings yet
Spark SQL_updated
19 pages
akka parlour menu
No ratings yet
akka parlour menu
4 pages
struc_patterns
No ratings yet
struc_patterns
86 pages
PBS CPCP Admin Guide
No ratings yet
PBS CPCP Admin Guide
29 pages
Microprocessor and Interfacing: Lab Assignment - 5
No ratings yet
Microprocessor and Interfacing: Lab Assignment - 5
17 pages
Computer Studies Notes
No ratings yet
Computer Studies Notes
60 pages
Bassbuds Duo Product Manual 07.09.2022
No ratings yet
Bassbuds Duo Product Manual 07.09.2022
2 pages
HK EB Student Manual 2021 (Summer) - Valtorta
No ratings yet
HK EB Student Manual 2021 (Summer) - Valtorta
2 pages
Step 1: Requirements Clarifications: System Design Interviews: A Step by Step Guide
No ratings yet
Step 1: Requirements Clarifications: System Design Interviews: A Step by Step Guide
4 pages
SAP Security
No ratings yet
SAP Security
6 pages
CANBus Logger Final A
No ratings yet
CANBus Logger Final A
32 pages
Oracle Questions & Answers: Exam Information
No ratings yet
Oracle Questions & Answers: Exam Information
25 pages
A Project Report ON: Department of Computer Engineering
No ratings yet
A Project Report ON: Department of Computer Engineering
13 pages
AI12
No ratings yet
AI12
2 pages
YK-2038 Bug: Vikash Chandra Sharma
No ratings yet
YK-2038 Bug: Vikash Chandra Sharma
33 pages
FM Communication
No ratings yet
FM Communication
144 pages
9 Create A Data-Driven Story With Power BI Reports
No ratings yet
9 Create A Data-Driven Story With Power BI Reports
100 pages
FTTH Nar Solutions Br-111600-En
No ratings yet
FTTH Nar Solutions Br-111600-En
24 pages
Bokeh Manual PDF
No ratings yet
Bokeh Manual PDF
16 pages
Dokumen - Tips - Installation Runbook For Mirantis Runbook For Palo Alto Networks Virtual Firewall
No ratings yet
Dokumen - Tips - Installation Runbook For Mirantis Runbook For Palo Alto Networks Virtual Firewall
55 pages
BP030 Data Gathering Requirements
No ratings yet
BP030 Data Gathering Requirements
6 pages
Serial Installation Guide
No ratings yet
Serial Installation Guide
4 pages
Dn058 Radio Manager Datasheet Eng 3
No ratings yet
Dn058 Radio Manager Datasheet Eng 3
4 pages
Configuration of Leased Asset Accounting
No ratings yet
Configuration of Leased Asset Accounting
2 pages
1
No ratings yet
1
48 pages
Javascript Resume
100% (1)
Javascript Resume
4 pages
The Osint Cyber War 2023-06-19
No ratings yet
The Osint Cyber War 2023-06-19
26 pages
2 Chapter 2 Network Topology
No ratings yet
2 Chapter 2 Network Topology
9 pages
Avaya C360 Quick Start Guide
No ratings yet
Avaya C360 Quick Start Guide
36 pages
Layer 2 VS Layer 3 Switching
No ratings yet
Layer 2 VS Layer 3 Switching
2 pages
Activity 3 - Python GUI With Tkinter and Arduino
No ratings yet
Activity 3 - Python GUI With Tkinter and Arduino
9 pages
Tico Bsnl-Rajasthan
No ratings yet
Tico Bsnl-Rajasthan
7 pages