mrjob lets you write MapReduce jobs in Python 2.6+/3.3+ and run them on several platforms. You can:

  • Write multi-step MapReduce jobs in pure Python
  • Test on your local machine
  • Run on a Hadoop cluster
  • Run in the cloud using Amazon Elastic MapReduce (EMR)

mrjob is licensed under the Apache License, Version 2.0.

To get started, install with pip:

pip install mrjob

and begin reading the tutorial below.



Module Index

Search Page