mrjob.job ========= .. py:module:: mrjob.job .. currentmodule:: mrjob.job .. autoclass:: MRJob .. _writing-one-step-jobs: One-step jobs ------------- .. automethod:: MRJob.mapper .. automethod:: MRJob.reducer .. automethod:: MRJob.combiner .. automethod:: MRJob.mapper_init .. automethod:: MRJob.mapper_final .. automethod:: MRJob.reducer_init .. automethod:: MRJob.reducer_final .. automethod:: MRJob.combiner_init .. automethod:: MRJob.combiner_final .. automethod:: MRJob.mapper_cmd .. automethod:: MRJob.reducer_cmd .. automethod:: MRJob.combiner_cmd .. _writing-multi-step-jobs: Multi-step jobs --------------- .. automethod:: MRJob.steps .. automethod:: MRJob.mr .. automethod:: MRJob.jar Running the job --------------- .. automethod:: MRJob.run .. automethod:: MRJob.__init__ .. automethod:: MRJob.make_runner Parsing output -------------- .. automethod:: MRJob.parse_output_line Counters and status messages ---------------------------- .. automethod:: MRJob.increment_counter .. automethod:: MRJob.set_status Setting protocols ----------------- .. autoattribute:: MRJob.INPUT_PROTOCOL .. autoattribute:: MRJob.INTERNAL_PROTOCOL .. autoattribute:: MRJob.OUTPUT_PROTOCOL .. automethod:: MRJob.input_protocol .. automethod:: MRJob.internal_protocol .. automethod:: MRJob.output_protocol Command-line options -------------------- See :ref:`writing-cl-opts` for information on adding command line options to your job. See :doc:`guides/configs-reference` for a complete list of all configuration options. .. automethod:: MRJob.configure_options .. automethod:: MRJob.add_passthrough_option .. automethod:: MRJob.add_file_option .. automethod:: MRJob.load_options .. automethod:: MRJob.is_mapper_or_reducer .. autoattribute:: MRJob.OPTION_CLASS .. _job-configuration: Job runner configuration ------------------------ .. automethod:: MRJob.job_runner_kwargs .. automethod:: MRJob.local_job_runner_kwargs .. automethod:: MRJob.emr_job_runner_kwargs .. automethod:: MRJob.hadoop_job_runner_kwargs .. automethod:: MRJob.generate_passthrough_arguments .. automethod:: MRJob.generate_file_upload_args .. automethod:: MRJob.mr_job_script Running specific parts of jobs ------------------------------ .. automethod:: MRJob.run_job .. automethod:: MRJob.run_mapper .. automethod:: MRJob.run_reducer .. automethod:: MRJob.run_combiner .. automethod:: MRJob.show_steps .. _hadoop-config: Hadoop Configuration -------------------- .. autoattribute:: MRJob.HADOOP_INPUT_FORMAT .. automethod:: MRJob.hadoop_input_format .. autoattribute:: MRJob.HADOOP_OUTPUT_FORMAT .. automethod:: MRJob.hadoop_output_format .. autoattribute:: MRJob.JOBCONF .. automethod:: MRJob.jobconf .. autoattribute:: MRJob.PARTITIONER .. automethod:: MRJob.partitioner Hooks for testing ----------------- .. currentmodule:: mrjob.job .. automethod:: MRJob.sandbox .. automethod:: MRJob.parse_output .. automethod:: MRJob.parse_counters