* 0.3.10 "David Caro " MINOR 075edb02: Merge pull request #116 from spirosdelviniotis/hepcrawl_wsp_unit_using_pipeline MINOR 0bcc8654: tests: update WSP unit tests - use pipeline output MINOR 22069eac: applied requested changes * 0.3.9 "David Caro " MINOR 35d4e007: Merge branch 'spirosdelviniotis-hepcrawl_wsp_unit_pipeline_tests' MINOR d3db0119: tests: add wsp test using the full pipeline output MINOR b55481c5: WSP: remove dublicated validation for item MINOR 14983cd0: tests: re-enable xfailled unit tests for WSP * 0.3.8 "David Caro " MINOR 06d664e3: Merge pull request #115 from spirosdelviniotis/hepcrawl_environment_handler_fixture MINOR 0f8b8d05: tests: create environment handler fixture MINOR a3971d23: tests: applied requested changes * 0.3.7 "David Caro " MINOR 670c0704: Merge pull request #110 from spirosdelviniotis/hepcrawl_rm_json_writer_pipeline MINOR 9f62cacf: pipelines: remove unused JsonWriterPipeline * 0.3.6 "David Caro " MINOR e7c64a6c: Merge pull request #99 from david-caro/bump_schemas_to_31 MINOR 0ac45ae7: packaging: bump inspire-schemas to 31 * 0.3.5 "David Caro " MINOR f7bbd1e4: Merge pull request #97 from spirosdelviniotis/hepcrawl_refactor_tests_folders MINOR 7b3ff5d0: tests: refactored tests folders * 0.3.4 "Jacopo Notarstefano " MINOR 26a4281f: Merge pull request #94 from david-caro/fix_preprint_date_format MINOR 2827ce5d: pep8: fix the items module MINOR fce333fb: global: correct the preprint_date format * 0.3.3 "David Caro " MINOR 897851db: Merge pull request #93 from david-caro/pipeline_return_hep_record MINOR 4e387b42: pipeline: return the valid hep record * 0.3.2 "David Caro " MINOR f35d99c9: Merge pull request #92 from david-caro/use_latest_schemas MINOR e8cfb2de: global: use latest schemas * 0.3.1 "David Caro " MINOR 15a6798c: travis: add deploy on tags too * 0.3.0 "David Caro " FEATURE c7213c67: Merge pull request #87 from rikirenz/literature-builder-refactor FEATURE f1362c95: tests: add xfail for the non-compatible tests FEATURE 3078cc16: tests: add integration tests for crawler2hep FEATURE 7fb0309c: setup: upgrade schema version FEATURE 75c06b0e: hepcrawl: remove schema validation FEATURE 1953fea7: crawler2hep: add module to create valid HEP record * 0.0.39 "David Caro " MINOR 1799f9ce: Merge pull request #86 from david-caro/arxiv_tests_use_pipeline_results MINOR 1256a2f3: pos: test records after pipeline processing MINOR d6336b2b: arxiv: test records after pipeline processing * 0.0.38 "David Caro " MINOR 68ed5b7e: Merge pull request #83 from david-caro/send_results_directly_to_inspire MINOR 8f2b3409: pipelines: send data payload through api * 0.0.37 "Thorsten Schwander " MINOR e8ac22ad: TLS URL adjustments for PoS * 0.0.36 "David Caro " MINOR 35aa34d5: Merge pull request #82 from david-caro/use_new_inspire_schemas MINOR 99b8ec91: schema: split categories into arxiv/inspire FIXED ISSUES: http://github.com/inspirehep/hepcrawl/issues/79 * 0.0.35 "David Caro " MINOR cc9e86dc: Merge pull request #81 from david-caro/add_release_notes MINOR 81215040: packaging: add release notes to the docs * 0.0.34 "David Caro " MINOR 232fd42e: Merge pull request #76 from david-caro/add_scarpy_config MINOR 7744d858: gitignore: ignore authors and changelog files MINOR 563d9b75: packaging: add scrapy config files to package * 0.0.33 "David Caro " MINOR dc3ece89: Merge pull request #72 from david-caro/pin_scrapyd MINOR c2d2c1a4: packaging: pin scrapyd version * 0.0.32 "David Caro " MINOR ef804ef4: Merge pull request #68 from david-caro/remove_ugly_param MINOR 65977f49: version remove unneded breaking extra param * 0.0.31 "David Caro " MINOR 5b84f3ea: Merge pull request #67 from david-caro/use_autosemver MINOR a1285891: packaging: use autosemver * 0.0.30 "David Caro " MINOR 2fa644fe: Merge pull request #65 from david-caro/little_scrapyd_fix MINOR bb9b8f99: packaging: use scrapyd for the scrapyd-deploy conf * 0.0.29 "David Caro " MINOR daf593dc: Merge pull request #58 from david-caro/validate_schemas MINOR 5c9d1654: arxiv_spider: fix affiliations format when empty MINOR ef071d6f: tests: adapt arxiv tests for schema validation MINOR 441e0d28: global: fix submission info MINOR 453bb778: arxiv_spider: adapt report numbers to schema MINOR 9a71bffd: wsp_spider: adapt license to schemas MINOR ab380a94: tests: fix arxiv report_number tests MINOR 14289548: pep8: some small fixes MINOR 40ea8bc5: global: Add schema validation on arxiv spider MINOR 15df85b9: global: add schema validation to wps spider MINOR 1c646b56: global: adapt licenses to schema MINOR daaafc0c: gitignore: add vim swapfiles MINOR 3f6f8a92: tests: correct arxiv tests to match schema MINOR 50490884: tests: skip find_links due to external link down MINOR b18ebbfe: global: fix report number MINOR 68c9e53c: pipelines: adapted to the schemas and validation MINOR 3e4a5f04: global: fix the journal year according to schema MINOR 504820fd: arxiv_spider: adapt report_numbers to schema MINOR 192768b9: tests: ensure there's at least one parsed record MINOR 91ed44b9: global: do the validation at the pipeline too * 0.0.28 "David Caro " MINOR 7c6e021f: Merge pull request #57 from bittirousku/fix_base MINOR 111d9ffe: base_spider: format changes in BASE metadata * 0.0.27 "David Caro " MINOR c89f28d4: Merge pull request #59 from david-caro/add_app_prefix_to_env_vars MINOR 40f4f1af: config: Use 'APP' prefixed env vars * 0.0.26 "David Caro " MINOR 0088c65f: Merge pull request #43 from bittirousku/edpharvest MINOR 23b61492: loaders: more subtitle input loaders MINOR 4d508f03: utils: add functionality to extract section from journal title MINOR 23dbbfb3: spiders: new EDP Sciences spider * 0.0.25 "Samuele Kaplun " MINOR 9a02072e: Merge pull request #50 from kaplun/pr/46 MINOR 1d707d50: arXiv_spider: collaboration, comment... FIXED ISSUES: http://github.com/inspirehep/hepcrawl/issues/46 * 0.0.24 "Eamonn Maguire " MINOR e3df926b: Merge pull request #45 from inspirehep/new-crawler-install-guide MINOR fe282063: install: updated installation instructions * 0.0.23 "Jacopo Notarstefano " MINOR d9cb6d6a: Merge pull request #44 from mihaibivol/fix-thesis MINOR 1a845213: thesis: adapt to new schema * 0.0.22 "Jan Åge Lavik " MINOR 41affdda: Merge pull request #42 from jalavik/fixrelatives MINOR dcf48575: global: use relative xpath expressions * 0.0.21 "Jan Åge Lavik " MINOR 5b094cc7: Merge pull request #41 from jalavik/arxiv_fix MINOR bd2cd4c8: arxiv_spider: fix xpath expressions to be relative MINOR 4f0cd6cb: loaders: more title input loaders * 0.0.20 "Samuele Kaplun " MINOR 29e7776b: Merge pull request #40 from kaplun/split_pages MINOR 8ac1497a: general: journal_pages -> journal_fpage/lpage * 0.0.19 "Jan Åge Lavik " MINOR 88d88bc3: Merge pull request #36 from bittirousku/hindawiharvest MINOR 8058d6ad: dnb_spider: small fixes MINOR aa29fcf7: spiders: new Hindawi spider MINOR 6d98447e: iop_spider: small fixes MINOR 1de859e3: utils: improve `get_mime_type` * 0.0.18 "Jan Aage Lavik " MINOR 5a22d063: pipelines: allow existing publication_info * 0.0.17 "Jan Aage Lavik " MINOR 9d747cbd: items: data model update to references * 0.0.16 "Jan Aage Lavik " MINOR 72a24555: settings: better default options for INSPIRE pipeline * 0.0.15 "Jan Aage Lavik " MINOR 6a5b5ef3: docs: add system level packages note * 0.0.14 "Jan Aage Lavik " MINOR 955fa919: pipelines: field_categories.source as 'publisher' * 0.0.13 "Jan Aage Lavik " MINOR 157bb85b: loaders: page_nr should be a list * 0.0.12 "Jan Aage Lavik " MINOR c1d2f3e5: arxiv_spider: extraction updates and docs * 0.0.11 "Jan Åge Lavik " MINOR 5e7b0ffe: Merge pull request #35 from bittirousku/clean_tests MINOR d8767abe: spiders: improve documentation MINOR 30651eff: tests: clean tests * 0.0.10 "Jan Åge Lavik " MINOR 97172afe: Merge pull request #34 from jalavik/field_categories_etc MINOR 0489bb66: utils: ftputil usage MINOR 4be11bd2: items: rename subject_terms -> field_categories MINOR a0306e3c: inputs: use unicode literal MINOR f0d75a26: items: update urls * 0.0.9 "Jan Aage Lavik " MINOR 842aadca: pos_spider: metadata extraction fix * 0.0.8 "Jan Åge Lavik " MINOR c6451d00: Merge pull request #31 from bittirousku/mitharvest MINOR e0fcd970: spiders: fix thesis supervisor getting MINOR 7e915448: tests: improvements to `get_node` MINOR f1dc29e7: inputs: new `parse_thesis_supervisors` loader function MINOR 958fac49: spiders: new MIT spider MINOR d8c715fe: MANIFEST: add tar.gz and pdf * 0.0.7 "Jan Åge Lavik " MINOR 62de34d8: Merge pull request #29 from bittirousku/iopharvest MINOR 796fdff7: spiders: new IOP spider MINOR a8df04eb: inputs: improve whitespace stripping * 0.0.6 "Jan Åge Lavik " MINOR 4ac7c675: Merge pull request #28 from bittirousku/infnharvest MINOR 6044640f: spiders: new INFN spider * 0.0.5 "Jan Åge Lavik " MINOR 4900b5fe: Merge pull request #32 from bittirousku/fix_scrapy_requests MINOR 859711b6: spiders: fix issue with scrapy selectors in request.meta * 0.0.4 "Henrik Vesterinen " MINOR 7455aa6f: spiders: new MAGIC spider * 0.0.3 "Jan Aage Lavik " MINOR e2695403: docs: major update * 0.0.2 "Jan Aage Lavik " MINOR e5bcc416: dateutils: fixes and updates * 0.0.1 "Jan Aage Lavik " MINOR 7bf00c5e: aps: parameter and pagination support