Welcome to Pyteomics tutorial!¶
What is Pyteomics?¶
Pyteomics is a collection of lightweight and handy tools for Python that help to handle various sorts of proteomics data. Pyteomics provides a growing set of modules to facilitate the most common tasks in proteomics data analysis, such as:
- calculation of basic physico-chemical properties of polypeptides:
- mass and isotopic distribution
- charge and pI
- chromatographic retention time
- access to common proteomics data:
- MS or LC-MS data
- FASTA databases
- search engines output
- easy manipulation of sequences of modified peptides and proteins
The goal of the Pyteomics project is to provide a versatile, reliable and well-documented set of open tools for the wide proteomics community. One of the project’s key features is Python itself, an open source language increasingly popular in scientific programming. The main applications of the library are reproducible statistical data analysis and rapid software prototyping.
- Pyteomics is hosted at the following sites:
- Python package @ Python Package Index: http://pypi.python.org/pypi/pyteomics
- project documentation @ python.org: http://pythonhosted.org/pyteomics
- source code @ bitbucket: http://hg.theorchromo.ru/pyteomics
- mailing list @ Google: https://groups.google.com/group/pyteomics
Feedback & Support¶
Please email to pyteomics@googlegroups.com with any questions about Pyteomics. You are welcome to use the BitBucket issue tracker to report bugs, request features, etc.
Citation¶
Pyteomics is distributed under Apache License version 2.0.
When using or redistributing Pyteomics, or parts of it, please cite the following paper:
Goloborodko, A.A.; Levitsky, L.I.; Ivanov, M.V.; and Gorshkov, M.V. (2013) “Pyteomics - a Python Framework for Exploratory Data Analysis and Rapid Software Prototyping in Proteomics”, Journal of The American Society for Mass Spectrometry, 24(2), 301–304. DOI: 10.1007/s13361-012-0516-6
Relation to the other proteomics data analysis tools¶
Our goal is to create an infrastructure for proteomics data analysis within Python ecosystem. Pyteomics is not a proteomic search engine, nor does it any data conversion. There are other tools for that. Pyteomics does not aim to substitute any of these, but rather to coexist and complement them.
Contents:¶
- Introduction
- How to install Pyteomics
- Peptide sequence formats. Parser module
- Peptide properties: mass, charge, chromatographic retention
- Data access
- Pyteomics API documentation
- parser - operations on modX peptide sequences
- mass - molecular masses and isotope distributions
- achrom - additive model of polypeptide chromatography
- electrochem - electrochemical properties of polypeptides
- fasta - manipulations with FASTA databases
- mzml - reader for mass spectrometry data in mzML format
- mgf - read and write MS/MS data in Mascot Generic Format.
- pepxml - pepXML file reader
- mzid - mzIdentML file reader
- auxiliary - common functions and objects
- pylab_aux - auxiliary functions for plotting with pylab
- Combined examples
- History of changes