PyAnnote is a Python module for collaborative annotation of multimedia content.
It provides a (hopefully) easy-to-use Python API to store and manipulate temporal annotations of multimedia content. Applications that can benefit from PyAnnote include video shot (or scene) boundary detection, speaker diarization, audio indexing based on automatic speech transcription or audio/visual person recognition, etc.
Various evaluation metrics are implemented, for the most common automatic audio/video processing tasks such as temporal segmentation, speech activity detection, speaker diarization and identification, etc.
A limited set of automatic annotation algorithms are available for now, that should be increased as PyAnnote evolve in the future.
PyAnnote was started in 2012 by Herve Bredin (bredin@limsi.fr) in the framework of the REPERE challenge, focusing on audiovisual people recognition in TV videos.
PyAnnote is distributed under the GNU General Public License:
Copyright 2012 Herve BREDIN (bredin@limsi.fr)
PyAnnote is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
PyAnnote is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with PyAnnote. If not, see <http://www.gnu.org/licenses/>.