PyAnnote

PyAnnote is a Python module for collaborative annotation of multimedia content.

It provides a (hopefully) easy-to-use Python API to store and manipulate temporal annotations of multimedia content. Applications that can benefit from PyAnnote include video shot (or scene) boundary detection, speaker diarization, audio indexing based on automatic speech transcription or audio/visual person recognition, etc.

Various evaluation metrics are implemented, for the most common automatic audio/video processing tasks such as temporal segmentation, speech activity detection, speaker diarization and identification, etc.

A limited set of automatic annotation algorithms are available for now, that should be increased as PyAnnote evolve in the future.

Author(s)

PyAnnote was started in 2012 by Herve Bredin (bredin@limsi.fr) in the framework of the REPERE challenge, focusing on audiovisual people recognition in TV videos.

Licence

PyAnnote is distributed under the GNU General Public License:

Copyright 2012 Herve BREDIN (bredin@limsi.fr)

    PyAnnote is free software: you can redistribute it and/or modify
    it under the terms of the GNU General Public License as published by
    the Free Software Foundation, either version 3 of the License, or
    (at your option) any later version.

    PyAnnote is distributed in the hope that it will be useful,
    but WITHOUT ANY WARRANTY; without even the implied warranty of
    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
    GNU General Public License for more details.

    You should have received a copy of the GNU General Public License
    along with PyAnnote.  If not, see <http://www.gnu.org/licenses/>.

Indices and tables

Table Of Contents

Next topic

Getting started

This Page