Επιτάχυνση ενός αλγόριθμου διαχωρισμού ομιλίας/ μουσικής με χρήση FPGA

Φαρδής, Στέφανος-Αντώνιος Π.

FPGA-based acceleration of a speech / music discrimination algorithm

Master Thesis

Author

Φαρδής, Στέφανος-Αντώνιος Π.

Date

2014

Abstract

This thesis objective is the implementation of a speech-music discrimination algorithm. This refers to the identification and classification of audio stream segments - or else frames - as speech or music. The implemented method is based on extracting from the audio recording, a feature called spectral entropy. As soon as spectral entropy has been extracted from an audio frame, mean value and standard deviation are calculated. Frames are classified depending on standard deviation's value, and specifically whether it is bigger than a predefined threshold. The method was implemented on Xilinx ML403 development board, which includes an FPGA of the Virtex 4 family. The developed system is based on the PowerPC processor that resides within the FPGA. Parts of the algorithm, and specifically audio frames sampling and frequency spectrum calculation are implemented directly in hardware for performance reasons.

Postgraduate Studies Programme

Προηγμένα Συστήματα Πληροφορικής

Department

Σχολή Τεχνολογιών Πληροφορικής και Επικοινωνιών. Τμήμα Πληροφορικής

Number of pages

Language

Greek

URI

https://dione.lib.unipi.gr/xmlui/handle/unipi/7007

Collections

Τμήμα Πληροφορικής

Show full item record

Except where otherwise noted, this item's license is described as
Attribution-NonCommercial-NoDerivatives 4.0 Διεθνές