Ground parrot vocalisation can be considered as an audio event. Test-based diverse density multiple instance learning (TB-DD-MIL) is proposed for detecting this event in audio files recorded in the field. The proposed method is motivated by the advantages of multiple instance learning from incomplete training data. Spectral features suitable for encoding the vocal source information of the ground parrot vocalization are also investigated. The proposed method was benchmarked against a dataset collected in various environmental conditions and an audio detection evaluation scheme is proposed. The evaluation includes a study on performance of the various vocal source features and comparison with other classification techniques. Experimental results indicated that the most appropriate feature to encode ground parrot calls is the spectral bandwidth and the proposed TB-DD-MIL method outperformed other existing classification methods.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1121/1.4999318 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!