Speaker diarisation evaluation
We present a new evaluation technique based on segment matches using the F-measure. This gives the user a deeper insight into how well matched the hypothesised segments are to the reference segments.
- Accounts for reference errors using a collar and merging segments of the same speaker occurring within 0.25 seconds of each other.
- Segments must match if the start and end boundaries of the hypothesised segment lies on the start and end boundaries (+/- collar) of the reference segment
- Different distributions (uniform, triangular, Gaussian) on the reference boundary can be considered
- Speaker mapping finds a score or cost involving the number of segments matched between a reference speaker and an hypothesis label and searches every possible combination of pairs, finding the combination of pairs with the lowest cost