Speaker diarisation evaluation

Speaker diarisation evaluation

We present a new evaluation technique based on segment matches using the F-measure. This gives the user a deeper insight into how well matched the hypothesised segments are to the reference segments.

  • Accounts for reference errors using a collar and merging segments of the same speaker occurring within 0.25 seconds of each other.
  • Segments must match if the start and end boundaries of the hypothesised segment lies on the start and end boundaries (+/- collar) of the reference segment
  • Different distributions (uniform, triangular, Gaussian) on the reference boundary can be considered
  • Speaker mapping finds a score or cost involving the number of segments matched between a reference speaker and an hypothesis label and searches every possible combination of pairs, finding the combination of pairs with the lowest cost

Publications

Back to Top