Recall is the Proper Evaluation Metric for Word Segmentation

Yan Shao, Christian Hardmeier, Joakim Nivre

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

We extensively analyse the correlations and drawbacks of conventionally employed evaluation metrics for word segmentation. Unlike in standard information retrieval, precision favours under-splitting systems and therefore can be misleading in word segmentation. Overall, based on both theoretical and experimental analysis, we propose that precision should be excluded from the standard evaluation metrics and that the evaluation score obtained by using only recall is sufficient and better correlated with the performance of word segmentation systems.
Original languageEnglish
Title of host publicationProceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
Place of PublicationTaipei, Taiwan
PublisherAsian Federation of Natural Language Processing
Pages86-90
Number of pages5
Volume2
ISBN (Electronic)978-1-948087-01-8
Publication statusPublished - 1 Dec 2017
EventThe 8th International Joint Conference on Natural Language Processing - Taipei, Taiwan, Province of China
Duration: 27 Nov 20171 Dec 2017
http://ijcnlp2017.org/site/page.aspx?pid=901&sid=1133&lang=en

Conference

ConferenceThe 8th International Joint Conference on Natural Language Processing
Abbreviated titleIJCNLP 2017
Country/TerritoryTaiwan, Province of China
CityTaipei
Period27/11/171/12/17
Internet address

Fingerprint

Dive into the research topics of 'Recall is the Proper Evaluation Metric for Word Segmentation'. Together they form a unique fingerprint.

Cite this