School of Computing

A comparative evaluation of a new unsupervised sentence boundary detection approach on documents in english and portuguese

Jan Strunk, Carlos N. Silla Jr., and Celso A. A. Kaestner

In Computational Linguistics and Intelligent Text Processing, volume 3878 of Lecture Notes in Computer Science, pages 182-196. Springer, February 2006 [doi].

Abstract

In this paper, we describe a new unsupervised sentence boundary detection system and present a comparative study evaluating its performance against di erent systems found in the literature that have been used to perform the task of automatic text segmentation into sentences for English and Portuguese documents. The results achieved by this new approach were as good as those of the previous systems, especially considering that the method does not require any additional training resources.

Download publication 124 kbytes (PDF)

Bibtex Record

@inproceedings{2903,
author = {Jan Strunk and Carlos N. Silla Jr. and Celso A. A. Kaestner},
title = {A Comparative Evaluation of a New Unsupervised Sentence Boundary Detection Approach on Documents in English and Portuguese},
month = {February},
year = {2006},
pages = {182-196},
keywords = {determinacy analysis, Craig interpolants},
note = {},
doi = {10.1007/11671299_16},
url = {http://www.cs.kent.ac.uk/pubs/2006/2903},
    publication_type = {inproceedings},
    submission_id = {14299_1242432094},
    booktitle = {Computational Linguistics and Intelligent Text Processing},
    volume = {3878},
    series = {Lecture Notes in Computer Science},
    publisher = {Springer},
    refereed = {yes},
}

School of Computing, University of Kent, Canterbury, Kent, CT2 7NF

Enquiries: +44 (0)1227 824180 or contact us.

Last Updated: 21/03/2014