eBangor

Universal text preprocessing and postprocessing for PPM using Alphabet Adjustment

Alhawiti, K. and Teahan, W.J. (2014) Universal text preprocessing and postprocessing for PPM using Alphabet Adjustment. In: Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014.

Full-text not available from this repository..

Abstract

In this paper, we introduce several new universal pre-processing techniques to improve Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text. These methods essentially 'adjust' the alphabet in some manner (for example, by expanding or reducing it) prior to the compression algorithm then being applied to the amended text.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Subjects: Research Publications
Departments: College of Physical and Applied Sciences > School of Computer Science
Date Deposited: 09 Dec 2014 16:32
Last Modified: 23 Sep 2015 02:59
ISSN: 1068-0314
URI: http://e.bangor.ac.uk/id/eprint/393
Identification Number: DOI: 10.1109/DCC.2014.12
Administer Item Administer Item

eBangor is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.