Faculty Advisor

Sarkozy, Gabor N

Faculty Advisor

Selkow, Stanley M.

Center

Budapest, Hungary

Abstract

Our MQP aimed to introduce finite state machine based techniques for natural language processing into Hunspell, the world's premiere Open Source spell checker used in several prominent projects such as Firefox and Open Office. We created compact machine-readable finite state transducer representations of 26 of the most commonly used languages on Wikipedia. We then created an automata based spell checker. In addition, we implemented an transducer based stemmer, which will be used in the future of transducer based morphological analysis.

Publisher

Worcester Polytechnic Institute

Date Accepted

April 2010

Major

Computer Science

Major

Mathematical Sciences

Project Type

Major Qualifying Project

Accessibility

Unrestricted

Advisor Department

Computer Science

Project Center

Budapest, Hungary

Share

COinS