Resources for download
In accordance with the philosophy of our center, the following open-source materials, the results of our projects, are freely available for anyone interested.
Software
- Hunpos is a HMM based open source part-of-speech tagger.
- Hunmorph is an open source tool and programming library for spell-checking, stemming and morphological analysing of agglutinative, german and other languages.
- hunalign is a language independent sentence level aligner to build parallel corpora.
Language resources
- The Hunglish Corpus is a sentence-aligned Hungarian-English parallel corpus published under the Creative Commons Attribution license.
- The Hungarian Webcorpus is a gigaword corpus of Hungarian gathered from the web.
- The Hunglish dictionary is a machine readable English-Hungarian bilingual lexicon.
- morphdb.hu is a Hungarian morphological database for use with Hunmorph morphological analyzer.