| | |
Summary: Application of Different Learning Methods to
Hungarian Partofspeech Tagging
Tam'as Horv'ath 1 , Zolt'an Alexin 2 , Tibor Gyim'othy 3 , and Stefan Wrobel 4;1
1 German National Research Center for Information Technology, GMD AiS.KD,
Schloß Birlinghoven, D53754 Sankt Augustin, tamas.horvath@gmd.de
2 Dept. of Applied Informatics, J'ozsef Attila University,
P.O.Box 652, H6701 Szeged, alexin@inf.uszeged.hu
3 Research Group on Artificial Intelligence, Hungarian Academy of Sciences,
Aradi v'ertanuk tere 1, H6720 Szeged, gyimi@inf.uszeged.hu
4 OttovonGuerickeUniversit¨at Magdeburg, IWS,
P.O.Box 4120, D39106 Magdeburg, wrobel@iws.cs.unimagdeburg.de
Abstract. From the point of view of computational linguistics, Hungar
ian is a difficult language due to its complex grammar and rich morphol
ogy. This means that even a common task such as partofspeech tagging
presents a new challenge for learning when looked at for the Hungarian
language, especially given the fact that this language has fairly free word
order. In this paper we therefore present a case study designed to illus
trate the potential and limits of current ILP and nonILP algorithms
on the Hungarian POStagging task. We have selected the popular C4.5
and Progol systems as propositional and ILP representatives, adding ex
|