Chris Pollett >
Students > [Bio] [Blog] |
Implemented a Segmenter for Yioop to segment Chinese sentence into wordsDescriptionThis is the project related to Chinese NLP. Since I am not good at Machine learning and cannot implement the conditional random Field by myself. I find an alternative way to do it by combining a statistic way and a dictionary search way. The result seems good. Academia Sinica(AS) dataset has an accuracy of 96.5% and Peking University(PKU) dataset has an accuracy of 89.8%. Results: |