Word Segmentation for Japanese and Indonesia


Here some of word segmentation for Japanese and Indonesia

A. Japanese

Kytea

  1. Graham Neubig,Yosuke Nakata, Shinsuke Mori. Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis. The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT). Portland, Oregon, USA. June 2011
  2. Graham Neubig,Shinsuke Mori. Word-based Partial Annotation for Efficient Corpus Construction. The seventh international conference on Language Resources and Evaluation (LREC 2010). Malta. May 2010.

Mecab

ChaSen

  1. Yuji Matsumoto and Kazuma Takaoka and Masayuki Asahara, ChaSenMorphological Analyzer version 2.4.0. 2007

Juman

Juman++

  1. Arseny Tolmachev, Daisuke Kawahara and Sadao Kurohashi:
    Juman++: A Morphological Analysis Toolkit for Scriptio Continua,
    Proceedings of EMNLP 2018: Conference on Empirical Methods in Natural Language Processing, System Demonstrations, pp.54–59, Brussels, Belgium, (2018.11).

Kuromoji

Bahasa Indonesia

Morphind

  1. Septina Dian Larasati, Vladislav Kuboň, and Daniel Zeman: Indonesian Morphology Tool (MorphInd): Towards an Indonesian Corpus. SFCM 2011. August 2011. Zurich, Switzerland. To be appear in Springer CCIS proceedings of the Workshop on Systems and Frameworks for Computational Morphology

POS Tag for Bahasa Indonesia

  1. Fam Rashel, Andry Luthfi, Arawinda Dinakaramani, and Ruli Manurung. Building an Indonesian Rule-Based Part-of-Speech Tagger

Sastrawi


Leave a Reply