A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2018.2830117 ◽

2018 ◽

Vol 26 (9) ◽

pp. 1528-1538 ◽

Author(s):

Meishan Zhang ◽

Nan Yu ◽

Guohong Fu

Keyword(s):

Neural Model ◽

Word Segmentation ◽

Download Full-text

A Coarse-to-Fine Labeling Framework for Joint Word Segmentation, POS Tagging, and Constituent Parsing

10.18653/v1/2021.conll-1.23 ◽

2021 ◽

Author(s):

Yang Hou ◽

Houquan Zhou ◽

Zhenghua Li ◽

Yu Zhang ◽

Min Zhang ◽

...

Keyword(s):

Word Segmentation ◽

Pos Tagging ◽

Download Full-text

MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language

Information ◽

10.3390/info10100317 ◽

2019 ◽

Vol 10 (10) ◽

pp. 317 ◽

Author(s):

Karol Nowakowski ◽

Michal Ptaszynski ◽

Fumito Masui

Keyword(s):

Language Processing ◽

High Performance ◽

Computational Cost ◽

Neural Model ◽

Word Segmentation ◽

Coarse Grained ◽

Endangered Language ◽

Modelling Techniques ◽

Series Of Experiments ◽

Word segmentation is an essential task in automatic language processing for languages where there are no explicit word boundary markers, or where space-delimited orthographic words are too coarse-grained. In this paper we introduce the MiNgMatch Segmenter—a fast word segmentation algorithm, which reduces the problem of identifying word boundaries to finding the shortest sequence of lexical n-grams matching the input text. In order to validate our method in a low-resource scenario involving extremely sparse data, we tested it with a small corpus of text in the critically endangered language of the Ainu people living in northern parts of Japan. Furthermore, we performed a series of experiments comparing our algorithm with systems utilizing state-of-the-art lexical n-gram-based language modelling techniques (namely, Stupid Backoff model and a model with modified Kneser-Ney smoothing), as well as a neural model performing word segmentation as character sequence labelling. The experimental results we obtained demonstrate the high performance of our algorithm, comparable with the other best-performing models. Given its low computational cost and competitive results, we believe that the proposed approach could be extended to other languages, and possibly also to other Natural Language Processing tasks, such as speech recognition.

Download Full-text

Thai personal named entity extraction without using word segmentation or POS tagging

2009 Eighth International Symposium on Natural Language Processing ◽

10.1109/snlp.2009.5340914 ◽

2009 ◽

Author(s):

P. Sutheebanjard ◽

W. Premchaiswadi

Keyword(s):

Word Segmentation ◽

Entity Extraction ◽

Named Entity ◽

Pos Tagging ◽

Named Entity Extraction

Download Full-text

LM Enhanced BiRNN-CRF for Joint Chinese Word Segmentation and POS Tagging

Natural Language Processing and Chinese Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-99501-4_9 ◽

2018 ◽

pp. 105-116

Author(s):

Jianhu Zhang ◽

Gongshen Liu ◽

Jie Zhou ◽

Cheng Zhou ◽

Huanrong Sun

Keyword(s):

Word Segmentation ◽

Chinese Word ◽

Chinese Word Segmentation ◽

Download Full-text

Exploiting Heterogeneous Annotations for Weibo Word Segmentation and POS Tagging

Natural Language Processing and Chinese Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-25207-0_46 ◽

2015 ◽

pp. 495-506 ◽

Author(s):

Jiayuan Chao ◽

Zhenghua Li ◽

Wenliang Chen ◽

Min Zhang

Keyword(s):

Word Segmentation ◽

Download Full-text

Encoding multi-granularity structural information for joint Chinese word segmentation and POS tagging

Pattern Recognition Letters ◽

10.1016/j.patrec.2020.07.017 ◽

2020 ◽

Vol 138 ◽

pp. 163-169

Author(s):

Ling Zhao ◽

Ailian Zhang ◽

Ying Liu ◽

Hao Fei

Keyword(s):

Structural Information ◽

Word Segmentation ◽

Chinese Word ◽

Chinese Word Segmentation ◽

Download Full-text

Research on the Method and System of Word Segmentation and POS Tagging for Ancient Chinese Medicine Literature

2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ◽

10.1109/bibm47256.2019.8983361 ◽

2019 ◽

Author(s):

Xianjun Fu ◽

Ting Yuan ◽

Xuebo Li ◽

Zhenguo Wang ◽

Yang Zhou ◽

...

Keyword(s):

Chinese Medicine ◽

Word Segmentation ◽

Download Full-text

An effective joint model for chinese word segmentation and POS tagging

Proceedings of the 2016 International Conference on Intelligent Information Processing - ICIIP '16 ◽

10.1145/3028842.3028877 ◽

2016 ◽

Author(s):

Heng-Jun Wang ◽

Nian-Wen Si ◽

Cheng Chen

Keyword(s):

Joint Model ◽

Word Segmentation ◽

Chinese Word ◽

Chinese Word Segmentation ◽

Download Full-text

Depicting a Neural Model for Lemmatization and POS Tagging of Words from Palaeographic Stone Inscriptions

2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS) ◽

10.1109/iciccs51141.2021.9432315 ◽

2021 ◽

Author(s):

S. Ezhilarasi ◽

P.Uma Maheswari

Keyword(s):

Neural Model ◽

Download Full-text

A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging

10.18653/v1/2021.emnlp-main.291 ◽

2021 ◽

Author(s):

Peijie Jiang ◽

Dingkun Long ◽

Yueheng Sun ◽

Meishan Zhang ◽

Guangwei Xu ◽

...

Keyword(s):

Word Segmentation ◽

Fine Grained ◽

Pos Tagging ◽

Domain Adaption

Download Full-text