Download PDFOpen PDF in browser

A Corpus-based Study of Japanese Verb Paradigms (Preliminary results)

12 pagesPublished: March 18, 2019

Abstract

The Japanese language has a great variety of verb inflectional suffixes (auxiliaries), each having conjugation of their own. In this paper we propose a corpus-based approach to studying Japanese verb paradigms. Such an approach benefits from identifying possible verb forms on big data of written language. Description of methods and tools used for building databases of verbs and auxiliaries and for parsing verb 7-grams from a Japanese N-gram Corpus is presented.

Keyphrases: corpus based study, japanese dictionary, japanese language, ngrams corpus, verb conjugation, verb paradigm study

In: Gerhard Wohlgenannt, Ruprecht von Waldenfels, Svetlana Toldova, Ekaterina Rakhilina, Denis Paperno, Olga Lyashevskaya, Natalia Loukachevitch, Sergei O. Kuznetsov, Olga Kultepina, Dmitry Ilvovsky, Boris Galitsky, Ekaterina Artemova and Elena Bolshakova (editors). Proceedings of Third Workshop "Computational linguistics and language science", vol 4, pages 33-44.

BibTeX entry
@inproceedings{CLLS2018:Corpus_based_Study_Japanese,
  author    = {Anna Novoselova and Alexander Kostyrkin},
  title     = {A Corpus-based Study of Japanese Verb Paradigms (Preliminary results)},
  booktitle = {Proceedings of Third Workshop "Computational linguistics and language science"},
  editor    = {Gerhard Wohlgenannt and Ruprecht von Waldenfels and Svetlana Toldova and Ekaterina Rakhilina and Denis Paperno and Olga Lyashevskaya and Natalia Loukachevitch and Sergei O. Kuznetsov and Olga Kultepina and Dmitry Ilvovsky and Boris Galitsky and Ekaterina Artemova and Elena Bolshakova},
  series    = {EPiC Series in Language and Linguistics},
  volume    = {4},
  publisher = {EasyChair},
  bibsource = {EasyChair, https://easychair.org},
  issn      = {2398-5283},
  url       = {/publications/paper/8Smc},
  doi       = {10.29007/tvck},
  pages     = {33-44},
  year      = {2019}}
Download PDFOpen PDF in browser