Skip to content

Releases: PyThaiNLP/pythainlp-corpus

OSCAR word freq icu v1.0

11 Jun 11:46
a217e7d
Compare
Choose a tag to compare

State

OSCAR word freq v0.1 buit from icu word tokenize

Authors: Korakot Chaovavanich @korakot

from https://web.facebook.com/groups/colab.thailand/permalink/1524070061101680/?_rdc=1&_rdr

TNC unigarm 201712 and bi/ti-garm 201705

11 Jun 11:18
4450a19
Compare
Choose a tag to compare
  • 201705_2gram.txt: State
  • 201705_3gram.txt: State
  • tncwordfreq-201712.xlsx: State

It is mrror TNC word frequency from Thai National Corpus (TNC)

Web: http://www.arts.chula.ac.th/ling/tnc/searchtnc/

LST20 CLS v0.2

03 Oct 09:41
0ed930f
Compare
Choose a tag to compare

State

  • change features name

LST20 v0.2.3

16 Sep 20:32
a57265e
Compare
Choose a tag to compare

-pos_lst20_perceptron-v0.2.3.pkl: State

  • pos_lst20_unigram-v0.2.3.json: State

LST20 Part-of-speech

  • port LST20 model

LST20 CLS v0.1

16 Sep 20:33
a57265e
Compare
Choose a tag to compare
lst20-cls-v0.1

Update db.json

LST20 v0.2.2

18 Aug 22:14
Compare
Choose a tag to compare
  • Rename taggers to pos_lst20_unigram and pos_lst20_perceptron, following the convention of other POS taggers in PyThaiNLP
  • Minify Unigram JSON file

LST20 v0.2

11 Aug 13:37
654a49d
Compare
Choose a tag to compare
lst20-v0.2

Update db.json

LST20 v0.1

11 Aug 12:23
5bde88c
Compare
Choose a tag to compare
lst20-v0.1

Update LICENSE

wiki_lm_lstm v0.32

13 Jun 14:59
a2087b3
Compare
Choose a tag to compare

State

thai2fit_wv v0.1

14 Jun 09:31
a2087b3
Compare
Choose a tag to compare

State

thai2fit_wv v0.1