Skip to content

PyThaiNLP v4.0.0-beta1

Pre-release
Pre-release
Compare
Choose a tag to compare
@wannaphong wannaphong released this 01 Apr 16:28
· 956 commits to dev since this release
b881672

This post will give you the change log for PyThaiNLP 4.0. PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).

This release is the first beta release of PyThaiNLP 4.0.

Schedule

  • Beta release: 1 April 2023
  • Production release: 14 April 2023

See 4.0 Milestone.

What is new?

Deprecation and other API changes

  • Delete all LST20 model #728
  • 947c7be Change pythainlp.tools.misspell to pythainlp.tools.misspell.misspell

Improve

  • Reduce import time #719
  • Fix/broken numeric data format (#652) #723

Tokenizer

  • Add blackboard cls #732
  • Add rule to TCC and Change TCC rule for newmm #741

Tag

  • Add blackboard pos_tag #733
  • Add ThaiNER 2.0 #781

Util

  • Add pythainlp.util.count_thai_chars #748
  • Add thai_strptime and convert_years #767

Transliterate

  • Add Thai2Rom ONNX model #743

Khavee

  • add khavee to pythainlp #777
  • add aek/too checker function to khavee #779

Parse

  • Add ud_goeswith #757

Corpus

  • Add new science word #763

What's Changed

New Contributors

Full Changelog: v3.1.0...v4.0.0-beta1