You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _pages/about.md
+28-2
Original file line number
Diff line number
Diff line change
@@ -4,8 +4,34 @@ title: About PyThaiNLP
4
4
permalink: /about/
5
5
---
6
6
7
-
PyThaiNLP Project is a Thai Natural Language Processing project. We build softwares and datasets for Thai language. Our Main Project is PyThaiNLP.
7
+
PyThaiNLP Project is an open source community for natural Language Processing project in the Thai language. We build softwares and datasets for Thai language. Our Main Project is PyThaiNLP.
8
8
9
-
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language. PyThaiNLP started at 2017.
9
+
## About Project
10
+
11
+
Our project are open source. We create softwares, models, and datasets for Thai language to public and are open source license.
12
+
13
+
**See all our project at [pythainlp.org/projects/](https://pythainlp.org/projects/)**
- Convenient character and word classes, like Thai consonants (pythainlp.thai_consonants), vowels (pythainlp.thai_vowels), digits (pythainlp.thai_digits), and stop words (pythainlp.corpus.thai_stopwords) -- comparable to constants like string.letters, string.digits, and string.punctuation
26
+
- Thai linguistic unit segmentation/tokenization, including sentence (sent_tokenize), word (word_tokenize), and subword segmentations based on Thai Character Cluster (subword_tokenize)
27
+
- Thai part-of-speech tagging (pos_tag)
28
+
- Thai spelling suggestion and correction (spell and correct)
29
+
- Thai transliteration (transliterate)
30
+
- Thai soundex (soundex) with three engines (lk82, udom83, metasound)
31
+
- Thai collation (sort by dictionary order) (collate)
32
+
- Read out number to Thai words (bahttext, num_to_thaiword)
Copy file name to clipboardExpand all lines: index.md
+3-21
Original file line number
Diff line number
Diff line change
@@ -4,32 +4,14 @@ layout: default
4
4
5
5
Welcome to The Official PyThaiNLP Project Website.
6
6
7
-
The PyThaiNLP Project is a Thai Natural Language Processing project. We build softwares and datasets for Thai language. Our Main Project is PyThaiNLP.
7
+
PyThaiNLP Project is an open source community for natural Language Processing project in the Thai language. We build softwares and datasets for Thai language. Our Main Project is PyThaiNLP that is a Python package for text processing and linguistic analysis on Thai language.
8
+
9
+
See more about the projec: [pythainlp.org/about](https://pythainlp.org/about)
8
10
9
11
**See all our project at [pythainlp.org/projects/](https://pythainlp.org/projects/)**
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to nltk, with focus on Thai language.
14
-
15
-
## PyThaiNLP Features
16
-
- Convenient character and word classes, like Thai consonants (pythainlp.thai_consonants), vowels (pythainlp.thai_vowels), digits (pythainlp.thai_digits), and stop words (pythainlp.corpus.thai_stopwords) -- comparable to constants like string.letters, string.digits, and string.punctuation
17
-
- Thai linguistic unit segmentation/tokenization, including sentence (sent_tokenize), word (word_tokenize), and subword segmentations based on Thai Character Cluster (subword_tokenize)
18
-
- Thai part-of-speech tagging (pos_tag)
19
-
- Thai spelling suggestion and correction (spell and correct)
20
-
- Thai transliteration (transliterate)
21
-
- Thai soundex (soundex) with three engines (lk82, udom83, metasound)
22
-
- Thai collation (sort by dictionary order) (collate)
23
-
- Read out number to Thai words (bahttext, num_to_thaiword)
0 commit comments