Metadata-Version: 2.1
Name: nlpo3
Version: 1.1.3
Summary: Python binding for nlpO3 Thai language processing library
Home-page: https://github.com/PyThaiNLP/nlpo3/
Author: Thanathip Suntorntip, Arthit Suriyawongkul, Wannaphong Phatthiyaphaibun
Author-email: wannaphong@yahoo.com
License: Apache-2.0
Description: 
        Python binding for nlpO3, a Thai natural language processing library in Rust.
        
        
        - Thai word tokenizer
        - use maximal-matching dictionary-based tokenization algorithm and honor Thai Character Cluster boundaries
        - 2.5x faster than similar pure Python implementation
        - built-in dictionary included (62,000 words, a copy from PyThaiNLP)
        - support custom dictionary
        
        
        
        ```bash
        pip install nlpo3
        ```
        
        
        Tokenization using default dictionary:
        ```python
        from nlpo3 import segment
        
        segment("สวัสดีครับ")  # returns ["สวัสดี", "ครับ"]
        ```
        
        Load file `path/to/dict.file` to memory and assigned it with name `custom_dict`.
        Then tokenize a text with `custom_dict` dictionary:
        ```python
        from nlpo3 import load_dict, segment
        
        load_dict("path/to/dict.file", "custom_dict")
        segment("สวัสดีครับ", "custom_dict")
        ```
        
        For more documentation, go [https://github.com/PyThaiNLP/nlpo3](https://github.com/PyThaiNLP/nlpo3)
Keywords: thai,tokenizer,nlp,word-segmentation,pythainlp
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Natural Language :: Thai
Classifier: Topic :: Text Processing :: Linguistic
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.6
Description-Content-Type: text/markdown
