Metadata-Version: 2.1
Name: baikalnlpy
Version: 1.0.0
Summary: The baikal nlp python client library
Home-page: https://baikal.ai/nlpdemo
License: BSD-3-Clause
Keywords: NLP,Korean,Deep Learning,POS tagger,baikal.ai
Author: Gihyun YUN
Author-email: gih2yun@baikal.ai
Requires-Python: >=3.6,<4.0
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: BSD License
Classifier: Natural Language :: Korean
Classifier: Operating System :: MacOS
Classifier: Operating System :: Microsoft :: Windows
Classifier: Operating System :: OS Independent
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development
Classifier: Topic :: Software Development :: Libraries
Classifier: Typing :: Typed
Requires-Dist: baikalai-apis (>=0.9.1,<0.10.0)
Requires-Dist: googleapis-common-protos (>=1.56.0,<2.0.0)
Requires-Dist: grpcio (>=1.32.0,<2.0.0)
Requires-Dist: protobuf (>=3.19.4,<4.0.0)
Project-URL: Repository, https://github.com/baikalai/baikalnlpy
Description-Content-Type: text/markdown

# What is this?

`baikalnlpy` is the python 3 library for baikal NLP.

Baikal NLP is a Korean NLP,
which provides tokenizing, POS tagging for Korean.

## How to install

```shell
pip3 install baikalnlpy
```

## How to get baikal NLP
- Go to https://license.baikal.ai/.
  - With registration, for the first time, you can get a free license for 3 months.
- Or use docker image.
```shell
docker pull baikalai/baikal-nlp:v1.7.0
```

## How to use

```python
import sys
import google.protobuf.text_format as tf
from baikalnlpy import Tagger

my_tagger = Tagger('localhost') # If you have your own localhost baikal NLP. 
# or
tagger = Tagger() # With smaller public cloud instance, it may be slow.

# print results. 
res = tagger.tags(["안녕하세요.", "반가워요!"])

# get protobuf message.
m = res.msg()
tf.PrintMessage(m, out=sys.stdout, as_utf8=True)
print(tf.MessageToString(m, as_utf8=True))
print(f'length of sentences is {len(m.sentences)}')
## output : 2
print(f'length of tokens in sentences[0] is {len(m.sentences[0].tokens)}')
print(f'length of morphemes of first token in sentences[0] is {len(m.sentences[0].tokens[0].morphemes)}')
print(f'lemma of first token in sentences[0] is {m.sentences[0].tokens[0].lemma}')
print(f'first morph of first token in sentences[0] is {m.sentences[0].tokens[0].morphemes[0]}')
print(f'tag of first morph of first token in sentences[0] is {m.sentences[0].tokens[0].morphemes[0].tag}')
# print number

# get json object
jo = res.as_json()
print(jo)

# get tuple of pos tagging.
pa = res.pos()
print(pa)
# another methods
ma = res.morphs()
print(ma)
na = res.nouns()
print(na)
va = res.verbs()
print(va)

# custom dictionary
cust_dic = tagger.custom_dict("my")
cust_dic.copy_np_set({'내고유명사', '우리집고유명사'})
cust_dic.copy_cp_set({'코로나19'})
cust_dic.copy_cp_caret_set({'코로나^백신', '"독감^백신'})
cust_dic.update()

tagger.set_domain('my')
tagger.pos('코로나19는 언제 끝날까요?')
```

