Metadata-Version: 2.1
Name: ruqia
Version: 0.0.15
Summary: Arabic NLP
Home-page: https://github.com/Ruqyai/Ara-NLP-lib
Author: Ruqiya Bin Safi
Author-email: myacount05@gmail.com
Project-URL: Bug Reports, https://github.com/Ruqyai/Ara-NLP-lib/issues
Project-URL: Become a sponsor, https://github.com/sponsors/Ruqyai
Project-URL: Source, https://github.com/Ruqyai/Ara-NLP-lib
Keywords: Arabic,NLP,development
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Topic :: Software Development :: Build Tools
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3 :: Only
Requires-Python: >=3.7, <4
Description-Content-Type: text/markdown
Requires-Dist: peppercorn
Provides-Extra: dev
Requires-Dist: check-manifest ; extra == 'dev'
Provides-Extra: test
Requires-Dist: coverage ; extra == 'test'

# Ruqia lib
This library used for Arabic NLP to process, prepare and clean the Arabic text   


مكتبة مخصصة لخدمة معالجة اللغة العربية وتشمل عدد من الوظائف لتنظيف النصوص وغيرها

## Install
```
pip install ruqia
```
## Use
```
from ruqiya import ruqiya
```
#
## All Functions

## Clean the text 
It includes these functions:   
        *1. remove_emails  
        2. remove_URLs  
        3. remove_mentions   
        4. hashtags_to_words     
        5. remove_punctuations  
        6. normalize_arabic   
        7. remove_diacritics   
        8. remove_stop_words   
        9. remove_emojis*   
```
text_cleaned1=ruqiya.clean_text(text)
print(text_cleaned1)
```
## Remove repeating character
```
text_cleaned2=ruqiya.remove_repeating_char(text)
print(text_cleaned2)
```
## Remove punctuations
```
text_cleaned3=ruqiya.remove_punctuations(text)
print(text_cleaned3)
```
## Normalize Arabic

```
text_cleaned4=ruqiya.normalize_arabic(text)
print(text_cleaned4)
```
## Remove diacritics

```
text_cleaned5=ruqiya.remove_diacritics(text)
print(text_cleaned5)
```
## Remove stop words

```
text_cleaned6=ruqiya. remove_stop_words(text)
print(text_cleaned6)
```
## Remove emojis

```
text_cleaned7=ruqiya. remove_emojis(text)
print(text_cleaned7)
```

## Remove mentions

```
text_cleaned8=ruqiya. remove_mentions(text)
print(text_cleaned8)
```
## Convert any hashtags to words

```
text_cleaned9=ruqiya. hashtags_to_words(text)
print(text_cleaned9)
```

## Remove hashtags

```
text_cleaned10=ruqiya. remove_hashtags(text)
print(text_cleaned10)
```
## Remove emails

```
text_cleaned11=ruqiya. remove_emails(text)
print(text_cleaned11)
```
## Remove URLs

```
text_cleaned12=ruqiya. remove_URLs(text)
print(text_cleaned12)
```
#
## Example
```
from ruqiya import ruqiya

text="""
!!أهلا وسهلا بك 👋 في الإصدار الأول من مكتبة رقيا
هل هي المرة الأولى التي تستخدم فيها المكتبة😀؟!!
معلومات التواصل 
ايميل
example@email.com
الموقع
https://pypi.org/project/ruqia/
تويتر
@Ru0Sa
وسم
#معالجة_العربية
"""
text_cleaned1=ruqiya.clean_text(text)
print(text_cleaned1)

text_cleaned2=ruqiya.remove_repeating_char(text)
print(text_cleaned2)

text_cleaned3=ruqiya.remove_punctuations(text)
print(text_cleaned3)

text_cleaned4=ruqiya.normalize_arabic(text)
print(text_cleaned4)

text_cleaned5=ruqiya.remove_diacritics(text)
print(text_cleaned5)

text_cleaned6=ruqiya. remove_stop_words(text)
print(text_cleaned6)

text_cleaned7=ruqiya. remove_emojis(text)
print(text_cleaned7)

text_cleaned8=ruqiya. remove_mentions(text)
print(text_cleaned8)

text_cleaned9=ruqiya. hashtags_to_words(text)
print(text_cleaned9)

text_cleaned10=ruqiya. remove_hashtags(text)
print(text_cleaned10)

text_cleaned11=ruqiya. remove_emails(text)
print(text_cleaned11)

text_cleaned12=ruqiya. remove_URLs(text)
print(text_cleaned12)

```
