Metadata-Version: 2.1
Name: py-any2text-parser
Version: 1.0.1
Summary: Any2Text - any format to text using Unstructured.io
Author-email: Vanson Leung <vanson@vanportdev.com>
Classifier: Development Status :: 5 - Production/Stable
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.9
Requires-Python: >=3.9
Description-Content-Type: text/markdown

# py-any2text-parser

## install

`pip install py-any2text-parser`

## usage

```

```



# developer only below:

## initialize

```shell
python3 -m venv ./venv
source venv/bin/activate
pip install -r requirements.txt
```

## test

```shell
python3 test_pdf2text.py
```

## usage

```python
from pdf2text.pdf2text import extract_pdf_file_to_text

file_path = "/Users/user/Downloads/AUDIT_MATERIALS/budget_materials/personal/2021/2021 03 remarks 2.pdf"
  
with open(file_path, "rb") as file:
  text_data, text = extract_pdf_file_to_text(
    filename="abc.pdf",
    file=file,
    meta_data_mapping = {
        "document_category": "DEF",
    }
  )
  
  print(text_data, text)
```

## develop - upload to pypi

```
pip install twine build
python3 -m build
twine upload dist/*
```
