Metadata-Version: 2.1
Name: document_LayoutBased_clustering_tool
Version: 0.1.2
Summary: A package for document layout-based clustering using text shading and feature extraction.
Author: Harish Kumar S
Author-email: harishkumar56278@gmail.com
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.7
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pypdfium2
Requires-Dist: pillow
Requires-Dist: numpy
Requires-Dist: opencv-python
Requires-Dist: tqdm
Requires-Dist: pandas
Requires-Dist: scikit-learn
Requires-Dist: tensorflow
Requires-Dist: matplotlib
Requires-Dist: openpyxl

# Document Layout-Based Clustering Tool

This Python package provides methods for:
- **Text Shading**: Extract document layout features.
- **Feature Extraction**: Extract deep learning features from document images.
- **Clustering**: Perform K-Means or Hierarchical clustering on extracted features.

## Installation
```bash
pip install document-LayoutBased-clustering-tool

```
document_LayoutBased_clustering_tool_0.1.0
├─ LICENSE
├─ README.md
├─ document_clustering
│  ├─ __init__.py
│  ├─ cluster_viewer.py
│  ├─ feature_extraction_and_clustering.py
│  └─ textshade.py
├─ pyproject.toml
├─ requirements.txt
├─ setup.py
└─ test
   └─ test.py

```
