Metadata-Version: 2.1
Name: dict-fr-AU-DELA
Version: 2021.9.4
Summary: EDITABLE French dictionaries from Laboratoire d'Automatique Documentaire et Linguistique (LADL)
Home-page: https://github.com/HubTou/dict-fr-AU-DELA/
Author: Hubert Tournier
Author-email: nobody@nowhere.invalid
License: LGPLLR
Project-URL: Bug Tracker, https://github.com/HubTou/dict-fr-AU-DELA/issues
Keywords: pnu-project,dictionary
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: End Users/Desktop
Classifier: License :: Other/Proprietary License
Classifier: Natural Language :: French
Classifier: Operating System :: OS Independent
Classifier: Topic :: Text Processing
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.0
Description-Content-Type: text/markdown
License-File: data/dict-fr-AU-DELA-License

# Installation
pip install [dict-fr-AU-DELA](https://pypi.org/project/dict-fr-AU-DELA/)

# EDITABLE French dictionaries from Laboratoire d'Automatique Documentaire et Linguistique (LADL)

## DESCRIPTION
Starting from the original [inflected form DELA French dictionary](https://infolingu.univ-mlv.fr/DonneesLinguistiques/Dictionnaires/telechargement.html),
provided by the former [Laboratoire d'Automatique Documentaire et Linguistique](https://infolingu.univ-mlv.fr/LADL/Historique.html) (LADL),
now integrated into [Institut Gaspard Monge](https://igm.univ-gustave-eiffel.fr/) (IGM) of the [Université Gustave Eiffel](https://www.univ-gustave-eiffel.fr/),
this repository contains:
* *modified* dictionary data, publicly [editable here](https://github.com/HubTou/dict-fr-AU-DELA/tree/main/data);
* a Python package gathering the results for exploitation by other tools.

The selected original dictionary is the inflected form DELA French dictionary in UTF-16 LE encoding,
from March 16, 2006, with 683.824 simple entries for 102.073 different lemmas and 108.436 compounded entries for 83.604 different lemmas.

## FILES
All files are installed in Python's */usr/local* equivalent, under *share/dict*.

### Original files

Filename|Description
---|---
dict-fr-AU-DELA-License|Lesser General Public License For Linguistic Resources

### Generated files

Filename|Description
---|---
dict-fr-AU-DELA|Modified inflected form DELA French dictionary in UTF-8 encoding and Unix-style end of lines
dict-fr-AU-DELA.ascii|French words and compound words list (unaccented)
dict-fr-AU-DELA.unicode|742.889 entries French words and compound words list (accented)
dict-fr-AU-DELA.combined|French words and compound words list (with both accented and unaccented words)
dict-fr-AU-DELA-proper_nouns.ascii|French proper nouns list (unaccented, sometimes compounded)
dict-fr-AU-DELA-proper_nouns.unicode|823 entries French proper nouns list (accented, sometimes compounded)
dict-fr-AU-DELA-proper_nouns.combined|French proper nouns list (with both accented and unaccented words, sometimes compounded)
dict-fr-AU-DELA-common-words.ascii|French common words list (unaccented)
dict-fr-AU-DELA-common-words.unicode|641.759 entries French common words list (accented)
dict-fr-AU-DELA-common-words.combined|French common words list (with both accented and unaccented words)
dict-fr-AU-DELA-common-compound-words.ascii|French common compound words list (unaccented)
dict-fr-AU-DELA-common-compound-words.unicode|100.320 entries French common compound words list (accented)
dict-fr-AU-DELA-common-compound-words.combined|French common compound words list (with both accented and unaccented words)

Beside manual edits, apart from the *dict-fr-AU-DELA* file, these generated files went through the following transformations:
* removal of escape backslashes
* removal of lemma and grammatical info from *dict-fr-AU-DELA*
* lossless conversion of accents for the *\*-ascii* versions
* combination of the *\*-ascii* and *\*-unicode* versions into the *\*-combined* ones (without duplicates)

## SEE ALSO
[spell(1)](https://www.freebsd.org/cgi/man.cgi?query=spell) like tools,
[anagram(6)](https://github.com/HubTou/anagram/blob/main/README.md),
[conjuguer(1)](https://github.com/HubTou/conjuguer/blob/main/README.md)

## HISTORY
DELA means "Dictionnaire Electronique du LADL" (LADL's electronic dictionaries). These dictionaries were initiated by the lab's founder, [Maurice Gross](https://fr.wikipedia.org/wiki/Maurice_Gross).

This modified version of the [original DELA dictionary](https://github.com/HubTou/dict-fr-DELA) was necessary because our [PNU project](https://github.com/HubTou/PNU)'s
[conjuguer](https://github.com/HubTou/conjuguer) command made it clear that there were errors in some verb conjugations.

It was naturally called AU-DELA, a pun meaning beyond DELA ("au-delà" in French being translated as "beyond").

I wrote an [history of Unix & French dictionaries](https://github.com/HubTou/PNU/wiki/Les-dictionnaires-sous-Unix) (in French only),
which covers this dictionary and many others.

## LICENSE
The original contents, as well as this package, are licensed under the [Lesser General Public License For Linguistic Resources](http://infolingu.univ-mlv.fr/DonneesLinguistiques/Lexiques-Grammaires/lgpllr.html).

## AUTHORS
[Laboratoire d'Automatique Documentaire et Linguistique](https://infolingu.univ-mlv.fr/LADL/Historique.html) (LADL) for the original contents.

[Hubert Tournier](https://github.com/HubTou) for the package and some initial changes.

The GitHub community for further changes.



