Metadata-Version: 1.1
Name: PyHDFS
Version: 0.1.2
Summary: Pure Python WebHDFS client
Home-page: https://github.com/jingw/pyhdfs
Author: Jing Wang
Author-email: jingwang13@gmail.com
License: MIT License
Description: =====================
        Python WebHDFS client
        =====================
        
        Because the world needs `yet <https://github.com/spotify/snakebite>`_ `another <https://github.com/ProjectMeniscus/pywebhdfs>`_ `way <https://pypi.python.org/pypi/hdfs>`_ to talk to HDFS from Python.
        
        Usage
        =====
        
        This library provides a Python client for `WebHDFS <https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/WebHDFS.html>`_. NameNode HA is supported if you pass in both NameNodes. Any failed operation will raise some subclass of ``HdfsException``.
        
        .. code-block:: python
        
            from pyhdfs import HdfsClient
            client = HdfsClient(hosts='namenode1.example.com:50070,namenode2.example.com:50070')
            print(client.list_status('/'))
        
        You can also pass the hostname as part of the URI:
        
        .. code-block:: python
        
            from pyhdfs import HdfsClient
            client = HdfsClient()
            print(client.list_status('//namenode1.example.com:50070;namenode2.example.com:50070/'))
        
        The methods and return values generally map directly to `WebHDFS endpoints <https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/WebHDFS.html>`_. The client also provides convenience methods that mimic Python ``os`` methods and HDFS CLI commands (e.g. ``walk`` and ``copy_to_local``).
        
        ``pyhdfs`` logs all HDFS actions at the INFO level, so turning on INFO level logging will give you a debug record for your application.
        
        For more information, see the `full API docs <http://pyhdfs.readthedocs.io/en/latest/>`_.
        
        Installing
        ==========
        
        ``pip install pyhdfs``
        
        You'll need Python 2.7 or Python 3.
        
        Development testing
        ===================
        
        .. image:: https://travis-ci.org/jingw/pyhdfs.svg?branch=master
            :target: https://travis-ci.org/jingw/pyhdfs
        
        .. image:: http://codecov.io/github/jingw/pyhdfs/coverage.svg?branch=master
            :target: http://codecov.io/github/jingw/pyhdfs?branch=master
        
        First get an environment with HDFS. The `Cloudera QuickStart VM <http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/cloudera_quickstart_vm.html>`_ works fine for this. (Note that the VM only comes with Python 2.6, so you might want to use your host and forward port 50070.)
        
        WARNING: The tests create and delete ``hdfs://localhost/tmp/pyhdfs_test``.
        
        Python 3::
        
            virtualenv3 --no-site-packages env3
            source env3/bin/activate
            pip3 install -e .
            pip3 install -r dev_requirements.txt
            py.test
        
        And again for Python 2 (after ``deactivate``)::
        
            virtualenv2 --no-site-packages env2
            source env2/bin/activate
            pip2 install -e .
            pip2 install -r dev_requirements.txt
            py.test
        
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: System :: Filesystems
