METADATA 9.7 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243
  1. Metadata-Version: 2.1
  2. Name: idna
  3. Version: 3.7
  4. Summary: Internationalized Domain Names in Applications (IDNA)
  5. Author-email: Kim Davies <kim+pypi@gumleaf.org>
  6. Requires-Python: >=3.5
  7. Description-Content-Type: text/x-rst
  8. Classifier: Development Status :: 5 - Production/Stable
  9. Classifier: Intended Audience :: Developers
  10. Classifier: Intended Audience :: System Administrators
  11. Classifier: License :: OSI Approved :: BSD License
  12. Classifier: Operating System :: OS Independent
  13. Classifier: Programming Language :: Python
  14. Classifier: Programming Language :: Python :: 3
  15. Classifier: Programming Language :: Python :: 3 :: Only
  16. Classifier: Programming Language :: Python :: 3.5
  17. Classifier: Programming Language :: Python :: 3.6
  18. Classifier: Programming Language :: Python :: 3.7
  19. Classifier: Programming Language :: Python :: 3.8
  20. Classifier: Programming Language :: Python :: 3.9
  21. Classifier: Programming Language :: Python :: 3.10
  22. Classifier: Programming Language :: Python :: 3.11
  23. Classifier: Programming Language :: Python :: 3.12
  24. Classifier: Programming Language :: Python :: Implementation :: CPython
  25. Classifier: Programming Language :: Python :: Implementation :: PyPy
  26. Classifier: Topic :: Internet :: Name Service (DNS)
  27. Classifier: Topic :: Software Development :: Libraries :: Python Modules
  28. Classifier: Topic :: Utilities
  29. Project-URL: Changelog, https://github.com/kjd/idna/blob/master/HISTORY.rst
  30. Project-URL: Issue tracker, https://github.com/kjd/idna/issues
  31. Project-URL: Source, https://github.com/kjd/idna
  32. Internationalized Domain Names in Applications (IDNA)
  33. =====================================================
  34. Support for the Internationalized Domain Names in
  35. Applications (IDNA) protocol as specified in `RFC 5891
  36. <https://tools.ietf.org/html/rfc5891>`_. This is the latest version of
  37. the protocol and is sometimes referred to as “IDNA 2008”.
  38. This library also provides support for Unicode Technical
  39. Standard 46, `Unicode IDNA Compatibility Processing
  40. <https://unicode.org/reports/tr46/>`_.
  41. This acts as a suitable replacement for the “encodings.idna”
  42. module that comes with the Python standard library, but which
  43. only supports the older superseded IDNA specification (`RFC 3490
  44. <https://tools.ietf.org/html/rfc3490>`_).
  45. Basic functions are simply executed:
  46. .. code-block:: pycon
  47. >>> import idna
  48. >>> idna.encode('ドメイン.テスト')
  49. b'xn--eckwd4c7c.xn--zckzah'
  50. >>> print(idna.decode('xn--eckwd4c7c.xn--zckzah'))
  51. ドメイン.テスト
  52. Installation
  53. ------------
  54. This package is available for installation from PyPI:
  55. .. code-block:: bash
  56. $ python3 -m pip install idna
  57. Usage
  58. -----
  59. For typical usage, the ``encode`` and ``decode`` functions will take a
  60. domain name argument and perform a conversion to A-labels or U-labels
  61. respectively.
  62. .. code-block:: pycon
  63. >>> import idna
  64. >>> idna.encode('ドメイン.テスト')
  65. b'xn--eckwd4c7c.xn--zckzah'
  66. >>> print(idna.decode('xn--eckwd4c7c.xn--zckzah'))
  67. ドメイン.テスト
  68. You may use the codec encoding and decoding methods using the
  69. ``idna.codec`` module:
  70. .. code-block:: pycon
  71. >>> import idna.codec
  72. >>> print('домен.испытание'.encode('idna2008'))
  73. b'xn--d1acufc.xn--80akhbyknj4f'
  74. >>> print(b'xn--d1acufc.xn--80akhbyknj4f'.decode('idna2008'))
  75. домен.испытание
  76. Conversions can be applied at a per-label basis using the ``ulabel`` or
  77. ``alabel`` functions if necessary:
  78. .. code-block:: pycon
  79. >>> idna.alabel('测试')
  80. b'xn--0zwm56d'
  81. Compatibility Mapping (UTS #46)
  82. +++++++++++++++++++++++++++++++
  83. As described in `RFC 5895 <https://tools.ietf.org/html/rfc5895>`_, the
  84. IDNA specification does not normalize input from different potential
  85. ways a user may input a domain name. This functionality, known as
  86. a “mapping”, is considered by the specification to be a local
  87. user-interface issue distinct from IDNA conversion functionality.
  88. This library provides one such mapping that was developed by the
  89. Unicode Consortium. Known as `Unicode IDNA Compatibility Processing
  90. <https://unicode.org/reports/tr46/>`_, it provides for both a regular
  91. mapping for typical applications, as well as a transitional mapping to
  92. help migrate from older IDNA 2003 applications.
  93. For example, “Königsgäßchen” is not a permissible label as *LATIN
  94. CAPITAL LETTER K* is not allowed (nor are capital letters in general).
  95. UTS 46 will convert this into lower case prior to applying the IDNA
  96. conversion.
  97. .. code-block:: pycon
  98. >>> import idna
  99. >>> idna.encode('Königsgäßchen')
  100. ...
  101. idna.core.InvalidCodepoint: Codepoint U+004B at position 1 of 'Königsgäßchen' not allowed
  102. >>> idna.encode('Königsgäßchen', uts46=True)
  103. b'xn--knigsgchen-b4a3dun'
  104. >>> print(idna.decode('xn--knigsgchen-b4a3dun'))
  105. königsgäßchen
  106. Transitional processing provides conversions to help transition from
  107. the older 2003 standard to the current standard. For example, in the
  108. original IDNA specification, the *LATIN SMALL LETTER SHARP S* (ß) was
  109. converted into two *LATIN SMALL LETTER S* (ss), whereas in the current
  110. IDNA specification this conversion is not performed.
  111. .. code-block:: pycon
  112. >>> idna.encode('Königsgäßchen', uts46=True, transitional=True)
  113. 'xn--knigsgsschen-lcb0w'
  114. Implementers should use transitional processing with caution, only in
  115. rare cases where conversion from legacy labels to current labels must be
  116. performed (i.e. IDNA implementations that pre-date 2008). For typical
  117. applications that just need to convert labels, transitional processing
  118. is unlikely to be beneficial and could produce unexpected incompatible
  119. results.
  120. ``encodings.idna`` Compatibility
  121. ++++++++++++++++++++++++++++++++
  122. Function calls from the Python built-in ``encodings.idna`` module are
  123. mapped to their IDNA 2008 equivalents using the ``idna.compat`` module.
  124. Simply substitute the ``import`` clause in your code to refer to the new
  125. module name.
  126. Exceptions
  127. ----------
  128. All errors raised during the conversion following the specification
  129. should raise an exception derived from the ``idna.IDNAError`` base
  130. class.
  131. More specific exceptions that may be generated as ``idna.IDNABidiError``
  132. when the error reflects an illegal combination of left-to-right and
  133. right-to-left characters in a label; ``idna.InvalidCodepoint`` when
  134. a specific codepoint is an illegal character in an IDN label (i.e.
  135. INVALID); and ``idna.InvalidCodepointContext`` when the codepoint is
  136. illegal based on its positional context (i.e. it is CONTEXTO or CONTEXTJ
  137. but the contextual requirements are not satisfied.)
  138. Building and Diagnostics
  139. ------------------------
  140. The IDNA and UTS 46 functionality relies upon pre-calculated lookup
  141. tables for performance. These tables are derived from computing against
  142. eligibility criteria in the respective standards. These tables are
  143. computed using the command-line script ``tools/idna-data``.
  144. This tool will fetch relevant codepoint data from the Unicode repository
  145. and perform the required calculations to identify eligibility. There are
  146. three main modes:
  147. * ``idna-data make-libdata``. Generates ``idnadata.py`` and
  148. ``uts46data.py``, the pre-calculated lookup tables used for IDNA and
  149. UTS 46 conversions. Implementers who wish to track this library against
  150. a different Unicode version may use this tool to manually generate a
  151. different version of the ``idnadata.py`` and ``uts46data.py`` files.
  152. * ``idna-data make-table``. Generate a table of the IDNA disposition
  153. (e.g. PVALID, CONTEXTJ, CONTEXTO) in the format found in Appendix
  154. B.1 of RFC 5892 and the pre-computed tables published by `IANA
  155. <https://www.iana.org/>`_.
  156. * ``idna-data U+0061``. Prints debugging output on the various
  157. properties associated with an individual Unicode codepoint (in this
  158. case, U+0061), that are used to assess the IDNA and UTS 46 status of a
  159. codepoint. This is helpful in debugging or analysis.
  160. The tool accepts a number of arguments, described using ``idna-data
  161. -h``. Most notably, the ``--version`` argument allows the specification
  162. of the version of Unicode to be used in computing the table data. For
  163. example, ``idna-data --version 9.0.0 make-libdata`` will generate
  164. library data against Unicode 9.0.0.
  165. Additional Notes
  166. ----------------
  167. * **Packages**. The latest tagged release version is published in the
  168. `Python Package Index <https://pypi.org/project/idna/>`_.
  169. * **Version support**. This library supports Python 3.5 and higher.
  170. As this library serves as a low-level toolkit for a variety of
  171. applications, many of which strive for broad compatibility with older
  172. Python versions, there is no rush to remove older interpreter support.
  173. Removing support for older versions should be well justified in that the
  174. maintenance burden has become too high.
  175. * **Python 2**. Python 2 is supported by version 2.x of this library.
  176. While active development of the version 2.x series has ended, notable
  177. issues being corrected may be backported to 2.x. Use "idna<3" in your
  178. requirements file if you need this library for a Python 2 application.
  179. * **Testing**. The library has a test suite based on each rule of the
  180. IDNA specification, as well as tests that are provided as part of the
  181. Unicode Technical Standard 46, `Unicode IDNA Compatibility Processing
  182. <https://unicode.org/reports/tr46/>`_.
  183. * **Emoji**. It is an occasional request to support emoji domains in
  184. this library. Encoding of symbols like emoji is expressly prohibited by
  185. the technical standard IDNA 2008 and emoji domains are broadly phased
  186. out across the domain industry due to associated security risks. For
  187. now, applications that need to support these non-compliant labels
  188. may wish to consider trying the encode/decode operation in this library
  189. first, and then falling back to using `encodings.idna`. See `the Github
  190. project <https://github.com/kjd/idna/issues/18>`_ for more discussion.