2.1. uniseg.codepoint
— Code Point
Unicode code point.
- uniseg.codepoint.chr(cp: int, /) str
Return the unicode object represents the code point integer cp
>>> chr(0x61) == 'a' True >>> chr(0x20b9f) == '\U00020b9f' True
- uniseg.codepoint.code_points(s: str, /) Iterator[str]
Iterate every Unicode code points of the unicode string s
>>> s = 'hello' >>> list(code_points(s)) == ['h', 'e', 'l', 'l', 'o'] True >>> s = 'abc\U00020b9f\u3042' >>> list(code_points(s)) == ['a', 'b', 'c', '\U00020b9f', '\u3042'] True
- uniseg.codepoint.ord(c: str, index: int | None = None, /) int
Return the integer value of the Unicode code point c
>>> ord('a') 97 >>> ord('\u3042') 12354 >>> ord('\U00020b9f') 134047 >>> ord('abc') Traceback (most recent call last): ... TypeError: need a single Unicode code point as parameter
It returns the result of built-in ord() when c is a single str object for compatibility:
>>> ord('a') 97
When index argument is specified (to not
None
), this function treats c as a Unicode string and returns integer value of code point atc[index]
(or may bec[index:index+2]
):>>> ord('hello', 0) 104 >>> ord('hello', 1) 101 >>> ord('a\U00020b9f', 1) 134047