Class Poco::Unicode

Library: Foundation
Package: Text
Header: Poco/Unicode.h

Description

This class contains enumerations and static utility functions for dealing with Unicode characters and their properties.

For more information on Unicode, see <http://www.unicode.org>.

The implementation is based on the Unicode support functions in PCRE.

Member Summary

Member Functions: isAlpha, isDigit, isLower, isPunct, isSpace, isUpper, properties, toLower, toUpper

Nested Classes

struct CharacterProperties

This structure holds the character properties of an Unicode character.

Enumerations

Anonymous

UCP_MAX_CODEPOINT = 0x10FFFF

CharacterCategory

Unicode character categories.

CharacterType

Unicode character types.

UCP_LOWER_CASE_LETTER

UCP_MODIFIER_LETTER

UCP_OTHER_LETTER

UCP_TITLE_CASE_LETTER

UCP_UPPER_CASE_LETTER

UCP_CONNECTOR_PUNCTUATION

UCP_DASH_PUNCTUATION

UCP_CLOSE_PUNCTUATION

UCP_FINAL_PUNCTUATION

UCP_INITIAL_PUNCTUATION

UCP_OTHER_PUNCTUATION

UCP_OPEN_PUNCTUATION

UCP_CURRENCY_SYMBOL

UCP_MODIFIER_SYMBOL

UCP_MATHEMATICAL_SYMBOL

UCP_OTHER_SYMBOL

UCP_LINE_SEPARATOR

UCP_PARAGRAPH_SEPARATOR

UCP_SPACE_SEPARATOR

Script

Unicode 7.0 script identifiers.

UCP_CANADIAN_ABORIGINAL

UCP_EGYPTIAN_HIEROGLYPHS

UCP_IMPERIAL_ARAMAIC

UCP_INSCRIPTIONAL_PAHLAVI

UCP_INSCRIPTIONAL_PARTHIAN

UCP_OLD_SOUTH_ARABIAN

UCP_MEROITIC_HIEROGLYPHS

UCP_CAUCASIAN_ALBANIAN

UCP_OLD_NORTH_ARABIAN

Member Functions

isAlpha

static bool isAlpha(
int ch
);

Returns true iff the given character is a letter.

isDigit

static bool isDigit(
int ch
);

Returns true iff the given character is a numeric character.

isLower

static bool isLower(
int ch
);

Returns true iff the given character is a lowercase character.

isPunct

static bool isPunct(
int ch
);

Returns true iff the given character is a punctuation character.

isSpace

static bool isSpace(
int ch
);

Returns true iff the given character is a separator.

isUpper

static bool isUpper(
int ch
);

Returns true iff the given character is an uppercase character.

properties

static void properties(
int ch,
CharacterProperties & props
);

Return the Unicode character properties for the character with the given Unicode value.

toLower

static int toLower(
int ch
);

If the given character is an uppercase character, return its lowercase counterpart, otherwise return the character.

toUpper

static int toUpper(
int ch
);