com.ibm.icu.text

Java Source Code / Java Documentation
1. 6.0 JDK Core
2. 6.0 JDK Modules
3. 6.0 JDK Modules com.sun
4. 6.0 JDK Modules com.sun.java
5. 6.0 JDK Modules sun
6. 6.0 JDK Platform
7. Ajax
8. Apache Harmony Java SE
9. Aspect oriented
10. Authentication Authorization
11. Blogger System
12. Build
13. Byte Code
14. Cache
15. Chart
16. Chat
17. Code Analyzer
18. Collaboration
19. Content Management System
20. Database Client
21. Database DBMS
22. Database JDBC Connection Pool
23. Database ORM
24. Development
25. EJB Server geronimo
26. EJB Server GlassFish
27. EJB Server JBoss 4.2.1
28. EJB Server resin 3.1.5
29. ERP CRM Financial
30. ESB
31. Forum
32. GIS
33. Graphic Library
34. Groupware
35. HTML Parser
36. IDE
37. IDE Eclipse
38. IDE Netbeans
39. Installer
40. Internationalization Localization
41. Inversion of Control
42. Issue Tracking
43. J2EE
44. JBoss
45. JMS
46. JMX
47. Library
48. Mail Clients
49. Net
50. Parser
51. PDF
52. Portal
53. Profiler
54. Project Management
55. Report
56. RSS RDF
57. Rule Engine
58. Science
59. Scripting
60. Search Engine
61. Security
62. Sevlet Container
63. Source Control
64. Swing Library
65. Template Engine
66. Test Coverage
67. Testing
68. UML
69. Web Crawler
70. Web Framework
71. Web Mail
72. Web Server
73. Web Services
74. Web Services apache cxf 2.0.1
75. Web Services AXIS2
76. Wiki Engine
77. Workflow Engines
78. XML
79. XML UI
Java
Java Tutorial
Java Open Source
Jar File Download
Java Articles
Java Products
Java by API
Photoshop Tutorials
Maya Tutorials
Flash Tutorials
3ds-Max Tutorials
Illustrator Tutorials
GIMP Tutorials
C# / C Sharp
C# / CSharp Tutorial
C# / CSharp Open Source
ASP.Net
ASP.NET Tutorial
JavaScript DHTML
JavaScript Tutorial
JavaScript Reference
HTML / CSS
HTML CSS Reference
C / ANSI-C
C Tutorial
C++
C++ Tutorial
Ruby
PHP
Python
Python Tutorial
Python Open Source
SQL Server / T-SQL
SQL Server / T-SQL Tutorial
Oracle PL / SQL
Oracle PL/SQL Tutorial
PostgreSQL
SQL / MySQL
MySQL Tutorial
VB.Net
VB.Net Tutorial
Flash / Flex / ActionScript
VBA / Excel / Access / Word
XML
XML Tutorial
Microsoft Office PowerPoint 2007 Tutorial
Microsoft Office Excel 2007 Tutorial
Microsoft Office Word 2007 Tutorial
Java Source Code / Java Documentation » Internationalization Localization » icu4j » com.ibm.icu.text 
com.ibm.icu.text
Extensions and enhancements to java.text to support unicode transforms, UnicodeSet, surrogate char utilities, UCA collation, normalization, break iteration (rule and dictionary based), enhanced number format, international string searching, and arabic shaping.

  • Unicode Transforms (Transliteration) convert between different representations of unicode text.
  • UnicodeSet provides set operations on unicode characters and strings, and are representable as compact expressions.
  • Surrogate character utilities (UTF16) provides indexing and substring operations on text containing surrogates.
  • UCA collation implements the current Unicode Collation Algorithm. Data for many locale-specific collations is provided.
  • Normalization supports standard Unicode normalization forms as well as FCD.
  • Break iteration supports character, word, line, sentences, and title case breaks.
  • Enhanced number format to 'spell out' numbers, padding and rounding control.
  • Internationally sensitive string searching, uses collation.
  • Arabic shaping converts between shaped and unshaped arabic characters and digits.
Java Source File NameTypeComment
AnyTransliterator.javaClass A transliterator that translates multiple input scripts to a single output script.
ArabicShaping.javaClass Shape Arabic text on a character basis.

ArabicShaping performs basic operations for "shaping" Arabic text.

ArabicShapingException.javaClass Thrown by ArabicShaping when there is a shaping error.
BreakDictionary.javaClass This is the class that represents the list of known words used by DictionaryBasedBreakIterator.
BreakIterator.javaClass A class that locates boundaries in text.
BreakIteratorFactory.javaClass
BreakTransliterator.javaClass Inserts the specified characters at word breaks.
CanonicalIterator.javaClass This class allows one to iterate through all the strings that are canonically equivalent to a given string.
CharsetDetector.javaClass CharsetDetector provides a facility for detecting the charset or encoding of character data in an unknown format. The input data can either be from an input stream or an array of bytes. The result of the detection operation is a list of possibly matching charsets, or, for simple use, you can just ask for a Java Reader that will will work over the input data.

Character set detection is at best an imprecise operation.

CharsetMatch.javaClass This class represents a charset that has been identified by a CharsetDetector as a possible encoding for a set of input data.
CharsetRecognizer.javaClass Abstract class for recognizing a single charset. Part of the implementation of ICU's CharsetDetector. Each specific charset that can be recognized will have an instance of some subclass of this class.
CharsetRecog_2022.javaClass class CharsetRecog_2022 part of the ICU charset detection imlementation.
CharsetRecog_mbcs.javaClass CharsetRecognizer implemenation for Asian - double or multi-byte - charsets. Match is determined mostly by the input data adhering to the encoding scheme for the charset, and, optionally, frequency-of-occurence of characters.

Instances of this class are singletons, one per encoding being recognized.

CharsetRecog_sbcs.javaClass This class recognizes single-byte encodings.
CharsetRecog_Unicode.javaClass This class matches UTF-16 and UTF-32, both big- and little-endian.
CharsetRecog_UTF8.javaClass
ChineseDateFormat.javaClass A concrete DateFormat for com.ibm.icu.util.ChineseCalendar . This class handles a ChineseCalendar-specific field, ChineseCalendar.IS_LEAP_MONTH.
ChineseDateFormatSymbols.javaClass A subclass of DateFormatSymbols for ChineseDateFormat .
CollationElementIterator.javaClass

CollationElementIterator is an iterator created by a RuleBasedCollator to walk through a string.

CollationKey.javaClass

A CollationKey represents a String under the rules of a specific Collator object.

CollationParsedRuleBuilder.javaClass Class for building a collator from a list of collation rules.
CollationRuleParser.javaClass
Collator.javaClass

Collator performs locale-sensitive string comparison.

CollatorReader.javaClass

Internal reader class for ICU data file uca.icu containing Unicode Collation Algorithm data.

This class simply reads uca.icu, authenticates that it is a valid ICU data file and split its contents up into blocks of data for use in com.ibm.icu.text.Collator.

CollatorServiceShim.javaClass
ComposedCharIter.javaClass ComposedCharIter is an iterator class that returns all of the precomposed characters defined in the Unicode standard, along with their decomposed forms.
CompoundTransliterator.javaClass A transliterator that is composed of two or more other transliterator objects linked together.
CurrencyFormat.javaClass Temporary internal concrete subclass of MeasureFormat implementing parsing and formatting of CurrencyAmount objects.
DateFormat.javaClass DateFormat is an abstract class for date/time formatting subclasses which formats and parses dates or time in a language-independent manner. The date/time formatting subclass, such as SimpleDateFormat, allows for formatting (i.e., date -> text), parsing (text -> date), and normalization.
DateFormatSymbols.javaClass DateFormatSymbols is a public class for encapsulating localizable date-time formatting data, such as the names of the months, the names of the days of the week, and the time zone data. DateFormat and SimpleDateFormat both use DateFormatSymbols to encapsulate this information.

Typically you shouldn't use DateFormatSymbols directly. Rather, you are encouraged to create a date-time formatter with the DateFormat class's factory methods: getTimeInstance, getDateInstance, or getDateTimeInstance. These methods automatically create a DateFormatSymbols for the formatter so that you don't have to.

DateTimePatternGenerator.javaClass This class provides flexible generation of date format patterns, like "yy-MM-dd".
DecimalFormat.javaClass DecimalFormat is a concrete subclass of NumberFormat that formats decimal numbers.
DecimalFormatSymbols.javaClass This class represents the set of symbols (such as the decimal separator, the grouping separator, and so on) needed by DecimalFormat to format numbers.
DecompData.javaClass
DictionaryBasedBreakIterator.javaClass A subclass of RuleBasedBreakIterator that adds the ability to use a dictionary to further subdivide ranges of text beyond what is possible using just the state-table-based algorithm.
DigitList.javaClass DigitList handles the transcoding between numeric values and strings of characters.
EscapeTransliterator.javaClass A transliterator that converts Unicode characters to an escape form.
FunctionReplacer.javaClass A replacer that calls a transliterator to generate its output text. The input text to the transliterator is the output of another UnicodeReplacer object.
IDNA.javaClass IDNA API implements the IDNA protocol as defined in the IDNA RFC. The draft defines 2 operations: ToASCII and ToUnicode.
LowercaseTransliterator.javaClass A transliterator that performs locale-sensitive toLower() case mapping.
MeasureFormat.javaClass A formatter for Measure objects.
MessageFormat.javaClass MessageFormat provides a means to produce concatenated messages in language-neutral way.
NameUnicodeTransliterator.javaClass A transliterator that performs name to character mapping.
NFRule.javaClass A class represnting a single rule in a RuleBasedNumberFormat.
NFRuleSet.javaClass A collection of rules used by a RuleBasedNumberFormat to format and parse numbers.
NFSubstitution.javaClass An abstract class defining protocol for substitutions.
NormalizationTransliterator.javaClass
Normalizer.javaClass Unicode Normalization

Unicode normalization API

normalize transforms Unicode text into an equivalent composed or decomposed form, allowing for easier sorting and searching of text. normalize supports the standard normalization forms described in Unicode Standard Annex #15 — Unicode Normalization Forms. Characters with accents or other adornments can be encoded in several different ways in Unicode.
NullTransliterator.javaClass A transliterator that leaves text unchanged.
NumberFormat.javaClass NumberFormat is the abstract base class for all number formats.
NumberFormatServiceShim.javaClass
Punycode.javaClass
Quantifier.javaClass
RawCollationKey.javaClass

Simple class wrapper to store the internal byte representation of a CollationKey.

RBBIDataWrapper.javaClass

Internal class used for Rule Based Break Iterators

This class provides access to the compiled break rule data, as it is stored in a .brk file.

RBBINode.javaClass This class represents a node in the parse tree created by the RBBI Rule compiler.
RBBIRuleBuilder.javaClass
RBBIRuleParseTable.javaClass Generated Java File.
RBBIRuleScanner.javaClass This class is part of the Rule Based Break Iterator rule compiler.
RBBISetBuilder.javaClass
RBBISymbolTable.javaClass
RBBITableBuilder.javaClass
RBNFChinesePostProcessor.javaClass A post-processor for Chinese text.
RBNFPostProcessor.javaClass Post processor for RBNF output.
RemoveTransliterator.javaClass A transliterator that removes characters.
Replaceable.javaInterface Replaceable is an interface representing a string of characters that supports the replacement of a range of itself with a new string of characters.
ReplaceableContextIterator.javaClass Implementation of UCaseProps.ContextIterator, iterates over a Replaceable.
ReplaceableString.javaClass ReplaceableString is an adapter class that implements the Replaceable API around an ordinary StringBuffer.

Note: This class does not support attributes and is not intended for general use.

RuleBasedBreakIterator.javaClass Rule Based Break Iterator This is a port of the C++ class RuleBasedBreakIterator from ICU4C.
RuleBasedCollator.javaClass

RuleBasedCollator is a concrete subclass of Collator.

RuleBasedNumberFormat.javaClass

A class that formats numbers according to a set of rules.

RuleBasedTransliterator.javaClass RuleBasedTransliterator is a transliterator that reads a set of rules in order to determine how to perform translations.
SCSU.javaInterface An interface defining constants for the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.
author:
   Stephen F.
SearchIterator.javaClass

SearchIterator is an abstract base class that defines a protocol for text searching.

SimpleDateFormat.javaClass SimpleDateFormat is a concrete class for formatting and parsing dates in a locale-sensitive manner.
StringCharacterIterator.javaClass StringCharacterIterator implements the CharacterIterater protocol for a String.
StringMatcher.javaClass An object that matches a fixed input string, implementing the UnicodeMatcher API.
StringPrep.javaClass StringPrep API implements the StingPrep framework as described by RFC 3454. StringPrep prepares Unicode strings for use in network protocols. Profiles of StingPrep are set of rules and data according to which the Unicode Strings are prepared.
StringPrepParseException.javaClass Exception that signals an error has occurred while parsing the input to StringPrep or IDNA.
StringReplacer.javaClass A replacer that produces static text as its output.
StringSearch.javaClass

StringSearch is the concrete subclass of SearchIterator that provides language-sensitive text searching based on the comparison rules defined in a RuleBasedCollator object.

StringSearch uses a version of the fast Boyer-Moore search algorithm that has been adapted to work with the large character set of Unicode.

SymbolTable.javaInterface An interface that defines both lookup protocol and parsing of symbolic names.

This interface is used by UnicodeSet to resolve $Variable style references that appear in set patterns.

TitlecaseTransliterator.javaClass A transliterator that converts all letters (as defined by UCharacter.isLetter()) to lower case, except for those letters preceded by non-letters.
TransformTransliterator.javaClass
TransliterationRule.javaClass A transliteration rule used by RuleBasedTransliterator. TransliterationRule is an immutable object.

A rule consists of an input pattern and an output string.

TransliterationRuleSet.javaClass A set of rules for a RuleBasedTransliterator.
Transliterator.javaClass Transliterator is an abstract class that transliterates text from one format to another.
TransliteratorIDParser.javaClass Parsing component for transliterator IDs.
TransliteratorParser.javaClass
TransliteratorRegistry.javaClass
UCharacterIterator.javaClass Abstract class that defines an API for iteration on text objects.This is an interface for forward and backward iteration and random access into a text object.
UFormat.javaClass An abstract class that extends java.text.Format to provide additional ICU protocol, specifically, the getLocale() API.
UForwardCharacterIterator.javaInterface Interface that defines an API for forward-only iteration on text objects. This is a minimal interface for iteration without random access or backwards iteration.
UnescapeTransliterator.javaClass A transliterator that converts Unicode escape forms to the characters they represent.
UnicodeCompressor.javaClass A compression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.

The SCSU works by using dynamically positioned windows consisting of 128 consecutive characters in Unicode.

UnicodeDecompressor.javaClass A decompression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.

USAGE

The static methods on UnicodeDecompressor may be used in a straightforward manner to decompress simple strings:

 byte [] compressed = ...
UnicodeFilter.javaClass UnicodeFilter defines a protocol for selecting a subset of the full range (U+0000 to U+FFFF) of Unicode characters.
UnicodeMatcher.javaInterface UnicodeMatcher defines a protocol for objects that can match a range of characters in a Replaceable string.
UnicodeNameTransliterator.javaClass A transliterator that performs character to name mapping.
UnicodeReplacer.javaInterface UnicodeReplacer defines a protocol for objects that replace a range of characters in a Replaceable string with output text.
UnicodeSet.javaClass A mutable set of Unicode characters and multicharacter strings.
UnicodeSetIterator.javaClass UnicodeSetIterator iterates over the contents of a UnicodeSet.
UppercaseTransliterator.javaClass A transliterator that performs locale-sensitive toUpper() case mapping.
UTF16.javaClass

Standalone utility class providing UTF16 character conversions and indexing conversions.

Code that uses strings alone rarely need modification. By design, UTF-16 does not allow overlap, so searching for strings is a safe operation.

www.java2java.com | Contact Us
Copyright 2009 - 12 Demo Source and Support. All rights reserved.
All other trademarks are property of their respective owners.