Class LdLocale
It represents a IETF BCP 47 tag, but does not implement all the features. Features can be added as needed.
It is constructed through the fromString(java.lang.String)
factory method. The toString()
method
produces a parseable and persistable string.
The class is immutable.
The java.util.Locale cannot be used because it has issues for historical reasons, notably the
script code conversion for Hebrew, Yiddish and Indonesian, and more. If one needs a Locale,
it is simple to create one based on this object.
The ICU ULocale cannot be used because a) it has issues too (for our use case) and b) we're not
using ICU in here [yet].
This class does not perform any modifications on the input. The input is used as is, and the getters return it in exactly the same way. No standardization, canonicalization, cleaning.
The input is validated syntactically, but not for code existence. For example the script code must be a valid ISO 15924 like "Latn" or "Cyrl", in correct case. But whether the code exists or not is not checked. These code standards are not fixed, simply because regional entities like Countries can change for political reasons, and languages are living entities. Therefore certain codes may exist at some point in time only (be introduced late, or be deprecated or removed, or even be re-assigned another meaning). It is not up to us to decide whether Kosovo is a country in 2015 or not. If one needs to only work with a certain range of acceptable codes, he can validate the codes through other classes that have knowledge about the codes.
Language: as for BCP 47, the iso 639-1 code must be used if there is one. For example "fr" for French. If not, the ISO 639-3 should be used. It is highly discouraged to use 639-2. Right now this class enforces a 2 or 3 char code, but this may be relaxed in the future.
Script: Only ISO 15924, no discussion.
Region: same as for BCP 47. That means ISO 3166-1 alpha-2 and "UN M.49". I can imagine relaxing it in the future to also allow 3166-2 codes. In most cases the "region" is a "country".
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate static String
assignLang
(String s) boolean
static @NotNull LdLocale
fromString
(@NotNull String string) @NotNull String
@NotNull com.google.common.base.Optional<String>
@NotNull com.google.common.base.Optional<String>
int
hashCode()
private static boolean
looksLikeGeoCode3166_1
(String string) private static boolean
looksLikeGeoCodeNumeric
(String string) private static boolean
looksLikeScriptCode
(String string) toString()
The output of this can be fed to the fromString() method.
-
Field Details
-
language
-
script
-
region
-
-
Constructor Details
-
LdLocale
-
-
Method Details
-
fromString
- Parameters:
string
- The output of the toString() method.- Returns:
- either a new or possibly a cached (immutable) instance.
-
looksLikeScriptCode
-
looksLikeGeoCode3166_1
-
looksLikeGeoCodeNumeric
-
assignLang
-
toString
The output of this can be fed to the fromString() method. -
getLanguage
- Returns:
- ISO 639-1 or 639-3 language code, eg "fr" or "gsw", see class header.
-
getScript
- Returns:
- ISO 15924 script code, eg "Latn", see class header.
-
getRegion
- Returns:
- ISO 3166-1 or UN M.49 code, eg "DE" or 150, see class header.
-
equals
-
hashCode
public int hashCode()
-