Home | Trees | Indices | Help |
|
---|
|
Provides an operator for the Mandarin Gwoyeu Romatzyh romanisation.
Features:
Limitations:
Gwoyeu Romatzyh renders rhotacised syllables (Erlhuah) by trying to give the actual pronunciation. As the effect of r-colouring looses the information of the underlying etymological syllable conversion between the r-coloured form back to the underlying form can not be done in an unambiguous way. As furthermore finals i, iu, in, iun contrast in the first and the second tone but not in the third and the forth tone conversion between different tones (including the base form) cannot be made in a general manner: 小鸡儿 sheau-jiel is different to 小街儿 sheau-jie’l but 几儿 jieel equals 姐儿 jieel (see Chao).
Thus this ReadingOperator lacks the general handling of syllable renderings and many methods narrow the range of syllables allowed. Unlike the original forms without r-colouring for Erlhuah forms the combination of a plain syllable with a specific tone is limited to the data given in the source, so operations involving tones may return with an UnsupportedError if the given syllable isn't found with that tone.
See Also:
'ₒ'
?
|
|||
|
|||
list |
|
||
str |
|
||
bool |
|
||
list of tuple |
|
||
list of str |
|
||
int |
|
||
tuple of str |
|
||
str |
|
||
tuple |
|
||
str |
|
||
dict |
|
||
list |
|
||
bool |
|
||
str |
|
||
set of str |
|
||
set of str |
|
||
list of str |
|
||
bool |
|
||
Inherited from Inherited from Inherited from Inherited from Inherited from |
|
|||
dict |
|
||
dict |
|
|
|||
Inherited from |
|
|||
READING_NAME =
Unique name of reading |
|||
TONES =
|
|||
SYLLABLE_STRUCTURE = re.compile(r'^ Regular expression describing the syllable structure in GR (C,V,C). |
|||
_syllableToneLookup = None Holds the tonal syllable to plain syllable & tone lookup table. |
|||
_abbrConversionLookup = None Holds the abbreviated entity lookup table. |
|||
DB_RHOTACISED_FINAL_MAPPING =
Database fields for tonal Erlhuah syllables. |
|||
DB_RHOTACISED_FINAL_MAPPING_ZEROINITIAL =
Database fields for tonal Erlhuah syllables with i, u and iu medials. |
|||
DB_RHOTACISED_FINAL_APOSTROPHE =
Default apostrophe used by GR syllable data in database for marking the longer and back vowel in rhotacised finals. |
|||
Inherited from |
|
|||
Inherited from |
|
Creates an instance of the GROperator.
|
Returns the reading operator's default options. The default implementation returns an empty dictionary. The keyword 'dbConnectInst' is not regarded a configuration option of the operator and is thus not included in the dict returned.
|
Takes a string written in GR and guesses the reading dialect. The options
|
Returns a set of tones supported by the reading. These tones don't necessarily reflect the tones of the underlying language but may defer to reflect notational or other features. The default implementation will raise a NotImplementedError.
|
Composes the given list of basic entities to a string. Applies an apostrophe between syllables if the second syllable has a zero-initial.
|
Checks if the given decomposition follows the romanisation format strictly to allow unambiguous decomposition. The romanisation should offer a way/protocol to make an unambiguous decomposition into it's basic syllables possible as to make the process of appending syllables to a string reversible. The testing on compliance with this protocol has to be implemented here. Thus this method can only return true for one and only one possible decomposition for all strings.
|
Takes a string written in the romanisation and returns the possible segmentations as a tree of syllables. The tree is represented by tuples
|
Removes apostrophes between two syllables for a given decomposition.
|
Gets the tone number of the tone or the etymological tone if it is a neutral or optional neutral tone.
|
Splits the given plain syllable into consonants-vowels-consonants.
|
Gets the entity with tone mark for the given plain entity and tone. This method only works for plain syllables that are not r-coloured (Erlhuah forms) as due to the depiction of Erlhuah in GR the information about the base syllable is lost and pronunciation partly varies between different syllables. Use getRhotacisedTonalEntity() to get the tonal entity for a given etymological (base) syllable.
|
Splits the entity into an entity without tone mark (plain entity) and the entity's tone. The default implementation will raise a NotImplementedError.
|
Gets the r-coloured entity (Erlhuah form) with tone mark for the given plain entity and tone. Not all entity-tone combinations are supported.
To Do (Fix): Build lookup for performance reasons. |
Gets the abbreviated form lookup table.
|
Gets a list of abbreviated GR spellings.
|
Returns true if the given entity is an abbreviated spelling. Reading entities will be handled as being case insensitive.
|
Converts the given abbreviated GR spelling to the original form. Non-abbreviated forms will returned unchanged. Takes care of capitalisation.
To Do (Fix): Move this method to the Converter, AmbiguousConversionError not needed for import here then |
Gets the list of plain entities supported by this reading without r-coloured forms (Erlhuah forms). Different to getReadingEntities() the entities will carry no tone mark.
|
Gets a set of full entities supported by the reading excluding abbreviated forms.
|
Gets a set of all entities supported by the reading. The list is used in the segmentation process to find entity boundaries.
|
Returns true if the given entity is recognised by the romanisation operator, i.e. it is a valid entity of the reading returned by the segmentation method. Reading entities will be handled as being case insensitive.
|
|
TONES
|
SYLLABLE_STRUCTURERegular expression describing the syllable structure in GR (C,V,C).
|
DB_RHOTACISED_FINAL_MAPPINGDatabase fields for tonal Erlhuah syllables.
|
DB_RHOTACISED_FINAL_MAPPING_ZEROINITIALDatabase fields for tonal Erlhuah syllables with i, u and iu medials.
|
Home | Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0.1 on Tue May 19 22:38:37 2009 | http://epydoc.sourceforge.net |