Trees | Indices | Help |
|
---|
|
Utility functions used by to prepare an arabic text to search and index .
|
|||
Indivudual Functions | |||
---|---|---|---|
unicode. |
|
||
unicode. |
|
||
unicode. |
|
||
unicode. |
|
||
unicode. |
|
||
Normalize One Function | |||
unicode. |
|
|
|||
__package__ =
|
|
Strip vowel from a text and return a result text. The striped marks are :
Example: >>> text=u"الْعَرَبِيّةُ" >>> strip_tashkeel(text) العربية
|
Strip tatweel from a text and return a result text. Example: >>> text=u"العـــــربية" >>> strip_tatweel(text) العربية
|
Normalize Hamza forms into one form, and return a result text. The converted letters are :
Example: >>> text=u"أهؤلاء من أولئكُ" >>> normalize_hamza(text) اهءلاء من اولءكُ
|
Normalize Lam Alef ligatures into two letters (LAM and ALEF), and return a result text. Some systems present lamAlef ligature as a single letter, this function convert it into two letters, The converted letters into LAM and ALEF are :
Example: >>> text=u"لانها لالئ الاسلام" >>> normalize_lamalef(text) لانها لالئ الاسلام
|
Normalize some spellerrors like, TEH_MARBUTA into HEH,ALEF_MAKSURA into YEH, and return a result text. In some context users omit the difference between TEH_MARBUTA and HEH, and ALEF_MAKSURA and YEh. The conversions are:
Example: >>> text=u"اشترت سلمى دمية وحلوى" >>> normalize_spellerrors(text) اشترت سلمي دميه وحلوي
|
Normalize input text and return a result text. Normalize a text by :
Example: >>> text=u'أستشتري دمـــى آلية لأبنائك قبل الإغلاق' >>> normalize_searchtext(text) استشتري دمي اليه لابناءك قبل الاغلاق
|
Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0.1 on Wed Feb 15 13:46:35 2017 | http://epydoc.sourceforge.net |