Apache OpenOffice (AOO) Bugzilla – Issue 63572
Rubies (phonetics guide) dialog should recognize word stems
Last modified: 2013-02-07 22:37:57 UTC
Rubies dialog is not fully functional. Japanese language support is on. User enters text in Japanese, and selects it. User invokes "Format/Asian phonetics guide..." from menu. What's supposed to happen is that the software identifies kanji word stems ("base text") in the selected text, and puts them one to each box in the "base text" column of the dialog box, and in the right-hand column, it should insert the furigana ("ruby text") for each base text item. The user can edit them by inserting an alternative pronunciation, or by deleting the furigana for some items. Then the user clicks "Apply", and rubies (furigana) are applied to all the word stems accordingly. There are several ways in which this doesn't work properly in OOo: 1) Instead of picking up just word stems, it picks up whole words as they appear in dictionaries, so if you write 行きます, (ikimasu -- the present or future tense of "to go"), it picks up 行き, (iki -- the gerund of "to go"), when it should pick up 行, (i -- the stem of "to go"). It leads to incorrect positioning of furigana. Picking up only word stems shouldn't be hard from the programming point of view, since they are sequences of kanji characters, and can be identified by the simple expedient of checking the unicode value of the characters to see if they are in the range of kanji characters, as opposed to kana, space, punctuation, or some other (or by using a dictionary subset that contains only kanji stems, no inflected forms or words spelled in kana). 2) The software doesn't produce the furigana, so the user has to type them all in by hand. From the way the software picks up words for the "base text" fields, it is evident that it is using a dictionary. If that dictionary contains entries for the pronunciation of each word contained therein, it ought to be a small matter to look up the pronunciation and insert that into the associated "ruby text" field. Sadly, the software doesn't bother, and the user is left with the tedious job of doing it by hand. In some situations, this could involve looking up a separate dictionary. 3) The system is completely fooled by punctuation marks. If the user selects an area of text that includes a comma (、), full-stop (。), or other punctuation mark, all the rest of the selected area, including the punctuation mark will be inserted into a single ruby text field. This adds to the tedium of adding furigana to text, because only one phrase can be handled at a time, rather than a substantial passage of text being all handled at once with a single invocation of the dialog. 4) The software ought to remember the furigana entered by the user, so that it can be automatically used again the next time the pronunciation guide dialog box is invoked. If the ruby dialog were fully functional, a user could select a whole page of text, invoke the phonetics guide once, click "Apply", and see furigana for all the words in passage. As it stands now, the user would have to invoke the dialog dozens of times, and type in the pronunciation for each word by hand, in a procedure more tedious than it would be to type out the whole passage twice, once with kanji and once without.
Reassigned to SBA.
I am sad to see that this issue is still marked "unconfirmed" after so long. It appears that the OOo developers don't appreciate the importance of furigana (aka "rubi") to those of us who wish to write or read documents in Japanese. It is *fundamental*. If you scan through the bug list, there are lots of queries about why the feature doesn't appear to work properly, or seems to be incomplete. In one of those reports, someone incorrectly states that other WPs don't have proper furigana/rubi feature, so why should OOo? As a matter of fact, Word 2000 has the feature. It makes a HUGE difference, and it would not be a huge programming project. Public domain dictionaries already exist, and the basic framework for the feature also already exists in the code. I would install OOo 1.x, Japanese version, but it appears that it and 2.x can't live side-by-side on the same PC (I need to have an English language UI available). Therefore, I'm giving up on OOo for the time being, and getting a beta copy of Word 2007. Maybe by the time the beta licence runs out, this issue will have been fixed, and bilingual Western/Japanese WP will be feasible with OOo.
"resolved remind" isn't a good status if the problem is still an issue - however I cannot reproduce. When using "行きます" and selecting it and using the Ruby feature, all four characters/symbols are entered individually in the dialog (see screenshot) Please do not write multiple request/bugs into one single issue this makes dealing with these issue much, much harder. Please also keep in mind that most developers and QA-People don't have knowledge of complex scripts or asian text at all. Regarding 2) I don't think that OOo uses a dictionary right now. I can reproduce 3), but please file a seperate, dedicated issue for this. For 4) please file a seperate issue as well.
Created attachment 38278 [details] no combined stuff in the dialog
The issue type "defect" means "This is a bug to fix" because "Defined behavior does not work as designed". But this one looks more like a huge pile of feature implementations are missing. I adjusted the summary to give a hint what this issue is about. I change the issue type to "Feature" and confirm. Confirming means: This is a valid request for a re-work of the ruby implementation and its UI in general. But... It is as valid as thousands of other existing requests that "only" wait for developer resources to come by... Reassigned to requirements.
Created attachment 72666